Spaces:
Running
Running
SERPent
SERP results scrapping
SERPent exposes an unified API to query SERP (Search Engine Result Pages) for a few common search engines, namely:
- DuckDuckGo
- Brave
- Bing
- Google Patents
The application uses the playwright
library to control a headless web browser, to simulate normal user activity, to fool the anti-bot measures often present on those sites. See the /serp/
endpoints for search results scrapping.
Website sources scrapping
SERPent also exposes a few endpoints to scrap the contents of certain sources (patents, scholar). See the /scrap/
endpoints for supported website sources scrapping.