Hassle-free web scraping service.
- Render client-side-rendered web pages
- Auto extract metadata and article content
- Extract DOM elements via CSS selectors
- Domain blocking (when
BLOCKLIST_URLenvironment variable provided) - Forward request headers like user-agent, cookies,...
- HTTP proxy
- Bundled with a blocklist of over 57,000 adware and malware domains
- Node.js >= 14
- Environment variables specified in .env.example
$ npm i # yarn install
$ npm run start:dev # yarn start:dev$ npm run docker:build:app # yarn docker:build:app
$ npm run docker:start:prod # yarn docker:start:prodStart the app and go to /docs for interactive API documentation.
Read more here.
Read more here.