List of libraries, tools and APIs for web scraping and data processing.
-
Updated
Oct 27, 2024 - Makefile
List of libraries, tools and APIs for web scraping and data processing.
The All in One Framework to build Awesome Scrapers.
a reliable high-level web crawling & scraping framework for Node.js.
Lightweight web scraping toolkit for documents and structured data.
Pythonic HTML Parsing for Humans™
Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis
Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.
ProxyCrawl PHP library for scraping and crawling websites
🚀 OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK 🤖
An API wrapper for Scrappey.com written in Python (cloudflare, datadome bypass & solver)
🚀 OMKAR TEMP MAIL HELPS YOU USE TEMPORARY EMAILS. 🤖
A simple, easy to use, scalable scraping framework written in PHP
An API wrapper for Scrappey.com written in Node.js (cloudflare bypass & solver)
🚀 THIS WEB SCRAPING TEMPLATE PROVIDES YOU WITH A GREAT STARTING POINT WHEN CREATING WEB SCRAPING BOTS. 🤖
Web scraping API to outsource tons of GET & xpath to cloud computing
🚀 FINAL CODE FOR TUTORIAL ON HOW TO SOLVE CAPTCHA IN SELENIUM USING 2CAPTCHA 🤖
✨ NodeJs crawling & scraping framework heavily inspired by Scrapy
M.A. Thesis work, news scraping framework/pipeline using python, beautifulsoup, newspaper3k, flask and mongodb with a custom api.
🚀 SCRAPE 1000'S OF PRODUCTS FROM DENTALKART 🤖
Scrape Amazon products
Add a description, image, and links to the scraping-framework topic page so that developers can more easily learn about it.
To associate your repository with the scraping-framework topic, visit your repo's landing page and select "manage topics."