🕷️ Enable AI agents to scrape and crawl the web effortlessly with this lightweight Model Context Protocol server, integrating seamlessly into your workflows.
-
Updated
Mar 28, 2026 - Python
🕷️ Enable AI agents to scrape and crawl the web effortlessly with this lightweight Model Context Protocol server, integrating seamlessly into your workflows.
🕷️ Automate web scraping with OmniCrawler, a powerful tool that builds large datasets by discovering and downloading relevant content effortlessly.
🌐 Use Crawlee to streamline web scraping with Node.js, featuring sessions management, proxy rotation, and dynamic content handling for efficient data extraction.
🔍 Automate dynamic web scraping with Scraping Browser, a full-host solution using Puppeteer, Selenium, and Playwright for seamless data collection.
🛒 Extract and analyze Amazon product data effortlessly with this lightweight Python scraper, ideal for price tracking and competitor research.
🏥 Scrape detailed hospital information from the Deutsches Krankenhaus Verzeichnis website, gathering structured data like contact details and addresses efficiently.
🕷️ Build efficient web crawlers with Vncz-Test-Actor-Scraper, a TypeScript template using Puppeteer for JavaScript-heavy pages and structured data storage.
📊 Extract Instagram Reels and insights efficiently with this high-performance scraper, turning public data into structured datasets for analysis and marketing.
📧 Scrape names and emails from US state bar websites to enhance legal networking and research with accurate, public contact data.
🕵️♂️ Scrape attorney data from major U.S. directories while ensuring compliance, covering 20 cities and five practice areas efficiently.
🛒 Scrape Amazon product data efficiently with AI and Playwright, designed for developers and data analysts seeking structured information.
🕵️♂️ Perform robust web security scanning and reconnaissance with PhantomCrawler, designed for researchers and pen testers to enhance application security.
🤖 Build and interact with Claude Agent using this Python SDK for seamless integration and efficient asynchronous querying.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
CLI for Olostep - The fastest way to get clean web data into your AI workflows. Search, scrape, and crawl the web from your terminal with Olostep — no headless browsers, no anti-bot headaches, no infra.
what is the best web scraping API service? Research through benchmarks
오하아사 순위를 아침마다 알려주는 디스코드 봇
Spider n8n community node — crawl, scrape, and extract structured data from any website inside your n8n workflows.
Open product-intelligence engine that turns messy retail and manufacturer page data into clean, canonical, comparable product records.
Add a description, image, and links to the web-crawling topic page so that developers can more easily learn about it.
To associate your repository with the web-crawling topic, visit your repo's landing page and select "manage topics."