实战🐍多种网站、电商数据爬虫🕷。包含🕸：淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评、携程、小米应用商店、安居客、途家民宿❤️❤️❤️。微信爬虫展示项目:

Python 5,336 1,420 Updated May 22, 2024

nghuyong / WeiboSpider

持续维护的新浪微博采集工具🚀🚀🚀

Python 3,996 842 Updated Aug 23, 2025

LiuXingMing / SinaSpider

新浪微博爬虫（Scrapy、Redis）

Python 3,280 1,508 Updated Sep 5, 2018

Gerapy / Gerapy

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

Python 3,493 649 Updated Oct 29, 2024

librauee / Reptile

🏀 Python3 网络爬虫实战（部分含详细教程）猫眼腾讯视频豆瓣研招网微博笔趣阁小说百度热点 B站 CSDN 网易云阅读阿里文学百度股票今日头条微信公众号网易云音乐拉勾有道 unsplash 实习僧汽车之家英雄联盟盒子大众点评链家 LPL赛程台风梦幻西游、阴阳师藏宝阁天气牛客网百度文库睡前故事知乎 Wish

Python 1,712 512 Updated Apr 19, 2021

scrapy / quotesbot

This is a sample Scrapy project for educational purposes

Python 1,349 779 Updated Nov 29, 2023

clemfromspace / scrapy-selenium

Scrapy middleware to handle javascript pages using selenium

Python 957 360 Updated Jul 8, 2024

lb2281075105 / Python-Spider

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指…

Python 784 275 Updated Aug 27, 2022

WSOL12 / Solana-Arbitrage-Bot

Solana Arbitrage Bot on pump.fun, Meteora, Raydium and Orca using Jito bundling, RPC and gRPC. Solana Arbitrage Bot Solana Arbitrage Bot Solana Arbitrage Bot Solana Arbitrage Bot Solana Arbitrage B…

TypeScript 487 210 Updated Nov 21, 2025

54xingzhe / weixin_crawler

高效微信公众号历史文章和阅读数据爬虫powered by scrapy

JavaScript 473 714 Updated Dec 6, 2018

zhanghe06 / news_spider

新闻抓取（微信、微博、头条...）

Python 225 44 Updated Dec 8, 2022

clemfromspace / scrapy-cloudflare-middleware

A Scrapy middleware to bypass the CloudFlare's anti-bot protection

Python 112 24 Updated Jun 20, 2021

monkey-hjy / python-spider

python爬虫小项目【持续更新】【笔趣阁小说下载、Tweet数据抓取、天气查询、网易云音乐逆向、天天基金网查询、微博数据抓取（生成cookie）、有道翻译逆向、企查查免登陆爬虫、大众点评svg加密破解、B站用户爬虫、拉钩免登录爬虫、自如租房字体加密、知乎问答

JavaScript 112 50 Updated Jan 5, 2024

WSOL12 / Solana-Relayer

A high-performance Solana transaction relayer written in Rust that connects your RPC to Jito’s block engine, enabling low-latency, reliable routing of transactions .

Rust 49 18 Updated Nov 18, 2025

woxcab / scrapy_rss

Tools to easy generate RSS feed that contains each scraped item using Scrapy framework.

Python 33 4 Updated Nov 19, 2025

Ingram7 / Weibo

Scrapy爬取微博（m.weibo.cn 解析api提取信息）

Python 24 9 Updated Sep 26, 2019

bingo-zh / scrapy-amac

为方便广大投资者对私募基金信息进行查询，中国基金业协会在官方网站搭建了私募基金分类公示平台，按照私募基金管理人登记的信息对私募基金进行分类公示。为了全面了解相关机构或者产品信息，学习使用 Scrapy 框架获取部分信息。

Python 18 4 Updated Apr 20, 2020

Apocally / WeiboWebSpider

基于python3.6的微博爬虫（scrapy）

Python 12 5 Updated Jan 22, 2017

Ingram7 / WeiboSpider

Scrapy 爬取微博（ weibo.cn 静态页面，正则、xpath解析）

Python 9 2 Updated Aug 26, 2019

croqaz / scrapy-count-filter

Scrapy🕷 middleware for limiting requests based on a counter

Python 7 1 Updated Dec 12, 2019

Sp4rr0w / GitHub-Scrapy-2

抓取GitHub上的用户信息（GitHub-Scrapy项目的升级版规则变了）

Python 4 2 Updated Nov 19, 2015

xueyuzi / ScapSpider

Scrapy 实现的 CVE 漏洞库爬虫

Python 3 3 Updated Jan 22, 2017

leonardohra / PacktPub_Scrapy

Generate a csv file with information of the person's PacktPub books.

Python 1 1 Updated Feb 26, 2018

firecrawl / firecrawl

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 69,957 5,493 Updated Dec 17, 2025

apify / crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

TypeScript 20,842 1,121 Updated Dec 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zwvc

Block or report zwvc

Crawler

crawlab-team / crawlab

ScrapeGraphAI / Scrapegraph-ai

scrapy / scrapy

scrapinghub / portia

chyroc / WechatSogou

DropsDevopsOrg / ECommerceCrawlers