Skip to content

MR-workaholic/QHgsxt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

青海工商系统爬虫

爬取的网址为http://qh.gsxt.gov.cn/

采用的是Scrapy爬虫框架,学习网址有Scrapy1.0中文文档

爬取之前安装requirements.txt里面的库,并进入相应的虚拟环境中

开始爬虫:scrapy crawl qh_gsxt -o result.json --logfile=debugmsg.txt --loglevel=DEBUG

About

青海工商局爬虫

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors