Parallelized web scraper for Github
-
Updated
Jan 3, 2021 - Python
Parallelized web scraper for Github
python script to find top 10 starred repos of user
Scrapes github profiles and stores data in json format
A multi-threaded GitHub scraper to collect Python code with docstrings from public repositories, creating a well-documented dataset for the JaraConverse LLM model.
Scrapes a whole github users page and searches for name leaks. Creates a list of emails from all non github api commits to a users personal(non forked) repos
This is a simple flask api for getting someone's GitHub profile details such as Name, No. of public repositories, No. of followers, No. of following etc, made by scraping GitHub
It allows you to search sensitive credentials (Based on your keyword file) in Git repositories and create a Json and HTML report. Tested on GitHub, GitLab and Gerrit
Advanced github scraper with switching proxies that will help ya find a keyword by searching the target's repos, folders, files.
Add a description, image, and links to the github-scraper topic page so that developers can more easily learn about it.
To associate your repository with the github-scraper topic, visit your repo's landing page and select "manage topics."