Stars
A fully offline voice assistant that combines lmstudio and applio together. Uses two methods of TTS, STT and also has some extra features.
Official source code of FreeCAD, a free and opensource multiplatform 3D parametric modeler.
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
Apache Nutch is an extensible and scalable web crawler
All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers
Intelligent proxy pool for Humans™ to extract content from the internet and build your own Large Language Models in this new AI era
DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
A collection of awesome web crawler,spider in different languages
Incredibly fast crawler designed for OSINT.
A next-generation crawling and spidering framework.
Scrapy, a fast high-level web crawling & scraping framework for Python.
Crawl a site to generate knowledge files to create your own custom GPT from a URL
An easy to use, powerful crawler implemented in PHP. Can execute Javascript.
AWX provides a web-based user interface, REST API, and task engine built on top of Ansible. It is one of the upstream projects for Red Hat Ansible Automation Platform.
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
🕵️♂️ Collect a dossier on a person by username from thousands of sites
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Easily train a good VC model with voice data <= 10 mins!
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
An API wrapper for Discord written in Python.
Download market data from Yahoo! Finance's API