Starred repositories
📚 Freely available programming books
All Algorithms implemented in Python
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Robust Speech Recognition via Large-Scale Weak Supervision
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,…
WebUI extension for ControlNet
PyTorch implementations of Generative Adversarial Networks.
Bringing Old Photo Back to Life (CVPR 2020 oral)
A very simple framework for state-of-the-art Natural Language Processing (NLP)
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Python APIs for web automation, testing, and bypassing bot-detection with ease.
Generic automation framework for acceptance testing and RPA