Starred repositories
๐ค Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Robust Speech Recognition via Large-Scale Weak Supervision
ใๅจๆๅญฆๆทฑๅบฆๅญฆไน ใ๏ผ้ขๅไธญๆ่ฏป่ ใ่ฝ่ฟ่กใๅฏ่ฎจ่ฎบใไธญ่ฑๆ็่ขซ70ๅคไธชๅฝๅฎถ็500ๅคๆๅคงๅญฆ็จไบๆๅญฆใ
๐ OpenHands: AI-Driven Development
๐งโ๐ซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaโฆ
๐๐ค Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
AI agents running research on single-GPU nanochat training automatically
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
YOLOv5 ๐ in PyTorch > ONNX > CoreML > TFLite
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels ofโฆ
A Simple and Universal Swarm Intelligence Engine, Predicting Anything. ็ฎๆด้็จ็็พคไฝๆบ่ฝๅผๆ๏ผ้ขๆตไธ็ฉ
Streamlit โ A faster way to build and share data apps.
Build and share delightful machine learning apps, all in Python. ๐ Star to support our work!
A generative speech model for daily dialogue.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
We write your reusable computer vision tools. ๐
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (Vโฆ
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
๐ค Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
OpenMMLab Detection Toolbox and Benchmark