Starred repositories
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
🦜🔗 The platform for reliable agents.
Robust Speech Recognition via Large-Scale Weak Supervision
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The world's simplest facial recognition api for Python and the command line
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
State-of-the-art 2D and 3D Face Analysis Project
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
Industry leading face manipulation platform
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Turn (almost) any Python command line program into a full GUI application with one line
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
The official Python SDK for Model Context Protocol servers and clients
An API wrapper for Discord written in Python.
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!