Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A collection of guides and examples for the Gemma open models from Google.
Integrate cutting-edge LLM technology quickly and easily into your apps
🔊 Text-Prompted Generative Audio Model
HeartMuLa Official Repo: The Most Powerful Open-Source Music Generation Model of 2026
Open neural machine translation models and web services
Algorithm powering the For You feed on X
⏰ AI conference deadline countdowns
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Sharp Monocular View Synthesis in Less Than a Second
Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
Introduction to Machine Learning Systems
State-of-the-art paired encoder and decoder models (17M-1B params)
SkyRL: A Modular Full-stack RL Library for LLMs
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
🤗 A PyTorch-native and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs.
Evaluation software used in the Text Retrieval Conference