-
Peking University
- Deneb
-
12:41
(UTC +08:00)
Stars
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
LlamaIndex is the leading document agent and OCR platform
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
Fully open reproduction of DeepSeek-R1
SGLang is a high-performance serving framework for large language models and multimodal models.
An open-source RAG-based tool for chatting with your documents.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
The official Python SDK for Model Context Protocol servers and clients
verl: Volcano Engine Reinforcement Learning for LLMs
Automatic headphone equalization from frequency responses
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
An open source implementation of CLIP.
Hierarchical Reasoning Model Official Release
vits2 backbone with multilingual-bert
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
A debugging and profiling tool that can trace and visualize python code execution
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Understand Human Behavior to Align True Needs
Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set