Stars
real time face swap and one-click video deepfake with only a single image
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Fully open reproduction of DeepSeek-R1
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
GUI for a Vocal Remover that uses Deep Neural Networks.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
An open-source tool-augmented conversational language model from Fudan University
Wan: Open and Advanced Large-Scale Video Generative Models
Enjoy the magic of Diffusion models!
A framework for few-shot evaluation of language models.
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Kronos: A Foundation Model for the Language of Financial Markets
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Chronos: Pretrained Models for Time Series Forecasting
Simple, scalable AI model deployment on GPU clusters
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
The state-of-the-art image restoration model without nonlinear activation functions.
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
"MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting