-
National University of Singapore
- Singapore
- https://czg1225.github.io/chenzigeng99/
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fully open reproduction of DeepSeek-R1
Official inference repo for FLUX.1 models
Fast and memory-efficient exact attention
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
verl: Volcano Engine Reinforcement Learning for LLMs
Janus-Series: Unified Multimodal Understanding and Generation Models
Train transformer language models with reinforcement learning.
Lets make video diffusion practical!
Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models
Minimal reproduction of DeepSeek R1-Zero
Hierarchical Reasoning Model Official Release
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Enjoy the magic of Diffusion models!
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A framework for few-shot evaluation of language models.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.