Stars
Stable Diffusion web UI
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Examples and guides for using the OpenAI API
Clone a voice in 5 seconds to generate arbitrary speech in real-time
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
State-of-the-art 2D and 3D Face Analysis Project
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
DLRover: An Automatic Distributed Deep Learning System
JethroChow / HunyuanDiT
Forked from Tencent-Hunyuan/HunyuanDiTHunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding