Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Official inference repo for FLUX.1 models
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or exte…
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!
绝区零 一条龙 | 全自动 | 自动闪避 | 自动每日 | 自动空洞 | 支持手柄
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
基于 FastAPI 构建的 Grok2API,全面适配 OpenAI 兼容的调用格式,支持流式/非流式对话、图像生成、图像编辑、视频生成、工具调用、语音聊天、一键NSFW、号池并发与自动负载均衡一体化。
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A Foundation Model for Generalist Gaming Agents
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
PyPy is a very fast and compliant implementation of the Python language.
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Independent set of GDScript tools - parser, linter, formatter, and more
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
[NeurIPS 2025] Direct3D‑S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
Added vLLM support to IndexTTS for faster inference.