Lists (5)
Sort Name ascending (A-Z)
Stars
A Cross-Platform Backend for High-Performance Sparse Convolutions
The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Official inference repo for FLUX.2 models
Kandinsky 5.0: A family of diffusion models for Video & Image generation
Momentum Human Rig is an anatomically-inspired parametric full-body digital human model developed at Meta. It includes: A parametric body skeletal model; A realistic 3D mesh skinned to the skeleton…
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
Open-source framework for the research and development of foundation models.
Open-source Trading OS with pluggable AI brain | From market data → AI reasoning → Trade execution | Self-hosted & Multi-exchange
Autonomous GPU Kernel Generation via Deep Agents
Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
[Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding
LLM agents built for control. Designed for real-world use. Deployed in minutes.
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
Qwen Code is a coding agent that lives in the digital world.