Highlights
- Pro
Stars
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
A framework for few-shot evaluation of language models.
A reproduction of the Deepseek-OCR model including training
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
Fast and memory-efficient exact attention
A high-throughput and memory-efficient inference and serving engine for LLMs
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.
A curated list for awesome discrete diffusion models resources.
[NeurIPS 2025] Open-source Multi-agent Poster Generation from Papers
✨✨Latest Advances on Multimodal Large Language Models
Enable Comprehensive LLM Evaluation on Graph Reasoning
PyTorch implementation for RPO https://arxiv.org/abs/2407.12164
Minimal reproduction of DeepSeek R1-Zero
Janus-Series: Unified Multimodal Understanding and Generation Models
EDM2 and Autoguidance -- Official PyTorch implementation
This repo contains the code for 1D tokenizer and generator
Implementation of MagViT2 Tokenizer in Pytorch
Vector (and Scalar) Quantization, in Pytorch
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
High-Resolution Image Synthesis with Latent Diffusion Models
A latent text-to-image diffusion model
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701