Skip to content
View xingyizhou's full-sized avatar
🕊️
.
🕊️
.

Block or report xingyizhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 16 Updated Apr 8, 2026

A PyTorch native platform for training generative AI models

Python 5,225 781 Updated Apr 11, 2026

This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision". SpidR is a self-supervised speech representat…

Python 56 6 Updated Mar 30, 2026

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 758 35 Updated Apr 11, 2026

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7 and MiroThinker-H1, achieve 74.0 and 88.2 on the BrowseComp, respectively.

Python 8,099 603 Updated Apr 10, 2026

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 583 41 Updated Feb 15, 2026

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 1,766 137 Updated Dec 31, 2025

An Extensible Deep Learning Library

Python 2,340 402 Updated Feb 18, 2026

🚀 A curated list of awesome resources focusing on Context Compression techniques for Large Language Models(LLMs).

HTML 69 1 Updated Jan 17, 2026

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,241 156 Updated Dec 8, 2025

[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation

Python 747 27 Updated Nov 27, 2025

Native Multimodal Models are World Learners

Python 1,497 61 Updated Dec 30, 2025

A language-model–powered compressor for natural language text

Python 49 3 Updated Oct 23, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,566 172 Updated Apr 8, 2026

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 984 69 Updated Jul 31, 2025

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 236 9 Updated Jan 22, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,594 3,627 Updated Apr 10, 2026

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 429 24 Updated Jun 20, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,245 153 Updated Mar 12, 2026

Fully open reproduction of DeepSeek-R1

Python 25,977 2,411 Updated Apr 2, 2026

Train transformer language models with reinforcement learning.

Python 18,001 2,633 Updated Apr 11, 2026

Solve Visual Understanding with Reinforced VLMs

Python 5,936 377 Updated Mar 12, 2026

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 96 3 Updated Mar 1, 2025

COYO-700M: Large-scale Image-Text Pair Dataset

Python 1,252 38 Updated Nov 30, 2022

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,558 93 Updated Nov 10, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,756 271 Updated Jul 18, 2025

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Python 141 6 Updated Jun 4, 2025

Next-Token Prediction is All You Need

Python 2,393 95 Updated Jan 12, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,299 6,911 Updated Apr 11, 2026
Next