Skip to content
View jiequancui's full-sized avatar

Block or report jiequancui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Single-stage End-to-End Training for Tokenization and Generation

Python 89 1 Updated Mar 24, 2026

Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 436 17 Updated Mar 15, 2026

[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 657 23 Updated Feb 27, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,789 2,579 Updated Mar 5, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,058 342 Updated Apr 10, 2026

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 519 11 Updated Nov 14, 2025

dLLM: Simple Diffusion Language Modeling

Python 2,365 236 Updated Feb 27, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 760 81 Updated Feb 18, 2026

General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.

Python 3,309 608 Updated Mar 24, 2026

Official code for IFDL-VLM: decoupled image forgery detection, localization, and explanation with vision-language models.

Python 5 Updated Mar 20, 2026

An awesome LaTeX template for NSFC proposal.

TeX 501 201 Updated Mar 16, 2026

[ICLR 2026] Reducing class-wise performance disparity via margin regularization

Python 3 Updated Feb 5, 2026
BibTeX Style 1,527 372 Updated Mar 19, 2026
Python 334 31 Updated Dec 17, 2024

[ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 235 11 Updated Dec 15, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,441 57 Updated Dec 16, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,418 220 Updated May 19, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,242 156 Updated Dec 8, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,849 75 Updated Feb 25, 2026

Official Repository of Absolute Zero Reasoner

Python 1,842 298 Updated Aug 24, 2025

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 181 4 Updated Feb 24, 2026

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Python 570 32 Updated Jun 29, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 5,868 692 Updated Mar 23, 2025

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 766 58 Updated Feb 1, 2024

Generative Agents: Interactive Simulacra of Human Behavior

21,097 2,955 Updated Aug 5, 2024

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 56,505 19,539 Updated Apr 12, 2026

A list of AI autonomous agents

27,216 2,695 Updated Feb 26, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

4,343 252 Updated Dec 9, 2025
Next