-
NTU
- Singapore
- https://jiequancui.github.io/
Lists (1)
Sort Name ascending (A-Z)
Stars
Single-stage End-to-End Training for Tokenization and Generation
Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.
Wan: Open and Advanced Large-Scale Video Generative Models
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
Reinforcement Learning via Self-Distillation (SDPO)
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
Official code for IFDL-VLM: decoupled image forgery detection, localization, and explanation with vision-language models.
[ICLR 2026] Reducing class-wise performance disparity via margin regularization
[ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
Official Repository of Absolute Zero Reasoner
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Generative Agents: Interactive Simulacra of Human Behavior
12 Lessons to Get Started Building AI Agents
A curated list of reinforcement learning with human feedback resources (continually updated)