Skip to content
View jiequancui's full-sized avatar

Block or report jiequancui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
378 results for source starred repositories
Clear filter
BibTeX Style 1,444 355 Updated Jan 22, 2026
Python 320 28 Updated Dec 17, 2024

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 216 9 Updated Dec 15, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,388 51 Updated Dec 16, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,347 208 Updated May 19, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,094 138 Updated Dec 8, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,753 65 Updated Jan 20, 2026

Official Repository of Absolute Zero Reasoner

Python 1,809 293 Updated Aug 24, 2025

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 172 4 Updated Dec 17, 2025

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Python 561 28 Updated Jun 29, 2025

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 763 58 Updated Feb 1, 2024

Generative Agents: Interactive Simulacra of Human Behavior

20,517 2,834 Updated Aug 5, 2024

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 50,153 17,546 Updated Feb 6, 2026

A list of AI autonomous agents

25,644 2,188 Updated Feb 26, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

4,289 251 Updated Dec 9, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,099 300 Updated Jan 5, 2026

Official repository of paper “IML-ViT: Benchmarking Image manipulation localization by Vision Transformer”

Jupyter Notebook 304 38 Updated Jun 29, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,551 239 Updated Nov 12, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,570 83 Updated Nov 16, 2025

CODA: Repurposing Continuous VAEs for Discrete Tokenization

Python 35 1 Updated Jul 4, 2025
Python 43 5 Updated May 30, 2025

[ICLR2025] Halton Scheduler for Masked Generative Image Transformer

Python 280 31 Updated Oct 28, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,833 1,712 Updated Feb 29, 2024

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,810 75 Updated Nov 27, 2025

[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

Python 79 5 Updated Jun 6, 2024

[NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1. Paper at: https:/…

Python 86 6 Updated Feb 3, 2026

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,220 67 Updated Feb 25, 2025

The Next Step Forward in Multimodal LLM Alignment

Python 196 9 Updated May 1, 2025

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 136 6 Updated Aug 5, 2025
Next