LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Python 561 28 Updated Jun 29, 2025

LLaVA-VL / LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 763 58 Updated Feb 1, 2024

joonspk-research / generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

20,517 2,834 Updated Aug 5, 2024

microsoft / ai-agents-for-beginners

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 50,153 17,546 Updated Feb 6, 2026

e2b-dev / awesome-ai-agents

A list of AI autonomous agents

25,644 2,188 Updated Feb 26, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

4,289 251 Updated Dec 9, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,099 300 Updated Jan 5, 2026

SunnyHaze / IML-ViT

Official repository of paper “IML-ViT: Benchmarking Image manipulation localization by Vision Transformer”

Jupyter Notebook 304 38 Updated Jun 29, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,551 239 Updated Nov 12, 2025

Gen-Verse / MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,570 83 Updated Nov 16, 2025

LeapLabTHU / CODA

CODA: Repurposing Continuous VAEs for Discrete Tokenization

Python 35 1 Updated Jul 4, 2025

Qinyu-Allen-Zhao / Arinar

Python 43 5 Updated May 30, 2025

valeoai / Halton-MaskGIT

[ICLR2025] Halton Scheduler for Masked Generative Image Transformer

Python 280 31 Updated Oct 28, 2025

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,833 1,712 Updated Feb 29, 2024

facebookresearch / MetaCLIP

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,810 75 Updated Nov 27, 2025

erfanshayegani / Jailbreak-In-Pieces

[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

Python 79 5 Updated Jun 6, 2024

VILA-Lab / M-Attack

[NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1. Paper at: https:/…

Python 86 6 Updated Feb 3, 2026

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,220 67 Updated Feb 25, 2025

Kwai-YuanQi / MM-RLHF

The Next Step Forward in Multimodal LLM Alignment

Python 196 9 Updated May 1, 2025

MME-Benchmarks / MME-CoT

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 136 6 Updated Aug 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiequan jiequancui

Achievements

Achievements

Block or report jiequancui

Lists (1)

✨ Inspiration

Stars

MCG-NKU / NSFC-LaTex

apple / ml-tarflow

End2End-Diffusion / iREPA

hustvl / LightningDiT

google-research / big_vision

LTH14 / JiT

bytetriper / RAE

LeapLabTHU / Absolute-Zero-Reasoner

Jiawei-Yang / DeTok

jiequancui / Generative-Distribution-Distillation

ictnlp / LLaVA-Mini