BradyFU

👋

Chaoyou Fu BradyFU

👋

南京大学-研究员|助理教授|博导-中国科协青年人才托举工程|中科院院长特别奖 Lead MME & VITA & Awesome-MLLM

709 followers · 4 following

Lead NJU-MiG (Multimodal intelligence Group, 南京大学米格小组)
https://bradyfu.github.io/

Achievements

Organizations

Stars

PaperDecision / PaperDecision

Python 171 1 Updated Jan 19, 2026

guanweifan / awesome-efficient-vla

🔥 A curated roadmap to the Efficient VLA landscape. We’re keeping this list live—contribute your latest work!

74 4 Updated Jan 29, 2026

NVlabs / QeRL

QeRL enables RL for 32B LLMs on a single H100 GPU.

Python 482 48 Updated Nov 27, 2025

Tencent / VITA

The official implement of VITA, VITA15, LongVITA, VITA-Audio, VITA-VLA, and VITA-E.

Python 147 2 Updated Oct 28, 2025

NVlabs / LongLive

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,037 86 Updated Jan 27, 2026

yfzhang114 / Thyme

✨✨ [ICLR 2026] Think Beyond Images

Python 576 35 Updated Sep 23, 2025

dreamtheater123 / Awesome-SpeechLM-Survey

Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.

175 6 Updated Jun 17, 2025

shilinyan99 / CrossLMM

CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms

25 Updated Dec 21, 2025

yfzhang114 / r1_reward

✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 281 22 Updated May 9, 2025

VITA-MLLM / VITA-Audio

✨✨[NeurIPS 2025] VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Python 673 60 Updated May 24, 2025

MME-Benchmarks / MME-Unify

✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Python 43 4 Updated Apr 10, 2025

LengSicong / MMR1

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 214 9 Updated Sep 26, 2025

VITA-MLLM / Sparrow

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

Jupyter Notebook 31 1 Updated Mar 28, 2025

Leon1207 / Video-RAG-master

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 396 39 Updated Jan 14, 2026

MAC-AutoML / QuoTA

✨✨[AAAI 2026] This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 77 2 Updated Apr 28, 2025

Phantom-video / Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,478 94 Updated Sep 11, 2025

Kwai-YuanQi / MM-RLHF

The Next Step Forward in Multimodal LLM Alignment

Python 196 9 Updated May 1, 2025

MME-Benchmarks / MME-CoT

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 136 6 Updated Aug 5, 2025

VITA-MLLM / LUCY

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Python 58 3 Updated Apr 14, 2025

VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 305 29 Updated May 14, 2025

xjtupanda / Sparrow

Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"

48 Updated Sep 3, 2025

HiThink-Research / MME-Finance

[MM 2025] A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning

Python 44 4 Updated Jan 8, 2026

VITA-MLLM / Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 363 24 Updated May 27, 2025

NVlabs / Eagle

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 926 48 Updated Oct 25, 2025

MME-Benchmarks / MME-RealWorld

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 151 11 Updated Oct 21, 2025

VITA-MLLM / VITA

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,486 182 Updated Mar 28, 2025

jinzhuoran / RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024

Python 90 9 Updated Sep 30, 2024

yfzhang114 / SliME

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 164 7 Updated Dec 26, 2024

seanzhuh / Awesome-Open-Vocabulary-Detection-and-Segmentation

Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

215 8 Updated Apr 3, 2025

YifanXu74 / Libra

Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)

Python 161 Updated Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chaoyou Fu BradyFU

Achievements

Achievements

Organizations

Block or report BradyFU

Stars

PaperDecision / PaperDecision

guanweifan / awesome-efficient-vla

NVlabs / QeRL

Tencent / VITA

NVlabs / LongLive

yfzhang114 / Thyme

dreamtheater123 / Awesome-SpeechLM-Survey

shilinyan99 / CrossLMM

yfzhang114 / r1_reward

VITA-MLLM / VITA-Audio

MME-Benchmarks / MME-Unify

LengSicong / MMR1

VITA-MLLM / Sparrow

Leon1207 / Video-RAG-master

MAC-AutoML / QuoTA

Phantom-video / Phantom

Kwai-YuanQi / MM-RLHF

MME-Benchmarks / MME-CoT

VITA-MLLM / LUCY

VITA-MLLM / Long-VITA

xjtupanda / Sparrow

HiThink-Research / MME-Finance

VITA-MLLM / Freeze-Omni

NVlabs / Eagle

MME-Benchmarks / MME-RealWorld

VITA-MLLM / VITA

jinzhuoran / RWKU

yfzhang114 / SliME

seanzhuh / Awesome-Open-Vocabulary-Detection-and-Segmentation

YifanXu74 / Libra