JOKERTONIGHT

Jianhan Jin JOKERTONIGHT

Nanjing University

Lists (1)

Sort

MLLMs learning

4 repositories

Stars

MME-Benchmarks / Video-MME-v2

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 365 3 Updated May 24, 2026

ultraworkers / claw-code

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 194,032 109,950 Updated Jun 8, 2026

SciYu / HiPhO

The first high school physics Olympiad benchmark for evaluating (M)LLMs with step-level grading and human-level comparison.

25 1 Updated Dec 19, 2025

akazemipour / PPO-RND

Random network distillation on Montezuma's Revenge and Super Mario Bros.

Python 55 10 Updated May 12, 2025

yfzhang114 / r1_reward

✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 289 22 Updated May 9, 2025

CaraJ7 / T2I-R1

[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Python 432 26 Updated Sep 18, 2025

zjunlp / Deco

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 146 13 Updated Sep 11, 2025

DAMO-NLP-SG / VCD

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 406 25 Updated Oct 7, 2024

MAC-AutoML / QuoTA

✨✨[AAAI 2026] This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 78 2 Updated Apr 28, 2025

Leon1207 / Video-RAG-master

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 438 40 Updated Jan 14, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 97,366 14,908 Updated Jun 2, 2026

Kwai-YuanQi / MM-RLHF

The Next Step Forward in Multimodal LLM Alignment

Python 200 9 Updated May 1, 2025

MME-Benchmarks / MME-CoT

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 135 7 Updated Aug 5, 2025

VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 305 29 Updated May 14, 2025

VITA-MLLM / Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 650 29 Updated Dec 23, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,899 1,128 Updated Jun 18, 2026

HKUST-Aerial-Robotics / VINS-Mono

A Robust and Versatile Monocular Visual-Inertial State Estimator

C++ 5,935 2,220 Updated Aug 14, 2024

Baekalfen / PyBoy

Game Boy emulator written in Python

Python 5,156 534 Updated Jun 5, 2026

sicara / easy-few-shot-learning

Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.

Python 1,312 172 Updated Nov 13, 2024

RL-VIG / LibFewShot

[TPAMI 2023] LibFewShot: A Comprehensive Library for Few-shot Learning.

Python 1,069 200 Updated Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly