Skip to content
View JOKERTONIGHT's full-sized avatar
  • Nanjing University

Block or report JOKERTONIGHT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 365 3 Updated May 24, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 194,032 109,950 Updated Jun 8, 2026

The first high school physics Olympiad benchmark for evaluating (M)LLMs with step-level grading and human-level comparison.

25 1 Updated Dec 19, 2025

Random network distillation on Montezuma's Revenge and Super Mario Bros.

Python 55 10 Updated May 12, 2025

✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 289 22 Updated May 9, 2025

[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Python 432 26 Updated Sep 18, 2025

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 146 13 Updated Sep 11, 2025

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 406 25 Updated Oct 7, 2024

✨✨[AAAI 2026] This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 78 2 Updated Apr 28, 2025

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 438 40 Updated Jan 14, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 97,366 14,908 Updated Jun 2, 2026

The Next Step Forward in Multimodal LLM Alignment

Python 200 9 Updated May 1, 2025

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 135 7 Updated Aug 5, 2025

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 305 29 Updated May 14, 2025

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 650 29 Updated Dec 23, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,899 1,128 Updated Jun 18, 2026

A Robust and Versatile Monocular Visual-Inertial State Estimator

C++ 5,935 2,220 Updated Aug 14, 2024

Game Boy emulator written in Python

Python 5,156 534 Updated Jun 5, 2026

Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.

Python 1,312 172 Updated Nov 13, 2024

[TPAMI 2023] LibFewShot: A Comprehensive Library for Few-shot Learning.

Python 1,069 200 Updated Oct 27, 2025