Skip to content
View ncTimTang's full-sized avatar

Highlights

  • Pro

Block or report ncTimTang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

About This repository is a curated collection of the most exciting and influential CVPR 2026 papers. 🔥 [Paper + Code + Demo]

Python 490 27 Updated Jun 6, 2026
JavaScript 1 Updated Jun 2, 2026

🔥🔥🔥 [Awesome] Latest Papers, Codes & Datasets on Streaming / Online Video Understanding — Building Always-on, Real-time Video AI 🤖

294 27 Updated Jun 2, 2026

A simple video streaming baseline that outperforms SOTAs.

Python 143 8 Updated May 1, 2026

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 1,021 62 Updated Oct 15, 2025

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 365 3 Updated May 24, 2026

[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding

Python 49 2 Updated Feb 28, 2026
JavaScript 5 3 Updated May 4, 2026

vHeat: Building Vision Models upon Heat Conduction

Python 281 11 Updated Jun 12, 2025

[ICML 2026 Spotlight] Code for miXed Discrete Diffusion Language Model

Python 25 Updated Mar 16, 2026

LLaDA Continue Pretraining with XDLM

Python 6 Updated Feb 11, 2026

[ICLR 2026] VideoAnchor: Reinforcing Subspace-Structured Visual Cues for Coherent Visual-Spatial Reasoning

Python 5 Updated Feb 28, 2026

Official repo of From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs

Python 24 1 Updated Feb 27, 2026

[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 217 8 Updated Jun 10, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 2,020 197 Updated Jun 9, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,257 8,843 Updated Jun 17, 2026

A comprehensive and up-to-date compilation of datasets, tools, methods, review papers, and competitions for remote sensing change detection.

2,250 403 Updated Apr 16, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,409 1,791 Updated Jan 30, 2026

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 174 17 Updated Mar 14, 2026

REverse-Engineered Reasoning for Open-Ended Generation

Python 97 7 Updated Sep 10, 2025

personal homepage of tangxi

HTML 1 Updated Apr 18, 2025

Fast and memory-efficient exact attention

Python 24,171 2,838 Updated Jun 17, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,018 373 Updated Apr 6, 2026

Official Repo for Open-Reasoner-Zero

Python 2,097 120 Updated Jun 2, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,986 380 Updated Mar 12, 2026

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,422 62 Updated May 11, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,022 4,091 Updated Jun 17, 2026

The official implementation of the **ReDDiT: Rehashing Noise for Discrete Visual Generation** paper.

Python 12 Updated Sep 27, 2025

[ICLR 2026] Geometric-Mean Policy Optimization

Python 105 11 Updated Jan 26, 2026

Implementation of paper "CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis"

Python 28 1 Updated Dec 19, 2025
Next