Skip to content
View FingerRec's full-sized avatar
:bowtie:
Focusing
:bowtie:
Focusing

Block or report FingerRec

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset

Python 85 2 Updated Aug 26, 2025

Official repository of FlowInOne: Unifying Multimodal Generation as Image-In Image-Out Flow Matching

Python 53 3 Updated Apr 25, 2026

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

Python 66,626 10,846 Updated Jun 7, 2026
Python 20 2 Updated Jun 10, 2026

The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.

Python 66 1 Updated May 25, 2026
Python 12 Updated Dec 9, 2025

Glance: Accelerating Diffusion Models with 1 Sample

Python 155 3 Updated Apr 15, 2026

Vision Bridge Transformer at Scale

Python 146 7 Updated Dec 1, 2025

[CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation.

Python 84 1 Updated Feb 26, 2026

[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 188 16 Updated May 1, 2026

Computer-Use Agents as Judges for Generative UI

Python 45 5 Updated Nov 27, 2025

VCode: SVG as Symbolic Visual Representation

Python 134 6 Updated Feb 21, 2026

Native Multimodal Models are World Learners

Python 1,524 66 Updated Dec 30, 2025

[ACL-main-2026]We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-code tasks of increasing difficulty.

Python 28 Updated Jan 27, 2026

PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.

Python 34 2 Updated Sep 26, 2024

Contexts Optical Compression

Python 23,288 2,151 Updated Jan 27, 2026

[ICML 2026] Video generation via code

Python 1,795 252 Updated May 31, 2026

Automatic Video Generation from Scientific Papers

Python 2,313 328 Updated Mar 5, 2026

Code for Data Collection & Training in Sim+Real Envs: [RSS 2024] Natural Language Can Help Bridge the Sim2Real Gap

Python 11 Updated Oct 25, 2025

Official Repository for MolmoAct

Python 369 41 Updated May 11, 2026

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 425 11 Updated Aug 26, 2025

📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.

449 22 Updated Apr 28, 2026

[ICCV 2025] Balanced Image Stylization with Style Matching Score

Python 70 2 Updated Mar 9, 2026

Seeing is Not Reasoning: MVPBench for Graph-based Evaluation of Multi-path Visual Physical CoT

Python 15 Updated Jul 30, 2025

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 622 31 Updated Sep 5, 2025

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 527 12 Updated Nov 14, 2025

[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"

Python 204 2 Updated Jan 7, 2026

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Python 1,084 51 Updated Nov 3, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,747 2,231 Updated Feb 1, 2025

Unified layout planning and image generation, ICCV2025

Python 45 3 Updated Jan 19, 2026
Next