Skip to content
View yyyybq's full-sized avatar

Block or report yyyybq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of papers on semantic correspondence, organized by year.

18 1 Updated Dec 10, 2025

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 306 10 Updated Dec 1, 2025

[Awesome-Spatial-VLMs] This repository is the official, community-maintained resource for the survey paper: Spatial Intelligence in Vision-Language Models: A Comprehensive Survey;

Python 39 2 Updated Dec 18, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,279 257 Updated Dec 25, 2025

Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"

Python 262 31 Updated Oct 29, 2025

Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.

Python 88 6 Updated Nov 26, 2025

Open-source unified multimodal model

Python 5,505 481 Updated Oct 27, 2025

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 110 7 Updated Nov 1, 2025

Official Repository for “CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models" [CVPR2025]

Python 4 Updated Dec 14, 2025

InteriorGS: 3D Gaussian Splatting Dataset of Semantically Labeled Indoor Scenes

182 7 Updated Aug 4, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,454 196 Updated Dec 3, 2025

Training VLM agents with multi-turn reinforcement learning

Python 356 42 Updated Dec 1, 2025
Python 114 3 Updated Nov 1, 2025

The first collection of academic iKUN papers in the world

4 1 Updated Jun 21, 2024

Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)

Python 65 3 Updated May 2, 2025

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024]

Python 237 22 Updated Mar 23, 2025

A paper list for spatial reasoning

553 32 Updated Dec 24, 2025

本人的科研经验

9,304 507 Updated Dec 12, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 16,063 1,541 Updated Jan 19, 2025
Python 8 1 Updated Apr 30, 2024

TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation

Python 67 7 Updated Sep 26, 2024

[ICME 2024] Implementation of the paper “HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition“.

Python 43 7 Updated Dec 25, 2024

(CVPR2024)RMT: Retentive Networks Meet Vision Transformer

Python 372 27 Updated Jul 29, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,738 267 Updated Feb 13, 2025

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models

Python 106 8 Updated Mar 5, 2024

Deep reinforcement learning (PPO) apply in FrozenLakev1

Python 11 Updated Jul 25, 2024

😎 基于知识的文本生成相关文章总结与个人笔记

21 Updated Oct 5, 2024

List of papers on hallucination detection in LLMs.

1,010 77 Updated Nov 14, 2025
Next