Skip to content
View Yuqin-G's full-sized avatar

Block or report Yuqin-G

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A self-hosted ML coding practice platform. 68 problems from ReLU to flow matching — attention, training, RLHF, diffusion, and more. Instant feedback in the browser.

Python 97 5 Updated Apr 9, 2026

Official implementation of Seeing with You: Perception-Reasoning Co-evolution for Multimodal Reasoning.

Python 33 1 Updated Apr 6, 2026

Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.

HTML 51 4 Updated Mar 8, 2026

(CVPR 26) Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration

Python 16 Updated Mar 8, 2026

A project implementing various agentic RL based on the Slime post-training framework

Python 251 6 Updated Apr 3, 2026

Synchronize Codex session provider metadata across rollout files and SQLite state.

C# 168 14 Updated Apr 3, 2026

ALMA (Automated meta-Learning of Memory designs for Agentic systems) is a framework that meta-learns memory designs to replace human-engineered designs for agentic system.

Python 196 22 Updated Apr 8, 2026

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 436 57 Updated Mar 20, 2026

A Claude Code hook plugin for IP-based access control · 防 Claude 封号 · Claude IP 检测 · IP 地理位置拦截 · Claude 账号保护

Shell 58 6 Updated Apr 1, 2026

MLLM hallucination, LVLM, LLM, Hallucination Mitigation, Training-free hallucination mitigation

8 1 Updated Apr 7, 2026
Python 76 4 Updated Apr 9, 2026

Self-hosted AI assistant with tool use, multi-agent orchestration, coding copilot and a lightweight Flask + vanilla JS stack.

Python 104 15 Updated Apr 9, 2026

Official repo for ”Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought“

Python 25 Updated Mar 29, 2026

Create, Evaluate, and Connect AI Skills

Python 658 64 Updated Apr 8, 2026

AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution

Python 295 32 Updated Apr 9, 2026

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Python 402 24 Updated Mar 31, 2026
Python 76 5 Updated Mar 13, 2026

Code for UNO.

Python 3 Updated Jan 22, 2026

Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Python 168 5 Updated Mar 13, 2026

Code release for paper "Test-Time Training Done Right"

Python 430 24 Updated Jan 5, 2026

[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 161 2 Updated Mar 19, 2026

Official repository for paper Auto-scaling Continuous Memory for GUI Agent

Python 25 3 Updated Feb 2, 2026

Official code for PEARL: Personalized Streaming Video Understanding Model

Python 49 3 Updated Mar 24, 2026

Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"

Python 74 2 Updated Apr 7, 2026

[CVPR 2025] RAP: Retrieval-Augmented Personalization

Python 83 4 Updated Nov 23, 2025

"Parallel Test-Time Scaling for Latent Reasoning Models"

Python 17 1 Updated Apr 7, 2026

Test-Time Mixture of World Models for Embodied Agents in Dynamic Environments [ICLR 2026]

Python 6 Updated Jan 30, 2026

Official code for the paper “Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search”.

Python 28 1 Updated Dec 8, 2025
Next