Skip to content
View Yuqin-G's full-sized avatar

Block or report Yuqin-G

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AgenticPay: A Multi-Agent LLM Negotiation System for Buyer–Seller Transactions

Python 25 9 Updated Apr 28, 2026

Code for "Variational Reasoning for Language Models"

Python 60 1 Updated Sep 29, 2025

[CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"

Python 37 2 Updated Nov 11, 2025

Official Code for paper "Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding""

Python 10 2 Updated Feb 6, 2026

[ACL '26] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

Python 19 Updated Apr 7, 2026

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation of large language models

30 2 Updated Apr 29, 2026

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 119 10 Updated Apr 30, 2026

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python 184 5 Updated Apr 29, 2026

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,432 43 Updated Mar 9, 2026

Research of DeepSeek Engram Architecture based on Qwen-3 and Stable Diffusion series.

Python 63 5 Updated Apr 7, 2026

EngramX — the cached context spine for AI coding agents. 9 built-in providers + any MCP server as a 10-line plugin, pre-mortem mistake-guard, bi-temporal memory, Anthropic Auto-Memory bridge, SSE s…

TypeScript 108 10 Updated Apr 24, 2026

[CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Python 225 13 Updated Apr 10, 2026

⚒ Evolutionary self-improvement for Hermes Agent — optimize skills, prompts, and code using DSPy + GEPA

Python 2,589 273 Updated Mar 29, 2026

Semi-automated research assistant for academic research and software development. Supports Claude Code, OpenCode, and Codex CLI across ideation, coding, experiments, writing, and publication.

Python 3,511 324 Updated Apr 29, 2026
Python 53 3 Updated Feb 12, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,533 760 Updated Apr 30, 2026

Agentic Learning Powered by AWorld

Python 103 10 Updated Apr 16, 2026

[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement l…

Python 211 3 Updated Dec 10, 2025
Python 112 8 Updated Apr 19, 2026

KEEP:Official code for "KEEP: A KV-Cache-Centric Memory Management System for Efficient Embodied Planning". A memory management system optimized for embodied planning via static-dynamic memory cons…

Jupyter Notebook 7 Updated Mar 11, 2026
Python 143 13 Updated Apr 9, 2026

A research agent system deeply rooted in your own Zotero library.

TypeScript 998 54 Updated Apr 30, 2026

[NeurIPS 2025] Ask a Strong LLM Judge when Your Reward Model is Uncertain

Python 9 Updated Oct 23, 2025

A professional research suite for conducting rigorous academic research using specialized agents and multi-platform CLI commands. Compatible with Claude Code, Gemini CLI, OpenAI Codex, and OpenCode.

HTML 83 10 Updated Apr 16, 2026

A ready-to-fork Claude Code template for academics using LaTeX/Beamer + R. Multi-agent review, quality gates, adversarial QA, and replication protocols.

HTML 1,014 2,067 Updated Apr 27, 2026

tmux sidebar for coding agents — Amp, Claude Code, Codex, OpenCode. Per-thread markers, local HTTP API, live session state.

TypeScript 998 53 Updated Apr 25, 2026

A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress

JavaScript 21,242 938 Updated Apr 29, 2026

Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding

Python 63 2 Updated Apr 29, 2026

AI Scientist by Chicago Human+AI Lab

Python 125 22 Updated Apr 27, 2026
Next