Yu-Fangxu

Fangxu Yu Yu-Fangxu

Ph.D. Student @ UMD

51 followers · 36 following

University of Maryland
College Park, MD, US
https://yu-fangxu.github.io/

Achievements

Highlights

Organizations

Stars

chrisliu298 / awesome-on-policy-distillation

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

392 8 Updated Jun 23, 2026

thinkwee / AwesomeOPD

Awesome List for On-Policy Distillation

670 12 Updated Jun 19, 2026

nick7nlp / Awesome-LLM-On-Policy-Distillation

A curated collection of papers and resources on On-Policy Distillation for Large Language Models.

Python 352 6 Updated Jun 21, 2026

FreedomIntelligence / Awesome-Rubrics

A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Rubrics

90 3 Updated Jun 15, 2026

scaleapi / SWE-bench_Pro-os

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

Python 452 83 Updated May 18, 2026

OpenSenseNova / SenseNova-U1

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,336 292 Updated Jun 15, 2026

thunlp / OPD

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python 696 44 Updated May 30, 2026

AgenticScience / Awesome-Agent-Scientists

Paper list of agent for science

269 23 Updated Mar 12, 2026

huaixuheqing / VPPO-RL

[ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"

Python 68 6 Updated Apr 3, 2026

Yu-Fangxu / ArrowGEV

[Findings of ACL 2026] ArrowGEV: Grounding Events in Video via Learning the Arrow of Time

Python 4 Updated Apr 19, 2026

MiroMindAI / MiroEval

MiroEval: A benchmark and evaluation framework for deep research agents — 100 tasks (70 text, 30 multimodal) assessed across synthesis quality, factuality, and research process. 13 systems evaluated.

Python 43 7 Updated Apr 6, 2026

AIDC-AI / Awesome-Unified-Multimodal-Models

Awesome Unified Multimodal Models

1,286 40 Updated Mar 24, 2026

AIFrontierLab / TorchUMM

A unified multimodal model toolkit

Python 131 9 Updated May 18, 2026

XSkill-Agent / XSkill

[ICML 2026] XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Python 225 27 Updated May 13, 2026

RUCBM / G-OPD

Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"

Python 240 13 Updated May 28, 2026

AI9Stars / Cheers

Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

Python 255 21 Updated Apr 13, 2026

aiming-lab / SkillRL

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Python 850 64 Updated May 17, 2026

SimWorld-AI / SimWorld-Studio

Automatic Environment Generation with Evolving Coding Agent for Embodied Agent Learning

Python 128 17 Updated Jun 23, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 380,051 79,583 Updated Jun 23, 2026

ulab-uiuc / GraphRouter

[ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You

Python 73 7 Updated Dec 30, 2025

QwenLM / Qwen3.6

Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.

3,608 241 Updated Jun 3, 2026

dingdongwang / EmotionThinker

ICLR 2026 (Oral) | EmotionThinker: Prosody-Aware Reinforcement Learning for Explainable Speech Emotion Reasoning

Python 54 4 Updated Feb 12, 2026

sappho-x / Flow-of-Spans

3 Updated Jan 30, 2026

openai / simple-evals

Python 4,534 491 Updated Apr 22, 2026

TIGER-AI-Lab / OpenResearcher

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Python 787 79 Updated Jun 10, 2026

SimWorld-AI / DeliveryBench

DeliveryBench: Can Agents Earn Profit in Real World?

Python 18 1 Updated Feb 11, 2026

DoYangTan / verl-rubric

Python 28 1 Updated Jan 31, 2026

IANNXANG / RuscaRL

Python 48 4 Updated Jan 30, 2026

Osilly / Vision-DeepResearch

[ICML 2026] Multimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of reasoning turns to dozens and the number of search-engine inte…

Python 648 56 Updated Jun 8, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 967 109 Updated Feb 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fangxu Yu Yu-Fangxu

Achievements

Achievements

Highlights

Organizations

Block or report Yu-Fangxu

Stars

chrisliu298 / awesome-on-policy-distillation

thinkwee / AwesomeOPD

nick7nlp / Awesome-LLM-On-Policy-Distillation

FreedomIntelligence / Awesome-Rubrics

scaleapi / SWE-bench_Pro-os

OpenSenseNova / SenseNova-U1

thunlp / OPD

AgenticScience / Awesome-Agent-Scientists

huaixuheqing / VPPO-RL

Yu-Fangxu / ArrowGEV

MiroMindAI / MiroEval

AIDC-AI / Awesome-Unified-Multimodal-Models

AIFrontierLab / TorchUMM

XSkill-Agent / XSkill

RUCBM / G-OPD

AI9Stars / Cheers

aiming-lab / SkillRL

SimWorld-AI / SimWorld-Studio

openclaw / openclaw

ulab-uiuc / GraphRouter

QwenLM / Qwen3.6

dingdongwang / EmotionThinker

sappho-x / Flow-of-Spans

openai / simple-evals

TIGER-AI-Lab / OpenResearcher

SimWorld-AI / DeliveryBench

DoYangTan / verl-rubric

IANNXANG / RuscaRL

Osilly / Vision-DeepResearch

lasgroup / SDPO