-
Fudan University
- Fudan University, Shanghai
- https://yjyddq.github.io/
- https://scholar.google.com/citations?user=3NSeUiwAAAAJ&hl=zh-CN
Highlights
- Pro
Stars
Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"
Benchmark for proactive personal assistant agents in long-horizon workflows.
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
A benchmark for LLMs on complicated tasks in the terminal
Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"
[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
👾 Open Computer Use – Open-Source Alternative to Codex Computer Use
分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
OpenTinker is an RL-as-a-Service infrastructure for foundation models
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
An in-the-wild benchmark for AI agents in the OpenClaw Environment.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
AI agents running research on single-GPU nanochat training automatically
Official Repository of "ρ-𝙴𝙾𝚂: Training-free Bidirectional Variable-Length Control for Masked Diffusion LLMs"
WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.
Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"
Official repository of DARE: Diffusion Large Language Models Alignment and Reinforcement Executor
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
[ACL26] Experimental resources for paper titled "LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions"
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
Official implementation of Selective Entropy Regularization (SIREN), proposed by paper 'Rethinking Entropy Regularization in Large Reasoning Models'.