Skip to content
View yjyddq's full-sized avatar

Highlights

  • Pro

Block or report yjyddq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python 593 50 Updated Nov 4, 2025

Agents' Last Exam

Python 719 29 Updated Jun 22, 2026

The codebase of Cola DLM

Python 236 13 Updated Jun 11, 2026

Benchmark for proactive personal assistant agents in long-horizon workflows.

Python 48 Updated Jun 4, 2026

Awesome List for On-Policy Distillation

667 12 Updated Jun 19, 2026

Xray、Tuic、hysteria2、sing-box 八合一一键脚本

Shell 21,184 5,490 Updated Jun 7, 2026

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Python 53 1 Updated May 13, 2026

A benchmark for LLMs on complicated tasks in the terminal

Python 2,375 544 Updated Jan 22, 2026

Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"

Python 244 2 Updated Jun 7, 2026

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 580 24 Updated Jan 4, 2026

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python 695 44 Updated May 30, 2026

👾 Open Computer Use – Open-Source Alternative to Codex Computer Use

Swift 1,158 116 Updated Jun 18, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,660 239 Updated Jun 22, 2026

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 675 63 Updated Mar 21, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 194,155 109,910 Updated Jun 8, 2026

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

Python 449 44 Updated May 19, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 219,735 33,674 Updated Jun 22, 2026

AI agents running research on single-GPU nanochat training automatically

Python 88,088 12,756 Updated Mar 26, 2026

Official Repository of "ρ-𝙴𝙾𝚂: Training-free Bidirectional Variable-Length Control for Masked Diffusion LLMs"

Python 7 Updated May 8, 2026

Diagnostic Framework for LLMs and MLLMs

Python 38 Updated Mar 2, 2026

All-in-One Safety Evaluation Framwork

Python 50 Updated Apr 21, 2026

WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.

Python 644 45 Updated Mar 3, 2026

Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"

Python 82 2 Updated Apr 7, 2026
Python 54 7 Updated Jan 23, 2026

Official repository of DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Python 211 6 Updated Jun 11, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 2,035 214 Updated Jun 19, 2026

[ACL26] Experimental resources for paper titled "LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions"

Python 10 Updated Jan 22, 2026

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.

Python 510 43 Updated Jan 28, 2026

Official implementation of Selective Entropy Regularization (SIREN), proposed by paper 'Rethinking Entropy Regularization in Large Reasoning Models'.

Python 32 Updated Dec 10, 2025
Next