Skip to content
View yyyouy's full-sized avatar
🏠
Working from home
🏠
Working from home
  • renmin university of china
  • beijing
  • 11:33 (UTC +08:00)

Highlights

  • Pro

Organizations

@ML-GSAI

Block or report yyyouy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Structured deep research skill for Claude Code/Open Code/Codex with human-in-the-loop control

Python 1,267 107 Updated May 7, 2026

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 691 43 Updated Jun 22, 2026

Local AI filmmaking studio — skills, canvas, timeline — driven from your coding agent.

JavaScript 303 27 Updated Jun 20, 2026

Official PyTorch implementation for "Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective"

Python 38 2 Updated Jan 25, 2026

Awesome List for On-Policy Distillation

667 12 Updated Jun 19, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,330 292 Updated Jun 15, 2026

21 writing rules for AI coding and writing agents. Drop-in for Claude Code, Codex, Copilot, Cursor, and Aider, so their output reads like a tech pro.

Python 523 29 Updated Jun 13, 2026

A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 609 17 Updated Jun 4, 2026

One config to rule all your AI agents: portable (every project, every session), effective (curated writing, routing, skills), and safer (destructive-command guard).

Python 183 20 Updated Jun 15, 2026

Agent skill for harness engineering — memory, permissions, context engineering, multi-agent coordination. Distilled from Claude Code, with Codex CLI and Gemini CLI on the roadmap. EN/ZH. Install vi…

285 48 Updated Apr 2, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,690 965 Updated Jun 23, 2026

[NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking

Python 31 1 Updated Jun 15, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,641 576 Updated Jun 23, 2026

Build your own Claude Code from scratch. 🔍 Claude Code 开源了 50 万行代码,读不动?用 ~4000 行 TypeScript / Python 从零复现核心架构,11 章分步教程带你理解 coding agent 精髓

Python 1,446 397 Updated May 12, 2026

Deep dive into Claude Code internals — architecture, agent loop, context engineering, and more. / 深入解析 Claude Code 源码:架构、Agent 循环、上下文工程、工具系统等

2,699 617 Updated May 5, 2026

OmX - Oh My codeX: Your codex is not alone. Add hooks, agent teams, HUDs, and so much more.

TypeScript 31,245 2,445 Updated Jun 22, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 194,176 109,905 Updated Jun 8, 2026

Kimi Code CLI is your next CLI agent.

Python 9,050 1,127 Updated Jun 22, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 61,039 7,528 Updated Jun 18, 2026

给 Claude Code 装上完整联网能力的 skill:三层通道调度 + 浏览器 CDP + 并行分治

JavaScript 7,801 562 Updated May 16, 2026

Public repository for Agent Skills

Python 154,008 18,156 Updated Jun 9, 2026

Open source repository of plugins primarily intended for knowledge workers to use in Claude Cowork

Python 21,763 2,543 Updated Jun 23, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,089 4,110 Updated Jun 23, 2026

Official repository of DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Python 211 6 Updated Jun 11, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 12,493 1,133 Updated Jun 21, 2026
Python 50 3 Updated May 16, 2026

A unified framework for easy reinforcement learning in Flow-Matching models

Python 584 47 Updated Jun 18, 2026

Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with C…

JavaScript 83,785 7,238 Updated Jun 22, 2026
Next