Skip to content
View dongyh20's full-sized avatar

Block or report dongyh20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

From Vision-Language-Action Models to a Real-World Robot Learning Stack

Python 120 7 Updated Jun 17, 2026

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 646 35 Updated Jun 18, 2026

Agents' Last Exam

Python 696 27 Updated Jun 18, 2026

Kimi Code CLI — The Starting Point for Next-Gen Agents

TypeScript 2,540 299 Updated Jun 18, 2026

A collection of skills for AI financial analysis.

JavaScript 2,840 317 Updated Jun 14, 2026

My learning notes for ML SYS.

Python 6,539 445 Updated Jun 18, 2026

[ECCV 2026] Official code of GEM: Generative Supervision Helps Embodied Intelligence

Python 82 1 Updated May 30, 2026

PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

Jupyter Notebook 255 12 Updated Jun 11, 2026

Skill package for ML/CV/NLP paper writing, curated and adapted from Prof. Peng Sida's open notes for Codex, Claude Code, and Gemini.

4,009 204 Updated Apr 23, 2026

Can Language Models Rebuild Programs From Scratch?

Python 768 51 Updated Jun 18, 2026

Beyond SFT-to-RL: Pre-alignment via Black-BoxOn-Policy Distillation for Multimodal RL

Python 90 2 Updated May 6, 2026

A benchmark for evaluating LLMs on Chinese traditional fortune telling — Bazi (八字) and Ziwei Doushu (紫微斗数).

Python 1,840 318 Updated May 9, 2026

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

2,125 96 Updated Jun 13, 2026

Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravit…

JavaScript 43,294 7,178 Updated Jun 18, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,249 284 Updated Jun 15, 2026

Reference code for the Meta-Harness paper.

Python 1,118 106 Updated Apr 29, 2026

Terrarium: Multi-turn data engine for evaluating and optimizing LLM agents in living environments.

Python 45 3 Updated Jun 17, 2026

🦞 ClawMark: A Living-World Benchmark for Multi-Day, Multimodal Coworker Agents

Python 110 9 Updated May 28, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

178,125 18,188 Updated Apr 20, 2026

The agent that grows with you

Python 196,690 34,674 Updated Jun 18, 2026

HY-Embodied: Embodied Foundation Models for Real-World Agents

Python 749 14 Updated Jun 18, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 55,912 7,245 Updated Jun 15, 2026

Kimi Code CLI is your next CLI agent.

Python 9,018 1,120 Updated Jun 10, 2026

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Python 65 6 Updated Apr 12, 2026

Your behavior is the signal. Not your words. — Behavioral intelligence for AI agents, built into your MacBook notch.

8 Updated Apr 7, 2026

Production-grade engineering skills for AI coding agents.

Shell 62,694 6,815 Updated Jun 16, 2026

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 365 3 Updated May 24, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,584 265 Updated Jun 18, 2026

A benchmark for evaluating contextual agents on realistic multimodal personal-computer environments with profiling and factual-retention tasks.

Python 28 1 Updated Apr 2, 2026

SkillsBench evaluates how well skills work and how effective agents are at using them.

PDDL 1,372 317 Updated Jun 18, 2026
Next