Skip to content
View Purshow's full-sized avatar

Organizations

@PKU-YuanGroup

Block or report Purshow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Python 985 65 Updated Apr 16, 2026

Welcome to GR00T Whole-Body Control (WBC)! This is a unified platform for developing and deploying advanced humanoid controllers. This includes: Decoupled WBC models used in NVIDIA Isaac-Gr00t, Gr0…

Python 1,499 186 Updated Apr 14, 2026
Python 4 Updated Apr 17, 2026

NucleusImage training recipe

44 1 Updated Apr 9, 2026

Terrarium: Multi-turn data engine for evaluating and optimizing LLM agents in living environments.

Python 22 Updated Apr 17, 2026

Official code of Motus: A Unified Latent Action World Model

Python 968 46 Updated Jan 5, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

54,232 4,578 Updated Apr 15, 2026

🦞 ClawMark: A Living-World Benchmark for Multi-Day, Multimodal Coworker Agents

Python 69 5 Updated Apr 15, 2026

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 782 49 Updated Apr 8, 2026
Python 470 39 Updated Apr 17, 2026

HY-Embodied: Embodied Foundation Models for Real-World Agents

Python 561 11 Updated Apr 14, 2026

Elevate your AI research writing, no more tedious polishing ✨

18,098 1,456 Updated Mar 25, 2026

科研写作助手 (Research Writing Assistant)

Python 702 63 Updated Mar 29, 2026

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Python 283 27 Updated Apr 7, 2026

(ICCV 2025) "Principal Components" Enable A New Language of Images

Jupyter Notebook 82 6 Updated Jul 28, 2025

[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Python 80 2 Updated Apr 14, 2026

Vero: An Open RL Recipe for General Visual Reasoning

Python 108 8 Updated Apr 13, 2026

📚 A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for streaming video.

126 3 Updated Apr 13, 2026

A simple video streaming baseline that outperforms SOTAs.

Python 98 5 Updated Apr 16, 2026

Your behavior is the signal. Not your words. — Behavioral intelligence for AI agents, built into your MacBook notch.

8 Updated Apr 7, 2026

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Python 23 2 Updated Apr 12, 2026

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 380 1 Updated Apr 14, 2026

JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.

Python 1,894 136 Updated Apr 15, 2026

paper collection: alignment of diffusion models

29 Updated Mar 6, 2026

A benchmark for evaluating contextual agents on realistic multimodal personal-computer environments with profiling and factual-retention tasks.

Python 25 1 Updated Apr 2, 2026

🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.

Python 57 2 Updated Apr 11, 2026

Compile agent conversations!

Python 286 18 Updated Apr 3, 2026

将冰冷的离别化为温暖的 Skill,欢迎加入数字生命1.0!Transforming cold farewells into warm skills? It's giving rebirth era. Welcome to Digital Life 1.0. 🫶

Python 14,796 1,446 Updated Apr 17, 2026

Codebase for InfoTok: Adaptive Discrete Video Tokenizer via Information-Theoretic Compression

Jupyter Notebook 39 1 Updated Mar 18, 2026
Next