Skip to content
View YihanHu-2022's full-sized avatar

Highlights

  • Pro

Block or report YihanHu-2022

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GPT-Image-2 PPT Generator Skill for Creating Image-Based PowerPoint Presentations in Codex and Other Skill-Compatible Agents

Python 1,624 87 Updated Jun 12, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,815 350 Updated Jun 12, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 7,252 1,172 Updated May 28, 2026

[RSS 2026] Causal video-action world model for generalist robot control

Python 1,328 112 Updated Apr 29, 2026

Official repo for "Let ViT Speak: Generative Language-Image Pre-training"

Python 126 4 Updated Jun 10, 2026

Unified Codebase for Advanced World Models.

Python 814 43 Updated Jun 11, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,695 109,960 Updated Jun 8, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 2,252 194 Updated Apr 19, 2026

Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

Python 201 44 Updated May 18, 2026

Official code of Motus: A Unified Latent Action World Model

Python 1,138 64 Updated Jan 5, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,768 525 Updated Jun 12, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,936 4,072 Updated Jun 12, 2026

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,726 112 Updated Jan 6, 2026

[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation" & Causal Forcing++

Python 779 44 Updated Jun 5, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 214,304 32,931 Updated Jun 11, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,448 340 Updated Jan 14, 2026

Light Image Video Generation Inference Framework

Python 2,377 213 Updated Jun 12, 2026

The agent engineering platform.

Python 139,143 23,068 Updated Jun 12, 2026

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 4,832 542 Updated Jun 10, 2026

ThinkGen: Generalized Thinking for Visual Generation

Python 58 Updated Dec 30, 2025

Your image is almost there!

Python 7,618 437 Updated Jul 26, 2024

[CVPR2026] Efficient Long Video Generation via Next-Frame-Rate Prediction

Python 7 1 Updated Dec 19, 2025

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Python 131 7 Updated Apr 28, 2026

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 620 47 Updated Feb 15, 2026

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Python 1,384 140 Updated May 3, 2024

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,394 274 Updated Sep 12, 2025

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,850 194 Updated Mar 24, 2026

You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.

Python 575 25 Updated Jan 17, 2026

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation (CVPR2026 Highlight)''

Python 2,026 175 Updated Apr 11, 2026
Next