Skip to content
View zengbohan0217's full-sized avatar

Block or report zengbohan0217

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Python 1,090 74 Updated Apr 16, 2026

A flexible framework for orchestrating deep learning models with Ray . It dynamically schedules and serves multiple models — from NLP (e.g., FastText) to CV (e.g., YOLO, SAM) — enabling scalable, d…

Python 5 5 Updated Apr 5, 2026

同事.skill、老板.skill、前任.skill、自己.skill、永生.skill、女娲.skill……

1,057 136 Updated Apr 15, 2026

ByteDance's All-in-One Video Generation Model for Human-Object Interaction Video Generation

186 9 Updated Apr 15, 2026
Python 7 1 Updated Apr 13, 2026

Awesome Multimodal Modeling [Covers MLLM, UMM, and NMM]

250 16 Updated Apr 17, 2026
Python 858 67 Updated Apr 13, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 527 42 Updated Apr 3, 2026

Enjoy the magic of Diffusion models!

Python 12,253 1,186 Updated Apr 16, 2026

A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 415 15 Updated Apr 18, 2026

仅需Python基础,从0构建自己的具身智能机器人;从0逐步构建VLA/OpenVLA/SmolVLA/Pi0, 深入理解具身智能

Python 1,465 162 Updated Apr 14, 2026

Vero: An Open RL Recipe for General Visual Reasoning

Python 108 8 Updated Apr 13, 2026

Light Image Video Generation Inference Framework

Python 2,190 187 Updated Apr 18, 2026

ZGI is an open-source platform for building AI applications. Its intuitive interface combines workflow design, agent orchestration, dataset management, and model integration—allowing you to quickly…

31 9 Updated Nov 6, 2025

把前任蒸馏成 AI Skill,用ta的方式跟你说话。

Python 4,600 456 Updated Apr 8, 2026

Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to …

JavaScript 16,299 1,580 Updated Apr 17, 2026

Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first framework to distill bidirectional audio-visual diffusion mo…

Python 132 1 Updated Mar 29, 2026

Lightweight coding agent that runs in your terminal

Rust 76,056 10,796 Updated Apr 18, 2026

Research on Coding Agents

11,673 19,737 Updated Apr 1, 2026

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 185,855 108,662 Updated Apr 17, 2026

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,157 106 Updated Feb 26, 2026

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,306 77 Updated Aug 7, 2025

An agentic skills framework & software development methodology that works.

Shell 158,305 13,776 Updated Apr 16, 2026

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

Python 558 35 Updated Mar 31, 2026

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

Python 185 2 Updated Mar 19, 2026

Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Python 154 5 Updated Feb 9, 2026

Automated system for LLM evaluation via agents.

Python 49 7 Updated Apr 13, 2026

AI Agent Assistant that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

Python 30,184 2,047 Updated Apr 17, 2026

mjlab-native port of InstinctLab for humanoid RL and Project-Instinct workflows.

Python 135 15 Updated Apr 17, 2026
Next