Skip to content
View zengbohan0217's full-sized avatar

Block or report zengbohan0217

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 445 32 Updated Apr 3, 2026

Enjoy the magic of Diffusion models!

Python 12,207 1,188 Updated Apr 8, 2026

A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 371 13 Updated Apr 8, 2026

仅需Python基础,从0构建自己的具身智能机器人;从0逐步构建VLA/OpenVLA/SmolVLA/Pi0, 深入理解具身智能

Jupyter Notebook 1,291 143 Updated Apr 8, 2026

Vero: An Open RL Recipe for General Visual Reasoning

Python 73 4 Updated Apr 9, 2026

Light Image Video Generation Inference Framework

Python 2,150 183 Updated Apr 9, 2026

ZGI is an open-source platform for building AI applications. Its intuitive interface combines workflow design, agent orchestration, dataset management, and model integration—allowing you to quickly…

31 9 Updated Nov 6, 2025

把前任蒸馏成 AI Skill,用ta的方式跟你说话。Inspired by colleague-skill(同事skill).

Python 4,008 406 Updated Apr 8, 2026

Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to …

TypeScript 14,688 1,381 Updated Apr 9, 2026

Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first framework to distill bidirectional audio-visual diffusion mo…

Python 129 1 Updated Mar 29, 2026

Lightweight coding agent that runs in your terminal

Rust 74,169 10,470 Updated Apr 9, 2026

Research on Coding Agents

11,526 19,728 Updated Apr 1, 2026

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 179,715 106,542 Updated Apr 9, 2026

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,147 105 Updated Feb 26, 2026

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,296 76 Updated Aug 7, 2025

An agentic skills framework & software development methodology that works.

Shell 143,680 12,277 Updated Apr 6, 2026

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

Python 460 23 Updated Mar 31, 2026

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

Python 181 1 Updated Mar 19, 2026

Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Python 147 5 Updated Feb 9, 2026

Automated system for LLM evaluation via agents.

Python 49 7 Updated Mar 31, 2026

Agentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

Python 29,466 1,989 Updated Apr 9, 2026

mjlab-native port of InstinctLab for humanoid RL and Project-Instinct workflows.

Python 130 14 Updated Apr 8, 2026

Unified Operator on Interactive World Model is a unified frontend for interactive world models. It lets users select a model, choose a dataset (e.g., CSGO) or directly upload an image, and immediat…

Python 41 1 Updated Apr 2, 2026

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 10,857 1,232 Updated Apr 9, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,778 494 Updated Apr 8, 2026
Python 99 3 Updated Mar 30, 2026

The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment

Python 1,552 185 Updated Mar 12, 2026

The First Unified Agent Data Synthesis Framework for Custom Agentic Task with all-in-one envrionment

Python 74 5 Updated Apr 9, 2026

A curated collection of research papers, models, and resources tracing the evolution from specialized models to unified world models.

129 10 Updated Mar 19, 2026
Next