Skip to content
View libaolu312's full-sized avatar

Highlights

  • Pro

Block or report libaolu312

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepPrivacy2 - A Toolbox for Realistic Image Anonymization

Python 368 47 Updated Jan 28, 2024

Video anonymization by face detection

Python 1,399 167 Updated Oct 13, 2024

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 15,007 1,269 Updated Apr 10, 2026

BitDance & UniWeTok: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model.

Python 463 28 Updated Mar 13, 2026

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 10,685 829 Updated Dec 4, 2024

Real-Time Physical Action-Conditioned Video Generation

Python 189 9 Updated Mar 6, 2026

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

Python 760 41 Updated Feb 25, 2026

Tracking the latest and greatest research papers on video generation.

156 10 Updated Mar 28, 2026

Interactive World Simulator for Robot Policy Training and Evaluation

Python 205 11 Updated Mar 20, 2026

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 1,065 132 Updated Apr 3, 2026

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image (CVPR 2026)

Jupyter Notebook 822 53 Updated Apr 3, 2026

🛠「Watt Toolkit」是一个开源跨平台的多功能 Steam 工具箱。

C# 25,069 1,613 Updated Mar 11, 2026

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,806 208 Updated Apr 10, 2026

A curated list of open-source projects at the intersection of Agent and RL

44 1 Updated Apr 10, 2026

Collection of forcing related autoregressive video Gen

97 1 Updated Mar 31, 2026
Python 27 2 Updated Apr 6, 2026

让 OpenClaw稳定的连上你的个人微信

TypeScript 1,615 332 Updated Feb 13, 2026

A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 373 13 Updated Apr 8, 2026

Elevate your AI research writing, no more tedious polishing ✨

16,850 1,349 Updated Mar 25, 2026

苍何的技能skills仓库,搜集好用的 skills,辅助提效

TypeScript 202 56 Updated Mar 2, 2026

paper collection: alignment of diffusion models

29 Updated Mar 6, 2026

Helios: Real Real-Time Long Video Generation Model

Python 1,661 129 Updated Apr 10, 2026

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 454 10 Updated Aug 8, 2025

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

Python 977 105 Updated Apr 10, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,808 503 Updated Apr 11, 2026

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 15,809 901 Updated Mar 31, 2026

Code release of [ICCV2025 Highlight] WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions

Python 27 1 Updated Mar 3, 2026

SPAgent, a foundation agent for understanding, reasoning over, and operating within the physical and spatial world.

Python 166 26 Updated Apr 7, 2026
Next