Skip to content
View WZDTHU's full-sized avatar

Block or report WZDTHU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Source code for 👏🏻"CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos"

Python 35 Updated Jun 14, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,855 543 Updated Jun 22, 2026

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation👏

Python 142 6 Updated May 11, 2026

Implementation for the paper "StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction".

Python 37 8 Updated May 8, 2026

Public repository for Agent Skills

Python 153,788 18,134 Updated Jun 9, 2026
Python 529 13 Updated May 1, 2026
Python 339 16 Updated Apr 24, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,890 363 Updated Jun 22, 2026

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Python 368 32 Updated Apr 7, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,254 290 Updated Jun 22, 2026

[ICML 2026] Multimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of reasoning turns to dozens and the number of search-engine inte…

Python 648 56 Updated Jun 8, 2026

Robust recipes to align language models with human and AI preferences

Python 5,614 492 Updated May 26, 2026

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 701 53 Updated Jun 2, 2026

[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation" & Causal Forcing++

Python 796 46 Updated Jun 17, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 9,948 743 Updated Jun 16, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,914 79,535 Updated Jun 22, 2026

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,840 77 Updated Nov 27, 2025

Open-source SOTA multi-image editing model

Python 872 43 Updated Jan 24, 2026

GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.

Python 943 78 Updated Mar 20, 2026
Jupyter Notebook 132 5 Updated Aug 19, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,302 156 Updated Apr 13, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,528 141 Updated Jun 10, 2026

The official code of Yume

Python 673 44 Updated Jan 14, 2026

[CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 568 20 Updated Apr 22, 2026

[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻

Python 533 27 Updated Feb 24, 2026

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 963 61 Updated Dec 20, 2025
Python 11,595 790 Updated Feb 9, 2026
Next