Skip to content
View yujinhanml's full-sized avatar
🌕
🌕

Highlights

  • Pro

Block or report yujinhanml

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The fastest repo in history to surpass 50K stars ⭐, reaching the milestone in just 2 hours after publication. Better Harness Tools, not merely storing the archive of leaked Claude Code but make rea…

Rust 72,977 73,407 Updated Apr 1, 2026

The official code of Yume

Python 639 38 Updated Jan 14, 2026

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Python 133 9 Updated Feb 7, 2024

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,141 105 Updated Feb 26, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,362 120 Updated Mar 24, 2026

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,891 359 Updated Mar 3, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 5,453 821 Updated Mar 30, 2026

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Python 14 Updated Dec 15, 2025

A survey for visual generation alignment

131 7 Updated Nov 9, 2025

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,500 183 Updated Mar 28, 2025

Official implementation of Continuous 3D Perception Model with Persistent State

Python 1,379 85 Updated Aug 27, 2025

[ICLR2026] The official code of "Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance"

Python 33 1 Updated Mar 23, 2026

Official Code Repo for UniVA: Universal Video Agents

TypeScript 432 65 Updated Jan 27, 2026

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,404 649 Updated Sep 26, 2024

(ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"

Python 47 2 Updated Jul 1, 2025

The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation"

19 1 Updated Mar 26, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,681 470 Updated Feb 10, 2026

Automatic Metric for Evaluating Generated Videos

Python 38 1 Updated Dec 8, 2025

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Python 105 Updated Feb 5, 2026

Official Implementation of VideoDPO

Python 163 3 Updated Jun 1, 2025

official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning (COLM 2024)

179 8 Updated Aug 7, 2024

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 954 38 Updated Mar 19, 2025

[ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

115 1 Updated Oct 7, 2025

[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Python 104 Updated Feb 28, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,863 15,015 Updated Apr 1, 2026

[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 749 30 Updated Feb 10, 2026

[NeurIPS 2025] Improving Video Generation with Human Feedback

Python 441 12 Updated Sep 24, 2025

BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models

Python 41 Updated Oct 30, 2025

[ICLR 2026] EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 227 6 Updated Mar 20, 2026
Next