Skip to content
View YU-deep's full-sized avatar
  • National University of Singapore
  • Singapore
  • 04:28 (UTC +08:00)

Block or report YU-deep

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
TypeScript 8,970 783 Updated Jun 15, 2026

Vision-OPD is a regional-to-global on-policy self-distillation framework that transfers a model's own privileged crop-conditioned perception to its full-image policy, enabling fine-grained visual u…

Python 118 3 Updated Jun 14, 2026

Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction

Python 13 Updated Jun 12, 2026

On Policy Distillation Build on top of Verl

Python 77 6 Updated May 25, 2026
Python 358 33 Updated May 10, 2026

Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".

Jupyter Notebook 300 9 Updated Jun 1, 2026
Python 217 12 Updated Jun 1, 2026

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,299 157 Updated Apr 13, 2026

Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe

Python 35 1 Updated Jun 10, 2026

PyTorch-based open-source code for paper "SOD: Step-wise On-policy Distillation for Small Language Model Agents"

Python 145 9 Updated May 22, 2026

[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark

Python 158 2 Updated May 4, 2026

Awesome List for On-Policy Distillation

639 10 Updated Jun 13, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 8,259 640 Updated Jun 15, 2026

A curated, continuously updated reading list, paper blogs, and resources for World Action Models (WAMs) in embodied AI.

HTML 799 19 Updated Jun 11, 2026
Python 8 Updated May 12, 2026

DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm

C 13,972 1,226 Updated Jun 15, 2026

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Python 75 3 Updated May 26, 2026

Fully Open Framework for Democratized Multimodal Training

Python 1,085 75 Updated Jun 15, 2026

Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"

Python 221 13 Updated May 28, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,307 520 Updated Jun 15, 2026

Offical Implementation for "Recursive Multi-Agent Systems"

Python 551 86 Updated May 25, 2026

JiuwenSwarm is an intelligent AI Agent built on openJiuwen. It extends the powerful capabilities of large language models directly to your fingertips through various communication apps you use daily.

Python 940 184 Updated Jun 15, 2026
Python 12 1 Updated Apr 24, 2026

OpenGame: Open Agentic Coding for Games

TypeScript 2,559 365 Updated Apr 22, 2026

Code search MCP for Claude Code. Make entire codebase the context for any coding agent.

TypeScript 11,852 874 Updated Jun 8, 2026

A Curated List of Vision-Language-Action (VLA) and World Action Models (WAM) Research and Beyond

747 25 Updated Jun 15, 2026

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Python 25 Updated Sep 30, 2025
Next