Skip to content
View ylzIng's full-sized avatar
  • Tianjin

Highlights

  • Pro

Block or report ylzIng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4 Updated Dec 2, 2025

Pytorch Implementation of Reliable Thinking with Images.

Python 23 2 Updated May 3, 2026

[ACL'26 Main] Beyond Majority Voting: Towards Fine-grained and More Reliable Reward Signal for Test-Time Reinforcement

Python 8 Updated Apr 6, 2026

Collection of latest papers and materials in the area of RLVR!

Python 121 6 Updated Jun 15, 2026

[ACL'26] Official Repository for The Paper: What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Python 15 Updated Apr 7, 2026

[CVPR2026] Chain of World: World Model Thinking in Latent Motion

Python 58 1 Updated Mar 4, 2026

Official code of Motus: A Unified Latent Action World Model

Python 1,152 65 Updated Jan 5, 2026

Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Python 3,893 538 Updated May 26, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 1,001 106 Updated Apr 3, 2026
Python 2,009 391 Updated Jul 23, 2024

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 4,284 786 Updated Dec 24, 2024

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 648 35 Updated Jun 19, 2026
Python 15 1 Updated Jun 10, 2026

[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark

Python 160 2 Updated May 4, 2026

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python 7,381 1,270 Updated Jun 19, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 2,293 194 Updated Apr 19, 2026

Can VLA Models Learn from Real-World Data Continually without Forgetting?

Python 6 Updated Jun 12, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,283 2,022 Updated Mar 17, 2026

[ICLR'26 Oral] Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts

Python 12 Updated Feb 10, 2026

[ICML 2026] LaST​$_0$​: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model

Python 78 6 Updated Apr 30, 2026

Implementation of Unified-Action-Model

8 1 Updated May 20, 2026

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 2,209 200 Updated Mar 19, 2026

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,732 113 Updated Jan 6, 2026

A curated collection of papers and resources on On-Policy Distillation for Large Language Models.

Python 334 6 Updated Jun 16, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,020 372 Updated Apr 6, 2026

MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark

Python 91 9 Updated Jun 7, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,040 4,098 Updated Jun 18, 2026

Official Repository of Absolute Zero Reasoner

Python 1,869 298 Updated Aug 24, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,466 131 Updated Nov 9, 2025

A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io

Rust 104,469 6,904 Updated Jun 18, 2026
Next