ylzIng

Follow

yezhang_99 ylzIng

Follow

AI for Science(AI4S)

Tianjin

Highlights

Pro

Lists (3)

Sort

LLM

🚀 My stack

Postgrad

Stars

mm-deception / debate-with-images

Python 4 Updated Dec 2, 2025

XLearning-SCU / Reliable_TWI

Pytorch Implementation of Reliable Thinking with Images.

Python 23 2 Updated May 3, 2026

szu-tera / SCOPE

[ACL'26 Main] Beyond Majority Voting: Towards Fine-grained and More Reliable Reward Signal for Test-Time Reinforcement

Python 8 Updated Apr 6, 2026

smiles724 / Awesome-LLM-RLVR

Collection of latest papers and materials in the area of RLVR!

Python 121 6 Updated Jun 15, 2026

Jasper-Yan / SCRL

[ACL'26] Official Repository for The Paper: What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Python 15 Updated Apr 7, 2026

fx-hit / CoWVLA

[CVPR2026] Chain of World: World Model Thinking in Latent Motion

Python 58 1 Updated Mar 4, 2026

thu-ml / Motus

Official code of Motus: A Unified Latent Action World Model

Python 1,152 65 Updated Jan 5, 2026

lucas-maes / le-wm

Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Python 3,893 538 Updated May 26, 2026

yuantianyuan01 / FastWAM

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 1,001 106 Updated Apr 3, 2026

tonyzhaozh / act

Python 2,009 391 Updated Jul 23, 2024

real-stanford / diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 4,284 786 Updated Dec 24, 2024

Tencent-Hunyuan / UniRL

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 648 35 Updated Jun 19, 2026

Warrenustc1958 / UniVLR

Python 15 1 Updated Jun 10, 2026

inclusionAI / Zooming-without-Zooming

[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark

Python 160 2 Updated May 4, 2026

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python 7,381 1,270 Updated Jun 19, 2026

dreamzero0 / dreamzero

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 2,293 194 Updated Apr 19, 2026

Agentic-Intelligence-Lab / ContinualVLA

Can VLA Models Learn from Real-World Data Continually without Forgetting?

Python 6 Updated Jun 12, 2026

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,283 2,022 Updated Mar 17, 2026

Xtra-Computing / LLM-Deception

[ICLR'26 Oral] Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts

Python 12 Updated Feb 10, 2026

ZhuoyangLiu2005 / last0

[ICML 2026] LaST$_0$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model

Python 78 6 Updated Apr 30, 2026

CladernyJorn / Unified-Action-Model

Implementation of Unified-Action-Model

8 1 Updated May 20, 2026

OpenHelix-Team / VLA-Adapter

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 2,209 200 Updated Mar 19, 2026

PRIME-RL / SimpleVLA-RL

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,732 113 Updated Jan 6, 2026

nick7nlp / Awesome-LLM-On-Policy-Distillation

A curated collection of papers and resources on On-Policy Distillation for Large Language Models.

Python 334 6 Updated Jun 16, 2026

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,020 372 Updated Apr 6, 2026

Ghy0501 / MCITlib

MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark

Python 91 9 Updated Jun 7, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,040 4,098 Updated Jun 18, 2026

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,869 298 Updated Aug 24, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,466 131 Updated Nov 9, 2025

farion1231 / cc-switch

A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io

Rust 104,469 6,904 Updated Jun 18, 2026