Skip to content
View YiyangZhou's full-sized avatar
:octocat:
:octocat:

Highlights

  • Pro

Block or report YiyangZhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The agent that grows with you

Python 126,138 18,858 Updated Apr 30, 2026
Python 47 Updated Apr 7, 2026

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 2,233 155 Updated Nov 4, 2025

Official implementation and experiment code for the paper "PETS: Principled and Efficient Test-Time Scaling via Optimal Trajectory Allocation".

Python 8 Updated Feb 20, 2026

BitDance & UniWeTok: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model.

Python 472 29 Updated Apr 20, 2026

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,919 91 Updated Jan 8, 2026

The best ChatGPT that $100 can buy.

Python 52,723 7,054 Updated Apr 14, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 366,655 75,275 Updated Apr 30, 2026

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

22,485 2,321 Updated Dec 12, 2025

Scalable toolkit for efficient model reinforcement

Python 1,585 357 Updated Apr 30, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,030 3,777 Updated Apr 30, 2026

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Python 31 Updated Feb 14, 2026

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

JavaScript 2 Updated Nov 6, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,778 1,447 Updated Feb 27, 2026

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,433 43 Updated Mar 9, 2026

Open-source unified multimodal model

Python 5,881 522 Updated Oct 27, 2025

[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Python 90 5 Updated Jun 10, 2025

[NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding

Python 52 2 Updated Sep 21, 2025

A comprehensive list of excellent research papers, models, datasets, and other resources on Vision-Language-Action (VLA) models in robotics.

473 15 Updated Mar 23, 2026

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 858 44 Updated Dec 14, 2025

Witness the aha moment of VLM with less than $3.

Python 4,056 285 Updated May 19, 2025

A simple pip-installable Python tool to generate your HTML citation world map from your Google Scholar ID.

Python 703 63 Updated Mar 14, 2026

web3.0知识整理 web3.0知识 web3.0学习资料 web3工作 web3书籍 web3job 区块链知识 blockchain

1,532 186 Updated Apr 4, 2026

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 24,807 3,574 Updated Apr 24, 2026

(NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"

Jupyter Notebook 50 5 Updated Jun 3, 2025

[NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Python 80 8 Updated Dec 4, 2024

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 68 4 Updated May 31, 2024

Official implementation for "MJ-BENCH: Is Your Multimodal Reward Model Really a Good Judge?"

Jupyter Notebook 8 Updated Jun 7, 2024

[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models

Python 87 2 Updated Oct 26, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 91,748 14,136 Updated Apr 16, 2026
Next