Skip to content
View czczup's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@OpenGVLab

Block or report czczup

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiMo-VL

565 27 Updated Aug 21, 2025

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Python 92 6 Updated Jul 17, 2025

Open-source unified multimodal model

Python 5,120 455 Updated Aug 22, 2025

Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"

Python 65 4 Updated Aug 28, 2025

Collection of Highlight papers

41 1 Updated May 24, 2024

The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.

26 1 Updated Feb 22, 2024

Official Repo for Open-Reasoner-Zero

Python 2,045 117 Updated Jun 2, 2025

学习笔记 - 码云:https://gitee.com/wanzheng_96/Modules-Learn)

Jupyter Notebook 495 149 Updated Nov 17, 2024

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,432 416 Updated Oct 9, 2025

A light-weight tool for evaluating LLMs in rule-based ways.

Python 69 5 Updated Jun 19, 2025
Python 958 45 Updated Jul 2, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,118 2,516 Updated Oct 9, 2025

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

529 20 Updated Sep 12, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 18,705 3,097 Updated Oct 9, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 737 28 Updated Sep 7, 2025

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

68 7 Updated Mar 18, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

Python 145 7 Updated Oct 6, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,424 544 Updated May 18, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,013 201 Updated Sep 30, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 820 53 Updated May 14, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,097 789 Updated Oct 9, 2025

[🏆AAAI2025] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.

Python 55 6 Updated Aug 30, 2025

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Python 15,434 2,272 Updated Aug 15, 2025

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

Python 701 221 Updated Sep 3, 2025

LLM&VLM Tutorial

Python 1,885 1,546 Updated May 5, 2025
Next