An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,320 807 Updated Oct 31, 2025

amueller / word_cloud

A little word cloud generator in Python

Python 10,458 2,338 Updated Aug 31, 2025

dw-dengwei / daily-arXiv-ai-enhanced

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 2,024 650 Updated Nov 5, 2025

sinaptik-ai / pandas-ai

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 22,516 2,200 Updated Oct 28, 2025

beixiaocai / xclabel

xclabel是一款支持多人协作的，样本导入+样本标注+模型训练+模型管理+模型测试+模型导出的工具

173 47 Updated Oct 3, 2025

windingwind / zotero-better-notes

Everything about note management. All in Zotero.

TypeScript 6,963 223 Updated Oct 20, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 12,924 2,147 Updated Sep 6, 2025

immersive-translate / zotero-immersivetranslate

Zotero BabelDOC plugin, for Immersive Translate Pro members.

TypeScript 186 15 Updated Oct 27, 2025

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,095 1,548 Updated Sep 5, 2024

linkangheng / PR1

[NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning

Python 266 10 Updated Jul 15, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,209 546 Updated Oct 30, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,529 3,763 Updated Nov 2, 2025

SAMMiCA / ChangeSim

A multi-modal, photo-realistic dataset for online end-to-end scene change detection and more (accepted to IROS2021).

Python 125 14 Updated Oct 28, 2022

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,243 100 Updated Oct 29, 2025

CherryHQ / cherry-studio

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 35,073 3,179 Updated Nov 6, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,658 7,516 Updated Jun 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cheung zhouchanggeng

Achievements

Achievements

Block or report zhouchanggeng

Stars

RobvanGastel / dinov3-finetune

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

OschAI / VisioFirm

we0091234 / crnn_plate_recognition

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

cuixing158 / 360-VR-Demo

unclecode / crawl4ai

ShaohonChen / Qwen3-SmVL

miquel-espinosa / no-time-to-train

karminski / one-small-step

Dao-AILab / flash-attention

huggingface / smollm

HKUDS / LightRAG

amusi / CVPR2025-Papers-with-Code

OpenRLHF / OpenRLHF