Skip to content
View ryf1123's full-sized avatar
:octocat:
:octocat:
  • EPFL
  • Lausanne, Swiss

Block or report ryf1123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Build and install Paddle Inference GPU 3. from source on NVIDIA Jetson (JetPack 6.x, CUDA 12), with TensorRT support and known issue fixes.

10 Updated Jan 26, 2026

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model

1,272 55 Updated Jan 8, 2026

从零构建大模型:从预训练到RLHF的完整实践

Python 2,602 199 Updated Mar 19, 2026

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 23,227 4,252 Updated Apr 14, 2026

AvaLee的产品经理学习文件集

76 34 Updated Sep 4, 2018

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 43,321 7,248 Updated Apr 15, 2026

Synthetic data curation for post-training and structured data extraction

Python 1,664 136 Updated Mar 28, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 101,269 13,109 Updated Apr 15, 2026

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 169,017 15,589 Updated Apr 15, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.

Python 61,561 5,335 Updated Apr 14, 2026

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

Python 164 7 Updated Oct 23, 2025

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

598 22 Updated Dec 15, 2025

A curated collection of papers on Vision-Language Models for Image Understanding from CVPR 2025

6 Updated Jun 9, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,419 42 Updated Mar 9, 2026

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

438 23 Updated Mar 8, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 15,312 1,430 Updated Mar 26, 2026

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,621 205 Updated Feb 16, 2025

This is the Repository for Geometry Problem Solving Method Evaluation

Python 26 1 Updated Oct 8, 2024

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,260 106 Updated Oct 29, 2025

The development and future prospects of large multimodal reasoning models.

603 21 Updated Jan 9, 2026

[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Python 425 42 Updated Mar 11, 2026
Python 11 2 Updated Apr 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,687 3,656 Updated Apr 15, 2026
Python 9 Updated May 16, 2025

Official code of FDS (CVPR 2025)

Python 10 4 Updated Apr 20, 2025

复现大模型相关算法及一些学习记录

Python 3,255 439 Updated Mar 21, 2026

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Python 1,589 169 Updated Apr 18, 2025
Python 355 30 Updated Apr 9, 2025

A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.

4,877 689 Updated Aug 18, 2025
Next