Skip to content
View ysjprojects's full-sized avatar

Highlights

  • Pro

Block or report ysjprojects

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable toolkit for efficient model reinforcement

Python 1,170 201 Updated Dec 24, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,276 257 Updated Dec 25, 2025

Async RL Training at Scale

Python 955 165 Updated Dec 24, 2025

health care management system frontend: react backend: flask

JavaScript 1 Updated Dec 2, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,908 469 Updated Dec 24, 2025
Python 3 Updated Oct 27, 2025

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,683 893 Updated Dec 18, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,315 117 Updated Dec 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,777 2,894 Updated Dec 25, 2025

Vision infrastructure to turn complex documents into RAG/LLM-ready data

Rust 2,925 188 Updated Sep 24, 2025

Train transformer language models with reinforcement learning.

Python 16,771 2,375 Updated Dec 24, 2025

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

TypeScript 18,981 1,365 Updated Dec 25, 2025

Every AI Agent deserves a wallet.

TypeScript 983 578 Updated Dec 23, 2025

Build, enrich, and transform datasets using AI models with no code

TypeScript 1,610 137 Updated Oct 23, 2025

⚡A CLI tool for code structural search, lint and rewriting. Written in Rust

Rust 11,732 294 Updated Dec 24, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,875 4,112 Updated Dec 23, 2025

ClickHouse® is a real-time analytics database management system

C++ 44,826 7,922 Updated Dec 25, 2025

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 41,737 5,584 Updated Dec 25, 2025

Universal memory layer for AI Agents

Python 44,649 4,854 Updated Dec 17, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,070 644 Updated Dec 24, 2025

Save, load, host, and share AI model checkpoints without slowing down training. Host on Lightning AI or your own cloud with enterprise-grade access controls.

Python 40 7 Updated Dec 16, 2025

Machine Learning Engineering Open Book

Python 16,089 988 Updated Dec 20, 2025

A curated list of Large Language Model (LLM) Interpretability resources.

1,456 107 Updated Jun 22, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,832 135 Updated Jan 17, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,388 8,612 Updated Nov 12, 2025

Fully open reproduction of DeepSeek-R1

Python 25,754 2,407 Updated Nov 24, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,744 2,513 Updated Sep 30, 2025

A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.

Python 3,746 263 Updated Dec 23, 2025

Curated list of datasets and tools for post-training.

4,110 335 Updated Nov 10, 2025
Next