ysjprojects

SJ Yu ysjprojects

17 followers · 10 following

www.sjyu.ai

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,170 201 Updated Dec 24, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,276 257 Updated Dec 25, 2025

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 955 165 Updated Dec 24, 2025

Peng-y-x / HealthcareManagement

health care management system frontend: react backend: flask

JavaScript 1 Updated Dec 2, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,908 469 Updated Dec 24, 2025

sureenheer / lendx

Python 3 Updated Oct 27, 2025

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,683 893 Updated Dec 18, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,315 117 Updated Dec 11, 2025

austinhuang0131 / oss-fall2025

Forked from ivanearisty/oss-taapp

Python 1 2 Updated Dec 20, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,777 2,894 Updated Dec 25, 2025

lumina-ai-inc / chunkr

Vision infrastructure to turn complex documents into RAG/LLM-ready data

Rust 2,925 188 Updated Sep 24, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,771 2,375 Updated Dec 24, 2025

mastra-ai / mastra

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

TypeScript 18,981 1,365 Updated Dec 25, 2025

coinbase / agentkit

Every AI Agent deserves a wallet.

TypeScript 983 578 Updated Dec 23, 2025

huggingface / aisheets

Build, enrich, and transform datasets using AI models with no code

TypeScript 1,610 137 Updated Oct 23, 2025

ast-grep / ast-grep

⚡A CLI tool for code structural search, lint and rewriting. Written in Rust

Rust 11,732 294 Updated Dec 24, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,875 4,112 Updated Dec 23, 2025

ClickHouse / ClickHouse

ClickHouse® is a real-time analytics database management system

C++ 44,826 7,922 Updated Dec 25, 2025

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 41,737 5,584 Updated Dec 25, 2025

mem0ai / mem0

Universal memory layer for AI Agents

Python 44,649 4,854 Updated Dec 17, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,070 644 Updated Dec 24, 2025

Lightning-AI / LitModels

Save, load, host, and share AI model checkpoints without slowing down training. Host on Lightning AI or your own cloud with enterprise-grade access controls.

Python 40 7 Updated Dec 16, 2025