JinxIsPerfect

🎯

Focusing

Jinx JinxIsPerfect

🎯

Focusing

5 followers · 43 following

Stars

189 results for source starred repositories

Clear filter

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 105,983 16,877 Updated Apr 1, 2026

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 10,463 1,285 Updated Mar 29, 2026

ReTool-RL / ReTool

Python 323 24 Updated Aug 12, 2025

aiming-lab / AutoResearchClaw

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 10,019 1,104 Updated Apr 1, 2026

datawhalechina / diy-llm

🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)｜🚀 6 个渐进式作业 + 代码驱动，建立 LLM 全栈认知体系

Jupyter Notebook 284 34 Updated Apr 1, 2026

microsoft / qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 40,017 6,253 Updated Mar 10, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 64,048 9,043 Updated Mar 26, 2026

qiancheng0 / ToolRL

Python 472 36 Updated Oct 16, 2025

NVlabs / GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 432 29 Updated Feb 17, 2026

bytedance / SandboxFusion

Python 968 93 Updated Dec 11, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,885 2,601 Updated Apr 2, 2026

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,728 290 Updated Apr 1, 2026

huggingface / Math-Verify

Python 1,122 53 Updated Jan 10, 2026

rui-ye / FedLLM-Bench

Python 122 19 Updated Aug 14, 2024

wyf3 / llm_related

复现大模型相关算法及一些学习记录

Python 3,207 428 Updated Mar 21, 2026

Gen-Verse / Open-AgentRL

RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings

Python 429 48 Updated Feb 27, 2026

microsoft / CodeBERT

CodeBERT

Python 2,752 501 Updated Jul 9, 2023

lblankl / Short-RL

Short RL

Python 18 1 Updated May 26, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,444 164 Updated Mar 20, 2025

ganler / code-r1

Reproducing R1 for Code with Reliable Rewards

Python 302 18 Updated May 5, 2025

Necolizer / awesome-rl-for-agents

A curated list of reinforcement learning (RL) for agents.

88 2 Updated Mar 30, 2026

KaleabTessera / Research-Paper-Reading-Template

A markdown template for taking notes to summarize research papers.

78 22 Updated Feb 19, 2024

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,013 1,585 Updated Feb 27, 2026

openai / codex

Lightweight coding agent that runs in your terminal

Rust 72,504 10,147 Updated Apr 2, 2026

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,130 175 Updated Aug 26, 2025

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 345,696 68,737 Updated Apr 2, 2026

rui-ye / OpenFedLLM

Python 467 77 Updated Dec 12, 2024

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,631 291 Updated Apr 2, 2026

datawhalechina / all-in-rag

🔍大模型应用开发实战一：RAG 技术全栈指南，在线阅读地址：https://datawhalechina.github.io/all-in-rag/

Python 5,633 2,776 Updated Mar 17, 2026

Agent-RL / ReCall

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,359 79 Updated May 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly