Skip to content
View JinxIsPerfect's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report JinxIsPerfect

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
189 results for source starred repositories
Clear filter

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 105,983 16,877 Updated Apr 1, 2026

An elegant PyTorch deep reinforcement learning library.

Python 10,463 1,285 Updated Mar 29, 2026
Python 323 24 Updated Aug 12, 2025

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 10,019 1,104 Updated Apr 1, 2026

🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系

Jupyter Notebook 284 34 Updated Apr 1, 2026

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 40,017 6,253 Updated Mar 10, 2026

AI agents running research on single-GPU nanochat training automatically

Python 64,048 9,043 Updated Mar 26, 2026
Python 472 36 Updated Oct 16, 2025

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 432 29 Updated Feb 17, 2026
Python 968 93 Updated Dec 11, 2025

Train transformer language models with reinforcement learning.

Python 17,885 2,601 Updated Apr 2, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,728 290 Updated Apr 1, 2026
Python 1,122 53 Updated Jan 10, 2026
Python 122 19 Updated Aug 14, 2024

复现大模型相关算法及一些学习记录

Python 3,207 428 Updated Mar 21, 2026

RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings

Python 429 48 Updated Feb 27, 2026

CodeBERT

Python 2,752 501 Updated Jul 9, 2023

Short RL

Python 18 1 Updated May 26, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,444 164 Updated Mar 20, 2025

Reproducing R1 for Code with Reliable Rewards

Python 302 18 Updated May 5, 2025

A curated list of reinforcement learning (RL) for agents.

88 2 Updated Mar 30, 2026

A markdown template for taking notes to summarize research papers.

78 22 Updated Feb 19, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 13,013 1,585 Updated Feb 27, 2026

Lightweight coding agent that runs in your terminal

Rust 72,504 10,147 Updated Apr 2, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,130 175 Updated Aug 26, 2025

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 345,696 68,737 Updated Apr 2, 2026
Python 467 77 Updated Dec 12, 2024

Minimalistic large language model 3D-parallelism training

Python 2,631 291 Updated Apr 2, 2026

🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/

Python 5,633 2,776 Updated Mar 17, 2026

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,359 79 Updated May 16, 2025
Next