Skip to content
View kykim0's full-sized avatar

Organizations

@sisl @JuliaPOMDP @StanfordVL

Block or report kykim0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,481 101 Updated Jun 15, 2026

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 594 38 Updated Nov 26, 2025

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7, achieves 74.0 and 75.3 on the BrowseComp and BrowseComp Zh, respectively.

Python 8,296 639 Updated Apr 25, 2026

A Tree Search Library with Flexible API for LLM Inference-Time Scaling

Python 551 72 Updated Feb 5, 2026
Python 115 18 Updated Jun 30, 2025

OpenClaw-RL: Train any agent simply by talking

Python 5,503 597 Updated May 23, 2026

The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".

263 10 Updated Jun 8, 2026

Self-referential self-improving agents that can optimize for any computable task

Python 2,583 337 Updated May 9, 2026

Research on Coding Agents

12,001 19,701 Updated Apr 1, 2026

Curated academic CV templates and guidelines for PhD students, researchers, and faculty job applicants.

TeX 1,148 129 Updated Apr 1, 2026

AI agents running research on single-GPU nanochat training automatically

Python 87,280 12,638 Updated Mar 26, 2026

LLM Chess - evaluating Large Language Models' reasoning and instruction-following abilities by simulating chess games

Python 104 10 Updated Jun 13, 2026

A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.

Python 67 8 Updated Apr 20, 2026

CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), based on an iterative competitive peer learning framework.

Python 67 10 Updated Dec 25, 2025

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 19,805 3,028 Updated Jun 11, 2026

Synthetic data curation for post-training and structured data extraction

Python 1,687 142 Updated Jun 14, 2026

Benchmark LLM reasoning capability by solving chess puzzles.

Python 91 5 Updated Apr 26, 2025

Training VLM agents with multi-turn reinforcement learning

Python 476 58 Updated May 11, 2026

Harsh Jhamtani*, Varun Gangal*, Eduard Hovy, Graham Neubig, Taylor Berg-Kirkpatrick. Learning to Generate Move-by-Move Commentary for Chess Games from Large-Scale Social Forum Data. ACL 2018

OpenEdge ABL 48 11 Updated Jul 21, 2022

Open source neural network chess engine with GPU acceleration and broad hardware support.

C++ 3,129 598 Updated May 5, 2026

A Text-Based Environment for Interactive Debugging

Python 298 40 Updated Jun 15, 2026

This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models".

Python 198 6 Updated Apr 29, 2026

Fully open reproduction of DeepSeek-R1

Python 26,329 2,444 Updated Apr 2, 2026

[ICLR 2026] Learning to Reason without External Rewards

Python 411 44 Updated Jan 26, 2026

A library for generative social simulation

Python 1,481 335 Updated Jun 16, 2026

AI paper trading project inspired by nof1 Alpha Arena, using cctx for quotation.

Python 592 147 Updated Nov 21, 2025

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

C++ 1,167 221 Updated Mar 27, 2026

Defeating the Training-Inference Mismatch via FP16

Python 196 17 Updated Nov 14, 2025

Natural Language Reinforcement Learning

Python 101 7 Updated Jul 30, 2025
Python 16 3 Updated Jul 10, 2025
Next