kykim0

kykim0

Learn. Understand. Serve.

14 followers · 8 following

@google
Bay Area / Seoul

Achievements

Organizations

Starred repositories

AgentR1 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,481 101 Updated Jun 15, 2026

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 594 38 Updated Nov 26, 2025

MiroMindAI / MiroThinker

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7, achieves 74.0 and 75.3 on the BrowseComp and BrowseComp Zh, respectively.

Python 8,296 639 Updated Apr 25, 2026

SakanaAI / treequest

A Tree Search Library with Flexible API for LLM Inference-Time Scaling

Python 551 72 Updated Feb 5, 2026

SakanaAI / ab-mcts-arc2

Python 115 18 Updated Jun 30, 2025

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,503 597 Updated May 23, 2026

ventr1c / Awesome-RL-based-Agentic-Search-Papers

The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".

263 10 Updated Jun 8, 2026

facebookresearch / HyperAgents

Self-referential self-improving agents that can optimize for any computable task

Python 2,583 337 Updated May 9, 2026

sanbuphy / learn-coding-agent

Research on Coding Agents

12,001 19,701 Updated Apr 1, 2026

LimHyungTae / Awesome-PhD-CV

Curated academic CV templates and guidelines for PhD students, researchers, and faculty job applicants.

TeX 1,148 129 Updated Apr 1, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 87,280 12,638 Updated Mar 26, 2026

maxim-saplin / llm_chess

LLM Chess - evaluating Large Language Models' reasoning and instruction-following abilities by simulating chess games

Python 104 10 Updated Jun 13, 2026

zlab-princeton / llm-pruning-collection

A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.

Python 67 8 Updated Apr 20, 2026

AGI-Eval-Official / CATArena

CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), based on an iterative competitive peer learning framework.

Python 67 10 Updated Dec 25, 2025

HKUDS / AI-Trader

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 19,805 3,028 Updated Jun 11, 2026

bespokelabsai / curator

Synthetic data curation for post-training and structured data extraction

Python 1,687 142 Updated Jun 14, 2026

kagisearch / llm-chess-puzzles

Benchmark LLM reasoning capability by solving chess puzzles.

Python 91 5 Updated Apr 26, 2025

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 476 58 Updated May 11, 2026

harsh19 / ChessCommentaryGeneration

Harsh Jhamtani*, Varun Gangal*, Eduard Hovy, Graham Neubig, Taylor Berg-Kirkpatrick. Learning to Generate Move-by-Move Commentary for Chess Games from Large-Scale Social Forum Data. ACL 2018

OpenEdge ABL 48 11 Updated Jul 21, 2022