Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,514 279 Updated Apr 17, 2026

Memento-Teams / Memento

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,399 282 Updated Oct 5, 2025

bowen-upenn / PersonaMem

[COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

Python 134 7 Updated Mar 19, 2026

younggyoseo / FastTD3

Python 441 47 Updated Oct 12, 2025

microsoft / magentic-ui

A research prototype of a human-centered web agent

Python 9,778 973 Updated Apr 15, 2026

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python 6,723 1,137 Updated Apr 18, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,508 394 Updated Nov 13, 2025

woominsong / Simba

Official implementation of Sparsified State-Space Models are Efficient Highway Networks (TMLR 2025).

2 Updated Mar 6, 2025

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,780 3,691 Updated Apr 17, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 77,225 15,791 Updated Apr 18, 2026

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,091 119 Updated Jun 2, 2025

goddoe / RLYX

A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.

Python 37 4 Updated Aug 27, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,991 2,416 Updated Apr 2, 2026

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,661 2,160 Updated Apr 13, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,441 543 Updated Apr 18, 2026

seal-rg / recurrent-pretraining

Pretraining and inference code for a large-scale depth-recurrent language model

Python 872 78 Updated Dec 29, 2025

gregorbachmann / Next-Token-Failures

Python 109 12 Updated Mar 12, 2024

simplescaling / s1

s1: Simple test-time scaling

Python 6,646 762 Updated Jun 25, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,053 1,583 Updated Feb 27, 2026

deepseek-ai / DeepSeek-R1

91,963 11,725 Updated Jun 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jihoon Tack jihoontack

Achievements

Achievements

Highlights

Block or report jihoontack

Stars

sierra-research / tau2-bench

Zhiyuan-Zeng / RLVE

Wuyxin / collabllm

facebookresearch / sweet_rl

sunnweiwei / PPP-Agent

facebookresearch / collaborative-reasoner

furiosa-ai / ParallelBench

thinking-machines-lab / tinker-cookbook

koreainvestment / open-trading-api

Alibaba-NLP / DeepResearch

Mirix-AI / MIRIX