Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,512 280 Updated Mar 12, 2026

Memento-Teams / Memento

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,355 275 Updated Oct 5, 2025

bowen-upenn / PersonaMem

[COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

Python 126 7 Updated Mar 19, 2026

younggyoseo / FastTD3

Python 433 47 Updated Oct 12, 2025

microsoft / magentic-ui

A research prototype of a human-centered web agent

Python 9,749 977 Updated Mar 21, 2026

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 6,497 1,084 Updated Mar 16, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,288 369 Updated Nov 13, 2025

woominsong / Simba

Official implementation of Sparsified State-Space Models are Efficient Highway Networks (TMLR 2025).

2 Updated Mar 6, 2025

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,162 3,489 Updated Mar 24, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,147 14,707 Updated Mar 24, 2026

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,088 119 Updated Jun 2, 2025

goddoe / RLYX

A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.

Python 37 4 Updated Aug 27, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,962 2,416 Updated Nov 24, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,567 2,147 Updated Sep 12, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,276 523 Updated Mar 24, 2026

seal-rg / recurrent-pretraining

Pretraining and inference code for a large-scale depth-recurrent language model

Python 867 77 Updated Dec 29, 2025

gregorbachmann / Next-Token-Failures

Python 109 11 Updated Mar 12, 2024

simplescaling / s1

s1: Simple test-time scaling

Python 6,650 765 Updated Jun 25, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,975 1,583 Updated Feb 27, 2026

deepseek-ai / DeepSeek-R1

91,980 11,747 Updated Jun 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jihoon Tack jihoontack

Achievements

Achievements

Highlights

Block or report jihoontack

Stars

sierra-research / tau2-bench

Zhiyuan-Zeng / RLVE

Wuyxin / collabllm

facebookresearch / sweet_rl

sunnweiwei / PPP-Agent

facebookresearch / collaborative-reasoner

furiosa-ai / ParallelBench

thinking-machines-lab / tinker-cookbook

koreainvestment / open-trading-api

Alibaba-NLP / DeepResearch

Mirix-AI / MIRIX