Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,528 281 Updated Apr 28, 2026

Memento-Teams / Memento

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,411 285 Updated Oct 5, 2025

bowen-upenn / PersonaMem

[COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

Python 136 8 Updated Mar 19, 2026

younggyoseo / FastTD3

Python 442 47 Updated Oct 12, 2025

microsoft / magentic-ui

A research prototype of a human-centered web agent

Python 9,802 976 Updated Apr 15, 2026

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python 6,905 1,167 Updated Apr 26, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,612 411 Updated Nov 13, 2025

woominsong / Simba

Official implementation of Sparsified State-Space Models are Efficient Highway Networks (TMLR 2025).

3 Updated Mar 6, 2025

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,034 3,782 Updated Apr 30, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,704 16,302 Updated Apr 30, 2026

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,093 119 Updated Jun 2, 2025

goddoe / RLYX

A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.

Python 37 4 Updated Aug 27, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,012 2,421 Updated Apr 2, 2026

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,707 2,163 Updated Apr 13, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,463 547 Updated Apr 30, 2026

seal-rg / recurrent-pretraining

Pretraining and inference code for a large-scale depth-recurrent language model

Python 879 80 Updated Dec 29, 2025

gregorbachmann / Next-Token-Failures

Python 111 12 Updated Mar 12, 2024

simplescaling / s1

s1: Simple test-time scaling

Python 6,650 761 Updated Jun 25, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,078 1,588 Updated Feb 27, 2026

deepseek-ai / DeepSeek-R1

92,011 11,732 Updated Jun 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jihoon Tack jihoontack

Achievements

Achievements

Highlights

Block or report jihoontack

Stars

sierra-research / tau2-bench

Zhiyuan-Zeng / RLVE

Wuyxin / collabllm

facebookresearch / sweet_rl

sunnweiwei / PPP-Agent

facebookresearch / collaborative-reasoner

furiosa-ai / ParallelBench

thinking-machines-lab / tinker-cookbook

koreainvestment / open-trading-api

Alibaba-NLP / DeepResearch

Mirix-AI / MIRIX