Skip to content
View jihwan-m's full-sized avatar
🎯
🎯

Block or report jihwan-m

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly.

Python 1,265 54 Updated May 18, 2026

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,459 113 Updated May 18, 2026

Algorithm powering the For You feed on X

Rust 25,268 4,310 Updated May 15, 2026

🚀2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platfor…

Rust 27,710 1,201 Updated May 18, 2026

Development repository for the Triton language and compiler

MLIR 19,210 2,856 Updated May 18, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,671 2,386 Updated May 18, 2026

Open source framework to vibecode and prototype voice agents with Gradium APIs

Rust 74 17 Updated May 18, 2026

Review-first terminal diff viewer for agentic coders

TypeScript 4,181 94 Updated May 18, 2026

🎨 NeMo Data Designer: Generate high-quality synthetic data from scratch or from seed data.

Python 1,865 170 Updated May 18, 2026

Agentic RL Training at Scale

Python 1,383 292 Updated May 18, 2026

high-performance linear attention kernel library built on TileLang

Python 492 38 Updated May 7, 2026

Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and improve overall performance.

Rust 63 23 Updated May 18, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,807 1,112 Updated May 18, 2026

Desktop app to manage markdown knowledge bases

TypeScript 11,015 795 Updated May 18, 2026

Evaluate and improve models and agents using environments

Python 903 147 Updated May 18, 2026

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 1,993 1,032 Updated May 18, 2026

A benchmark for LLMs on complicated tasks in the terminal

Python 2,218 515 Updated Jan 22, 2026

Scalable toolkit for efficient model reinforcement

Python 1,636 386 Updated May 18, 2026

Agentic RL on Any Harness at Scale

Python 141 22 Updated May 18, 2026

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,118 148 Updated Apr 3, 2025

🎥 Make videos programmatically with React

TypeScript 47,261 3,270 Updated May 18, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,901 1,047 Updated May 7, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,954 325 Updated Jan 14, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 954 84 Updated Feb 28, 2026

PyTorch building blocks for the OLMo ecosystem

Python 1,226 241 Updated May 18, 2026

Open-source framework for the research and development of foundation models.

Python 979 116 Updated May 18, 2026

Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end reference examples to build with Nemotron models

Jupyter Notebook 1,104 238 Updated May 18, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,686 402 Updated May 18, 2026

The agent that grows with you

Python 156,352 25,135 Updated May 18, 2026
Next