sarckk

🎯

Yong Hoon Shin sarckk

🎯

optimist

51 followers · 83 following

Achievements

x3 x2

Achievements

x3 x2

Organizations

Stars

facebookresearch / tribev2

This repository contains the code to train and evaluate TRIBE v2, a multimodal model for brain response prediction

Jupyter Notebook 2,623 583 Updated May 11, 2026

qlabs-eng / slowrun

100M tokens. Infinite compute. Lowest val loss wins.

Python 467 66 Updated May 14, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 13,472 2,101 Updated Apr 26, 2026

Noumena-Network / nmoe

MoE training for Me and You and maybe other people

Python 386 33 Updated Mar 15, 2026

gpu-mode / resource-stream

GPU programming related news and material links

2,133 126 Updated Mar 8, 2026

ModelEngine-Group / unified-cache-management

Persist and reuse KV Cache to speedup your LLM.

Python 277 74 Updated May 15, 2026

NVIDIA-developer-blog / code-samples

Source code examples from the Parallel Forall Blog

HTML 1,330 643 Updated Sep 23, 2025

ISEEKYAN / mbridge

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 211 72 Updated May 17, 2026

thinking-machines-lab / batch_invariant_ops

Python 1,012 77 Updated Nov 4, 2025

meta-pytorch / monarch

PyTorch Single Controller

Rust 1,033 162 Updated May 18, 2026

a-ghorbani / pocketpal-ai

An app that brings language models directly to your phone.

TypeScript 6,975 699 Updated May 17, 2026

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,100 2,078 Updated Mar 27, 2026

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 864 146 Updated May 17, 2026

maxbbraun / accent

Accent Smart Picture Frame

Python 207 17 Updated May 7, 2026

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 8,283 1,179 Updated May 18, 2026

Infini-AI-Lab / TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Python 278 20 Updated Aug 31, 2024

facebookresearch / locate-3d

Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset

Python 443 52 Updated Jun 3, 2025

shadowpa0327 / Palu

[ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection

Python 157 16 Updated Feb 20, 2025

DreamLM / Dream

Dream 7B, a large diffusion language model

Python 1,238 77 Updated Nov 21, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,335 170 Updated Jan 4, 2025

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 27,933 5,958 Updated May 18, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 80,283 16,883 Updated May 18, 2026

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 485 41 Updated May 30, 2025

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

1,832 126 Updated Feb 23, 2026

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,119 1,460 Updated May 16, 2026

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,217 76 Updated May 11, 2026

facebookresearch / HolisticTraceAnalysis

A library to analyze PyTorch traces.

Python 517 92 Updated May 13, 2026

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 2,009 166 Updated Jun 17, 2025

facebookresearch / dlrm

An implementation of a deep learning recommendation model (DLRM)

Python 4,037 865 Updated Jan 12, 2026

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,097 512 Updated Jan 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yong Hoon Shin sarckk

Achievements

Achievements

Organizations

Block or report sarckk

Stars

facebookresearch / tribev2

qlabs-eng / slowrun

GeeeekExplorer / nano-vllm

Noumena-Network / nmoe

gpu-mode / resource-stream

ModelEngine-Group / unified-cache-management

NVIDIA-developer-blog / code-samples

ISEEKYAN / mbridge

thinking-machines-lab / batch_invariant_ops

meta-pytorch / monarch

a-ghorbani / pocketpal-ai

openai / gpt-oss

pytorch / helion

maxbbraun / accent

LMCache / LMCache

Infini-AI-Lab / TriForce

facebookresearch / locate-3d

shadowpa0327 / Palu

DreamLM / Dream

Zefan-Cai / KVCache-Factory

sgl-project / sglang

vllm-project / vllm

microsoft / vattention

HuangOwen / Awesome-LLM-Compression

facebookresearch / vggt

hemingkx / SpeculativeDecodingPapers

facebookresearch / HolisticTraceAnalysis

horseee / Awesome-Efficient-LLM

facebookresearch / dlrm

NVIDIA / Cosmos