llsj14

Sungjae Lee llsj14

I'm a software engineer at NAVER Cloud

32 followers · 28 following

llsj14.github.io

Achievements

Stars

rtk-ai / rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

Rust 63,512 3,907 Updated Jun 17, 2026

onyx-dot-app / onyx

Open Source AI Platform - AI Chat with advanced features that works with every LLM

Python 30,403 4,158 Updated Jun 18, 2026

LMCache / LMCache

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

Python 9,308 1,344 Updated Jun 18, 2026

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 35,114 2,979 Updated Jun 16, 2026

wanshuiyin / Auto-claude-code-research-in-sleep

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 12,303 1,126 Updated Jun 18, 2026

vllm-project / recipes

Common recipes to run vLLM

JavaScript 864 306 Updated Jun 18, 2026

ai-dynamo / aiconfigurator

Offline optimization of your disaggregated Dynamo graph

Python 341 128 Updated Jun 18, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,288 1,257 Updated Jun 18, 2026

osayamenja / FlashMoE

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 268 38 Updated May 5, 2026

xcena-dev / maru

High-Performance KV Cache Storage Engine on CXL Shared Memory for LLM Inference

Python 52 4 Updated Jun 15, 2026

taco-project / FlexKV

Python 287 53 Updated Jun 18, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,608 859 Updated Jun 18, 2026

0xeb / TheBigPromptLibrary

A collection of prompts, system prompts and LLM instructions

HTML 5,140 696 Updated Feb 21, 2026

asgeirtj / system_prompts_leaks

Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravit…

JavaScript 43,264 7,171 Updated Jun 18, 2026

langchain-ai / open_deep_research

Python 11,745 1,674 Updated Jun 7, 2026

Dao-AILab / causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 906 192 Updated May 9, 2026

fla-org / flash-linear-attention

🚀 Efficient implementations for emerging model architectures

Python 5,232 560 Updated Jun 18, 2026

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 6,189 623 Updated Jun 15, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,654 971 Updated Jun 17, 2026

vllm-project / tpu-inference

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 355 215 Updated Jun 18, 2026

cornserve-ai / cornserve

Easy, Fast, and Scalable Multimodal AI

Python 126 10 Updated Jun 2, 2026

snu-mllab / KVzip

[NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)

Python 221 13 Updated Feb 11, 2026

wafer-ai / gpu-perf-engineering-resources

A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do

827 102 Updated Apr 27, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,741 1,289 Updated Jun 15, 2026

langchain-ai / open-swe

An Open-Source Asynchronous Coding Agent

Python 10,003 1,137 Updated Jun 18, 2026

microsoft / TinyTroupe

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Jupyter Notebook 7,476 662 Updated May 7, 2026

NVlabs / ToolOrchestra

ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.

Python 740 102 Updated Mar 25, 2026

snowflakedb / ArcticInference

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 451 64 Updated Jun 17, 2026

openelb / openelb

Load Balancer Implementation for Kubernetes in Bare-Metal, Edge, and Virtualization

Go 1,776 209 Updated May 26, 2025

aws-neuron / deep-learning-containers

AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia instances using AWS Neuron SDK.

Python 22 12 Updated May 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sungjae Lee llsj14

Achievements

Achievements

Block or report llsj14

Stars

rtk-ai / rtk

onyx-dot-app / onyx

LMCache / LMCache

stanfordnlp / dspy

wanshuiyin / Auto-claude-code-research-in-sleep

vllm-project / recipes

ai-dynamo / aiconfigurator

ai-dynamo / dynamo

osayamenja / FlashMoE

xcena-dev / maru

taco-project / FlexKV

kvcache-ai / Mooncake

0xeb / TheBigPromptLibrary

asgeirtj / system_prompts_leaks

langchain-ai / open_deep_research

Dao-AILab / causal-conv1d

fla-org / flash-linear-attention

gpu-mode / lectures

OpenRLHF / OpenRLHF

vllm-project / tpu-inference

cornserve-ai / cornserve

snu-mllab / KVzip

wafer-ai / gpu-perf-engineering-resources

deepseek-ai / DeepEP

langchain-ai / open-swe

microsoft / TinyTroupe

NVlabs / ToolOrchestra

snowflakedb / ArcticInference

openelb / openelb

aws-neuron / deep-learning-containers