jaeyong-song

🧭

Visiting

Jaeyong Song jaeyong-song

🧭

Visiting

Deep Learning and System

22 followers · 54 following

Grad Student@AISys, Seoul National University
Seoul, Korea
https://aisys.snu.ac.kr/members/JaeyongSong.html

Achievements

Highlights

Stars

NVIDIA-NeMo / Megatron-Bridge

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 732 365 Updated Jun 16, 2026

RyanCodrai / turbovec

A vector index built on TurboQuant, written in Rust with Python bindings

Python 11,733 1,018 Updated Jun 10, 2026

chnlee / TeamKorea

TeamKorea agent-reasoning solution for the MLSys 2026 scheduling contest (Track B)

Python 5 1 Updated May 26, 2026

TongmingLAIC / AKO4ALL

Agentic Kernel Optimization for All — automated GPU kernel optimization for any kernel, any hardware, any language

Python 294 20 Updated May 31, 2026

Fission-AI / OpenSpec

Spec-driven development (SDD) for AI coding assistants.

TypeScript 55,087 3,859 Updated Jun 13, 2026

HazyResearch / HipKittens

Fast and Furious AMD Kernels

C++ 433 66 Updated Jun 13, 2026

StarTrail-org / LEANN

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 11,983 1,072 Updated Jun 16, 2026

colbymchenry / codegraph

Pre-indexed code knowledge graph, auto syncs on code changes, for Claude Code, Codex, Gemini, Cursor, OpenCode, AntiGravity, Kiro, and Hermes Agent — fewer tokens, fewer tool calls, 100% local

TypeScript 50,129 3,066 Updated Jun 16, 2026

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,255 80 Updated Jun 2, 2026

AIS-SNU / GriNNder

[MLSys '26] GriNNder: Breaking the Memory Capacity Wall in Full-Graph GNN Training with Storage Offloading

Python 5 Updated May 10, 2026

RUC-NLPIR / FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,505 306 Updated Apr 10, 2026

thustorage / GustANN

High-Throughput, Cost-Effective Billion-Scale Vector Search with a Single GPU [SIGMOD'26]

Cuda 27 5 Updated Apr 22, 2026

thc1006 / qwen3.6-speculative-decoding-rtx3090

First public benchmark of llama.cpp speculative decoding on Qwen3.6-35B-A3B with a single RTX 3090 (post PR #19493 merge, 2026-04-19). 19 configurations covering ngram-cache, ngram-mod, and classic…

Python 28 1 Updated May 16, 2026

mattpocock / skills

Skills for Real Engineers. Straight from my .claude directory.

Shell 131,322 11,436 Updated Jun 12, 2026

Alishahryar1 / free-claude-code

Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

Python 34,834 5,364 Updated Jun 12, 2026

qdrant / qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 32,371 2,384 Updated Jun 16, 2026

TUM-DSE / proteus

Heterogeneous FPGA virtualization

Shell 6 Updated Mar 3, 2026

scale-snu / layered-prefill

Layered prefill changes the scheduling axis from tokens to layers and removes redundant MoE weight reloads while keeping decode stall free. The result is lower TTFT, lower end-to-end latency, and l…

Python 17 2 Updated Mar 9, 2026

VIA-Research / SwarmIO

SwarmIO is an SSD emulation framework for next-generation GPU-centric storage systems research

C 49 1 Updated May 24, 2026

PKU-SEC-Lab / HybriMoE

[DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"

Python 117 17 Updated Dec 15, 2025

opendatalab / MinerU

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

Python 67,726 5,704 Updated Jun 15, 2026

sspec-project / SparseSpec

Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding

Python 109 8 Updated Dec 2, 2025

aloth / olcli

Overleaf CLI, library & MCP server — pull, push, sync, compile LaTeX projects. Use from terminal, import as TypeScript library, or connect AI agents via Model Context Protocol.

TypeScript 82 16 Updated Jun 13, 2026

ybq22 / supervisor

JavaScript 165 17 Updated Apr 7, 2026

gaoj0017 / RaBitQ

The repo has been moved to https://github.com/VectorDB-NTU/RaBitQ-Library. [SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor …

C++ 249 38 Updated Apr 22, 2026

ultraworkers / claw-code

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,900 109,957 Updated Jun 8, 2026

Huttysam / samsung-galaxybook-linux-unified

Unified configuration and drivers for running Linux on Samsung Galaxy Book with complete functionality. Combines galaxy-book2-pro-linux and samsung-galaxybook-extras repositories.

ASL 17 1 Updated Sep 7, 2025

alibaba / ServeGen

A framework for generating realistic LLM serving workloads

Python 153 14 Updated May 11, 2026

NVIDIA / tilus

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 488 26 Updated Jun 11, 2026

Egonex-AI / Understand-Anything

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 61,097 5,045 Updated Jun 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jaeyong Song jaeyong-song

Achievements

Achievements

Highlights

Block or report jaeyong-song

Stars

NVIDIA-NeMo / Megatron-Bridge

RyanCodrai / turbovec

chnlee / TeamKorea

TongmingLAIC / AKO4ALL

Fission-AI / OpenSpec

HazyResearch / HipKittens

StarTrail-org / LEANN

colbymchenry / codegraph

hemingkx / SpeculativeDecodingPapers

AIS-SNU / GriNNder

RUC-NLPIR / FlashRAG

thustorage / GustANN

thc1006 / qwen3.6-speculative-decoding-rtx3090

mattpocock / skills

Alishahryar1 / free-claude-code

qdrant / qdrant

TUM-DSE / proteus

scale-snu / layered-prefill

VIA-Research / SwarmIO

PKU-SEC-Lab / HybriMoE

opendatalab / MinerU

sspec-project / SparseSpec

aloth / olcli

ybq22 / supervisor

gaoj0017 / RaBitQ

ultraworkers / claw-code

Huttysam / samsung-galaxybook-linux-unified

alibaba / ServeGen

NVIDIA / tilus

Egonex-AI / Understand-Anything