pongib

pongtsu pongib

42 followers · 192 following

bangkok, thailand

Achievements

Lists (3)

Sort

Stars

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,340 210 Updated May 17, 2026

Rising0321 / nano-vllm-omni

A lightweight `vLLM-Omni`-style diffusion implementation built around `Wan2.2-TI2V-5B-Diffusers` inspired from nano-vllm

Python 37 3 Updated May 1, 2026

mattpocock / skills

Skills for Real Engineers. Straight from my .claude directory.

Shell 88,326 7,707 Updated May 13, 2026

lucienhuangfu / eLLM

eLLM can infer LLM on CPUs faster than on GPUs

Rust 411 41 Updated May 17, 2026

gpu-mode / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 2,440 229 Updated Apr 1, 2026

garrytan / gstack

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 98,375 14,642 Updated May 17, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 154,370 24,711 Updated May 17, 2026

muxi-ai / muxi

Deploy intelligence. Open-source infrastructure for AI agents in production.

18 4 Updated May 3, 2026

ranaroussi / yfinance

Download market data from Yahoo! Finance's API

Python 23,684 3,260 Updated May 14, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 4,787 935 Updated May 17, 2026

ChromeDevTools / chrome-devtools-mcp

Chrome DevTools for coding agents

TypeScript 39,821 2,533 Updated May 17, 2026

addyosmani / agent-skills

Production-grade engineering skills for AI coding agents.

Shell 42,763 4,706 Updated May 16, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 81,492 11,848 Updated Mar 26, 2026

RightNow-AI / autokernel

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,364 133 Updated Mar 19, 2026

sandyresearch / chipmunk

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 109 2 Updated Sep 8, 2025

wafer-ai / gpu-perf-engineering-resources

A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do

674 76 Updated Apr 27, 2026

gpu-mode / awesomeMLSys

An ML Systems Onboarding list

1,069 41 Updated Feb 19, 2026

skanakakorn / agentic-tri

Tri-cognitive Agentic Framework

Go 3 Updated Mar 20, 2026

siboehm / ShallowSpeed

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 165 10 Updated Oct 19, 2023

huggingface / kernels

Build compute kernels and load them from the Hub.

Python 641 89 Updated May 17, 2026

scbdatax / genai-datax-language-confusion-and-multilingual-performance

Jupyter Notebook 1 1 Updated Nov 18, 2025

rasbt / reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 4,357 633 Updated May 17, 2026

kyutai-labs / moshi-finetune

Python 450 63 Updated Oct 3, 2025

gau-nernst / tts

Python 1 Updated Jan 11, 2026

FunAudioLLM / Fun-Audio-Chat

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Python 941 102 Updated Feb 27, 2026

gau-nernst / learn-cuda

Learn CUDA with PyTorch

Cuda 301 44 Updated May 13, 2026

nari-labs / dia2

TTS model capable of streaming conversational audio in realtime.

Python 1,121 97 Updated Nov 29, 2025

MiniMax-AI / MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 3,151 281 Updated Jul 7, 2025

HKUDS / AI-Trader

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 17,750 2,721 Updated May 13, 2026

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 5,624 977 Updated May 17, 2026

pongtsu pongib

Lists (3)

LLM

ML Explore

Rust

Stars