Skip to content
View pongib's full-sized avatar

Block or report pongib

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,340 210 Updated May 17, 2026

A lightweight `vLLM-Omni`-style diffusion implementation built around `Wan2.2-TI2V-5B-Diffusers` inspired from nano-vllm

Python 37 3 Updated May 1, 2026

Skills for Real Engineers. Straight from my .claude directory.

Shell 88,326 7,707 Updated May 13, 2026

eLLM can infer LLM on CPUs faster than on GPUs

Rust 411 41 Updated May 17, 2026

Puzzles for learning Triton

Jupyter Notebook 2,440 229 Updated Apr 1, 2026

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 98,375 14,642 Updated May 17, 2026

The agent that grows with you

Python 154,370 24,711 Updated May 17, 2026

Deploy intelligence. Open-source infrastructure for AI agents in production.

18 4 Updated May 3, 2026

Download market data from Yahoo! Finance's API

Python 23,684 3,260 Updated May 14, 2026

A framework for efficient model inference with omni-modality models

Python 4,787 935 Updated May 17, 2026

Chrome DevTools for coding agents

TypeScript 39,821 2,533 Updated May 17, 2026

Production-grade engineering skills for AI coding agents.

Shell 42,763 4,706 Updated May 16, 2026

AI agents running research on single-GPU nanochat training automatically

Python 81,492 11,848 Updated Mar 26, 2026

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,364 133 Updated Mar 19, 2026

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 109 2 Updated Sep 8, 2025

A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do

674 76 Updated Apr 27, 2026

An ML Systems Onboarding list

1,069 41 Updated Feb 19, 2026

Tri-cognitive Agentic Framework

Go 3 Updated Mar 20, 2026

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 165 10 Updated Oct 19, 2023

Build compute kernels and load them from the Hub.

Python 641 89 Updated May 17, 2026

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 4,357 633 Updated May 17, 2026
Python 1 Updated Jan 11, 2026

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Python 941 102 Updated Feb 27, 2026

Learn CUDA with PyTorch

Cuda 301 44 Updated May 13, 2026

TTS model capable of streaming conversational audio in realtime.

Python 1,121 97 Updated Nov 29, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 3,151 281 Updated Jul 7, 2025

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 17,750 2,721 Updated May 13, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,624 977 Updated May 17, 2026
Next