Skip to content
View pongib's full-sized avatar

Block or report pongib

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,586 267 Updated Jun 19, 2026

A lightweight `vLLM-Omni`-style diffusion implementation built around `Wan2.2-TI2V-5B-Diffusers` inspired from nano-vllm

Python 51 5 Updated May 25, 2026

Skills for Real Engineers. Straight from my .claude directory.

Shell 135,948 11,781 Updated Jun 18, 2026

eLLM can infer LLM on CPUs faster than on GPUs

Rust 426 43 Updated Jun 18, 2026

Puzzles for learning Triton

Jupyter Notebook 2,492 238 Updated Apr 1, 2026

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 111,307 16,554 Updated Jun 18, 2026

The agent that grows with you

Python 197,235 34,869 Updated Jun 19, 2026

Deploy intelligence. Open-source infrastructure for AI agents in production.

21 4 Updated Jun 17, 2026

Download market data from Yahoo! Finance's API

Python 24,327 3,328 Updated Jun 17, 2026

A framework for efficient model inference with omni-modality models

Python 5,204 1,139 Updated Jun 19, 2026

Chrome DevTools for coding agents

TypeScript 43,976 2,831 Updated Jun 19, 2026

Production-grade engineering skills for AI coding agents.

Shell 63,099 6,845 Updated Jun 19, 2026

AI agents running research on single-GPU nanochat training automatically

Python 87,616 12,670 Updated Mar 26, 2026

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,417 143 Updated Mar 19, 2026

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 111 2 Updated Sep 8, 2025

A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do

830 102 Updated Apr 27, 2026

An ML Systems Onboarding list

1,087 43 Updated Feb 19, 2026

Tri-cognitive Agentic Framework

Go 3 Updated Mar 20, 2026

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 165 10 Updated Oct 19, 2023

Build compute kernels and load them from the Hub.

Python 697 105 Updated Jun 19, 2026

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 4,537 669 Updated Jun 12, 2026
Python 1 Updated Jan 11, 2026

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Python 971 105 Updated Feb 27, 2026

Learn CUDA with PyTorch

Cuda 336 50 Updated Jun 1, 2026

TTS model capable of streaming conversational audio in realtime.

Python 1,145 98 Updated Nov 29, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 3,159 284 Updated Jul 7, 2025

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 19,858 3,031 Updated Jun 11, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,823 1,061 Updated Jun 19, 2026
Next