Skip to content
View omkaark's full-sized avatar
  • 11:59 (UTC -05:00)

Highlights

  • Pro

Block or report omkaark

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for "EgoX: Egocentric Video Generation from a Single Exocentric Video"

Python 608 36 Updated Jan 15, 2026

GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.

Python 323 29 Updated Nov 11, 2025

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 10,644 825 Updated Dec 4, 2024

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,658 957 Updated Feb 13, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,037 522 Updated Feb 13, 2026

TheBoringNotch: Not so boring notch That Rocks 🎸🎶

Swift 7,033 492 Updated Feb 8, 2026

Anthropic's original performance take-home, now open for you to try!

Python 3,434 753 Updated Jan 22, 2026

Fetch source code for npm packages to give AI coding agents deeper context

TypeScript 384 20 Updated Jan 26, 2026

Browser automation CLI for AI agents

TypeScript 13,976 810 Updated Feb 13, 2026

Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.

Python 1,959 175 Updated Jan 14, 2026

~950 line, minimal, extensible LLM inference engine built from scratch.

Python 420 33 Updated Jan 9, 2026

MoE training for Me and You and maybe other people

Python 354 29 Updated Feb 7, 2026

My learning notes for ML SYS.

Python 5,326 345 Updated Jan 30, 2026

kernels, of the mega variety

Python 672 44 Updated Jan 29, 2026

A framework for the evaluation of autoregressive code generation language models.

Python 1,019 253 Updated Jul 22, 2025

Code for the paper "Efficient Training of Language Models to Fill in the Middle"

Python 197 45 Updated Apr 2, 2023

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,129 437 Updated Jan 17, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,450 121 Updated Nov 13, 2025

NanoGPT (124M) in 2 minutes

Python 4,619 623 Updated Feb 11, 2026

Official repository for LiteTracker: Leveraging Temporal Causality for Accurate Low-latency Tissue Tracking; published at MICCAI 2025.

Python 204 12 Updated Nov 12, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,172 818 Updated Feb 3, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 97,382 26,851 Updated Feb 13, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,613 4,719 Updated Feb 13, 2026

a small protein language model based off of nanochat

Python 2 Updated Oct 20, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,685 430 Updated Feb 13, 2026

FlashAttention written in metal-cpp headers

Makefile 5 1 Updated Feb 12, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 25,580 2,776 Updated Feb 13, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,651 464 Updated Oct 27, 2025

A PyTorch native platform for training generative AI models

Python 5,066 703 Updated Feb 13, 2026

PyTorch building blocks for the OLMo ecosystem

Python 798 139 Updated Feb 13, 2026
Next