Skip to content
View gohar94's full-sized avatar

Highlights

  • Pro

Block or report gohar94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,026 4,012 Updated Mar 31, 2026

NVIDIA Linux open GPU kernel module source

C 16,845 1,636 Updated Mar 24, 2026

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 573 38 Updated Feb 15, 2026
316 28 Updated Feb 26, 2026

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,360 183 Updated Mar 12, 2026
Jupyter Notebook 23 2 Updated May 18, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,345 85 Updated Jul 14, 2024

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,883 542 Updated Mar 13, 2026

Nano vLLM

Python 12,611 1,833 Updated Nov 3, 2025

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,750 4,981 Updated Mar 31, 2026

Resource Multiplexing in Tuning and Serving Large Language Models (USENIX ATC 2025)

Python 8 5 Updated May 16, 2025

Naive attempt at implementing TTT paper by letting autograd do the heavy lifting

Python 8 Updated Feb 20, 2026

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,936 203 Updated Feb 9, 2026

dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.

Python 123 9 Updated Mar 3, 2026

Kernel Tuner

Python 387 64 Updated Mar 31, 2026

NVIDIA Linux open GPU with P2P support

C 12 1 Updated Jan 6, 2026

Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.

C 59 5 Updated Nov 24, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 466 39 Updated May 30, 2025

The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"

Python 303 28 Updated Jul 28, 2025

The open-source RAG platform: built-in citations, deep research, 22+ file formats, partitions, MCP server, and more.

TypeScript 1,934 172 Updated Mar 21, 2026

The best ChatGPT that $100 can buy.

Python 50,756 6,658 Updated Mar 27, 2026

Contexts Optical Compression

Python 22,773 2,095 Updated Jan 27, 2026

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,393 774 Updated Mar 30, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,451 976 Updated Mar 31, 2026

Train transformer language models with reinforcement learning.

Python 17,855 2,597 Updated Mar 31, 2026

A framework for optimizing DSPy programs with RL

Python 329 28 Updated Jan 12, 2026

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 345 32 Updated Nov 10, 2025
Next