Skip to content
View gmittal's full-sized avatar

Block or report gmittal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,786 76 Updated Jun 16, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,117 228 Updated Aug 17, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,219 985 Updated Jul 1, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 896 50 Updated Sep 30, 2025

MLX: An array framework for Apple silicon

C++ 23,180 1,425 Updated Dec 21, 2025

The official Porsche Design System repository, offering fundamental UXI guidelines and a library of reusable web components to enable designers and developers to build consistent, intuitive, and hi…

TypeScript 559 39 Updated Dec 19, 2025
Python 31 7 Updated Jan 9, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,025 229 Updated Oct 16, 2025

Inference Llama 2 in one file of pure 🔥

Mojo 2,115 136 Updated Nov 30, 2025

Python pdb for multiple processes

Python 73 9 Updated May 24, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,837 583 Updated May 3, 2024

Inference code for CodeLlama models

Python 16,373 1,944 Updated Aug 12, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 196 13 Updated Jun 14, 2023

Supercharge Your LLM Application Evaluations 🚀

Python 11,800 1,178 Updated Dec 20, 2025

commaVQ is a dataset of compressed driving video

Jupyter Notebook 339 60 Updated Oct 31, 2025

Tools for building GPU clusters

Shell 1,404 352 Updated Jun 30, 2025

LLMs for your CLI

Python 1,352 77 Updated May 29, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,433 181 Updated Oct 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,871 12,102 Updated Dec 21, 2025

Universal memory layer for AI Agents

Python 44,517 4,836 Updated Dec 17, 2025

Write scalable load tests in plain Python 🚗💨

Python 27,250 3,151 Updated Dec 19, 2025

CUDA on non-NVIDIA GPUs

Rust 13,682 880 Updated Dec 19, 2025

It's React, but in Python

Python 8,145 333 Updated Dec 15, 2025

Implementation of Flash Attention in Jax

Python 222 24 Updated Mar 1, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 980 57 Updated Jan 30, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 839 63 Updated Jul 1, 2024

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 712 121 Updated Dec 14, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,369 2,742 Updated Dec 21, 2025

The Official Python Client for Lamini's API

Python 2,540 154 Updated Apr 7, 2025

Tiny data-over-sound library

C++ 7,393 431 Updated Aug 26, 2025
Next