Skip to content
View gmittal's full-sized avatar

Block or report gmittal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,849 83 Updated Jan 4, 2026

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,234 239 Updated Aug 17, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,408 1,029 Updated Jul 1, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 906 50 Updated Sep 30, 2025

MLX: An array framework for Apple silicon

C++ 25,036 1,634 Updated Apr 3, 2026

The official Porsche Design System repository, offering fundamental UXI guidelines and a library of reusable web components to enable designers and developers to build consistent, intuitive, and hi…

TypeScript 593 42 Updated Apr 2, 2026
Python 31 7 Updated Jan 9, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,133 237 Updated Oct 16, 2025

Inference Llama 2 in one file of pure 🔥

Mojo 2,121 136 Updated Feb 9, 2026

Python pdb for multiple processes

Python 81 9 Updated May 24, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,931 605 Updated May 3, 2024

Inference code for CodeLlama models

Python 16,333 1,939 Updated Aug 12, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 195 13 Updated Jun 14, 2023

Supercharge Your LLM Application Evaluations 🚀

Python 13,212 1,333 Updated Feb 24, 2026

commaVQ is a dataset of compressed driving video

Jupyter Notebook 366 70 Updated Mar 31, 2026

Tools for building GPU clusters

Shell 1,428 352 Updated Feb 23, 2026

LLMs for your CLI

Python 1,359 79 Updated May 29, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,485 188 Updated Feb 2, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,173 15,133 Updated Apr 3, 2026

Universal memory layer for AI Agents

Python 51,881 5,804 Updated Apr 3, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,676 3,195 Updated Apr 2, 2026

CUDA on non-NVIDIA GPUs

Rust 14,058 898 Updated Apr 1, 2026

It's React, but in Python

Python 8,146 330 Updated Feb 17, 2026

Implementation of Flash Attention in Jax

Python 227 25 Updated Mar 1, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 991 57 Updated Jan 30, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 844 63 Updated Jul 1, 2024

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 733 126 Updated Jan 26, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 25,833 2,793 Updated Apr 3, 2026

The Official Python Client for Lamini's API

Python 2,540 154 Updated Apr 7, 2025

Tiny data-over-sound library

C++ 7,583 453 Updated Mar 21, 2026
Next