Skip to content
View gmittal's full-sized avatar

Block or report gmittal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,858 82 Updated Jan 4, 2026

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,275 242 Updated Aug 17, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,458 1,038 Updated Jul 1, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 904 51 Updated Sep 30, 2025

MLX: An array framework for Apple silicon

C++ 25,825 1,731 Updated Apr 28, 2026

The official Porsche Design System repository, offering fundamental UXI guidelines and a library of reusable web components to enable designers and developers to build consistent, intuitive, and hi…

TypeScript 602 44 Updated Apr 28, 2026
Python 31 7 Updated Jan 9, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,163 242 Updated Oct 16, 2025

Inference Llama 2 in one file of pure 🔥

Mojo 2,120 136 Updated Feb 9, 2026

Python pdb for multiple processes

Python 82 9 Updated May 24, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,951 609 Updated May 3, 2024

Inference code for CodeLlama models

Python 16,335 1,941 Updated Aug 12, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 195 13 Updated Jun 14, 2023

Supercharge Your LLM Application Evaluations 🚀

Python 13,702 1,381 Updated Feb 24, 2026

commaVQ is a dataset of compressed driving video

Jupyter Notebook 370 74 Updated Mar 31, 2026

Tools for building GPU clusters

Shell 1,430 350 Updated Apr 27, 2026

LLMs for your CLI

Python 1,362 79 Updated May 29, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,499 188 Updated Feb 2, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,463 16,202 Updated Apr 28, 2026

Universal memory layer for AI Agents

Python 54,308 6,118 Updated Apr 28, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,742 3,205 Updated Apr 28, 2026

CUDA on non-NVIDIA GPUs

Rust 14,156 901 Updated Apr 28, 2026

It's React, but in Python

Python 8,143 328 Updated Apr 18, 2026

Implementation of Flash Attention in Jax

Python 228 25 Updated Mar 1, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 998 58 Updated Jan 30, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 843 64 Updated Jul 1, 2024

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 735 127 Updated Jan 26, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 25,913 2,804 Updated Apr 27, 2026

The Official Python Client for Lamini's API

Python 2,538 153 Updated Apr 7, 2025

Tiny data-over-sound library

C++ 7,629 454 Updated Apr 16, 2026
Next