Skip to content
View gmittal's full-sized avatar

Block or report gmittal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
2907 results for source starred repositories
Clear filter

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,790 77 Updated Jun 16, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,121 228 Updated Aug 17, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,225 986 Updated Jul 1, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 897 50 Updated Sep 30, 2025

MLX: An array framework for Apple silicon

C++ 23,245 1,432 Updated Dec 24, 2025

The official Porsche Design System repository, offering fundamental UXI guidelines and a library of reusable web components to enable designers and developers to build consistent, intuitive, and hi…

TypeScript 560 39 Updated Dec 22, 2025
Python 31 7 Updated Jan 9, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,032 230 Updated Oct 16, 2025

Inference Llama 2 in one file of pure 🔥

Mojo 2,115 136 Updated Nov 30, 2025

Python pdb for multiple processes

Python 73 9 Updated May 24, 2025

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 196 13 Updated Jun 14, 2023

Supercharge Your LLM Application Evaluations 🚀

Python 11,833 1,180 Updated Dec 24, 2025

commaVQ is a dataset of compressed driving video

Jupyter Notebook 339 61 Updated Oct 31, 2025

Tools for building GPU clusters

Shell 1,405 352 Updated Jun 30, 2025

LLMs for your CLI

Python 1,355 77 Updated May 29, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,434 182 Updated Oct 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,109 12,166 Updated Dec 24, 2025

Universal memory layer for AI Agents

Python 44,637 4,853 Updated Dec 17, 2025

Write scalable load tests in plain Python 🚗💨

Python 27,271 3,155 Updated Dec 23, 2025

CUDA on non-NVIDIA GPUs

Rust 13,696 882 Updated Dec 19, 2025

It's React, but in Python

Python 8,148 333 Updated Dec 22, 2025

Implementation of Flash Attention in Jax

Python 222 24 Updated Mar 1, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 980 57 Updated Jan 30, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 840 64 Updated Jul 1, 2024

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 712 121 Updated Dec 14, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,380 2,745 Updated Dec 24, 2025

The Official Python Client for Lamini's API

Python 2,540 154 Updated Apr 7, 2025

Tiny data-over-sound library

C++ 7,397 431 Updated Aug 26, 2025

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,905 368 Updated Dec 7, 2024

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 948 38 Updated Mar 19, 2025
Next