Skip to content
View jeffra's full-sized avatar

Highlights

  • Pro

Organizations

@brownsys

Block or report jeffra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 451 64 Updated Jun 17, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,704 33,540 Updated Jun 18, 2026

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

Python 288 40 Updated Jun 12, 2026
Jupyter Notebook 93 25 Updated Mar 8, 2025

Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines

Python 197 13 Updated May 6, 2024

Machine Learning Engineering Open Book

Python 18,142 1,152 Updated May 18, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 21,284 2,337 Updated Jun 18, 2026

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,105 191 Updated Jun 30, 2025

Pretrained language model with 100B parameters

Python 3,755 290 Updated Jul 10, 2023

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 3,385 681 Updated Dec 16, 2025

Code release for SLIP Self-supervision meets Language-Image Pre-training

Python 792 74 Updated Feb 9, 2023

Azure HPC/AI VM Images

Shell 128 109 Updated Jun 17, 2026

Library for 8-bit optimizers and quantization routines.

780 47 Updated Aug 18, 2022

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,446 226 Updated Mar 20, 2024

Distribution transparent Machine Learning experiments on Apache Spark

Python 91 13 Updated Feb 21, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,442 1,116 Updated Jun 11, 2026

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python 436 74 Updated Jun 14, 2023

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

Python 485 93 Updated Nov 24, 2025

RDMA and SHARP plugins for nccl library

C 232 43 Updated Apr 3, 2026

Example models using DeepSpeed

Python 6,823 1,119 Updated May 20, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,541 4,860 Updated Jun 18, 2026

A minimal & modern LaTeX template for your (bachelor's | master's | doctoral) thesis

TeX 1,228 137 Updated May 18, 2026

Find the smallest number of switches necessary to build topologies of a given number of hosts and bisection bandwidth for the EGFT, HyperX, and Jellyfish topologies.

Python 2 Updated Jul 24, 2013