Skip to content
View seshurajup's full-sized avatar

Block or report seshurajup

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
24 results for forked starred repositories written in Python
Clear filter

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,188 365 Updated Aug 14, 2025

Quantized inference code for LLaMA models

Python 1,046 100 Updated Mar 17, 2023

FuseAI Project

Python 583 37 Updated Jan 25, 2025

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick

Python 293 25 Updated Nov 25, 2023

Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA

Python 192 17 Updated Dec 11, 2023

A PyTorch implementation of EfficientNet

Python 178 41 Updated Mar 25, 2023

Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2

Python 86 5 Updated Aug 29, 2023

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 75 16 Updated Aug 17, 2024

Fast and memory-efficient exact attention

Python 20 5 Updated Jul 22, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 13 2 Updated Oct 11, 2022

CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

Python 12 Updated Oct 10, 2025

A super-fast Python implementation of seam carving algorithm for intelligent image resizing.

Python 9 1 Updated Aug 11, 2022

πŸ€— GIFT-SW for PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 6 1 Updated May 17, 2025

Retrieval via attention

Python 6 Updated Apr 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 6 6 Updated Apr 12, 2024

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Python 4 Updated May 25, 2022

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 3 Updated Jan 2, 2025

πŸš€ Accelerate training and inference of πŸ€— Transformers and πŸ€— Diffusers with easy to use hardware optimization tools

Python 2 Updated Jul 5, 2023

DROJ, a DRO-inspired embedding-level jailbreak method.

Python 2 Updated Jan 26, 2025

Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)

Python 1 Updated Apr 17, 2022

8-bit CUDA functions for PyTorch

Python 1 Updated Oct 31, 2023

Optimizing Hyperparameters with Conformal Quantile Regression

Python 1 Updated May 15, 2023