seshurajup

SeshurajuP seshurajup

56 followers · 497 following

@dolcera
Hyderabad

Lists (2)

Sort

Information Retrieval

1 repository

Tuning

1 repository

Stars

24 results for forked starred repositories written in Python

Clear filter

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,188 365 Updated Aug 14, 2025

tloen / llama-int8

Forked from meta-llama/llama

Quantized inference code for LLaMA models

Python 1,046 100 Updated Mar 17, 2023

fanqiwan / FuseAI

Forked from 18907305772/FuseAI

FuseAI Project

Python 583 37 Updated Jan 25, 2025

sanjeevanahilan / nanoChatGPT

Forked from karpathy/nanoGPT

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick

Python 293 25 Updated Nov 25, 2023

fabawi / ImageBind-LoRA

Forked from facebookresearch/ImageBind

Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA

Python 192 17 Updated Dec 11, 2023

shijianjian / EfficientNet-PyTorch-3D

Forked from lukemelas/EfficientNet-PyTorch

A PyTorch implementation of EfficientNet

Python 178 41 Updated Mar 25, 2023

remixer-dec / llama-mps

Forked from markasoftware/llama-cpu

Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2

Python 86 5 Updated Aug 29, 2023

hamishivi / EasyLM

Forked from young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 75 16 Updated Aug 17, 2024

GuanghaoYe / Emergence-of-Thinking

Forked from OpenRLHF/OpenRLHF

Python 53 4 Updated Feb 11, 2025

lucidrains / flash-attention

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 20 5 Updated Jul 22, 2024

B06901052 / DeepSpeed

Forked from deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 13 2 Updated Oct 11, 2022

Kwai-Klear / CE-GPPO

Forked from Kwai-Klear/KlearReasoner

CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

Python 12 Updated Oct 10, 2025

yu4u / seam-carving

Forked from li-plus/seam-carving

A super-fast Python implementation of seam carving algorithm for intelligent image resizing.

Python 9 1 Updated Aug 11, 2022

On-Point-RND / GIFT_SW

Forked from huggingface/peft

🤗 GIFT-SW for PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 6 1 Updated May 17, 2025

manthan2305 / Efficient-G-Retriever

Forked from XiaoxinHe/G-Retriever

Retrieval via attention

Python 6 Updated Apr 22, 2025

OlivierDehaene / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 6 6 Updated Apr 12, 2024

chenghuige / pytorch-loss

Forked from CoinCheung/pytorch-loss

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Python 4 Updated May 25, 2022

Timothyxxx / aguvis

Forked from xlang-ai/aguvis

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 3 Updated Jan 2, 2025

NielsRogge / optimum

Forked from huggingface/optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Python 2 Updated Jul 5, 2023

Leon-Leyang / embedding-level-jailbreak

Forked from chujiezheng/LLM-Safeguard

DROJ, a DRO-inspired embedding-level jailbreak method.

Python 2 Updated Jan 26, 2025

darraghdog / Kaggle-Carvana-Image-Masking-Challenge

Forked from petrosgk/Kaggle-Carvana-Image-Masking-Challenge

Python 1 Updated Sep 14, 2017

LouisCastricato / Lafite

Forked from drboog/Lafite

Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)

Python 1 Updated Apr 17, 2022

yxli2123 / bitsandbytes

Forked from bitsandbytes-foundation/bitsandbytes

8-bit CUDA functions for PyTorch

Python 1 Updated Oct 31, 2023

jgolebiowski / syne-tune-icml

Forked from geoalgo/syne-tune

Optimizing Hyperparameters with Conformal Quantile Regression

Python 1 Updated May 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly