Lists (2)
Sort Name ascending (A-Z)
Stars
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
tloen / llama-int8
Forked from meta-llama/llamaQuantized inference code for LLaMA models
sanjeevanahilan / nanoChatGPT
Forked from karpathy/nanoGPTA crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
fabawi / ImageBind-LoRA
Forked from facebookresearch/ImageBindFine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA
A PyTorch implementation of EfficientNet
remixer-dec / llama-mps
Forked from markasoftware/llama-cpuExperimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2
hamishivi / EasyLM
Forked from young-geng/EasyLMLarge language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Fast and memory-efficient exact attention
B06901052 / DeepSpeed
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Kwai-Klear / CE-GPPO
Forked from Kwai-Klear/KlearReasonerCE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
yu4u / seam-carving
Forked from li-plus/seam-carvingA super-fast Python implementation of seam carving algorithm for intelligent image resizing.
On-Point-RND / GIFT_SW
Forked from huggingface/peftπ€ GIFT-SW for PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Retrieval via attention
OlivierDehaene / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
chenghuige / pytorch-loss
Forked from CoinCheung/pytorch-losslabel-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Timothyxxx / aguvis
Forked from xlang-ai/aguvisAguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
NielsRogge / optimum
Forked from huggingface/optimumπ Accelerate training and inference of π€ Transformers and π€ Diffusers with easy to use hardware optimization tools
DROJ, a DRO-inspired embedding-level jailbreak method.
darraghdog / Kaggle-Carvana-Image-Masking-Challenge
Forked from petrosgk/Kaggle-Carvana-Image-Masking-ChallengeLouisCastricato / Lafite
Forked from drboog/LafiteCode for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)
8-bit CUDA functions for PyTorch
jgolebiowski / syne-tune-icml
Forked from geoalgo/syne-tuneOptimizing Hyperparameters with Conformal Quantile Regression