-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
-
ao Public
Forked from pytorch/aoPyTorch native quantization and sparsity for training and inference
Python BSD 3-Clause "New" or "Revised" License UpdatedJul 18, 2025 -
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 22, 2025 -
neptune-fetcher Public
Forked from neptune-ai/neptune-fetcherNeptune Fetcher is designed to separate data retrieval capabilities from the regular neptune package. This separation makes data fetching more efficient and improves performance.
Python Apache License 2.0 UpdatedMar 20, 2025 -
-
flogging Public
Forked from FragileTech/floggingLogging module with nicer formatting
Python MIT License UpdatedJan 15, 2025 -
litgpt Public
Forked from Lightning-AI/litgptPretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
Python Apache License 2.0 UpdatedMay 24, 2024 -
toolbox Public
Forked from stas00/ml-engineeringEssential guides and programming tools in my toolbox (with focus on ML Training)
Python Creative Commons Attribution Share Alike 4.0 International UpdatedApr 22, 2024 -
litdata Public
Forked from Lightning-AI/litDataBlazingly fast, distributed streaming of training data from any cloud storage for training AI models
Python Apache License 2.0 UpdatedApr 3, 2024 -
lightning-thunder Public
Forked from Lightning-AI/lightning-thunderSource to source compiler for PyTorch. It makes PyTorch programs faster on single accelerators and distributed.
Python Apache License 2.0 UpdatedMar 21, 2024 -
Fuser Public
Forked from NVIDIA/FuserA Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++ Other UpdatedOct 24, 2023 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of autoregressive language models.
Python MIT License UpdatedOct 6, 2023 -
neurips_llm_efficiency_challenge Public
Forked from llm-efficiency-challenge/neurips_llm_efficiency_challengeNeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Python UpdatedSep 23, 2023 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…
Python Apache License 2.0 UpdatedJun 7, 2023 -
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python MIT License UpdatedApr 25, 2023 -
faster-pytorch-blog Public
Forked from rasbt/faster-pytorch-blogOutlining techniques for improving the training performance of your PyTorch model without compromising its accuracy
Python UpdatedMar 29, 2023 -
DALI Public
Forked from NVIDIA/DALIA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
C++ Apache License 2.0 UpdatedMar 15, 2023 -
lightning-quick-start Public
Forked from Lightning-Universe/lightning-quick-startPython UpdatedJan 18, 2023 -
ffcv Public
Forked from libffcv/ffcvFFCV: Fast Forward Computer Vision (and other ML workloads!)
Python Apache License 2.0 UpdatedNov 22, 2022 -
lightning Public
Forked from Lightning-AI/pytorch-lightningBuild and train PyTorch models and connect them to the ML lifecycle using Lightning App templates, without handling DIY infrastructure, cost management, scaling, and other headaches.
Python Apache License 2.0 UpdatedNov 12, 2022 -
stable-diffusion Public
Forked from CompVis/stable-diffusionA latent text-to-image diffusion model
Jupyter Notebook Other UpdatedNov 7, 2022 -
taming-transformers Public
Forked from CompVis/taming-transformersTaming Transformers for High-Resolution Image Synthesis
Jupyter Notebook MIT License UpdatedNov 7, 2022 -
-
nnutils Public
Forked from jpuigcerver/nnutilsCPU & CUDA implementation of several neural network utils
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedOct 3, 2022 -
PyLaia Public
Forked from jpuigcerver/PyLaiaA deep learning toolkit for handwritten document analysis
-
PyLaia-examples Public
A set of experiments using PyLaia on different datasets
-
UVA Public
UVA programming challenges
-