A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,884 540 Updated Nov 5, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,352 1,886 Updated Jun 3, 2025

paramiko / paramiko

The leading native Python SSHv2 protocol library.

Python 9,557 2,033 Updated Oct 20, 2025

pytorch / tutorials

PyTorch tutorials.

Python 8,877 4,293 Updated Nov 5, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,653 595 Updated Nov 5, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,065 519 Updated Jun 9, 2025

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,408 176 Updated Oct 27, 2025

mistralai / mistral-common

Official inference library for pre-processing of Mistral models

Python 809 109 Updated Nov 5, 2025

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 10,531 981 Updated Mar 20, 2025

ChaofanTao / Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

742 20 Updated Oct 30, 2025

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

2,885 129 Updated Oct 28, 2025

intel / intel-extension-for-pytorch

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,987 308 Updated Nov 4, 2025

pyutils / line_profiler

Line-by-line profiling for Python

Python 3,141 130 Updated Oct 31, 2025

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,213 200 Updated May 19, 2025

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,208 361 Updated Oct 19, 2025

microsoft / torchscale

Foundation Architecture for (M)LLMs

Python 3,119 220 Updated Apr 11, 2024

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,071 56 Updated Mar 20, 2025

TencentARC / SEED-Voken

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 968 35 Updated Oct 22, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,258 2,978 Updated Feb 25, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,882 89 Updated Aug 15, 2024

mikaelhaji / n1-codec

a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)

Jupyter Notebook 46 6 Updated Jun 3, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,220 4,536 Updated Oct 13, 2025

ROCm / ROCm

AMD ROCm™ Software - GitHub Home

Shell 5,855 487 Updated Nov 5, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,062 731 Updated Oct 31, 2025