Skip to content
View rexxxx1234's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report rexxxx1234

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
165 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,446 11,109 Updated Nov 7, 2025

Inference code for Llama models

Python 58,906 9,812 Updated Jan 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,101 8,226 Updated Dec 9, 2024

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,020 3,931 Updated Nov 7, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,632 4,613 Updated Nov 7, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,708 6,871 Updated Nov 7, 2025

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,987 3,445 Updated May 18, 2024

Let us control diffusion models!

Python 33,263 2,979 Updated Feb 25, 2024

Fully open reproduction of DeepSeek-R1

Python 25,618 2,400 Updated Sep 8, 2025

Graph Neural Network Library for PyTorch

Python 23,106 3,910 Updated Nov 7, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,818 2,666 Updated Jul 3, 2025

Contexts Optical Compression

Python 19,811 1,413 Updated Oct 25, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,126 1,909 Updated Nov 1, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,602 2,233 Updated Feb 1, 2025

Mamba SSM architecture

Python 16,356 1,482 Updated Oct 10, 2025

Convert Machine Learning Code Between Frameworks

Python 14,235 5,588 Updated Oct 17, 2025

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,126 3,057 Updated Jul 31, 2025

Open source code for AlphaFold 2.

Python 13,939 2,503 Updated Oct 31, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,547 1,988 Updated Nov 3, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,370 1,522 Updated Apr 24, 2025

Go ahead and axolotl questions

Python 10,745 1,184 Updated Nov 7, 2025

A framework for few-shot evaluation of language models.

Python 10,554 2,832 Updated Oct 29, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,263 387 Updated Aug 12, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,853 827 Updated Oct 3, 2025
Python 7,556 2,202 Updated Oct 23, 2025

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,419 1,175 Updated Mar 21, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,365 560 Updated Oct 19, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,082 524 Updated Jul 1, 2025

Adding guardrails to large language models.

Python 5,925 472 Updated Nov 6, 2025
Next