Skip to content
View rexxxx1234's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report rexxxx1234

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
165 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,316 11,076 Updated Nov 6, 2025

Inference code for Llama models

Python 58,905 9,813 Updated Jan 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,040 8,215 Updated Dec 9, 2024

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,981 3,924 Updated Nov 6, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,624 4,613 Updated Nov 6, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,694 6,869 Updated Nov 6, 2025

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,986 3,443 Updated May 18, 2024

Let us control diffusion models!

Python 33,260 2,978 Updated Feb 25, 2024

Fully open reproduction of DeepSeek-R1

Python 25,614 2,401 Updated Sep 8, 2025

Graph Neural Network Library for PyTorch

Python 23,100 3,909 Updated Nov 3, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,815 2,665 Updated Jul 3, 2025

Contexts Optical Compression

Python 19,704 1,389 Updated Oct 25, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,118 1,906 Updated Nov 1, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,599 2,234 Updated Feb 1, 2025

Mamba SSM architecture

Python 16,343 1,481 Updated Oct 10, 2025

Convert Machine Learning Code Between Frameworks

Python 14,234 5,588 Updated Oct 17, 2025

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,123 3,058 Updated Jul 31, 2025

Open source code for AlphaFold 2.

Python 13,933 2,501 Updated Oct 31, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,541 1,988 Updated Nov 3, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,362 1,523 Updated Apr 24, 2025

Go ahead and axolotl questions

Python 10,740 1,183 Updated Nov 6, 2025

A framework for few-shot evaluation of language models.

Python 10,546 2,831 Updated Oct 29, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,262 386 Updated Aug 12, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,852 828 Updated Oct 3, 2025
Python 7,550 2,199 Updated Oct 23, 2025

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,418 1,176 Updated Mar 21, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,364 560 Updated Oct 19, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,082 524 Updated Jul 1, 2025

Adding guardrails to large language models.

Python 5,920 472 Updated Nov 6, 2025
Next