Stars
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A high-throughput and memory-efficient inference and serving engine for LLMs
You like pytorch? You like micrograd? You love tinygrad! ❤️
Fully open reproduction of DeepSeek-R1
Image-to-Image Translation in PyTorch
A TTS model capable of generating ultra-realistic dialogue in one pass.
An elegant PyTorch deep reinforcement learning library.
Keras implementations of Generative Adversarial Networks.
Utilities intended for use with Llama models.
Tools for merging pretrained large language models.
Entropy Based Sampling and Parallel CoT Decoding
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
Muon is an optimizer for hidden layers in neural networks
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
A simple, performant and scalable Jax LLM!
Schedule-Free Optimization in PyTorch
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
SkyRL: A Modular Full-stack RL Library for LLMs
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Cramming the training of a (BERT-type) language model into limited compute.