TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,416 1,963 Updated Dec 17, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,597 3,787 Updated Dec 18, 2025

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 157,816 13,944 Updated Dec 17, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,116 7,771 Updated Dec 16, 2025

Farama-Foundation / Jumpy

On-the-fly conversions between Jax and NumPy tensors

Python 57 9 Updated Mar 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ray Li yurayli

Achievements

Achievements

Block or report yurayli

High performance ML

jax-ml / jax

google / flax

Dao-AILab / flash-attention

bitsandbytes-foundation / bitsandbytes

RAIVNLab / MRL

deepspeedai / DeepSpeed

ridgerchu / matmulfreellm

linkedin / Liger-Kernel

triton-lang / triton

vllm-project / vllm

huggingface / text-generation-inference

NVIDIA / TensorRT-LLM

sgl-project / sglang

ollama / ollama

hiyouga / LLaMA-Factory

Farama-Foundation / Jumpy