Skip to content
View AdamLouly's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report AdamLouly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Docker configuration for running VLLM on dual DGX Sparks

Shell 1,634 297 Updated Jun 18, 2026

Agents, and RL environment, for optimizing GPU kernels on AMD ROCm using LLM agents. Benchmarks LLM serving workloads end-to-end, profiles bottleneck kernels, optimizes them via Claude Code or Code…

Python 68 9 Updated Jun 12, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,425 704 Updated May 17, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,277 18,201 Updated Jun 18, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,782 5,998 Updated Jun 18, 2026

Standards for all 50 states, organizations, schools, & districts. Sponsored by Common Curriculum

JavaScript 40 8 Updated May 22, 2026

Nano vLLM

Python 14,091 2,231 Updated Apr 26, 2026

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,704 227 Updated Sep 3, 2024

Build compute kernels and load them from the Hub.

Python 697 105 Updated Jun 18, 2026

A high-performance acceleration library dedicated to large-scale model training on AMD GPUs

Python 64 23 Updated Jun 18, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,292 1,258 Updated Jun 18, 2026

Development repository for the Triton language and compiler

MLIR 19,470 2,945 Updated Jun 18, 2026

Solve puzzles. Learn CUDA.

Jupyter Notebook 12,236 932 Updated Sep 1, 2024

Applied AI experiments and examples for PyTorch

Python 323 33 Updated Aug 22, 2025

A PyTorch native platform for training generative AI models

Python 5,450 863 Updated Jun 18, 2026

Visualize ONNX models with model-explorer

Python 72 7 Updated May 19, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,541 4,860 Updated Jun 18, 2026

Automatically split your PyTorch models on multiple GPUs for training & inference

Python 656 44 Updated Jan 2, 2024

Code review assistant powered by LLM

Python 193 29 Updated Apr 26, 2026

llm-export can export llm model to onnx.

Python 352 40 Updated May 8, 2026

A tool for parsing, editing, optimizing, and profiling ONNX models.

Python 490 68 Updated Jun 8, 2026

How react works

JavaScript 172 11 Updated Dec 20, 2023

A Multi-Paradigm React State Management Library

TypeScript 118 11 Updated Sep 18, 2024

LLM training in simple, raw C/CUDA

Cuda 30,255 3,653 Updated Jun 26, 2025
Python 251 28 Updated Jul 25, 2024

GPU Monitor for python.

Python 7 1 Updated Dec 9, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 21,283 2,337 Updated Jun 18, 2026

A Lightweight Recommendation System

Python 9,311 718 Updated Oct 13, 2025

🐥 A code review bot powered by ChatGPT

JavaScript 4,444 462 Updated Feb 7, 2026
Next