Skip to content
View exantrius's full-sized avatar
  • USA, SF Bay Area
  • 05:50 (UTC -08:00)

Block or report exantrius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 11,020 1,580 Updated Feb 12, 2025

Jellyfin Server/Web packaging and release workflows

Python 48 59 Updated Nov 3, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 155,412 13,542 Updated Nov 5, 2025

Materials for learning SGLang

631 49 Updated Oct 26, 2025

TensorRT LLM Benchmark Configuration

Python 13 4 Updated Jul 26, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 19,734 3,272 Updated Nov 5, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,091 11,038 Updated Nov 5, 2025

Generative Models by Stability AI

Python 26,564 2,975 Updated Nov 3, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 92,806 10,443 Updated Nov 5, 2025

The macOS & iOS file archiver

PHP 6,006 272 Updated Oct 14, 2025

Primary Git Repository for the Zephyr Project. Zephyr is a new generation, scalable, optimized, secure RTOS for multiple hardware architectures.

C 13,611 8,185 Updated Nov 5, 2025

Primary Git Repository for the Zephyr Project. Zephyr is a new generation, scalable, optimized, secure RTOS for multiple hardware architectures.

C 15 2 Updated Oct 8, 2025

Inference Llama 2 in one file of pure C

C 18,912 2,399 Updated Aug 6, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 13,630 2,006 Updated Aug 8, 2024

Deep Learning in Javascript. Train Convolutional Neural Networks (or ordinary ones) in your browser.

JavaScript 11,069 2,059 Updated Jan 7, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,965 8,196 Updated Dec 9, 2024

LLM training in simple, raw C/CUDA

Cuda 28,070 3,264 Updated Jun 26, 2025

LLM101n: Let's build a Storyteller

35,453 1,929 Updated Aug 1, 2024

The sassy UML diagram renderer

TypeScript 2,797 211 Updated Mar 31, 2025

A curated collection of diagramming tools used by leading software engineering teams

3,148 94 Updated Aug 25, 2024

Linux sources

C 1,426 305 Updated Jul 19, 2024

For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.

C 71,569 24,304 Updated Nov 5, 2025

Generate types and converters from JSON, Schema, and GraphQL

TypeScript 13,412 1,155 Updated Oct 27, 2025

Handlebars for golang

Go 643 118 Updated Jun 19, 2025

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,320 5,818 Updated Nov 5, 2025

LLM inference in C/C++

C++ 89,036 13,542 Updated Nov 5, 2025

An open source, offline capable, mind mapping application leveraging HTML5 technologies

JavaScript 2,865 605 Updated Feb 5, 2023
Next