Skip to content
View tuanavu's full-sized avatar

Highlights

  • Pro

Block or report tuanavu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Quantized LLM training in pure CUDA/C++.

C++ 214 14 Updated Nov 5, 2025

A Distributed, Fault-Tolerant Message Queue from Scratch. Inspired by Apache Kafka

Go 67 6 Updated Oct 23, 2025

The best ChatGPT that $100 can buy.

Python 35,765 4,111 Updated Nov 5, 2025

1st Place Team Crane: @aswinkumar1999 @rathull @kyolebu

Jupyter Notebook 26 2 Updated Sep 8, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 643 79 Updated Nov 5, 2025

Intelligent automation and multi-agent orchestration for Claude Code

Python 19,934 2,221 Updated Nov 1, 2025

KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

Go 891 101 Updated Nov 5, 2025

CUDA Python: Performance meets Productivity

Python 3,019 217 Updated Nov 5, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,755 3,273 Updated Nov 5, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,213 104 Updated Oct 17, 2025

Simple, complete, correct, optimal and industrial quality solutions for MIT 6.824 distributed systems course

Go 4 Updated Sep 2, 2024

A uniform interface to run deep learning models from multiple frameworks

C++ 939 75 Updated Jan 3, 2024

LevelCache is an ephemeral embedded cache with TTL support built on top of LevelDB.

C 54 9 Updated Jul 11, 2025

Python tool for converting files and office documents to Markdown.

Python 82,570 4,657 Updated Oct 20, 2025

A lightweight, powerful framework for multi-agent workflows

Python 17,113 2,818 Updated Nov 5, 2025

ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server. The birthplace of VibeOps.

Python 4,671 452 Updated Sep 24, 2025

A Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.

Go 7,566 711 Updated Nov 2, 2025

An open protocol enabling communication and interoperability between opaque agentic applications.

TypeScript 20,558 2,086 Updated Nov 5, 2025

GitHub's official MCP Server

Go 24,248 2,954 Updated Nov 5, 2025

MCP PubMed Search Server

Python 46 11 Updated Dec 12, 2024

The book "Performance Analysis and Tuning on Modern CPU"

TeX 3,354 235 Updated Jun 9, 2025

Implementing the 4 agentic patterns from scratch

Jupyter Notebook 1,612 300 Updated Mar 18, 2025

Deploy your agentic worfklows to production

Python 2,059 228 Updated Aug 31, 2025

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 7,030 639 Updated Oct 21, 2025

Class materials for a distributed systems lecture series

9,231 686 Updated Mar 18, 2025

A toolkit to run Ray applications on Kubernetes

Go 2,121 648 Updated Nov 4, 2025

Some CUDA example code with READMEs.

Cuda 176 26 Updated Mar 2, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,907 313 Updated Nov 5, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,832 430 Updated Mar 5, 2025
Next