Skip to content
View tattocau's full-sized avatar

Block or report tattocau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source, community-driven agent harness

Rust 38,188 3,282 Updated Jun 13, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,734 18,004 Updated Jun 13, 2026

Experimental implementation of DeepSeek v4 flaash in llama.cpp

C++ 303 53 Updated Apr 27, 2026

DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm

C 13,548 1,194 Updated Jun 11, 2026

LLM inference in C/C++

C++ 116,302 19,524 Updated Jun 13, 2026

Lossless DFlash speculative decoding for MLX on Apple Silicon

Python 730 55 Updated Jun 11, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,448 340 Updated Jan 14, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,704 1,058 Updated Apr 30, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,370 1,043 Updated Jun 4, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,725 1,283 Updated Jun 11, 2026

DeepSeek Coder: Let the Code Write Itself

Python 23,673 2,849 Updated Nov 11, 2025

A kernel library written in tilelang

Python 1,585 138 Updated Apr 23, 2026

Docker configuration for running VLLM on dual DGX Sparks

Shell 1,597 289 Updated Jun 12, 2026

The agent that grows with you

Python 192,218 33,508 Updated Jun 13, 2026

Fast LLM speculative inference server for consumer hardware.

C++ 2,422 221 Updated Jun 12, 2026

ClashFX — macOS proxy tool with Enhanced Mode (TUN)

Swift 137 12 Updated Jun 12, 2026

The open source coding agent.

TypeScript 173,823 20,946 Updated Jun 13, 2026

《Real-Time Rendering 4th》 (RTR4) 中文翻译

3,208 409 Updated Mar 13, 2025

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,459 79,158 Updated Jun 13, 2026

Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python 17,483 2,602 Updated Jun 12, 2026

精读鸿蒙内核源码,百万汉字注解分析;百篇博客深入解剖,挖透内核地基工程.注解同步官方,工具文档齐全,多站点发布 . weharmonyos.com

C 427 83 Updated Nov 9, 2025

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,386 5,945 Updated Jun 13, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,269 1,313 Updated Jun 7, 2026

Run compilers interactively from your web browser and interact with the assembly

TypeScript 18,834 2,055 Updated Jun 13, 2026

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

JavaScript 61,515 6,693 Updated Jun 13, 2026

Powerful AI Client

TypeScript 40,445 4,102 Updated Jun 12, 2026

Lightweight Lenovo Vantage and Hotkeys replacement for Lenovo Legion laptops.

C# 7,508 368 Updated Jul 24, 2025

Guides, Tricks, and Tips to get the Legion Go running best on Linux

Shell 253 8 Updated May 11, 2026
Next