Skip to content
View Neilblaze's full-sized avatar
╰( ▀ ͜͞ʖ▀)つ──☆*:・゚
╰( ▀ ͜͞ʖ▀)つ──☆*:・゚

Block or report Neilblaze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Fast, lossless LLM inference via dual-view diffusion decoding.

Python 422 17 Updated May 18, 2026

Fast LLM speculative inference server for consumer hardware.

C++ 2,428 223 Updated Jun 13, 2026

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 10,441 1,104 Updated Jun 13, 2026

Normalized Transformer (nGPT)

Python 208 24 Updated Nov 19, 2024

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

Rust 62,109 3,827 Updated Jun 12, 2026

Communicate with an LLM provider using a single interface

Python 2,061 179 Updated Jun 12, 2026

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 13,279 1,394 Updated Nov 24, 2025

LLMRouter: An Open-Source Library for LLM Routing

Python 1,956 189 Updated May 13, 2026

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,460 285 Updated Oct 5, 2025

Autonomous GPU Kernel Generation & Optimization via Deep Agents

Python 446 74 Updated Jun 6, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,787 1,047 Updated Jun 13, 2026

Universal memory layer for AI Agents

Python 58,484 6,720 Updated Jun 13, 2026

Dynamic Memory Management for Serving LLMs without PagedAttention

C 490 42 Updated Jun 10, 2026

MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning

Python 67 4 Updated Apr 2, 2026

Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents

Swift 21,960 1,692 Updated Jun 13, 2026

A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.

Python 63 8 Updated Apr 20, 2026

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,570 134 Updated Mar 5, 2026

Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity

Go 29,385 4,210 Updated Jun 13, 2026

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

Python 1,304 150 Updated Jun 12, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 11,942 1,549 Updated Mar 17, 2026

Compression for unit-norm embedding vectors using spherical coordinates

C 83 8 Updated Jan 23, 2026

SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal

Python 3,504 361 Updated May 21, 2026

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 1,279 370 Updated Jun 8, 2026

Open-Source Frontier Voice AI

Python 49,314 5,479 Updated May 6, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,404 285 Updated Apr 8, 2026

A censorship-resistant platform where voices can’t be silenced, yet every report is verified with zero-knowledge proofs

TypeScript 8 Updated Sep 7, 2025

Exploratory analysis of Bayesian models with Python

TeX 1,827 496 Updated Jun 12, 2026

A browser extension for insights into GitHub, Gitee projects and developers.

TypeScript 405 105 Updated Jun 10, 2026

An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.

Python 43,897 4,585 Updated Jun 5, 2026
Next