Skip to content
View ashwin's full-sized avatar

Block or report ashwin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pure Rust Inference Engine

Rust 511 77 Updated Jun 18, 2026

More useful firefox searching extension than Built-in features. You can search words with various search engines in the popup.

JavaScript 2 Updated Oct 26, 2025

Search extension for the chrome web browser

JavaScript 216 31 Updated Jun 2, 2026

Lightweight harness for replaying inference traffic against an endpoint

Python 21 3 Updated Jun 17, 2026

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 1,435 89 Updated Jun 8, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,517 606 Updated Jun 18, 2026

LLM inference in C/C++

C++ 117,207 19,710 Updated Jun 19, 2026

Fast LLM speculative inference server for consumer hardware.

C++ 2,567 240 Updated Jun 18, 2026

The Definitive AI Agent Benchmark

Python 175 20 Updated Jun 17, 2026
Jupyter Notebook 8 1 Updated Mar 26, 2026

Use Codex from Claude Code to review code or delegate tasks.

JavaScript 21,262 1,287 Updated Jun 14, 2026

Modern HTTP benchmarking tool

C 40,329 3,031 Updated Dec 30, 2023

An extremely fast Python package and project manager, written in Rust.

Rust 86,541 3,218 Updated Jun 18, 2026

Browserino is a tiny browser selector for MacOS written in SwiftUI.

Swift 403 29 Updated Jan 8, 2026

A free, self-hostable news aggregator…

PHP 15,350 1,201 Updated Jun 17, 2026

High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI

Python 443 80 Updated Jun 3, 2026

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Python 2,334 304 Updated Jun 19, 2026

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 281 71 Updated Jun 17, 2026

Vim - the text editor - for macOS

Vim Script 7,860 693 Updated Jun 18, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,882 7,065 Updated Jun 18, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,706 33,542 Updated Jun 18, 2026

A lightweight window border system for macOS

C 3,625 81 Updated May 14, 2026

A pleasant feed reader

TypeScript 28 1 Updated May 30, 2026

The automatic work journal. Privately turns your screen into a timeline of what you actually accomplished. Open-source and local-first.

Swift 6,113 335 Updated Jun 11, 2026

pytest plugin for distributed testing and loop-on-failures testing modes.

Python 1,868 267 Updated Jun 16, 2026

Image recognition for chess positions

Jupyter Notebook 126 25 Updated Dec 28, 2020

Predict chessboard FEN layouts from images using TensorFlow

Jupyter Notebook 565 108 Updated Jul 25, 2023

Expert Parallelism Load Balancer

Python 1,389 203 Updated Mar 24, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,419 545 Updated Jun 18, 2026

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,418 158 Updated Jun 18, 2026
Next