- Los Gatos, CA
-
17:46
(UTC -07:00) - http://codeyarns.com
- @codeyarns@mastodon.social
- @codeyarns.bsky.social
- in/ashwinn
Stars
More useful firefox searching extension than Built-in features. You can search words with various search engines in the popup.
Search extension for the chrome web browser
Lightweight harness for replaying inference traffic against an endpoint
Tile-Based Runtime for Ultra-Low-Latency LLM Inference
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Fast LLM speculative inference server for consumer hardware.
Use Codex from Claude Code to review code or delegate tasks.
An extremely fast Python package and project manager, written in Rust.
Browserino is a tiny browser selector for MacOS written in SwiftUI.
High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A lightweight window border system for macOS
The automatic work journal. Privately turns your screen into a timeline of what you actually accomplished. Open-source and local-first.
pytest plugin for distributed testing and loop-on-failures testing modes.
Image recognition for chess positions
Predict chessboard FEN layouts from images using TensorFlow
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)