- Zurich, Switzerland
- https://www.linkedin.com/in/bamert/
Starred repositories
VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models
A high-throughput and memory-efficient inference and serving engine for LLMs
Production-grade client-side tracing, profiling, and analysis for complex software systems.
Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
Performance-portable, length-agnostic SIMD with runtime dispatch
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
An extremely fast Python package and project manager, written in Rust.
A simple MEMS I2S microphone and audio processing library for ESP32.
The official Python SDK for Model Context Protocol servers and clients
A community driven registry service for Model Context Protocol (MCP) servers.
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"
Single and multiple view camera calibration tool
A TypeScript-like language for WebAssembly.
An experimental HTTP VFS driver for SQLite WASM
A library for efficient similarity search and clustering of dense vectors.
An alternative MacOS application icon for the wonderful Kitty terminal emulator.
🦊A highly customizable theme for vim and neovim with support for lsp, treesitter and a variety of plugins.
Extension to mason.nvim that makes it easier to use lspconfig with mason.nvim.
Aquantia AQC10x multigigabit PCIe NIC linux driver (atlantic) - VmWare ESXi port
This repository contains a Pytorch implementation of the paper "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks" by Jonathan Frankle and Michael Carbin that can be easily a…
Port of OpenAI's Whisper model in C/C++