Skip to content
View kasikci's full-sized avatar

Block or report kasikci

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI prompts for accelerating the research workflow.

119 16 Updated Mar 10, 2026
Jinja 19 2 Updated Dec 4, 2025

A language-model–powered compressor for natural language text

Python 49 3 Updated Oct 23, 2025

A Streaming-Native Serving Engine for TTS/STS Models

Python 61 8 Updated Feb 22, 2026

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 19,429 1,398 Updated Mar 12, 2026

Release repo for our SLAM Handbook

TeX 4,400 288 Updated Nov 25, 2025

Translate C macros and condtional compilation to Rust

C++ 11 2 Updated Mar 20, 2026

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,377 12,865 Updated Mar 25, 2026

Dotfiles. Managed by YADM

Vim Script 305 48 Updated Mar 22, 2026

Access large language models from the command-line

Python 11,443 779 Updated Mar 17, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,692 543 Updated Mar 26, 2026

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 951 47 Updated Oct 29, 2025
Python 1 3 Updated May 22, 2025

[EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Python 149 5 Updated May 18, 2025

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…

TypeScript 22,672 1,055 Updated Mar 27, 2026

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,807 506 Updated Mar 26, 2026

Numbers every LLM developer should know

4,292 140 Updated Jan 16, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 7,003 747 Updated Mar 26, 2026

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 381 40 Updated Jul 10, 2025

A comprehensive guide on buying and owning a Tesla

926 189 Updated Sep 9, 2024

Learn LeetCode and prepare for coding interviews with free resources.

3,785 410 Updated Feb 14, 2025

DCPerf benchmark suite for hyperscale cloud applications

Python 237 78 Updated Mar 26, 2026

[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

Python 262 33 Updated Nov 18, 2024

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Cuda 336 30 Updated Jul 2, 2024

Benchmarking suite for Google workloads

C++ 142 17 Updated Mar 24, 2026
SystemVerilog 20 8 Updated Jun 12, 2024

OSS-Fuzz - continuous fuzzing for open source software.

Shell 11,994 2,670 Updated Mar 26, 2026

Avatars for Zoom, Skype and other video-conferencing apps.

Python 16,538 4,315 Updated Aug 30, 2024

Resources for conference program chairs, especially in systems/PL areas of computer science.

Python 11 1 Updated May 14, 2023

[LLVM Static Slicer] Various program analyses, construction of dependence graphs and program slicing of LLVM bitcode.

C++ 525 141 Updated May 21, 2025
Next