Skip to content
View stefan-it's full-sized avatar
🤓
hacking 🎧
🤓
hacking 🎧

Highlights

  • Pro

Organizations

@flairNLP @Hugging-Face-Supporter @GermanT5 @Hugging-Face-Helping-Hand @LEL-A

Block or report stefan-it

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A vector index built on TurboQuant, written in Rust with Python bindings

Rust 1,018 91 Updated May 17, 2026

Repository for scripts affiliated with training and classification of German political texts

Jupyter Notebook 1 Updated Sep 15, 2025

Tools for merging pretrained large language models.

Python 7,083 715 Updated May 6, 2026

26m function call model that runs on incredibly small devices

Python 2,103 120 Updated May 16, 2026

Key Value Means paper code repository

Python 5 2 Updated May 12, 2026

Experiments and examples using the Marin framework

Python 3 Updated May 8, 2026

Open-source framework for the research and development of foundation models.

Python 975 116 Updated May 17, 2026

Aurora optimizer release

Python 119 5 Updated May 8, 2026

Copy Fail 2: Electric Boogaloo

C 311 31 Updated May 8, 2026
2 Updated Apr 25, 2026

TurboQuant: Near-optimal KV cache quantization for LLM inference (3-bit keys, 2-bit values) with Triton kernels + vLLM integration

Python 1,394 172 Updated Mar 27, 2026

Open source interface for iCUE LINK Hub and other Corsair AIOs, Hubs for Linux. Manage RGB lighting, fan speeds, system metrics, as well as keyboards, mice, headsets via a web dashboard.

Go 1,003 76 Updated May 9, 2026
Shell 293 39 Updated May 2, 2026

Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"

Python 93 5 Updated Sep 12, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,229 570 Updated May 17, 2026

Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)

Python 60 3 Updated Sep 27, 2024

Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation

Python 1 1 Updated Apr 15, 2026

OpenAI Privacy Filter

Python 2,170 191 Updated Apr 22, 2026

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 9,589 1,020 Updated May 15, 2026

Official code for the paper Detoxification for LLM: From Dataset Itself. The code is based on transformers.

Python 1 Updated Apr 22, 2026

[ACL 2026 Findings] E2E-GMNER: End-to-End Generative Grounded Multimodal Named Entity Recognition

Python 4 Updated Apr 22, 2026

Official PyTorch implementation of Sessa: Selective State Space Attention for long-context sequence modeling.

Python 13 Updated Apr 28, 2026

[ACL 2026] How Tokenization Limits Phonological Knowledge Representation in Language Models and How to Improve Them

Python 2 Updated Apr 8, 2026

GLUE for Luxembourgish

Python 2 Updated Apr 9, 2026

Lucebox: LLM inference server built for speed for specific consumer hardware.

C++ 2,134 200 Updated May 17, 2026

BenGER - Research-grade benchmarking for LLMs in the German legal domain

TypeScript 2 Updated May 16, 2026

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

JavaScript 61,253 3,411 Updated May 12, 2026
Next