stefan-it

🤓

hacking 🎧

Stefan Schweter stefan-it

🤓

hacking 🎧

Researcher, M.Sc Computational Linguistics, Former student @ The Center for Information and Language Processing (CIS), LMU Munich

579 followers · 276 following

Bavarian Oberland, Germany
18:49 (UTC +02:00)
https://schweter.ml

Achievements

x3 x3 x3

Achievements

x3 x3 x3

Highlights

Organizations

Stars

RyanCodrai / turbovec

A vector index built on TurboQuant, written in Rust with Python bindings

Rust 1,018 91 Updated May 17, 2026

SinclairSchneider / german_ideology_prediction

Repository for scripts affiliated with training and classification of German political texts

Jupyter Notebook 1 Updated Sep 15, 2025

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 7,083 715 Updated May 6, 2026

cactus-compute / needle

26m function call model that runs on incredibly small devices

Python 2,103 120 Updated May 16, 2026

Diabolocom-Research / Decoder2Encoder

Forked from Nicolas-BZRD/EuroBERT

Python 3 2 Updated Feb 4, 2026

guox18 / formal-language-prepretraining

Python 3 Updated May 12, 2026

recursal / KVM-paper

Key Value Means paper code repository

Python 5 2 Updated May 12, 2026

marin-community / marin-experiments

Experiments and examples using the Marin framework

Python 3 Updated May 8, 2026

marin-community / marin

Open-source framework for the research and development of foundation models.

Python 975 116 Updated May 17, 2026

tilde-research / aurora-release

Aurora optimizer release

Python 119 5 Updated May 8, 2026

kitft / natural_language_autoencoders

Python 617 75 Updated May 7, 2026

0xdeadbeefnetwork / Copy_Fail2-Electric_Boogaloo

Copy Fail 2: Electric Boogaloo

C 311 31 Updated May 8, 2026

DMIRLAB-Group / SAM-NER

2 Updated Apr 25, 2026

0xSero / turboquant

TurboQuant: Near-optimal KV cache quantization for LLM inference (3-bit keys, 2-bit values) with Triton kernels + vLLM integration

Python 1,394 172 Updated Mar 27, 2026

jurkovic-nikola / OpenLinkHub

Open source interface for iCUE LINK Hub and other Corsair AIOs, Hubs for Linux. Manage RGB lighting, fan speeds, system metrics, as well as keyboards, mice, headsets via a web dashboard.

Go 1,003 76 Updated May 9, 2026

LobsterTrap / tank-os

Shell 293 39 Updated May 2, 2026

zaydzuhri / softpick-attention

Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"

Python 93 5 Updated Sep 12, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,229 570 Updated May 17, 2026

tech-srl / layer_norm_expressivity_role

Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)

Python 60 3 Updated Sep 27, 2024

Peiran-Li-DS / ToxiGAN

Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation

Python 1 1 Updated Apr 15, 2026

openai / privacy-filter

OpenAI Privacy Filter

Python 2,170 191 Updated Apr 22, 2026

huggingface / ml-intern

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 9,589 1,020 Updated May 15, 2026

ntsw2001 / data_detox_for_llm

Official code for the paper Detoxification for LLM: From Dataset Itself. The code is based on transformers.

Python 1 Updated Apr 22, 2026

Finch-coder / E2E-GMNER

[ACL 2026 Findings] E2E-GMNER: End-to-End Generative Grounded Multimodal Named Entity Recognition

Python 4 Updated Apr 22, 2026

LibratioAI / sessa

Official PyTorch implementation of Sessa: Selective State Space Attention for long-context sequence modeling.

Python 13 Updated Apr 28, 2026

liaodisen / Tokenization-Phonology

[ACL 2026] How Tokenization Limits Phonological Knowledge Representation in Language Models and How to Improve Them

Python 2 Updated Apr 8, 2026

plumaj / ltzGLUE

GLUE for Luxembourgish

Python 2 Updated Apr 9, 2026

Luce-Org / lucebox-hub

Lucebox: LLM inference server built for speed for specific consumer hardware.

C++ 2,134 200 Updated May 17, 2026

SebastianNagl / benger-platform

BenGER - Research-grade benchmarking for LLMs in the German legal domain

TypeScript 2 Updated May 16, 2026

JuliusBrussee / caveman

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

JavaScript 61,253 3,411 Updated May 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stefan Schweter stefan-it

Achievements

Achievements

Highlights

Organizations

Block or report stefan-it

Stars

RyanCodrai / turbovec

SinclairSchneider / german_ideology_prediction

arcee-ai / mergekit

cactus-compute / needle

Diabolocom-Research / Decoder2Encoder

guox18 / formal-language-prepretraining

recursal / KVM-paper

marin-community / marin-experiments

marin-community / marin

tilde-research / aurora-release

kitft / natural_language_autoencoders

0xdeadbeefnetwork / Copy_Fail2-Electric_Boogaloo

DMIRLAB-Group / SAM-NER

0xSero / turboquant

jurkovic-nikola / OpenLinkHub

LobsterTrap / tank-os

zaydzuhri / softpick-attention

tile-ai / tilelang

tech-srl / layer_norm_expressivity_role

Peiran-Li-DS / ToxiGAN

openai / privacy-filter

huggingface / ml-intern

ntsw2001 / data_detox_for_llm

Finch-coder / E2E-GMNER

LibratioAI / sessa

liaodisen / Tokenization-Phonology

plumaj / ltzGLUE

Luce-Org / lucebox-hub

SebastianNagl / benger-platform

JuliusBrussee / caveman