Skip to content
View s3nh's full-sized avatar
🦊
🦊

Block or report s3nh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

89 tools for Korean law — statutes, precedents, ordinances, interpretations | MCP Server · CLI · npm

TypeScript 1,048 155 Updated Apr 3, 2026

REAP: Router-weighted Expert Activation Pruning for SMoE compression

Python 312 56 Updated Apr 1, 2026

🍀 Codebase for CloverLM

Python 6 Updated Apr 2, 2026

Implementation of Fast Weight Attention

Python 24 1 Updated Mar 25, 2026
Jupyter Notebook 1 Updated Mar 30, 2026

Official implementation for Training LLMs with MXFP4

Python 123 16 Updated Apr 25, 2025

Implements harmful/harmless refusal removal using pure HF Transformers

Python 1,752 277 Updated Nov 27, 2025

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 1 Updated Mar 19, 2026
Python 619 46 Updated Apr 2, 2026
Python 1 Updated Mar 30, 2026

Comparative study and experimentation on standard vs mHC vs attention residual (full and block)

Python 13 2 Updated Mar 17, 2026

A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES.

JavaScript 47,294 3,860 Updated Apr 3, 2026

Original reference implementation of the CUDA rasterizer from the paper "StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time Rendering"

Cuda 57 8 Updated Jun 20, 2024

An unofficial implementation of absGS

Python 124 5 Updated Apr 21, 2024

Open-source framework for turning expert knowledge into PII-free synthetic conversational data and production-ready LoRA adapters.

Python 55 1 Updated Mar 24, 2026

An agent for CUDA compute-communication kernel co-design

Cuda 33 3 Updated Mar 24, 2026

A lightweight inference engine supporting speculative speculative decoding (SSD).

Python 845 64 Updated Mar 22, 2026

Repo for FastAPI

Jupyter Notebook 1 Updated Mar 2, 2024

Open-source CUDA compiler targeting multiple GPU architectures. Compiles .cu to AMD and Tenstorrent GPU's

C 1,537 67 Updated Mar 25, 2026

Voice-to-text app for macOS to transcribe what you say to text almost instantly

Swift 4,433 600 Updated Apr 1, 2026

Agent harness to publish your history from Claude Code et al. as Huggingface datasets.

Python 2,029 234 Updated Apr 3, 2026

Accelerated Asm Python

Assembly 125 6 Updated Mar 18, 2026

A collection of research papers on low-precision training methods

64 2 Updated May 10, 2025

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

924 36 Updated Mar 10, 2026

Kagi Small Web

HTML 1,448 604 Updated Apr 3, 2026

Shared Middle-Layer for Triton Compilation

MLIR 329 93 Updated Dec 5, 2025

Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)

Cuda 84 6 Updated Feb 10, 2026

Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service

Python 62 5 Updated Jan 29, 2025

Pure C inference of Mistral Voxtral Realtime 4B speech to text model

C 1,590 108 Updated Feb 15, 2026

Implementation of the fast weight product key memory from Sakana AI

Python 16 1 Updated Apr 1, 2026
Next