Skip to content
View appoose's full-sized avatar

Block or report appoose

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Easy to use, High Performant Knowledge Distillation for LLMs

Python 97 8 Updated May 5, 2025

Train transformer language models with reinforcement learning.

Python 17,933 2,616 Updated Apr 5, 2026

Structured Outputs

Python 13,632 676 Updated Mar 26, 2026

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 5,475 500 Updated Feb 23, 2026

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,103 1,009 Updated Dec 2, 2025

Fast low-bit matmul kernels in Triton

Python 443 33 Updated Apr 4, 2026

Efficient Triton Kernels for LLM Training

Python 6,260 509 Updated Apr 3, 2026

Aana SDK is a powerful framework for building AI enabled multimodal applications.

Python 58 8 Updated Aug 22, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,048 87 Updated Sep 4, 2024

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,538 202 Updated Nov 9, 2024

Script for HQQification of mixtral from HF's shards

Jupyter Notebook 2 1 Updated Mar 2, 2024

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,330 230 Updated Apr 8, 2024

Official implementation of Half-Quadratic Quantization (HQQ)

Python 924 90 Updated Feb 26, 2026

llama.cpp with BakLLaVA model describes what does it see

Python 379 42 Updated Nov 8, 2023

Streamlit Component, for a Chatbot UI

JavaScript 1,098 273 Updated Aug 19, 2024

A tiny Scala library to send events and entities to Apache Flume.

Scala 6 1 Updated May 7, 2020

Crawl popular image search engines ( google, bing, 500px , flickr ) for images given a query

Python 12 4 Updated Feb 24, 2013

A collection of routines for building web APIs on top of Tornado web server.

Python 13 1 Updated Feb 15, 2013