Skip to content
View vovler's full-sized avatar

Highlights

  • Pro

Block or report vovler

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

sqlite3 driver for go using database/sql

C 5 Updated Dec 4, 2025

A framework for efficient model inference with omni-modality models

Python 1,692 212 Updated Dec 25, 2025

Turso is an in-process SQL database, compatible with SQLite.

Rust 15,973 660 Updated Dec 25, 2025

Execution Time Analysis, Reroute Enhancement, Remote Python Logs, For ComfyUI developers.

JavaScript 170 14 Updated Nov 1, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,723 222 Updated Dec 25, 2025

High-performance C++/CUDA SDK for running Audio2Emotion and Audio2Face inference with integrated post-processing.

C++ 122 20 Updated Aug 28, 2025

ComfyUI-TBG-SAM3 A plug-and-play ComfyUI extension providing production-ready nodes for Meta’s SAM3 (Segment Anything Model 3) for text- or point-based segmentation, exhaustive mask generation, and…

Python 145 9 Updated Nov 29, 2025

TTS model capable of streaming conversational audio in realtime.

Python 930 77 Updated Nov 29, 2025

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,703 340 Updated Oct 28, 2025

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 6 3 Updated Dec 5, 2025

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 720 78 Updated Aug 14, 2025

SD.Next: All-in-one WebUI for AI generative image and video creation

Python 6,846 528 Updated Dec 25, 2025

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,295 89 Updated Mar 27, 2025

ComfyUI Plugin of Nunchaku

Python 2,612 123 Updated Dec 24, 2025

Optimized Go Compression Packages

Go 5,351 356 Updated Dec 2, 2025

Go middleware to compress HTTP responses with Gzip, Deflate, Brotli, Zstandard, XZ/LZMA2, LZ4, and more..

Go 83 9 Updated Dec 18, 2025

LLM inference in C/C++

C++ 91,990 14,244 Updated Dec 25, 2025

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 3,524 495 Updated Dec 24, 2025

Open-Source Frontier Voice AI

Python 19,036 2,101 Updated Dec 17, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,932 3,793 Updated Dec 25, 2025

Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

Rust 13,736 758 Updated Dec 24, 2025

AMD's graph optimization engine.

C++ 268 111 Updated Dec 24, 2025

Specify a github or local repo, github pull request, arXiv or Sci-Hub paper, Youtube transcript or documentation URL on the web and scrape into a text file and clipboard for easier LLM ingestion

Python 1,867 172 Updated Nov 14, 2025

project for skyreels-a3

JavaScript 78 4 Updated Aug 9, 2025

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,397 325 Updated Dec 9, 2025

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

C++ 4,965 483 Updated Dec 25, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,903 4,116 Updated Dec 23, 2025

Generate ARKit expression from audio in realtime

Python 172 30 Updated Oct 24, 2025

A lightweight WebGL Render for LAM and LAM_Audio2Expression

TypeScript 45 6 Updated Dec 25, 2025
Next