Skip to content
View iver56's full-sized avatar
  • ElevenLabs
  • Trondheim, Norway
  • X @iver56

Organizations

@ninjadev

Block or report iver56

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

DFlash: Block Diffusion for Flash Speculative Decoding

Python 5,194 373 Updated May 10, 2026

Disambiguate japanese heteronyms

Python 34 7 Updated Oct 3, 2023

Official Pytorch implementation of the fundamental frequency estimator described in "Robust and Lightweight F0 Estimation Through Mid-Level Fusion of DSP-Informed Features", ICASSP 2026.

Python 14 2 Updated Apr 30, 2026

Rust bindings to libopus

C 20 14 Updated May 2, 2026

Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation

Python 720 29 Updated Jun 9, 2026

Adapters for external AAC and Opus decoders to be used with Symphonia

Rust 26 6 Updated Jun 20, 2026

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 340 35 Updated Feb 27, 2026

Python toolkit for high-quality time and pitch processing

Python 69 1 Updated May 30, 2026

Bonsai Demo

Shell 868 96 Updated May 31, 2026

A library for panning and zooming elements using CSS transforms 🔍

TypeScript 2,435 421 Updated Jun 7, 2026

Perfect Green Screen Keys

Python 13,938 859 Updated May 28, 2026

AI agents running research on single-GPU nanochat training automatically

Python 88,141 12,759 Updated Mar 26, 2026

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

Python 427 29 Updated May 6, 2026

🌋LavaSR: Fast Speech restoration and enhancement

Python 554 49 Updated Jun 19, 2026

The CMU Pronouncing Dictionary converted to IPA

96 12 Updated Jun 29, 2019

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

Python 251 44 Updated May 19, 2026

The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 11,245 1,367 Updated May 27, 2026

Variations of L1 SNR Loss function for training audio source separation machine learning models

Python 44 Updated May 1, 2026

On-device AI across mobile, embedded and edge for PyTorch

Python 4,747 1,039 Updated Jun 22, 2026

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 51,144 9,072 Updated Jun 22, 2026

HeartMuLa Official Repo: The Most Powerful Open-Source Music Generation Model of 2026

Python 3,710 412 Updated Apr 10, 2026

A lightning fast audio upsampler.

Python 775 73 Updated Feb 26, 2026

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

Swift 12,593 1,297 Updated May 22, 2026

Pure Mojo tokenizer for LLM inference - BPE, tiktoken, HuggingFace compatible

Mojo 4 1 Updated Jan 9, 2026
Python 2,971 340 Updated Jun 15, 2026

Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative modeling.

Python 100 12 Updated Jun 19, 2026

A highly compressive and high-quality neural audio codec for speech models.

Python 268 26 Updated Jan 23, 2026

Runs 405B LLMs on 8GB VRAM

Jupyter Notebook 3,022 235 Updated Apr 2, 2026

[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Python 1,583 93 Updated May 7, 2025

Elasto Mania Classic Source Code

C++ 5 9 Updated Jun 22, 2026
Next