Skip to content
View voidful's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Sponsors

@ga642381

Block or report voidful

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DACVAE

Python 102 10 Updated Dec 19, 2025

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how tโ€ฆ

Python 2,064 145 Updated Dec 19, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,924 125 Updated Dec 18, 2025

Production ready toolkit to run AI locally

Kotlin 3,627 88 Updated Dec 19, 2025

๐Ÿฆ„๏ธ ๐ŸŽƒ ๐Ÿ‘ป Clash Premium ่ง„ๅˆ™้›†(RULE-SET)๏ผŒๅ…ผๅฎน ClashX Proใ€Clash for Windows ็ญ‰ๅŸบไบŽ Clash Premium ๅ†…ๆ ธ็š„ๅฎขๆˆท็ซฏใ€‚

23,533 1,978 Updated Dec 19, 2025

Roo Code gives you a whole dev team of AI agents in your code editor.

TypeScript 21,295 2,679 Updated Dec 19, 2025
Python 425 28 Updated Nov 27, 2025

Hong Kong Location Knowledge Base

1 Updated Nov 20, 2025

Mustango: Toward Controllable Text-to-Music Generation

Python 385 32 Updated Jun 2, 2025

Ultralytics YOLO ๐Ÿš€

Python 50,125 9,678 Updated Dec 19, 2025

A Cloudflare Worker that integrates with a Telegram Bot to filter spam and manage silence consensus polls.

TypeScript 6 1 Updated May 6, 2025

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. ๐Ÿ”ฅ ๐Ÿ”ฅ ๐Ÿ”ฅ

Python 4,611 542 Updated Dec 3, 2025
Jupyter Notebook 59 5 Updated Oct 22, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,487 213 Updated Dec 16, 2025

This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lanโ€ฆ

Python 70 3 Updated Sep 21, 2025

FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

Python 33 3 Updated Nov 4, 2025

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 406 28 Updated Nov 27, 2025

A Conversational Speech Generation Model

Python 14,368 1,458 Updated May 27, 2025

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 165 4 Updated Dec 17, 2025

Official implementation of "Continuous Autoregressive Language Models"

Python 672 80 Updated Dec 1, 2025

Metrics for evaluating music and audio generative models โ€“ with a focus on long-form, full-band, and stereo generations.

Python 271 23 Updated Dec 6, 2025
Python 5 4 Updated Dec 19, 2025

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 443 24 Updated Dec 15, 2025

A list of tools, papers and code related to Fake Audio Detection.

206 11 Updated Dec 10, 2025

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai Tech Report Link: https://arxiv.org/abs/2512.10971

Python 10,112 1,607 Updated Dec 19, 2025

NOF0 - ๅผ€ๆบ็š„ AI ไบคๆ˜“็ซžๆŠ€ๅœบ

Go 2,773 439 Updated Dec 7, 2025
Python 248 26 Updated May 19, 2025

Official code of ConfTuner: Training Large Language Models to Express Their Confidence Verbally

Python 17 Updated Sep 26, 2025

Trainging, inference, and testing of the SAC speech codec model.

Python 92 6 Updated Nov 1, 2025

An All-in-One Speech, Sound, Music Codec with Single Nested Codebook

Python 22 1 Updated Oct 11, 2025
Next