Skip to content
View voidful's full-sized avatar
🎯
Focusing
🎯
Focusing

Sponsors

@ga642381

Block or report voidful

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

Python 1,216 91 Updated Jan 30, 2026

Chinese NER problem that needs to capture 18 types of entities in medical conversation text. The process is divided into 4 parts that are encapsulated in high-level abstract classes. We control the…

Python 6 2 Updated Oct 1, 2021

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 159,711 24,876 Updated Feb 4, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 6,786 815 Updated Feb 4, 2026

Ke-Omni-R is an advanced audio reasoning model and achieved SOTA on MMAU

Python 60 1 Updated Jun 11, 2025

sharing current agents in use

11,972 2,522 Updated Jul 28, 2025

Browser automation CLI for AI agents

TypeScript 12,496 707 Updated Feb 3, 2026

This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.

Python 6,820 714 Updated Feb 1, 2026

FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.

Python 59 8 Updated Dec 9, 2025

CodMate is a macOS SwiftUI app for managing CLI AI sessions: browse, search, organize, resume, and review work produced by Codex, Claude Code, and Gemini CLI. It focuses on speed, a compact three-c…

Swift 589 35 Updated Jan 31, 2026

A high-performance, 100% client-side tool for removing Gemini AI watermarks. Built with pure JavaScript, it leverages a mathematically precise Reverse Alpha Blending algorithm rather than unpredict…

JavaScript 2,620 302 Updated Jan 16, 2026

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,876 314 Updated Mar 14, 2023

Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction

Python 345 9 Updated Jan 22, 2026

DACVAE

Python 190 15 Updated Dec 22, 2025

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,267 275 Updated Jan 5, 2026

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,137 141 Updated Jan 22, 2026

Production ready toolkit to run AI locally

Kotlin 5,563 182 Updated Feb 4, 2026

🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET),兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。

24,251 2,010 Updated Feb 3, 2026

Roo Code gives you a whole dev team of AI agents in your code editor.

TypeScript 22,093 2,850 Updated Feb 4, 2026
Python 578 40 Updated Jan 15, 2026

Hong Kong Location Knowledge Base

1 Updated Nov 20, 2025

Mustango: Toward Controllable Text-to-Music Generation

Python 387 34 Updated Jun 2, 2025

Ultralytics YOLO 🚀

Python 52,888 10,127 Updated Feb 4, 2026

A Cloudflare Worker that integrates with a Telegram Bot to filter spam and manage silence consensus polls.

TypeScript 6 1 Updated May 6, 2025

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 4,824 562 Updated Dec 3, 2025
Jupyter Notebook 59 5 Updated Oct 22, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,624 229 Updated Dec 30, 2025

This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lan…

Python 75 3 Updated Jan 25, 2026

FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

Python 40 3 Updated Nov 4, 2025

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 428 27 Updated Nov 27, 2025
Next