voidful

🎯

Focusing

Eric Lam voidful

🎯

Focusing

👩‍🎓PhD@NTU Speech Lab. Formerly, Microsoft Research Intern.

411 followers · 327 following

Highlights

Developer Program Member
Pro

Lists (1)

Sort

instruction dataset

8 repositories

Stars

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

Python 1,216 91 Updated Jan 30, 2026

windsuzu / AICUP-Deidentification-of-Medical-Data

Chinese NER problem that needs to capture 18 types of entities in medical conversation text. The process is divided into 4 parts that are encapsulated in high-level abstract classes. We control the…

Python 6 2 Updated Oct 1, 2021

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 159,711 24,876 Updated Feb 4, 2026

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 6,786 815 Updated Feb 4, 2026

shuaijiang / Ke-Omni-R

Ke-Omni-R is an advanced audio reasoning model and achieved SOTA on MMAU

Python 60 1 Updated Jun 11, 2025

contains-studio / agents

sharing current agents in use

11,972 2,522 Updated Jul 28, 2025

vercel-labs / agent-browser

Browser automation CLI for AI agents

TypeScript 12,496 707 Updated Feb 3, 2026

JerBouma / FinanceDatabase

This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.

Python 6,820 714 Updated Feb 1, 2026

cofe-ai / flm-audio

FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.

Python 59 8 Updated Dec 9, 2025

loocor / codmate

CodMate is a macOS SwiftUI app for managing CLI AI sessions: browse, search, organize, resume, and review work produced by Codex, Claude Code, and Gemini CLI. It focuses on speed, a compact three-c…

Swift 589 35 Updated Jan 31, 2026

journey-ad / gemini-watermark-remover

A high-performance, 100% client-side tool for removing Gemini AI watermarks. Built with pure JavaScript, it leverages a mathematically precise Reverse Alpha Blending algorithm rather than unpredict…

JavaScript 2,620 302 Updated Jan 16, 2026

facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,876 314 Updated Mar 14, 2023

facebookresearch / pixio

Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction

Python 345 9 Updated Jan 22, 2026

facebookresearch / dacvae

DACVAE

Python 190 15 Updated Dec 22, 2025

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,267 275 Updated Jan 5, 2026

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,137 141 Updated Jan 22, 2026

RunanywhereAI / runanywhere-sdks

Production ready toolkit to run AI locally

Kotlin 5,563 182 Updated Feb 4, 2026

Loyalsoldier / clash-rules

🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET)，兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。

24,251 2,010 Updated Feb 3, 2026

RooCodeInc / Roo-Code

Roo Code gives you a whole dev team of AI agents in your code editor.

TypeScript 22,093 2,850 Updated Feb 4, 2026

stepfun-ai / Step-Audio-R1

Python 578 40 Updated Jan 15, 2026

hon9kon9ize / hk-location-kb

Hong Kong Location Knowledge Base

1 Updated Nov 20, 2025

AMAAI-Lab / mustango

Mustango: Toward Controllable Text-to-Music Generation

Python 387 34 Updated Jun 2, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 52,888 10,127 Updated Feb 4, 2026

gaplo917 / cf-glt-admin-bot

A Cloudflare Worker that integrates with a Telegram Bot to filter spam and manage silence consensus polls.

TypeScript 6 1 Updated May 6, 2025

lyuwenyu / RT-DETR

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 4,824 562 Updated Dec 3, 2025

3loi / NaturalVoices

Jupyter Notebook 59 5 Updated Oct 22, 2025

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,624 229 Updated Dec 30, 2025

AmphionTeam / TaDiCodec

This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lan…

Python 75 3 Updated Jan 25, 2026

amphionspace / FlexiCodec

FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

Python 40 3 Updated Nov 4, 2025

inclusionAI / Ming-UniAudio

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 428 27 Updated Nov 27, 2025

Eric Lam voidful

Sponsors

Highlights

Lists (1)

instruction dataset

Stars