Skip to content
View nanless's full-sized avatar

Block or report nanless

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

Python 25 2 Updated Nov 4, 2025

MindSpider:专为舆情分析设计的AI爬虫

Python 52 20 Updated Oct 15, 2025

从0实现一个简洁清晰的Deep Search Agent

Python 299 71 Updated Aug 19, 2025

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

Python 14,269 2,451 Updated Nov 5, 2025

open-vocabulary sound event detection

Python 30 2 Updated Nov 4, 2025

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 339 16 Updated Nov 4, 2025

SoulX Podcast TTS Metal Test

Python 44 5 Updated Oct 30, 2025

[ACM MM 2025] AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation

Python 22 2 Updated Oct 28, 2025

A Python Library for Full Reference Binaural Fidelity Testing, Visualization & Feature Generation

Python 12 3 Updated Oct 30, 2025

Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"

Python 29 3 Updated Oct 30, 2025

The code of QaSNet

Python 3 1 Updated Sep 23, 2025

Official implementation of Decoupled MeanFlow

Python 20 1 Updated Oct 28, 2025

SAM 2++: Tracking Anything at Any Granularity

Python 23 2 Updated Nov 4, 2025

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 2,059 218 Updated Oct 29, 2025

[INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"

Python 63 3 Updated Jun 16, 2025

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai

Python 8,855 1,258 Updated Nov 5, 2025

Zotero MCP: Connects your Zotero research library with Claude and other AI assistants via the Model Context Protocol to discuss papers, get summaries, analyze citations, and more.

Python 670 60 Updated Oct 24, 2025

Official Repository of UltraVoice

JavaScript 44 1 Updated Oct 28, 2025

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 1,606 166 Updated Nov 4, 2025

Code for the blog "Neural audio codecs: how to get audio into LLMs"

Python 122 3 Updated Oct 20, 2025

Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark

Python 33 4 Updated May 7, 2025

An interface library for RL post training with environments.

Python 610 83 Updated Nov 4, 2025

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 160 8 Updated Oct 31, 2025

PyTorch-native post-training at scale

Python 494 50 Updated Nov 5, 2025

中国市场分析脚本是一个功能强大的Python工具,旨在为用户提供对中国A股市场的深入分析。该脚本利用Akshare库从多种数据源获取实时和历史股票数据,并计算关键财务指标,以帮助投资者做出明智的决策。

Python 19 1 Updated Oct 10, 2025

OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT

Python 421 80 Updated Sep 26, 2025

Vogent Turn: fast, open-source turn-detection for Voice AI applications

Python 33 2 Updated Oct 28, 2025
Python 16 2 Updated Oct 16, 2025
Next