-
Southern University of Science and Technology
- 1088 Xueyuan Avenue, Shenzhen 518055, P.R. China
Highlights
- Pro
Stars
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
Lightweight coding agent that runs in your terminal
Baseline Recipe for VoicePrivacy Challenge 2026: anonymization systems and evaluation software
A simple package for Guided source separation (GSS)
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
GitHub Copilot & Copilot Chat plugin for Typora on both Windows, macOS and Linux.
Simple example of using a CSI-Camera (like the Raspberry Pi Version 2 camera) with the NVIDIA Jetson Developer Kit
Jetson Nano with Ubuntu 20.04 image
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构…
Asterinas aims to be a production-grade Linux alternative—memory safe, high-performance, and more.
Adaptive Flow-Matching for Target Speaker Extraction
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Official implementation of 'MeanSE: Efficient Generative Speech Enhancement with Mean Flows'
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
Multilingual Voice Understanding Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A special academic or professional LaTeX resume template
A fluid simulator using Lattice-Boltzmann Method with simple and convenient GUI for educational purpose. 一个拥有漂亮易用的GUI的、使用格子玻尔兹曼法的、教育用途的流体力学数值计算和动画展示程序。
Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.
Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.
Unified automatic quality assessment for speech, music, and sound.
Official baseline for ICASSP 2026 URGENT Challenge Track 2 (Speech Quality Assessment)