Audio & Speech Processing
-
Brno University of Technology
-
05:23
(UTC +01:00)
Yifan Peng
pyf98
Senior Research Scientist at NVIDIA | Multimodal LLMs, Speech-to-Speech
NVIDIA Santa Clara, CA
Dominik Klement
domklement
Junior researcher in ASR, Neural Audio Compression, Speaker Diarization,
Research Intern at CLSP Johns Hopkins,
Junior Researcher at BUT Speech
Ruoyu Wang
rywang99
Ph.D. @SPRATeam-USTC. Long-term intern @iflytek.
University of Science and Technology of China
Hervé BREDIN
hbredin
The "pyannote" guy. Co-Founder and CSO of @pyannoteai. Currently on leave from CNRS
@pyannoteai and CNRS France
liangdi
DiLiangWU
joint Phd student at Zhejiang University & Westlake University.
Research interests: speaker diarization, speaker recognition, speech and language processing.
Westlake University Hangzhou
Xu Tan (谭旭)
tan-xu
ex Principal Researcher and Research Manager at Microsoft Research Asia, working on LLMs, multimodality, and generative AI for video and audio.
Beijing, China
Audio-WestlakeU
Audio-WestlakeU
Audio Signal and Information Processing Lab at Westlake University
Hangzhou
JunyiPeng
JunyiPeng00
Speaker verification, multichannel processing.
Brno University of Technology Czech Republic
Jonathan Le Roux
Jonathan-LeRoux
Speech & Audio Senior Team Leader at MERL
@merlresearch Cambridge, MA
Changsheng Quan
quancs
A PhD of Zhejiang University & Westlake University. Main Interest: Reinforcement Learning & AI Infra.
WestlakeU Hangzhou, Zhejiang, PRC
Ziyang Ma
ddlBoJack
Ph.D.ing | Research Intern @QwenLM | Prev: @ByteDance-Seed @alibaba-damo-academy @microsoft @megvii-research
maum.ai
maum-ai
maum.ai provides AI platform and various AI engines based on deep machine learning.
Pangyo Seongnam-si Gyeonggi-do Korea
PreviousNext