Skip to content
View zengchang233's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report zengchang233

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zengchang233/README.md

Chang Zeng | 曾畅

Senior Research Scientist at Shanda AI Research Tokyo.

I work on generative audio, voice LLMs, and multimodal foundation models.

With 7+ years of experience from research to production, I build product-ready speech and audio AI systems across expressive TTS, full-duplex speech systems, codec and tokenization design, large-scale multi-GPU training, and deployment.

Current Focus

  • Generative audio and voice LLMs
  • Multimodal foundation models
  • Speech and singing voice generation
  • Speaker recognition and anti-spoofing
  • Audio separation and enhancement

Experience

  • 2025.09 - Present: Senior Research Scientist, Shanda AI Research Tokyo
  • 2024.04 - 2025.08: Multimodal Generative AI Researcher, Li Auto
  • 2023.09 - 2024.03: Speech ML Researcher (Intern), RevComm Inc.

Recent News

Selected Publications

Pinned Loading

  1. nii-yamagishilab/Attention_Backend_for_ASV nii-yamagishilab/Attention_Backend_for_ASV Public

    Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances

    Python 50 7

  2. xiaoicesing2 xiaoicesing2 Public

    The source code for the paper XiaoiceSing2 (interspeech2023)

    Python 49 3

  3. CrossSinger CrossSinger Public

    The source code for the paper CrossSinger (asru2023)

    18 2