default search action

combined dblp search
author search
venue search
publication search

ask others

Yuzhe Liang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-13802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-13802
Yushen Chen, Junzhe Liu, Yujie Tu, Zhikang Niu, Yuzhe Liang, Kai Yu, Chunyu Qiang, Chen Zhang, Xie Chen:
Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis. CoRR abs/2601.13802 (2026)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2603-11089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2603-11089
Nolan Chan, Timmy Gang, Yongqian Wang, Yuzhe Liang, Dingdong Wang:
V2A-DPO: Omni-Preference Optimization for Video-to-Audio Generation. CoRR abs/2603.11089 (2026)
2025
[c6]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/Chen0YLLXN00L0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Chen0YLLXN00L0025
Wenxi Chen, Ziyang Ma, Ruiqi Yan, Yuzhe Liang, Xiquan Li, Ruiyang Xu, Zhikang Niu, Yanqiao Zhu, Yifan Yang, Zhanxun Liu, Kai Yu, Yuxuan Hu, Jinyu Li, Yan Lu, Shujie Liu, Xie Chen:
SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training. ACL (Findings) 2025: 2262-2282
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Chen0LXLZ0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Chen0LXLZ0025
Wenxi Chen, Ziyang Ma, Xiquan Li, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Kai Yu, Xie Chen:
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs. ICASSP 2025: 1-5
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiC0XLZK025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiC0XLZK025
Xiquan Li, Wenxi Chen, Ziyang Ma, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Qiuqiang Kong, Xie Chen:
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning. ICASSP 2025: 1-5
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-13032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-13032
Ziyang Ma, Yinghao Ma, Yanqiao Zhu, Chen Yang, Yi-Wen Chao, Ruiyang Xu, Wenxi Chen, Yuanzhe Chen, Zhuo Chen, Jian Cong, Kai Li, Keliang Li, Siyou Li, Xinfeng Li, Xiquan Li, Zheng Lian, Yuzhe Liang, Minghao Liu, Zhikang Niu, Tianrui Wang, Yuping Wang, Yuxuan Wang, Yihao Wu, Guanrou Yang, Jianwei Yu, Ruibin Yuan, Zhisheng Zheng, Ziya Zhou, Haina Zhu, Wei Xue, Emmanouil Benetos, Kai Yu, Chng Eng Siong, Xie Chen:
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix. CoRR abs/2505.13032 (2025)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-19774
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-19774
Jun Wang, Xijuan Zeng, Chunyu Qiang, Ruilong Chen, Shiyao Wang, Le Wang, Wangjing Zhou, Pengfei Cai, Jiahui Zhao, Nan Li, Zihan Li, Yuzhe Liang, Xiaopeng Wang, Haorui Zheng, Ming Wen, Kang Yin, Yiran Wang, Nan Li, Feng Deng, Liang Dong, Chen Zhang, Di Zhang, Kun Gai:
Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation. CoRR abs/2506.19774 (2025)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-06098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-06098
Xiquan Li, Junxi Liu, Yuzhe Liang, Zhikang Niu, Wenxi Chen, Xie Chen:
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows. CoRR abs/2508.06098 (2025)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-16841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-16841
Wenxi Chen, Xinsheng Wang, Ruiqi Yan, Yushen Chen, Zhikang Niu, Ziyang Ma, Xiquan Li, Yuzhe Liang, Hanlin Wen, Shunshun Yin, Ming Tao, Xie Chen:
SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization. CoRR abs/2510.16841 (2025)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-18487
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-18487
Chunyu Qiang, Kang Yin, Xiaopeng Wang, Yuzhe Liang, Jiahui Zhao, Ruibo Fu, Tianrui Wang, Cheng Gong, Chen Zhang, Longbiao Wang, Jianwu Dang:
InstructAudio: Unified speech and music generation with natural language instruction. CoRR abs/2511.18487 (2025)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-04720
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-04720
Xiaopeng Wang, Chunyu Qiang, Ruibo Fu, Zhengqi Wen, Xuefei Liu, Yukun Liu, Yuzhe Liang, Kang Yin, Yuankun Xie, Heng Xie, Chenxing Li, Chen Zhang, Changsheng Li:
M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis. CoRR abs/2512.04720 (2025)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-09504
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-09504
Kang Yin, Chunyu Qiang, Sirui Zhao, Xiaopeng Wang, Yuzhe Liang, Pengfei Cai, Tong Xu, Chen Zhang, Enhong Chen:
DMP-TTS: Disentangled multi-modal Prompting for Controllable Text-to-Speech with Chained Guidance. CoRR abs/2512.09504 (2025)
2024
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/bmcbi/Liang24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bmcbi/Liang24
Yuzhe Liang:
Application of multi-scale dynamic enhancement based on deep neural network and CT urinary tract secretory phase image fusion in the diagnosis of urinary system diseases. BMC Bioinform. 25(1) (2024)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/HuangJHZQCLFZLCLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/HuangJHZQCLFZLCLQ24
Wen Huang, Anbai Jiang, Bing Han, Xinhu Zheng, Yihong Qiu, Wenxi Chen, Yuzhe Liang, Pingyi Fan, Wei-Qiang Zhang, Cheng Lu, Xie Chen, Jia Liu, Yanmin Qian:
Semi-Supervised Acoustic Scene Classification with Test-Time Adaptation. ICME Workshops 2024: 1-5
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/LiangCJQZHHQFZCLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/LiangCJQZHHQFZCLC24
Yuzhe Liang, Wenxi Chen, Anbai Jiang, Yihong Qiu, Xinhu Zheng, Wen Huang, Bing Han, Yanmin Qian, Pingyi Fan, Wei-Qiang Zhang, L. Cheng, Jia Liu, Xie Chen:
Improving Acoustic Scene Classification via Self-Supervised and Semi-Supervised Learning with Efficient Audio Transformer. ICME Workshops 2024: 1-6
[c1]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/ChenLMZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ChenLMZ024
Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen:
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer. IJCAI 2024: 3807-3815
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03497
Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen:
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer. CoRR abs/2401.03497 (2024)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-09472
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-09472
Xiquan Li, Wenxi Chen, Ziyang Ma, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Qiuqiang Kong, Xie Chen:
DRCap: Decoding CLAP Latents with Retrieval-augmented Generation for Zero-shot Audio Captioning. CoRR abs/2410.09472 (2024)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-09503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-09503
Wenxi Chen, Ziyang Ma, Xiquan Li, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Kai Yu, Xie Chen:
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs. CoRR abs/2410.09503 (2024)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.