default search action

combined dblp search
author search
venue search
publication search

ask others

Zehan Wang 0001

> Home > Persons

Person information

affiliation: Zhejiang University, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChengHLWJYCDHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChengHLWJYCDHZ24
Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. ACL (Findings) 2024: 9973-9986
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangZWYTYLW0CS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangZWYTYLW0CS24
Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners. ACL (1) 2024: 10929-10942
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/0001ZCHLYHZ0GZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0001ZCHLYHZ0GZ24
Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. ICML 2024
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/HuangHW0C0YYLGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangHW0C0YYLGZ24
Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangWHXHYC00YL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangWHXHYC00YL24
Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. ACM Multimedia 2024: 10630-10639
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-04883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-04883
Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. CoRR abs/2405.04883 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00320
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00320
Yongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang, Zehan Wang, Fuming You, Ruiqi Li, Zhou Zhao:
Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching. CoRR abs/2406.00320 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18583
Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT. CoRR abs/2406.18583 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-11895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-11895
Zehan Wang, Ziang Zhang, Hang Zhang, Luping Liu, Rongjie Huang, Xize Cheng, Hengshuang Zhao, Zhou Zhao:
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces. CoRR abs/2407.11895 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06734
Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Jiang, Jiawei Huang, Rongjie Huang, Jinglin Liu, Jinzheng He, Chen Zhang, Zehan Wang, Xize Chen, Xiang Yin, Zhou Zhao:
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes. CoRR abs/2410.06734 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-21269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-21269
Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Ziyang Ma, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao:
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup. CoRR abs/2410.21269 (2024)
2023
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangZHXZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangZHXZ23
Zehan Wang, Yang Zhao, Haifeng Huang, Yan Xia, Zhou Zhao:
Scene-robust Natural Language Video Localization via Learning Domain-invariant Representations. ACL (Findings) 2023: 144-160
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WangHZLCZYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangHZLCZYZ23
Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding. EMNLP 2023: 10612-10625
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WangHZLCZYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WangHZLCZYZ23
Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding. ICCV 2023: 2662-2671
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ChengJHLLWWLYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ChengJHLLWWLYZ23
Xize Cheng, Tao Jin, Rongjie Huang, Linjun Li, Wang Lin, Zehan Wang, Ye Wang, Huadai Liu, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. ICCV 2023: 15689-15699
[c1]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/WangZCHLYTLWZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangZCHLYTLWZZ23
Zehan Wang, Yang Zhao, Xize Cheng, Haifeng Huang, Jiageng Liu, Aoxiong Yin, Li Tang, Linjun Li, Yongqi Wang, Ziang Zhang, Zhou Zhao:
Connecting Multi-modal Contrastive Representations. NeurIPS 2023
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05309
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05309
Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. CoRR abs/2303.05309 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14381
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14381
Zehan Wang, Yang Zhao, Xize Cheng, Haifeng Huang, Jiageng Liu, Li Tang, Linjun Li, Yongqi Wang, Aoxiong Yin, Ziang Zhang, Zhou Zhao:
Connecting Multi-modal Contrastive Representations. CoRR abs/2305.14381 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-09267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-09267
Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding. CoRR abs/2307.09267 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-13363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-13363
Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding. CoRR abs/2307.13363 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-08769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-08769
Zehan Wang, Haifeng Huang, Yang Zhao, Ziang Zhang, Zhou Zhao:
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes. CoRR abs/2308.08769 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-08884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-08884
Zehan Wang, Ziang Zhang, Luping Liu, Yang Zhao, Haifeng Huang, Tao Jin, Zhou Zhao:
Extending Multi-modal Contrastive Representations. CoRR abs/2310.08884 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08168
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08168
Haifeng Huang, Zehan Wang, Rongjie Huang, Luping Liu, Xize Cheng, Yang Zhao, Tao Jin, Zhou Zhao:
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers. CoRR abs/2312.08168 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-13633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-13633
Haifeng Huang, Yang Zhao, Zehan Wang, Yan Xia, Zhou Zhao:
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding. CoRR abs/2312.13633 (2023)
[i1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15197
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15197
Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, Changpeng Yang, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. CoRR abs/2312.15197 (2023)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.