default search action

combined dblp search
author search
venue search
publication search

ask others

Pan Zhang 0001

> Home > Persons

Person information

affiliation: Shanghai Artificial Intelligence Laboratory, Shanghai, China
affiliation: PIESAT Information Technology Co, Ltd., Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tgrs/LuZHXNZYZWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tgrs/LuZHXNZYZWL25
Kaixuan Lu, Ruiqian Zhang, Xiao Huang, Yuxing Xie, Xiaogang Ning, Hanchao Zhang, Mengke Yuan, Pan Zhang, Tao Wang, Tongkui Liao:
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing. IEEE Trans. Geosci. Remote. Sens. 63: 1-13 (2025)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03218
Rui Qian, Shuangrui Ding, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction. CoRR abs/2501.03218 (2025)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03226
Beichen Zhang, Yuhong Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Haodong Duan, Yuhang Cao, Dahua Lin, Jiaqi Wang:
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning. CoRR abs/2501.03226 (2025)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-05510
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-05510
Yifei Li, Junbo Niu, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang:
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? CoRR abs/2501.05510 (2025)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-12368
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-12368
Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model. CoRR abs/2501.12368 (2025)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-16330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-16330
Ye Fang, Zeyi Sun, Shangzhan Zhang, Tong Wu, Yinghao Xu, Pan Zhang, Jiaqi Wang, Gordon Wetzstein, Dahua Lin:
RelightVid: Temporal-Consistent Diffusion Model for Video Relighting. CoRR abs/2501.16330 (2025)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-05173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-05173
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin:
VideoRoPE: What Makes for Good Video Rotary Position Embedding? CoRR abs/2502.05173 (2025)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-08590
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-08590
Yujie Zhou, Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Qidong Huang, Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Anyi Rao, Jiaqi Wang, Li Niu:
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion. CoRR abs/2502.08590 (2025)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13128
Zihan Liu, Shuangrui Ding, Zhixiong Zhang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation. CoRR abs/2502.13128 (2025)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-06232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-06232
Jiazi Bu, Pengyang Ling, Yujie Zhou, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance. CoRR abs/2504.06232 (2025)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-07957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-07957
Shengyuan Ding, Shenxi Wu, Xiangyu Zhao, Yuhang Zang, Haodong Duan, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
MM-IFEngine: Towards Multimodal Instruction Following. CoRR abs/2504.07957 (2025)
2024
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/staeors/LiuPLZYLCZHWLJLLLLYCYTHSSVPH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/staeors/LiuPLZYLCZHWLJLLLLYCYTHSSVPH24
Guozhang Liu, Baochai Peng, Ting Liu, Pan Zhang, Mengke Yuan, Chaoran Lu, Ningning Cao, Sen Zhang, Simin Huang, Tao Wang, Xiaoqiang Lu, Licheng Jiao, Qiong Liu, Lingling Li, Fang Liu, Xu Liu, Yuting Yang, Kaiqiang Chen, Zhiyuan Yan, Deke Tang, Hai Huang, Michael Schmitt, Xian Sun, Gemine Vivone, Claudio Persello, Ronny Hänsch:
Large-Scale Fine-Grained Building Classification and Height Estimation for Semantic Urban Reconstruction: Outcome of the 2023 IEEE GRSS Data Fusion Contest. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 17: 11194-11207 (2024)
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangWHPZZDLLWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangWHPZZDLLWH24
Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He:
VIGC: Visual Instruction Generation and Correction. AAAI 2024: 5309-5317
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LingCZC0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LingCZC0Z24
Pengyang Ling, Lin Chen, Pan Zhang, Huaian Chen, Yi Jin, Jinjin Zheng:
FreeDrag: Feature Dragging for Reliable Point-Based Image Editing. CVPR 2024: 6860-6870
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0002FWZZKXL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0002FWZZKXL024
Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Alpha-CLIP: A CLIP Model Focusing on Wherever you Want. CVPR 2024: 13019-13029
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HuangDZ0H0L0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HuangDZ0H0L0Y24
Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation. CVPR 2024: 13418-13427
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ZhangZDZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ZhangZDZW24
Beichen Zhang, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Jiaqi Wang:
Long-CLIP: Unlocking the Long-Text Capability of CLIP. ECCV (51) 2024: 310-325
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ChenLDZHWZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ChenLDZHWZL24
Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin:
ShareGPT4V: Improving Large Multi-modal Models with Better Captions. ECCV (17) 2024: 370-387
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/LiuYLLPDLZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/LiuYLLPDLZWL24
Ting Liu, Mengke Yuan, Chaoran Lu, Kaixuan Lu, Baochai Peng, Heyang Duan, Mengya Li, Pan Zhang, Tao Wang, Tongkui Liao:
Water Body Extraction from SAR and Multi-Source Data Using Siamese Network-Based Segmentation. IGARSS 2024: 772-775
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuanYQFCLDZZWL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuanYQFCLDZZWL024
Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models. ACM Multimedia 2024: 11198-11201
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0016WLD0ZCDB00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0016WLD0ZCDB00024
Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Lin Bin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. NeurIPS 2024
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenLDZZCDWQLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenLDZZCDWQLZ24
Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? NeurIPS 2024
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DongZZCWOZDZLYG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DongZZCWOZDZLYG24
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. NeurIPS 2024
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuCZWDZLX0L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuCZWDZLX0L024
Ziyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang, Zijian Liang, Yuanjun Xiong, Yu Qiao, Dahua Lin, Jiaqi Wang:
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs. NeurIPS 2024
[c6]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MaZC0JLLLMDZP0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MaZC0JLLLMDZP0W24
Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun:
MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations. NeurIPS 2024
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/QianDZZDLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QianDZZDLW24
Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang:
Streaming Long Video Understanding with Large Language Models. NeurIPS 2024
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-16420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-16420
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-14767
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-14767
Yuhang Cao, Pan Zhang, Xiaoyi Dong, Dahua Lin, Jiaqi Wang:
DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models. CoRR abs/2402.14767 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17645
Shuangrui Ding, Zihan Liu, Xiaoyi Dong, Pan Zhang, Rui Qian, Conghui He, Dahua Lin, Jiaqi Wang:
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation. CoRR abs/2402.17645 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13805
Ziyu Liu, Zeyi Sun, Yuhang Zang, Wei Li, Pan Zhang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition. CoRR abs/2403.13805 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15378
Beichen Zhang, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Jiaqi Wang:
Long-CLIP: Unlocking the Long-Text Capability of CLIP. CoRR abs/2403.15378 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-20330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-20330
Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? CoRR abs/2403.20330 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06512
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-13044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-13044
Tao Chu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Qiong Liu, Jiaqi Wang:
Unified Scene Representation and Reconstruction for 3D Large Language Models. CoRR abs/2404.13044 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-11190
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-11190
Ying Jin, Pengyang Ling, Xiaoyi Dong, Pan Zhang, Jiaqi Wang, Dahua Lin:
ReasonPix2Pix: Instruction Reasoning Dataset for Advanced Image Editing. CoRR abs/2405.11190 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16009
Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang:
Streaming Long Video Understanding with Large Language Models. CoRR abs/2405.16009 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00093
Zeyi Sun, Tong Wu, Pan Zhang, Yuhang Zang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Bootstrap3D: Improving 3D Content Creation with Synthetic Data. CoRR abs/2406.00093 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04325
Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. CoRR abs/2406.04325 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05338
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05338
Pengyang Ling, Jiazi Bu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Tong Wu, Huaian Chen, Jiaqi Wang, Yi Jin:
MotionClone: Training-Free Motion Cloning for Controllable Video Generation. CoRR abs/2406.05338 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11739
Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Jiaqi Huang, Zunnan Xu, Xiu Li, Kehong Yuan, Yanyan Zu, Jiayao Ha, Qiong Gao, Licheng Jiao:
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results. CoRR abs/2406.11739 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11833
Ziyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang, Zijian Liang, Yuanjun Xiong, Yu Qiao, Dahua Lin, Jiaqi Wang:
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs. CoRR abs/2406.11833 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-01523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-01523
Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun:
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations. CoRR abs/2407.01523 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03320
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03320
Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-11691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-11691
Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models. CoRR abs/2407.11691 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06241
Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way. CoRR abs/2410.06241 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-07167
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-07167
Qidong Huang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate. CoRR abs/2410.07167 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-16268
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-16268
Shuangrui Ding, Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Yuwei Guo, Dahua Lin, Jiaqi Wang:
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree. CoRR abs/2410.16268 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-17247
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-17247
Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang, Feng Wu, Dahua Lin:
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction. CoRR abs/2410.17247 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-17637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-17637
Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. CoRR abs/2410.17637 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-06091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-06091
Kaixuan Lu, Ruiqian Zhang, Xiao Huang, Yuxing Xie, Xiaogang Ning, Hanchao Zhang, Mengke Yuan, Pan Zhang, Tao Wang, Tongkui Liao:
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing. CoRR abs/2411.06091 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-01824
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-01824
Zeyi Sun, Ziyang Chu, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models. CoRR abs/2412.01824 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-02044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-02044
Pan Zhang, Baochai Peng, Chaoran Lu, Quanjin Huang:
ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification. CoRR abs/2412.02044 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-09596
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-09596
Pan Zhang, Xiaoyi Dong, Yuhang Cao, Yuhang Zang, Rui Qian, Xilin Wei, Lin Chen, Yifei Li, Junbo Niu, Shuangrui Ding, Qipeng Guo, Haodong Duan, Xin Chen, Han Lv, Zheng Nie, Min Zhang, Bin Wang, Wenwei Zhang, Xinyue Zhang, Jiaye Ge, Wei Li, Jingwen Li, Zhongying Tu, Conghui He, Xingcheng Zhang, Kai Chen, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions. CoRR abs/2412.09596 (2024)
2023
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WangZCCZWWHL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WangZCCZWWHL23
Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin:
V3Det: Vast Vocabulary Visual Detection Dataset. ICCV 2023: 19787-19797
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/LiuPLZYLCZHW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/LiuPLZYLCZHW23
Guozhang Liu, Baochai Peng, Ting Liu, Pan Zhang, Mengke Yuan, Chaoran Lu, Ningning Cao, Sen Zhang, Simin Huang, Tao Wang:
Fine-Grained Building Roof Instance Segmentation Based on Domain Adapted Pretraining and Composite Dual-Backbone. IGARSS 2023: 670-673
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/LuCZLPLYZHW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/LuCZLPLYZHW23
Chaoran Lu, Ningning Cao, Pan Zhang, Ting Liu, Baochai Peng, Guozhang Liu, Mengke Yuan, Sen Zhang, Simin Huang, Tao Wang:
Hgdnet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation. IGARSS 2023: 758-761
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/siggrapha/WuLYZPWL023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/siggrapha/WuLYZPWL023
Tong Wu, Zhibing Li, Shuai Yang, Pan Zhang, Xingang Pan, Jiaqi Wang, Dahua Lin, Ziwei Liu:
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image. SIGGRAPH Asia 2023: 53:1-53:10
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-03752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-03752
Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin:
V3Det: Vast Vocabulary Visual Detection Dataset. CoRR abs/2304.03752 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04684
Pengyang Ling, Lin Chen, Pan Zhang, Huaian Chen, Yi Jin:
FreeDrag: Point Tracking is Not What You Need for Interactive Point-based Image Editing. CoRR abs/2307.04684 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05358
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05358
Guozhang Liu, Baochai Peng, Ting Liu, Pan Zhang, Mengke Yuan, Chaoran Lu, Ningning Cao, Sen Zhang, Simin Huang, Tao Wang:
Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone. CoRR abs/2308.05358 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05387
Chaoran Lu, Ningning Cao, Pan Zhang, Ting Liu, Baochai Peng, Guozhang Liu, Mengke Yuan, Sen Zhang, Simin Huang, Tao Wang:
HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation. CoRR abs/2308.05387 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-12714
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-12714
Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He:
VIGC: Visual Instruction Generation and Correction. CoRR abs/2308.12714 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-13566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-13566
Zhiyuan Zhao, Linke Ouyang, Bin Wang, Siyuan Huang, Pan Zhang, Xiaoyi Dong, Jiaqi Wang, Conghui He:
MLLM-DataEngine: An Iterative Refinement Approach for MLLM. CoRR abs/2308.13566 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15112
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15112
Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12793
Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin:
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions. CoRR abs/2311.12793 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17911
Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation. CoRR abs/2311.17911 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03818
Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want. CoRR abs/2312.03818 (2023)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04543
Tong Wu, Zhibing Li, Shuai Yang, Pan Zhang, Xingang Pan, Jiaqi Wang, Dahua Lin, Ziwei Liu:
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image. CoRR abs/2312.04543 (2023)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.