default search action
Bohan Zeng
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
- [c12]Chengyu Shen, Zhen Hao Wong, Runming He, Hao Liang, Meiyi Qiang, Zimo Meng, Zhengyang Zhao, Bohan Zeng, Zhengzhou Zhu, Bin Cui, Wentao Zhang:
Let's Verify Math Questions Step by Step. KDD (1) 2026: 2770-2781 - [i41]Chengzhuo Tong, Mingkun Chang, Shenglong Zhang, Yuran Wang
, Cheng Liang, Zhizheng Zhao, Ruichuan An, Bohan Zeng, Yang Shi, Yifan Dai, Ziming Zhao, Guanbin Li, Pengfei Wan, Yuanxing Zhang, Wentao Zhang:
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation. CoRR abs/2601.10061 (2026) - [i40]Xinlong Chen, Weihong Lin, Jingyun Hua, Linli Yao, Yue Ding, Bozhou Li, Bohan Zeng, Yang Shi, Qiang Liu, Yuanxing Zhang, Pengfei Wan, Liang Wang, Tieniu Tan:
DiaDem: Advancing Dialogue Descriptions in Audiovisual Video Captioning for Multimodal Large Language Models. CoRR abs/2601.19267 (2026) - [i39]Bohan Zeng, Kaixin Zhu, Daili Hua, Bozhou Li, Chengzhuo Tong, Yuran Wang
, Xinyi Huang, Yifan Dai, Zixiang Zhang, Yifan Yang, Zhou Liu, Hao Liang, Xiaochen Ma, Ruichuan An, Tianyi Bai, Hongcheng Gao, Junbo Niu, Yang Shi, Xinlong Chen, Yue Ding, Minglei Shi, Kai Zeng, Yiwen Tang, Yuanxing Zhang, Pengfei Wan, Xintao Wang, Wentao Zhang:
Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks. CoRR abs/2602.01630 (2026) - [i38]Bozhou Li, Yushuo Guan, Haolin Li, Bohan Zeng, Yiyan Ji, Yue Ding, Pengfei Wan, Kun Gai, Yuanxing Zhang, Wentao Zhang:
Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers. CoRR abs/2602.03510 (2026) - [i37]Yue Ding, Yiyan Ji, Jungang Li, Xuyang Liu, Xinlong Chen, Junfei Wu, Bozhou Li, Bohan Zeng, Yang Shi, Yushuo Guan, Yuanxing Zhang, Jiaheng Liu, Qiang Liu, Pengfei Wan, Liang Wang:
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models. CoRR abs/2602.04804 (2026) - [i36]Junyu Feng, Binxiao Xu, Jiayi Chen, Mengyu Dai, Cenyang Wu, Haodong Li, Bohan Zeng, Yunliu Xie, Hao Liang, Ming Lu, Wentao Zhang:
M2A: Multimodal Memory Agent with Dual-Layer Hybrid Memory for Long-Term Personalized Interactions. CoRR abs/2602.07624 (2026) - [i35]Binxiao Xu, Junyu Feng, Xiaopeng Lin, Haodong Li, Zhiyuan Feng, Bohan Zeng, Shaolin Lu, Ming Lu, Qi She, Wentao Zhang:
AD-MIR: Bridging the Gap from Perception to Persuasion in Advertising Video Understanding via Structured Reasoning. CoRR abs/2602.07625 (2026) - [i34]Haobo Lin, Tianyi Bai, Chen Chen
, Jiajun Zhang, Bohan Zeng, Wentao Zhang, Binhang Yuan:
Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code. CoRR abs/2602.18745 (2026) - [i33]Hao Liang, Zhengyang Zhao, Zhaoyang Han, Meiyi Qiang, Xiaochen Ma, Bohan Zeng, Qifeng Cai, Zhiyu Li, Linpeng Tang, Weinan E, Wentao Zhang:
Towards Next-Generation LLM Training: From the Data-Centric Perspective. CoRR abs/2603.14712 (2026) - [i32]Hao Liang, Zhengyang Zhao, Meiyi Qiang, Mingrui Chen, Lu Ma, Rongyi Yu, Hengyi Feng, Shixuan Sun, Zimo Meng, Xiaochen Ma, Xuanlin Yang, Qifeng Cai, Ruichuan An, Bohan Zeng, Zhen Hao Wong, Chengyu Shen, Runming He, Zhaoyang Han, Yaowei Zheng, Fangcheng Fu, Conghui He, Bin Cui, Zhiyu Li, Weinan E, Wentao Zhang:
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models. CoRR abs/2603.26164 (2026) - [i31]DataFlow Team, Bohan Zeng, Daili Hua, Kaixin Zhu, Yifan Dai, Bozhou Li, Yuran Wang, Chengzhuo Tong, Yifan Yang, Mingkun Chang, Jianbin Zhao, Zhou Liu, Hao Liang, Xiaochen Ma, Ruichuan An, Junbo Niu, Zimo Meng, Tianyi Bai, Meiyi Qiang, Huanyao Zhang, Zhiyou Xiao, Tianyu Guo, Qinhan Yu, Runhao Zhao, Zhengpin Li, Xinyi Huang, Yisheng Pan, Yiwen Tang, Yang Shi, Yue Ding, Xinlong Chen, Hongcheng Gao, Minglei Shi, Jialong Wu, Zekun Wang, Yuanxing Zhang, Xintao Wang, Pengfei Wan, Yiren Song, Mike Zheng Shou, Wentao Zhang:
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models. CoRR abs/2604.04707 (2026) - 2025
- [j3]Xuhui Liu, Sicheng Gao, Bohan Zeng, Luping Zhang, Tian Wang, Jianzhuang Liu, Baochang Zhang
:
Implicit Diffusion Models for Continuous Super-Resolution. Int. J. Comput. Vis. 133(9): 6535-6557 (2025) - [j2]Ling Yang
, Yikai Zhao
, Zhaochen Yu
, Bohan Zeng
, Minkai Xu
, Shenda Hong
, Bin Cui
:
Spatio-Temporal Energy-Guided Diffusion Model for Zero-Shot Video Synthesis and Editing. IEEE Trans. Circuits Syst. Video Technol. 35(6): 6034-6046 (2025) - [c11]Bohan Zeng, Shanglin Li, Yutang Feng, Ling Yang, Juan Zhang, Hong Li, Jiaming Liu, Conghui He, Wentao Zhang, Jianzhuang Liu, Baochang Zhang, Shuicheng Yan:
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts. ICLR 2025 - [c10]Yang Shi
, Jiaheng Liu
, Yushuo Guan
, Zhenhua Wu
, Yuanxing Zhang
, Zihao Wang
, Weihong Lin
, Jingyun Hua
, Zekun Wang
, Xinlong Chen
, Bohan Zeng
, Wentao Zhang
, Fuzheng Zhang
, Wenjing Yang
, Di Zhang
:
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model. ACM Multimedia 2025: 10994-11003 - [c9]Bohan Zeng
, Ling Yang
, Jiaming Liu
, Minghao Xu
, Yuanxing Zhang
, Pengfei Wan
, Wentao Zhang
, Shuicheng Yan
:
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing. ACM Multimedia 2025: 12674-12681 - [i30]Hailong Guo, Bohan Zeng, Yiren Song, Wentao Zhang, Chuang Zhang, Jiaming Liu:
Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks. CoRR abs/2501.15891 (2025) - [i29]Ling Yang, Kaixin Zhu, Juanxi Tian, Bohan Zeng, Mingbao Lin, Hongjuan Pei, Wentao Zhang, Shuicheng Yan:
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes. CoRR abs/2503.13435 (2025) - [i28]Yang Shi, Jiaheng Liu, Yushuo Guan, Zhenhua Wu, Yuanxing Zhang, Zihao Wang, Weihong Lin, Jingyun Hua, Zekun Wang, Xinlong Chen, Bohan Zeng, Wentao Zhang, Fuzheng Zhang, Wenjing Yang, Di Zhang:
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model. CoRR abs/2504.10068 (2025) - [i27]Chengyu Shen, Zhen Hao Wong, Runming He, Hao Liang, Meiyi Qiang, Zimo Meng, Zhengyang Zhao, Bohan Zeng, Zhengzhou Zhu, Bin Cui, Wentao Zhang:
Let's Verify Math Questions Step by Step. CoRR abs/2505.13903 (2025) - [i26]Yang Shi, Huanqian Wang, Wulin Xie, Huanyao Zhang, Lijie Zhao, Yifan Zhang, Xinfeng Li, Chaoyou Fu
, Zhuoer Wen, Wenting Liu, Zhuoran Zhang, Xinlong Chen, Bohan Zeng, Sihan Yang, Yuanxing Zhang, Pengfei Wan, Haotian Wang, Wenjing Yang:
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios. CoRR abs/2505.21333 (2025) - [i25]Tianyi Bai, Zengjie Hu, Fupeng Sun, Jiantao Qiu, Yizhen Jiang, Guangxin He, Bohan Zeng, Conghui He, Binhang Yuan, Wentao Zhang:
Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification. CoRR abs/2506.07235 (2025) - [i24]Xinlong Chen, Yuanxing Zhang, Yushuo Guan, Bohan Zeng, Yang Shi, Sihan Yang, Pengfei Wan, Qiang Liu, Liang Wang, Tieniu Tan:
VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks. CoRR abs/2506.09079 (2025) - [i23]Junbo Niu, Yuanhong Zheng, Ziyang Miao, Hejun Dong, Chunjiang Ge, Hao Liang, Ma Lu, Bohan Zeng, Qiahao Zheng, Conghui He, Wentao Zhang:
Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models. CoRR abs/2506.12776 (2025) - [i22]Hao Liang, Ruitao Wu, Bohan Zeng, Junbo Niu, Wentao Zhang, Bin Dong:
Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge. CoRR abs/2509.06079 (2025) - [i21]Yang Shi, Yuhao Dong, Yue Ding, Yuran Wang
, Xuanyu Zhu, Sheng Zhou, Wenting Liu, Haochen Tian
, Rundong Wang, Huanqian Wang, Zuyan Liu, Bohan Zeng, Ruizhe Chen, Qixun Wang, Zhuoran Zhang, Xinlong Chen, Chengzhuo Tong, Bozhou Li, Chaoyou Fu
, Qiang Liu, Haotian Wang, Wenjing Yang, Yuanxing Zhang, Pengfei Wan, Yifan Zhang, Ziwei Liu:
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark. CoRR abs/2509.24897 (2025) - [i20]Xukai Wang, Xuanbo Liu, Mingrui Chen, Haitian Zhong, Xuanlin Yang, Bohan Zeng, Jinbo Hu, Hao Liang, Junbo Niu, Xuchen Li, Ruitao Wu, Ruichuan An, Yang Shi, Liu Liu, Xu-Yao Zhang, Qiang Liu, Zhouchen Lin, Wentao Zhang, Bin Dong:
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning. CoRR abs/2510.14265 (2025) - [i19]Kai Zeng, Zhanqian Wu, Kaixin Xiong, Xiaobao Wei, Xiangyu Guo, Zhenxin Zhu, Kalok Ho, Lijun Zhou, Bohan Zeng, Ming Lu, Haiyang Sun, Bing Wang, Guang Chen, Hangjun Ye, Wentao Zhang:
Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks. CoRR abs/2510.19195 (2025) - [i18]Zhou Liu, Zhaoyang Han, Guochen Yan, Hao Liang, Bohan Zeng, Xing Chen, Yuanfeng Song, Wentao Zhang:
DataGovBench: Benchmarking LLM Agents for Real-World Data Governance Workflows. CoRR abs/2512.04416 (2025) - [i17]Daili Hua, Xizhi Wang, Bohan Zeng, Xinyi Huang, Hao Liang, Junbo Niu, Xinlong Chen, Quanqing Xu, Wentao Zhang:
VABench: A Comprehensive Benchmark for Audio-Video Generation. CoRR abs/2512.09299 (2025) - [i16]Tianyu Guo, Hongyu Chen, Hao Liang, Meiyi Qiang, Bohan Zeng, Linzhuang Sun, Bin Cui, Wentao Zhang:
BRACE: A Benchmark for Robust Audio Caption Quality Evaluation. CoRR abs/2512.10403 (2025) - [i15]Yiwen Tang, Zoey Guo, Kaixin Zhu, Ray Zhang, Qizhi Chen, Dongzhi Jiang, Junli Liu, Bohan Zeng, Haoming Song, Delin Qu, Tianyi Bai, Dan Xu, Wentao Zhang, Bin Zhao:
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation. CoRR abs/2512.10949 (2025) - [i14]Minglei Shi, Haolin Wang, Borui Zhang, Wenzhao Zheng, Bohan Zeng, Ziyang Yuan, Xiaoshi Wu, Yuanxing Zhang, Huan Yang, Xintao Wang, Pengfei Wan, Kun Gai, Jie Zhou, Jiwen Lu:
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder. CoRR abs/2512.11749 (2025) - [i13]Yuran Wang
, Bohan Zeng, Chengzhuo Tong, Wenxuan Liu, Yang Shi, Xiaochen Ma, Hao Liang, Yuanxing Zhang, Wentao Zhang:
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling. CoRR abs/2512.12675 (2025) - [i12]Hao Liang, Xiaochen Ma, Zhou Liu, Zhen Hao Wong, Zhengyang Zhao, Zimo Meng, Runming He, Chengyu Shen, Qifeng Cai, Zhaoyang Han, Meiyi Qiang, Yalin Feng
, Tianyi Bai, Zewei Pan, Ziyi Guo, Yizhen Jiang, Jingwen Deng, Qijie You, Peichao Lai, Tianyu Guo, Chi Hsu Tsai, Hengyi Feng, Rui Hu, Wenkai Yu, Junbo Niu, Bohan Zeng, Ruichuan An, Lu Ma, Jihao Huang, Yaowei Zheng, Conghui He, Linpeng Tang, Bin Cui, Weinan E, Wentao Zhang:
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI. CoRR abs/2512.16676 (2025) - 2024
- [j1]Bohan Zeng
, Shan Gao, Yuelei Xu, Zhaoxiang Zhang, Fan Li, Chenghang Wang:
Detection of Military Targets on Ground and Sea by UAVs with Low-Altitude Oblique Perspective. Remote. Sens. 16(7): 1288 (2024) - [c8]Bohan Zeng, Shanglin Li, Xuhui Liu, Sicheng Gao, Xiaolong Jiang, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
Controllable Mind Visual Diffusion Model. AAAI 2024: 6935-6943 - [c7]Xuhui Liu, Bohan Zeng, Sicheng Gao, Shanglin Li, Yutang Feng, Hong Li, Boyu Liu, Jianzhuang Liu, Baochang Zhang:
LaDiffGAN: Training GANs with Diffusion Supervision in Latent Spaces. CVPR Workshops 2024: 1115-1125 - [c6]Shanglin Li, Bohan Zeng, Yutang Feng, Sicheng Gao, Xiuhui Liu, Jiaming Liu, Lin Li, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
ZONE: Zero-Shot Instruction-Guided Local Editing. CVPR 2024: 6254-6263 - [c5]Hong Li
, Yutang Feng, Song Xue, Xuhui Liu, Bohan Zeng, Shanglin Li, Boyu Liu, Jianzhuang Liu, Shumin Han, Baochang Zhang:
UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation. CVPR 2024: 10585-10595 - [i11]Ling Yang, Bohan Zeng, Jiaming Liu, Hong Li, Minghao Xu, Wentao Zhang, Shuicheng Yan:
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing. CoRR abs/2405.14785 (2024) - [i10]Bohan Zeng, Ling Yang, Siyu Li, Jiaming Liu, Zixiang Zhang, Juanxi Tian, Kaixin Zhu, Yongzhen Guo, Fu-Yun Wang, Minkai Xu, Stefano Ermon, Wentao Zhang:
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis. CoRR abs/2410.07155 (2024) - [i9]Ling Yang, Zixiang Zhang, Junlin Han, Bohan Zeng, Runjia Li, Philip Torr, Wentao Zhang:
Semantic Score Distillation Sampling for Compositional Text-to-3D Generation. CoRR abs/2410.09009 (2024) - 2023
- [c4]Bohan Zeng, Xuhui Liu, Sicheng Gao, Boyu Liu, Hong Li, Jianzhuang Liu, Baochang Zhang:
Face Animation with an Attribute-Guided Diffusion Model. CVPR Workshops 2023: 628-637 - [c3]Sicheng Gao, Xuhui Liu, Bohan Zeng, Sheng Xu, Yanjing Li, Xiaoyan Luo, Jianzhuang Liu, Xiantong Zhen, Baochang Zhang:
Implicit Diffusion Models for Continuous Super-Resolution. CVPR 2023: 10021-10030 - [i8]Sicheng Gao, Xuhui Liu, Bohan Zeng, Sheng Xu, Yanjing Li, Xiaoyan Luo, Jianzhuang Liu, Xiantong Zhen, Baochang Zhang:
Implicit Diffusion Models for Continuous Super-Resolution. CoRR abs/2303.16491 (2023) - [i7]Bohan Zeng, Xuhui Liu, Sicheng Gao, Boyu Liu, Hong Li, Jianzhuang Liu, Baochang Zhang:
Face Animation with an Attribute-Guided Diffusion Model. CoRR abs/2304.03199 (2023) - [i6]Bohan Zeng, Shanglin Li, Xuhui Liu, Sicheng Gao, Xiaolong Jiang, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
Controllable Mind Visual Diffusion Model. CoRR abs/2305.10135 (2023) - [i5]Bohan Zeng, Shanglin Li, Yutang Feng, Hong Li, Sicheng Gao, Jiaming Liu, Huaxia Li, Xu Tang, Jianzhuang Liu, Baochang Zhang:
IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts. CoRR abs/2310.05375 (2023) - [i4]Shanglin Li, Bohan Zeng, Yutang Feng, Sicheng Gao, Xuhui Liu, Jiaming Liu, Li Lin, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
ZONE: Zero-Shot Instruction-Guided Local Editing. CoRR abs/2312.16794 (2023) - 2022
- [c2]Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lü
:
IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors. ECCV (11) 2022: 346-361 - [c1]Bohan Zeng, Boyu Liu, Hong Li, Xuhui Liu, Jianzhuang Liu, Dapeng Chen, Wei Peng, Baochang Zhang:
FNeVR: Neural Volume Rendering for Face Animation. NeurIPS 2022 - [i3]Sheng Xu, Yanjing Li, Teli Ma, Bohan Zeng, Baochang Zhang, Peng Gao, Jinhu Lv:
TerViT: An Efficient Ternary Vision Transformer. CoRR abs/2201.08050 (2022) - [i2]Bohan Zeng, Boyu Liu, Hong Li, Xuhui Liu, Jianzhuang Liu, Dapeng Chen, Wei Peng
, Baochang Zhang:
FNeVR: Neural Volume Rendering for Face Animation. CoRR abs/2209.10340 (2022) - [i1]Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lv:
IDa-Det: An Information Discrepancy-aware Distillation for 1-bit Detectors. CoRR abs/2210.03477 (2022)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-05-10 00:58 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint