default search action
Juncheng Li 0006
Person information
- affiliation: Zhejiang University, Hangzhou, China
Other persons with the same name
- Juncheng Li — disambiguation page
- Juncheng Li 0001
(aka: Juncheng B. Li, Juncheng Billy Li, Billy Li) — Carnegie Mellon University, PA, USA (and 2 more)
- Juncheng Li 0002
— Nanyang Technological University, Singapore
- Juncheng Li 0003
— East China Normal University, Shanghai, China
- Juncheng Li 0004
— Hunan University of Humanities, Science and Technology, Loudi, China
- Juncheng Li 0005 — Lanzhou Jiaotong University, China
- Juncheng Li 0007
— Guangdong University of Technology, School of Automation, China
- Juncheng Li 0008 — Chinese University of Hong Kong, Department of Mathematics, Hong Kong
- Juncheng Li 0009 — Lancaster University, Department of Management Science, UK
- Juncheng Li 0010
— Purdue University, School of Mechanical Engineering, West Lafayette, IN, USA
- Juncheng Li 0011 — South China University of Technology, School of Intelligent Engineering, Guangzhou, China
- Juncheng Li 0012
— Tsinghua University, IIIS, Beijing, China
- Juncheng Li 0013 — Shanghai University, School of Communication and Information Engineering, China
- Juncheng Li 0014 — National University of Singapore, Singapore
- Juncheng Li 0015
— University of Edinburgh, Institute for Digital Communications, School of Engineering, UK
- Juncheng Li 0016 — Huazhong University of Science and Technology, School of Civil and Hydraulic Engineering, Wuhan, China
- Juncheng Li 0017 — Shenzhen Geological Construction Engineering Company, China
- Juncheng Li 0018 — Fudan University, Shanghai, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
- [j6]Haoyu Zheng
, Qifan Yu, Wenqiao Zhang, Hongyang He, Juncheng Li
, Zheqi Lv
, Dongping Zhang
, Siliang Tang
, Yueting Zhuang
:
MAKIMA: Tuning-free multi-attribute open-domain video editing via mask-guided attention modulation. Expert Syst. Appl. 320: 132107 (2026) - [j5]Juncheng Li
, Minghe Gao
, Siliang Tang
, Longhui Wei, Jun Xiao
, Fei Wu
, Richang Hong
, Meng Wang
, Qi Tian
:
Structure-Induced Gradient Regulation for Generalizable Vision-Language Models. IEEE Trans. Pattern Anal. Mach. Intell. 48(1): 219-235 (2026) - [c49]Zhenkui Zhang, Wendong Bu, Kaihang Pan, Bingchen Miao, Wenqiao Zhang, Guoming Wang, Wei Ji, Rui Tang, Juncheng Li, Siliang Tang:
Evolving Generalist Virtual Agents with Generative and Associative Memory. AAAI 2026: 13006-13014 - [c48]Zhaoyu Fan, Kaihang Pan
, Mingze Zhou
, Bosheng Qin
, Juncheng Li
, Shengyu Zhang, Wenqiao Zhang, Siliang Tang
, Fei Wu, Yueting Zhuang
:
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs. WWW 2026: 4137-4148 - [i70]Keyu Wang, Bingchen Miao, Wendong Bu, Yu Wu, Juncheng Li, Shengyu Zhang, Wenqiao Zhang, Siliang Tang, Jun Xiao, Yueting Zhuang:
CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents. CoRR abs/2601.02201 (2026) - [i69]Zhouzhou Shen, Xueyu Hu, Xiyun Li, Tianqing Fang, Juncheng Li, Shengyu Zhang:
World-Model-Augmented Web Agents with Action Correction. CoRR abs/2602.15384 (2026) - 2025
- [j4]Bosheng Qin
, Juncheng Li
, Siliang Tang
, Yueting Zhuang
:
DBA: Efficient Transformer With Dynamic Bilinear Low-Rank Attention. IEEE Trans. Neural Networks Learn. Syst. 36(8): 14493-14507 (2025) - [j3]Wei Ji
, Li Li
, Hao Fei
, Xiangyan Liu
, Xun Yang
, Juncheng Li
, Roger Zimmermann
:
Toward Complex-query Referring Image Segmentation: A Novel Benchmark. ACM Trans. Multim. Comput. Commun. Appl. 21(1): 40:1-40:18 (2025) - [c47]Tianwei Lin, Jiang Liu, Wenqiao Zhang, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Jiannan Guo, Hao Jiang, Siliang Tang, Yueting Zhuang:
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition. ACL (1) 2025: 13622-13637 - [c46]Chenhan Fu, Guoming Wang, Juncheng Li, Rongxing Lu, Siliang Tang:
Choice is what matters after Attention. AISTATS 2025: 262-270 - [c45]Chenhan Fu, Guoming Wang, Juncheng Li, Wenqiao Zhang, Rongxing Lu, Siliang Tang:
ITERATE: Image-Text Enhancement, Retrieval, and Alignment for Transmodal Evolution with LLMs. COLING 2025: 1365-1376 - [c44]Haiyi Qiu, Minghe Gao, Long Qian, Kaihang Pan, Qifan Yu, Juncheng Li, Wenjie Wang, Siliang Tang, Yueting Zhuang, Tat-Seng Chua:
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training. CVPR 2025: 3284-3294 - [c43]Leigang Qu, Haochuan Li, Wenjie Wang, Xiang Liu, Juncheng Li, Liqiang Nie, Tat-Seng Chua:
SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation. CVPR 2025: 18497-18508 - [c42]Shengqiong Wu, Hao Fei, Jingkang Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Tat-Seng Chua:
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene. CVPR 2025: 24539-24549 - [c41]Qifan Yu, Wei Chow, Zhongqi Yue, Kaihang Pan, Yang Wu, Xiaoyang Wan, Juncheng Li, Siliang Tang, Hanwang Zhang, Yueting Zhuang:
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea. CVPR 2025: 26125-26135 - [c40]Kaihang Pan, Wang Lin, Zhongqi Yue, Tenglong Ao, Liyu Jia, Wei Zhao, Juncheng Li, Siliang Tang, Hanwang Zhang:
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens. CVPR 2025: 26136-26146 - [c39]Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Weiming Wu, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. ICML 2025 - [c38]Wendong Bu, Yang Wu, Qifan Yu, Minghe Gao, Bingchen Miao, Zhenkui Zhang, Kaihang Pan, Liyunfei, Mengze Li, Wei Ji, Juncheng Li, Siliang Tang, Yueting Zhuang:
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities. ICML 2025 - [c37]Bingchen Miao, Yang Wu, Minghe Gao, Qifan Yu, Wendong Bu, Wenqiao Zhang, Yunfei Li, Siliang Tang, Tat-Seng Chua, Juncheng Li:
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark. ICML 2025 - [c36]Hanghui Guo
, Weijie Shi
, Mengze Li
, Juncheng Li
, Hao Chen
, Yue Cui
, Jiajie Xu
, Jia Zhu
, Jiawei Shen
, Zhangze Chen
, Sirui Han
:
Consistent and Invariant Generalization Learning for Short-video Misinformation Detection. ACM Multimedia 2025: 2254-2263 - [c35]Sijing Li
, Tianwei Lin
, Lingshuai Lin
, Wenqiao Zhang
, Jiang Liu
, Xiaoda Yang
, Juncheng Li
, Yucheng He
, Xiaohui Song
, Jun Xiao
, Yueting Zhuang
, Beng Chin Ooi
:
EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model. ACM Multimedia 2025: 3893-3902 - [c34]Bingchen Miao
, Wenqiao Zhang
, Juncheng Li
, Wangyu Wu
, Siliang Tang
, Zhaocheng Li
, Haochen Shi
, Jun Xiao
, Yueting Zhuang
:
Robust Modality-Incomplete Anomaly Detection: A Modality-Instructive Framework with Benchmark. ACM Multimedia 2025: 7317-7326 - [c33]Yurun Chen
, Xueyu Hu
, Keting Yin
, Juncheng Li
, Shengyu Zhang
:
Evaluating the Robustness of Multimodal Agents Against Active Environmental Injection Attacks. ACM Multimedia 2025: 11648-11656 - [c32]Bobo Li
, Yuheng Wang
, Hao Fei
, Juncheng Li
, Wei Ji
, Mong-Li Lee
, Wynne Hsu
:
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents. ACM Multimedia 2025: 13273-13280 - [c31]Xiangnan Chen
, Yuancheng Fang
, Juncheng Li
, Qian Xiao
, Jun Lin
, Siliang Tang
, Yueting Zhuang
:
Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts. ACM Multimedia 2025: 13297-13303 - [i68]Yurun Chen, Xueyu Hu, Keting Yin, Juncheng Li, Shengyu Zhang:
AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks. CoRR abs/2502.13053 (2025) - [i67]Xiangnan Chen, Yuancheng Fang, Qian Xiao, Juncheng Li, Jun Lin, Siliang Tang
, Yi Yang, Yueting Zhuang:
Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts. CoRR abs/2503.04095 (2025) - [i66]Aoxiong Yin, Kai Shen, Yichong Leng, Xu Tan, Xinyu Zhou, Juncheng Li, Siliang Tang
:
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation. CoRR abs/2503.04606 (2025) - [i65]Haoyu Zheng, Qifan Yu, Binghe Yu, Yang Dai, Wenqiao Zhang, Juncheng Li, Siliang Tang
, Yueting Zhuang:
SOYO: A Tuning-Free Approach for Video Style Morphing via Style-Adaptive Interpolation in Diffusion Models. CoRR abs/2503.06998 (2025) - [i64]Shengqiong Wu, Hao Fei, Jingkang Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Tat-Seng Chua:
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene. CoRR abs/2503.15019 (2025) - [i63]Bingchen Miao, Yang Wu, Minghe Gao, Qifan Yu, Wendong Bu, Wenqiao Zhang, Yunfei Li, Siliang Tang
, Tat-Seng Chua, Juncheng Li:
Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark. CoRR abs/2503.18665 (2025) - [i62]Minghe Gao, Xuqi Liu, Zhongqi Yue, Yang Wu, Shuang Chen, Juncheng Li, Siliang Tang
, Fei Wu, Tat-Seng Chua, Yueting Zhuang:
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program. CoRR abs/2504.06606 (2025) - [i61]Sijing Li, Tianwei Lin, Lingshuai Lin, Wenqiao Zhang, Jiang Liu, Xiaoda Yang, Juncheng Li, Yucheng He, Xiaohui Song, Jun Xiao, Yueting Zhuang, Beng Chin Ooi:
EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model. CoRR abs/2504.13650 (2025) - [i60]Kaihang Pan, Wang Lin, Zhongqi Yue, Tenglong Ao, Liyu Jia, Wei Zhao, Juncheng Li, Siliang Tang
, Hanwang Zhang:
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens. CoRR abs/2504.14666 (2025) - [i59]Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Weiming Wu, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo
, Jiebo Luo
, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. CoRR abs/2505.04620 (2025) - [i58]Kaihang Pan, Yang Wu, Wendong Bu, Kai Shen, Juncheng Li, Yingting Wang, Yunfei Li, Siliang Tang, Jun Xiao, Fei Wu, Hang Zhao, Yueting Zhuang:
Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation. CoRR abs/2506.01480 (2025) - [i57]Bobo Li, Yuheng Wang, Hao Fei, Juncheng Li, Wei Ji, Mong-Li Lee, Wynne Hsu:
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents. CoRR abs/2506.01520 (2025) - [i56]Wei Chow, Yuan Gao, Linfeng Li, Xian Wang, Qi Xu, Hang Song, Lingdong Kong, Ran Zhou, Yi Zeng, Yidong Cai, Botian Jiang, Shilin Xu, Jiajun Zhang, Minghui Qiu, Xiangtai Li, Tianshu Yang, Siliang Tang, Juncheng Li:
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query. CoRR abs/2506.03144 (2025) - [i55]Kaihang Pan, Wendong Bu, Yuruo Wu, Yang Wu, Kai Shen, Yunfei Li, Hang Zhao, Juncheng Li, Siliang Tang, Yueting Zhuang:
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL. CoRR abs/2506.05501 (2025) - [i54]Jie Cao, Tianwei Lin, Hongyang He, Rolan Yan, Wenqiao Zhang, Juncheng Li, Dongping Zhang, Siliang Tang, Yueting Zhuang:
MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models. CoRR abs/2506.05928 (2025) - [i53]Wendong Bu, Yang Wu, Qifan Yu, Minghe Gao, Bingchen Miao, Zhenkui Zhang, Kaihang Pan, Yunfei Li, Mengze Li, Wei Ji, Juncheng Li, Siliang Tang, Yueting Zhuang:
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities. CoRR abs/2506.08933 (2025) - [i52]Jisheng Dang, Wu Xudong, Bimei Wang, Lv Ning, Chen Jiayu, Jingwen Zhao, Yichu Liu, Jizhao Liu, Juncheng Li, Teng Wang:
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder. CoRR abs/2506.22880 (2025) - [i51]Hanghui Guo, Weijie Shi, Mengze Li, Juncheng Li, Hao Chen, Yue Cui, Jiajie Xu, Jia Zhu, Jiawei Shen, Zhangze Chen, Sirui Han:
Consistent and Invariant Generalization Learning for Short-video Misinformation Detection. CoRR abs/2507.04061 (2025) - [i50]Yurun Chen, Xavier Hu, Yuhan Liu, Keting Yin, Juncheng Li, Zhuosheng Zhang, Shengyu Zhang:
HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization. CoRR abs/2508.04010 (2025) - [i49]Zhaoyu Fan, Kaihang Pan, Mingze Zhou, Bosheng Qin, Juncheng Li, Shengyu Zhang, Wenqiao Zhang, Siliang Tang, Fei Wu, Yueting Zhuang:
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs. CoRR abs/2509.05714 (2025) - [i48]Haoyu Zheng, Zhuonan Wang, Yuqian Yuan, Tianwei Lin, Wenqiao Zhang, Zheqi Lv, Juncheng Li, Siliang Tang, Yueting Zhuang, Hongyang He:
Fast Thinking for Large Language Models. CoRR abs/2509.23633 (2025) - [i47]Bingchen Miao, Rong Wei, Zhiqi Ge, Xiaoquan sun, Shiqi Gao, Jingzhe Zhu, Renhan Wang, Siliang Tang, Jun Xiao, Rui Tang, Juncheng Li:
Towards Physically Executable 3D Gaussian for Embodied Navigation. CoRR abs/2510.21307 (2025) - [i46]Wei Chow, Jiachun Pan, Yongyuan Liang, Mingze Zhou, Xue Song, Liyu Jia, Saining Zhang, Siliang Tang, Juncheng Li, Fengda Zhang, Weijia Wu, Hanwang Zhang, Tat-Seng Chua:
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation. CoRR abs/2511.11434 (2025) - [i45]Minghe Gao, Juncheng Li, Yuze Lin, Xuqi Liu, Jiaming Ji, Xiaoran Pan, Zihan Xu, Xian Li, Mingjie Li, Wei Ji, Rong Wei, Rui Tang, Qizhou Wang, Kai Shen, Jun Xiao, Qi Wu, Siliang Tang, Yueting Zhuang:
Arcadia: Toward a Full-Lifecycle Framework for Embodied Lifelong Learning. CoRR abs/2512.00076 (2025) - [i44]Kaihang Pan, Weile Chen, Haiyi Qiu, Qifan Yu, Wendong Bu, Zehan Wang, Yun Zhu, Juncheng Li, Siliang Tang:
WiseEdit: Benchmarking Cognition- and Creativity-Informed Image Editing. CoRR abs/2512.00387 (2025) - [i43]Wendong Bu, Kaihang Pan, Yuze Lin, Jiacheng Li, Kai Shen, Wenqiao Zhang, Juncheng Li, Jun Xiao, Siliang Tang:
OmniMoGen: Unifying Human Motion Generation via Learning from Interleaved Text-Motion Instructions. CoRR abs/2512.19159 (2025) - [i42]Binhe Yu, Zhen Wang, Kexin Li, Yuqian Yuan, Wenqiao Zhang, Long Chen, Juncheng Li, Jun Xiao, Yueting Zhuang:
AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization. CoRR abs/2512.23537 (2025) - 2024
- [j2]Jianhao Guo
, Siliang Tang
, Juncheng Li
, Kaihang Pan
, Lingfei Wu
:
RustGraph: Robust Anomaly Detection in Dynamic Graphs by Jointly Learning Structural-Temporal Dependency. IEEE Trans. Knowl. Data Eng. 36(7): 3472-3485 (2024) - [c30]Qifan Yu, Juncheng Li, Longhui Wei, Liang Pang, Wentao Ye, Bosheng Qin, Siliang Tang
, Qi Tian, Yueting Zhuang:
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data. CVPR 2024: 12944-12953 - [c29]Xinyi Jiang, Guoming Wang, Junhao Guo, Juncheng Li, Wenqiao Zhang, Rongxing Lu, Siliang Tang
:
DIEM: Decomposition-Integration Enhancing Multimodal Insights. CVPR 2024: 27294-27303 - [c28]Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang, Hanwang Zhang, Yueting Zhuang:
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions. ICLR 2024 - [c27]Bosheng Qin, Juncheng Li, Siliang Tang
, Tat-Seng Chua, Yueting Zhuang:
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions. ICME 2024: 1-6 - [c26]Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang:
Auto-Encoding Morph-Tokens for Multimodal LLM. ICML 2024: 39308-39323 - [c25]Long Qian, Juncheng Li, Yu Wu, Yaobo Ye, Hao Fei, Tat-Seng Chua, Yueting Zhuang, Siliang Tang:
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning. ICML 2024: 41340-41356 - [c24]Minghe Gao
, Shuang Chen
, Liang Pang
, Yuan Yao
, Jisheng Dang
, Wenqiao Zhang
, Juncheng Li
, Siliang Tang
, Yueting Zhuang
, Tat-Seng Chua
:
Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales. ACM Multimedia 2024: 846-855 - [c23]Zhiqi Ge
, Hongzhe Huang
, Mingze Zhou
, Juncheng Li
, Guoming Wang
, Siliang Tang
, Yueting Zhuang
:
WorldGPT: Empowering LLM as Multimodal World Model. ACM Multimedia 2024: 7346-7355 - [c22]Minghe Gao
, Juncheng Li
, Hao Fei
, Liang Pang
, Wei Ji
, Guoming Wang
, Zheqi Lv
, Wenqiao Zhang
, Siliang Tang
, Yueting Zhuang
:
De-fine: Decomposing and Refining Visual Programs with Auto-Feedback. ACM Multimedia 2024: 7649-7657 - [c21]Zhiqi Ge
, Juncheng Li
, Qifan Yu
, Wei Zhou
, Siliang Tang
, Yueting Zhuang
:
DEMON24: ACM MM24 Demonstrative Instruction Following Challenge. ACM Multimedia 2024: 11426-11428 - [c20]Wei Ji
, Hao Fei
, Yinwei Wei
, Zhedong Zheng
, Juncheng Li
, Long Chen
, Lizi Liao
, Yueting Zhuang
, Roger Zimmermann
:
The 2nd International Workshop on Deep Multi-modal Generation and Retrieval. MMGR@MM 2024: 1-6 - [c19]Wei Chow, Juncheng Li, Qifan Yu, Kaihang Pan, Hao Fei, Zhiqi Ge, Shuai Yang, Siliang Tang, Hanwang Zhang, Qianru Sun:
Unified Generative and Discriminative Training for Multi-modal Large Language Models. NeurIPS 2024 - [c18]Kaihang Pan, Zhaoyu Fan, Juncheng Li, Qifan Yu, Hao Fei, Siliang Tang, Richang Hong, Hanwang Zhang, Qianru Sun:
Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration. NeurIPS 2024 - [c17]Kaihang Pan
, Juncheng Li
, Wenjie Wang
, Hao Fei
, Hongye Song
, Wei Ji
, Jun Lin
, Xiaozhong Liu
, Tat-Seng Chua
, Siliang Tang
:
I3: Intent-Introspective Retrieval Conditioned on Instructions. SIGIR 2024: 1839-1849 - [i41]Long Qian, Juncheng Li, Yu Wu, Yaobo Ye, Hao Fei, Tat-Seng Chua, Yueting Zhuang, Siliang Tang
:
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning. CoRR abs/2402.11435 (2024) - [i40]Wenqiao Zhang, Tianwei Lin, Jiang Liu, Fangxun Shu, Haoyuan Li, Lei Zhang, Wanggui He, Hao Zhou, Zheqi Lv, Hao Jiang, Juncheng Li, Siliang Tang
, Yueting Zhuang:
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models. CoRR abs/2403.13447 (2024) - [i39]Minghe Gao, Shuang Chen, Liang Pang, Yuan Yao, Jisheng Dang, Wenqiao Zhang, Juncheng Li, Siliang Tang
, Yueting Zhuang, Tat-Seng Chua:
Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales. CoRR abs/2404.11129 (2024) - [i38]Haoyu Zheng, Wenqiao Zhang, Yaoke Wang, Hao Zhou, Jiang Liu, Juncheng Li, Zheqi Lv, Siliang Tang
, Yueting Zhuang:
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation. CoRR abs/2404.13558 (2024) - [i37]Kaihang Pan, Siliang Tang
, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang:
Auto-Encoding Morph-Tokens for Multimodal LLM. CoRR abs/2405.01926 (2024) - [i36]Tianwei Lin, Jiang Liu, Wenqiao Zhang, Zhaocheng Li, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Hao Jiang, Siliang Tang
, Yueting Zhuang:
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition. CoRR abs/2408.09856 (2024) - [i35]Hongzhe Huang, Zhewen Yu, Jiang Liu, Li Cai, Dian Jiao, Wenqiao Zhang, Siliang Tang
, Juncheng Li, Hao Jiang, Haoyuan Li, Yueting Zhuang:
Align2LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation. CoRR abs/2409.18541 (2024) - [i34]Kaihang Pan, Zhaoyu Fan, Juncheng Li, Qifan Yu, Hao Fei, Siliang Tang
, Richang Hong, Hanwang Zhang, Qianru Sun:
Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration. CoRR abs/2409.19872 (2024) - [i33]Bingchen Miao, Wenqiao Zhang, Juncheng Li, Siliang Tang
, Zhaocheng Li, Haochen Shi, Jun Xiao, Yueting Zhuang:
RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection. CoRR abs/2410.01737 (2024) - [i32]Wei Chow, Juncheng Li, Qifan Yu, Kaihang Pan, Hao Fei, Zhiqi Ge, Shuai Yang, Siliang Tang
, Hanwang Zhang, Qianru Sun:
Unified Generative and Discriminative Training for Multi-modal Large Language Models. CoRR abs/2411.00304 (2024) - [i31]Minghe Gao, Wendong Bu, Bingchen Miao, Yang Wu, Yunfei Li, Juncheng Li, Siliang Tang
, Qi Wu, Yueting Zhuang, Meng Wang:
Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms. CoRR abs/2411.10943 (2024) - [i30]Qifan Yu, Wei Chow, Zhongqi Yue, Kaihang Pan, Yang Wu, Xiaoyang Wan, Juncheng Li, Siliang Tang
, Hanwang Zhang, Yueting Zhuang:
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea. CoRR abs/2411.15738 (2024) - [i29]Haiyi Qiu, Minghe Gao, Long Qian, Kaihang Pan, Qifan Yu, Juncheng Li, Wenjie Wang, Siliang Tang
, Yueting Zhuang, Tat-Seng Chua:
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training. CoRR abs/2412.00161 (2024) - [i28]Jinbin Bai, Wei Chow, Ling Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Shuicheng Yan:
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing. CoRR abs/2412.04280 (2024) - [i27]Leigang Qu, Haochuan Li, Wenjie Wang, Xiang Liu, Juncheng Li, Liqiang Nie, Tat-Seng Chua:
SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation. CoRR abs/2412.05818 (2024) - [i26]Qifan Yu, Zhebei Shen, Zhongqi Yue, Yang Wu, Wenqiao Zhang, Yunfei Li, Juncheng Li, Siliang Tang
, Yueting Zhuang:
Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness. CoRR abs/2412.06293 (2024) - [i25]Zhiqi Ge, Juncheng Li, Xinglei Pang, Minghe Gao, Kaihang Pan, Wang Lin, Hao Fei, Wenqiao Zhang, Siliang Tang
, Yueting Zhuang:
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining. CoRR abs/2412.10342 (2024) - [i24]Jiang Liu, Bolin Li, Haoyuan Li, Tianwei Lin, Wenqiao Zhang, Tao Zhong, Zhelun Yu, Jinghao Wei, Hao Cheng, Hao Jiang, Zheqi Lv, Juncheng Li, Siliang Tang
, Yueting Zhuang:
Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework. CoRR abs/2412.19684 (2024) - [i23]Haoyu Zheng, Wenqiao Zhang, Zheqi Lv, Yu Zhong, Yang Dai, Jianxiang An, Yongliang Shen, Juncheng Li, Dongping Zhang, Siliang Tang
, Yueting Zhuang:
MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation. CoRR abs/2412.19978 (2024) - 2023
- [j1]Juncheng Li
, Siliang Tang
, Linchao Zhu
, Wenqiao Zhang
, Yi Yang, Tat-Seng Chua
, Fei Wu
, Yueting Zhuang
:
Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12601-12617 (2023) - [c16]Wei Ji, Renjie Liang
, Zhedong Zheng
, Wenqiao Zhang, Shengyu Zhang, Juncheng Li, Mengze Li, Tat-Seng Chua:
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning. CVPR 2023: 23013-23022 - [c15]Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang
:
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization. EMNLP (Findings) 2023: 1059-1077 - [c14]Xiangnan Chen, Qian Xiao, Juncheng Li, Duo Dong, Jun Lin, Xiaozhong Liu, Siliang Tang
:
Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document. EMNLP (Findings) 2023: 1587-1598 - [c13]Yilin Lu, Juncheng Li, Xiaoqiang Wang, Haochen Shi, Tao Chen, Siliang Tang
:
Reasoning Makes Good Annotators : An Automatic Task-specific Rules Distilling Framework for Low-resource Relation Extraction. EMNLP (Findings) 2023: 7447-7457 - [c12]Juncheng Li, Minghe Gao, Longhui Wei, Siliang Tang
, Wenqiao Zhang, Mengze Li, Wei Ji, Qi Tian, Tat-Seng Chua, Yueting Zhuang:
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models. ICCV 2023: 2551-2562 - [c11]Qifan Yu, Juncheng Li, Yu Wu, Siliang Tang
, Wei Ji, Yueting Zhuang:
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World. ICCV 2023: 21503-21514 - [c10]Mengze Li
, Haoyu Zhang
, Juncheng Li
, Zhou Zhao
, Wenqiao Zhang
, Shengyu Zhang
, Shiliang Pu
, Yueting Zhuang
, Fei Wu
:
Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning. ACM Multimedia 2023: 3807-3816 - [i22]Juncheng Li, Siliang Tang
, Linchao Zhu, Wenqiao Zhang, Yi Yang, Tat-Seng Chua, Fei Wu, Yueting Zhuang:
Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding. CoRR abs/2301.09071 (2023) - [i21]Juncheng Li, Minghe Gao, Longhui Wei, Siliang Tang
, Wenqiao Zhang, Mengze Li, Wei Ji, Qi Tian, Tat-Seng Chua, Yueting Zhuang:
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models. CoRR abs/2303.06571 (2023) - [i20]Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang
:
Meta-augmented Prompt Tuning for Better Few-shot Learning. CoRR abs/2303.12314 (2023) - [i19]Qifan Yu, Juncheng Li, Yu Wu, Siliang Tang
, Wei Ji, Yueting Zhuang:
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World. CoRR abs/2303.13233 (2023) - [i18]Bosheng Qin
, Juncheng Li, Siliang Tang
, Tat-Seng Chua, Yueting Zhuang:
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions. CoRR abs/2305.12328 (2023) - [i17]Qifan Yu, Juncheng Li, Wentao Ye, Siliang Tang, Yueting Zhuang:
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration. CoRR abs/2305.12799 (2023) - [i16]Xiangnan Chen, Juncheng Li, Duo Dong, Qian Xiao, Jun Lin, Xiaozhong Liu, Siliang Tang
:
Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document. CoRR abs/2305.13850 (2023) - [i15]Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Hanwang Zhang, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang
, Yueting Zhuang:
Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions. CoRR abs/2308.04152 (2023) - [i14]Kaihang Pan, Juncheng Li, Hongye Song, Hao Fei, Wei Ji, Shuo Zhang, Jun Lin, Xiaozhong Liu, Siliang Tang
:
ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval. CoRR abs/2308.10025 (2023) - [i13]Minghe Gao, Juncheng Li, Hao Fei, Liang Pang, Wei Ji, Guoming Wang, Wenqiao Zhang, Siliang Tang
, Yueting Zhuang:
De-fine: Decomposing and Refining Visual Programs with Auto-Feedback. CoRR abs/2311.12890 (2023) - [i12]Wenqiao Zhang, Zheqi Lv, Hao Zhou, Jia-Wei Liu, Juncheng Li, Mengze Li, Siliang Tang
, Yueting Zhuang:
Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer. CoRR abs/2311.12905 (2023) - [i11]Qifan Yu, Juncheng Li, Longhui Wei, Liang Pang, Wentao Ye, Bosheng Qin, Siliang Tang
, Qi Tian, Yueting Zhuang:
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data. CoRR abs/2311.13614 (2023) - 2022
- [c9]Wenqiao Zhang, Haochen Shi
, Jiannan Guo, Shengyu Zhang, Qingpeng Cai, Juncheng Li, Sihui Luo, Yueting Zhuang:
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning. AAAI 2022: 3335-3343 - [c8]Juncheng Li, Junlin Xie, Long Qian, Linchao Zhu, Siliang Tang
, Fei Wu, Yi Yang, Yueting Zhuang, Xin Eric Wang:
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning. CVPR 2022: 3022-3031 - [c7]Juncheng Li, Junlin Xie, Linchao Zhu, Long Qian, Siliang Tang
, Wenqiao Zhang, Haochen Shi, Shengyu Zhang, Longhui Wei, Qi Tian, Yueting Zhuang:
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos. ACM Multimedia 2022: 5083-5092 - [c6]Ziqi Jiang, Shengyu Zhang, Siyuan Yao, Wenqiao Zhang, Sihan Zhang, Juncheng Li, Zhou Zhao, Fei Wu:
Weakly-supervised Disentanglement Network for Video Fingerspelling Detection. ACM Multimedia 2022: 5446-5455 - [c5]Juncheng Li, Xin He, Longhui Wei, Long Qian, Linchao Zhu, Lingxi Xie, Yueting Zhuang, Qi Tian, Siliang Tang:
Fine-Grained Semantically Aligned Vision-Language Pre-Training. NeurIPS 2022 - [i10]Juncheng Li, Junlin Xie, Long Qian, Linchao Zhu, Siliang Tang
, Fei Wu, Yi Yang, Yueting Zhuang, Xin Eric Wang:
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning. CoRR abs/2203.13049 (2022) - [i9]Wenqiao Zhang, Jiannan Guo, Mengze Li, Haochen Shi, Shengyu Zhang, Juncheng Li, Siliang Tang
, Yueting Zhuang:
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval. CoRR abs/2207.04211 (2022) - [i8]Juncheng Li, Junlin Xie, Linchao Zhu, Long Qian, Siliang Tang
, Wenqiao Zhang, Haochen Shi, Shengyu Zhang, Longhui Wei, Qi Tian, Yueting Zhuang:
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos. CoRR abs/2208.01954 (2022) - [i7]Juncheng Li, Xin He, Longhui Wei, Long Qian, Linchao Zhu, Lingxi Xie, Yueting Zhuang, Qi Tian, Siliang Tang
:
Fine-Grained Semantically Aligned Vision-Language Pre-Training. CoRR abs/2208.02515 (2022) - [i6]Bosheng Qin
, Juncheng Li, Siliang Tang
, Yueting Zhuang:
DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention. CoRR abs/2211.16368 (2022) - 2021
- [c4]Juncheng Li, Siliang Tang
, Linchao Zhu
, Haochen Shi
, Xuanwen Huang, Fei Wu, Yi Yang, Yueting Zhuang:
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference. ICCV 2021: 1847-1857 - [i5]Juncheng Li, Siliang Tang, Linchao Zhu, Haochen Shi, Xuanwen Huang, Fei Wu, Yi Yang, Yueting Zhuang:
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference. CoRR abs/2107.12270 (2021) - [i4]Wenqiao Zhang, Haochen Shi, Jiannan Guo, Shengyu Zhang, Qingpeng Cai, Juncheng Li, Sihui Luo, Yueting Zhuang:
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning. CoRR abs/2112.06558 (2021) - 2020
- [c3]Juncheng Li, Xin Wang, Siliang Tang
, Haizhou Shi
, Fei Wu, Yueting Zhuang, William Yang Wang:
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation. CVPR 2020: 12120-12129 - [c2]Jiacheng Li
, Siliang Tang
, Juncheng Li, Jun Xiao, Fei Wu, Shiliang Pu, Yueting Zhuang:
Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling. ACM Multimedia 2020: 4208-4216 - [i3]Jiacheng Li, Siliang Tang, Juncheng Li, Jun Xiao, Fei Wu, Shiliang Pu, Yueting Zhuang:
Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling. CoRR abs/2008.04504 (2020)
2010 – 2019
- 2019
- [c1]Juncheng Li, Siliang Tang
, Fei Wu, Yueting Zhuang:
Walking with MIND: Mental Imagery eNhanceD Embodied QA. ACM Multimedia 2019: 1211-1219 - [i2]Juncheng Li, Siliang Tang, Fei Wu, Yueting Zhuang:
Walking with MIND: Mental Imagery eNhanceD Embodied QA. CoRR abs/1908.01482 (2019) - [i1]Juncheng Li, Xin Wang, Siliang Tang, Haizhou Shi, Fei Wu, Yueting Zhuang, William Yang Wang:
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation. CoRR abs/1911.07450 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-05-07 02:54 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint