default search action
Wenxuan Huang 0001
Person information
- affiliation: East China Normal University, School of Computer Science and Technology, Shanghai, China
Other persons with the same name
- Wenxuan Huang — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
- [i26]Wenxuan Huang, Yu Zeng, Qiuchen Wang, Zhen Fang, Shaosheng Cao, Zheng Chu, Qingyu Yin, Shuang Chen, Zhenfei Yin, Lin Chen, Zehui Chen, Yao Hu, Philip Torr, Feng Zhao, Wanli Ouyang:
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models. CoRR abs/2601.22060 (2026) - [i25]Yu Zeng, Wenxuan Huang, Zhen Fang, Shuang Chen, Yufan Shen, Yishuo Cai, Xiaoman Wang, Zhenfei Yin, Lin Chen, Zehui Chen, Shiting Huang, Yiming Zhao, Xu Tang, Yao Hu, Philip Torr, Wanli Ouyang, Shaosheng Cao:
Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models. CoRR abs/2602.02185 (2026) - [i24]Kangjie Zhang, Wenxuan Huang, Xin Zhou, Boxiang Zhou, Dejia Song, Yuan Xie, Baochang Zhang, Lizhuang Ma, Nemo Chen, Xu Tang, Yao Hu, Shaohui Lin:
CLIP-Map: Structured Matrix Mapping for Parameter-Efficient CLIP Compression. CoRR abs/2602.05909 (2026) - [i23]Yuntian Tang, Bohan Jia, Wenxuan Huang, Lianyue Zhang, Jiao Xie, Wenxi Li, Rongrong Ji, Shaohui Lin:
Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression. CoRR abs/2602.08324 (2026) - [i22]Qiuchen Wang, Shihang Wang, Yu Zeng, Qiang Zhang, Fanrui Zhang, Zhuoning Guo, Bosi Zhang, Wenxuan Huang, Lin Chen, Zehui Chen, Pengjun Xie, Ruixue Ding:
VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph. CoRR abs/2602.12735 (2026) - [i21]Zhuokang Shen, Yifan Wang, Hanyu Chen, Wenxuan Huang, Yunhang Shen, Shaohui Lin:
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant. CoRR abs/2603.01059 (2026) - [i20]Wenxuan Huang, Mingyu Tsoi, Yanhao Huang, Xinjie Mao, Xue Xia, Hao Wu, Jiaqi Wei, Yuejin Yang, Lang Yu, Cheng Tan, Xiang Zhang, Zhangyang Gao, Siqi Sun:
HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts. CoRR abs/2603.01396 (2026) - [i19]Shuizhou Chen, Lang Yu, Kedu Jin, Songming Zhang, Hao Wu, Wenxuan Huang, Sheng Xu, Quan Qian, Qin Chen, Lei Bai, Siqi Sun, Zhangyang Gao:
SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction. CoRR abs/2603.17380 (2026) - 2025
- [j1]Wenxuan Huang
, Guanqun Sheng
, Xingong Tang, Kai Ma, Jingyi Lu, Hang Sun
:
An Intelligent First-Arrival Picking Method of Microseismic Signals Based on the Small Sample Expansion. IEEE Trans. Geosci. Remote. Sens. 63: 1-19 (2025) - [c4]Xiangfeng Xu, Pinyi Zhang, Wenxuan Huang, Yunhang Shen, Haosheng Chen, Jingzhong Lin, Wei Li, Gaoqi He, Jiao Xie, Shaohui Lin:
Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion. CVPR 2025: 9829-9838 - [c3]Wenxuan Huang, Zijie Zhai, Yunhang Shen, Shaosheng Cao, Fei Zhao, Xiangfeng Xu, Zheyu Ye, Shaohui Lin:
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification. ICLR 2025 - [c2]Ling You
, Wenxuan Huang
, Xinni Xie
, Xiangyi Wei
, Bangyan Li
, Shaohui Lin
, Yang Li
, Changbo Wang
:
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation. ACM Multimedia 2025: 3418-3427 - [i18]Wenxuan Huang, Bohan Jia, Zijie Zhai, Shaosheng Cao, Zheyu Ye, Fei Zhao, Zhe Xu, Yao Hu, Shaohui Lin:
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models. CoRR abs/2503.06749 (2025) - [i17]Bangyan Li, Wenxuan Huang, Yunhang Shen, Yeqiang Wang, Shaohui Lin, Jingzhong Lin, Ling You, Yinqi Zhang, Ke Li, Xing Sun, Yuling Sun:
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? CoRR abs/2503.07487 (2025) - [i16]Yukun Qi, Yiming Zhao, Yu Zeng, Xikun Bao, Wenxuan Huang, Lin Chen, Zehui Chen, Jie Zhao, Zhongang Qi, Feng Zhao:
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning. CoRR abs/2504.07956 (2025) - [i15]Ling You, Wenxuan Huang, Xinni Xie, Xiangyi Wei, Bangyan Li, Shaohui Lin, Yang Li, Changbo Wang:
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation. CoRR abs/2504.17365 (2025) - [i14]Jingzhong Lin, Yuanyuan Qi, Xinru Li, Wenxuan Huang, Xiangfeng Xu, Bangyan Li, Xuejiao Wang, Gaoqi He:
ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation. CoRR abs/2505.05589 (2025) - [i13]Bohan Jia, Wenxuan Huang, Yuntian Tang, Junbo Qiao, Jincheng Liao, Shaosheng Cao, Fei Zhao, Zhaopeng Feng, Zhouhong Gu, Zhenfei Yin, Lei Bai, Wanli Ouyang, Lin Chen, Fei Zhao, Zihan Wang, Yuan Xie, Shaohui Lin:
CompBench: Benchmarking Complex Instruction-guided Image Editing. CoRR abs/2505.12200 (2025) - [i12]Zhaopeng Feng, Yupu Liang, Shaosheng Cao, Jiayuan Su, Jiahan Ren, Zhe Xu, Yao Hu, Wenxuan Huang, Jian Wu, Zuozhu Liu:
MT3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning. CoRR abs/2505.19714 (2025) - [i11]Yuhao Zhou, Yiheng Wang, Xuming He, Ruoyao Xiao, Zhiwei Li, Qiantai Feng, Zijie Guo, Yuejin Yang, Hao Wu, Wenxuan Huang, Jiaqi Wei, Dan Si, Xiuqi Yao, Jia Bu, Haiwen Huang, Tianfan Fu, Shixiang Tang, Ben Fei, Dongzhan Zhou, Fenghua Ling, Yan Lu, Siqi Sun, Chenhui Li, Guanjie Zheng, Jiancheng Lv, Wenlong Zhang, Lei Bai:
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning. CoRR abs/2506.10521 (2025) - [i10]Zhouhong Gu, Xiaoxuan Zhu, Yin Cai, Hao Shen, Xingzhou Chen, Qingyi Wang, Jialin Li, Xiaoran Shi, Haoran Guo, Wenxuan Huang, Hongwei Feng, Yanghua Xiao, Zheyu Ye, Yao Hu, Shaosheng Cao:
AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need. CoRR abs/2506.15451 (2025) - [i9]Wenxuan Huang, Shuang Chen, Zheyong Xie, Shaosheng Cao, Shixiang Tang, Yufan Shen, Qingyu Yin, Wenbo Hu, Xiaoman Wang, Yuntian Tang, Junbo Qiao, Yue Guo, Yao Hu, Zhenfei Yin, Philip Torr, Yu Cheng, Wanli Ouyang, Shaohui Lin:
Interleaving Reasoning for Better Text-to-Image Generation. CoRR abs/2509.06945 (2025) - [i8]Yu Zeng, Wenxuan Huang, Shiting Huang, Xikun Bao, Yukun Qi, Yiming Zhao, Qiuchen Wang, Lin Chen, Zehui Chen, Huaian Chen, Wanli Ouyang, Feng Zhao:
Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models. CoRR abs/2510.01304 (2025) - [i7]Qin Dong, Yuntian Tang, Heming Jia, Yunhang Shen, Bohan Jia, Wenxuan Huang, Lianyue Zhang, Jiao Xie, Shaohui Lin:
MASA: Rethinking the Representational Bottleneck in LoRA with Multi-A Shared Adaptation. CoRR abs/2510.06005 (2025) - [i6]Qingyu Yin, Chak Tou Leong, Linyi Yang, Wenxuan Huang, Wenjie Li, Xiting Wang, Jaehong Yoon, YunXing, XingYu, Jinjin Gu:
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning? CoRR abs/2510.06036 (2025) - [i5]Jiaqi Wei, Xiang Zhang, Yuejin Yang, Wenxuan Huang, Juntai Cao, Sheng Xu, Xiang Zhuang, Zhangyang Gao, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Chenyu You, Wanli Ouyang, Siqi Sun:
Unifying Tree Search Algorithm and Reward Design for LLM Reasoning: A Survey. CoRR abs/2510.09988 (2025) - [i4]Xiaoyu Zhan, Wenxuan Huang, Hao Sun, Xinyu Fu, Changfeng Ma, Shaosheng Cao, Bohan Jia, Shaohui Lin, Zhenfei Yin, Lei Bai, Wanli Ouyang, Yuanqi Li, Jie Guo, Yanwen Guo:
Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models. CoRR abs/2511.01618 (2025) - 2024
- [c1]Wenxuan Huang, Yunhang Shen, Jiao Xie, Baochang Zhang, Gaoqi He, Ke Li, Xing Sun
, Shaohui Lin:
A General and Efficient Training for Transformer via Token Expansion. CVPR 2024: 15783-15792 - [i3]Wenxuan Huang, Yunhang Shen, Jiao Xie, Baochang Zhang, Gaoqi He, Ke Li, Xing Sun
, Shaohui Lin:
A General and Efficient Training for Transformer via Token Expansion. CoRR abs/2404.00672 (2024) - [i2]Wenxuan Huang, Zijie Zhai, Yunhang Shen, Shaosheng Cao, Fei Zhao, Xiangfeng Xu, Zheyu Ye, Shaohui Lin:
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification. CoRR abs/2412.00876 (2024) - 2023
- [i1]Shaohui Lin, Wenxuan Huang, Jiao Xie, Baochang Zhang, Yunhang Shen, Zhou Yu, Jungong Han, David S. Doermann
:
Filter Pruning for Efficient CNNs via Knowledge-driven Differential Filter Sampler. CoRR abs/2307.00198 (2023)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-04-25 00:50 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint