default search action
Sirui Zhao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
- [j14]Biao Zhu, Jun Zhang, Sirui Zhao, Zhengye Zhang, Enhong Chen:
Unsupervised lightweight 3D convolutional network for enhanced infrared imaging in wearable devices. Frontiers Comput. Sci. 20(1): 2001306 (2026) - [j13]Yifan Xu
, Sirui Zhao
, Shifeng Liu
, Tong Xu
, Enhong Chen
:
Emotionally Controllable Audio-driven Talking Face Generation. ACM Trans. Multim. Comput. Commun. Appl. 22(2): 40:1-40:22 (2026) - [i25]Feng-Qi Cui, Jinyang Huang, Sirui Zhao, Jinglong Guo, Qifan Cai, Xin Yan, Zhi Liu:
ReMA: A Training-Free Plug-and-Play Mixing Augmentation for Video Behavior Recognition. CoRR abs/2601.00311 (2026) - [i24]Ji-Xuan He, Jia-Cheng Zhao, Feng-Qi Cui, Jinyang Huang, Yang Liu, Sirui Zhao, Meng Li, Zhi Liu:
Dual-Path Learning based on Frequency Structural Decoupling and Regional-Aware Fusion for Low-Light Image Super-Resolution. CoRR abs/2603.27301 (2026) - [i23]Wenli Zhang, Xianglong Shi, Sirui Zhao, Xinqi Chen, Guo Cheng, Yifan Xu, Tong Xu, Yong Liao:
SyncBreaker:Stage-Aware Multimodal Adversarial Attacks on Audio-Driven Talking Head Generation. CoRR abs/2604.08405 (2026) - [i22]Shifeng Liu, Zhengye Zhang, Sirui Zhao, Xinglong Mao, Zhehan Kan, Zhixiang Wei, Shiwei Wu, Chaoyou Fu, Tong Xu, Enhong Chen:
ActFER: Agentic Facial Expression Recognition via Active Tool-Augmented Visual Reasoning. CoRR abs/2604.08990 (2026) - [i21]Shukang Yin, Sirui Zhao, Hanchao Wang, Baozhi Jia, Xianquan Wang, Chaoyou Fu, Enhong Chen:
Tango: Taming Visual Signals for Efficient Video Large Language Models. CoRR abs/2604.09547 (2026) - 2025
- [j12]Shifeng Liu
, Xinglong Mao
, Sirui Zhao
, Peiming Li, Tong Xu
, Enhong Chen
:
MER-CLIP: AU-Guided Vision-Language Alignment for Micro-Expression Recognition. IEEE Trans. Affect. Comput. 16(4): 3028-3042 (2025) - [j11]Hao Wang
, Mingjia Yin
, Luankang Zhang
, Sirui Zhao
, Enhong Chen
:
MF-GSLAE: A Multi-Factor User Representation Pre-Training Framework for Dual-Target Cross-Domain Recommendation. ACM Trans. Inf. Syst. 43(2): 30:1-30:28 (2025) - [c18]Chaoyou Fu
, Yuhan Dai, Yongdong Luo
, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li
, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu
, Xiawu Zheng, Enhong Chen, Caifeng Shan
, Ran He, Xing Sun:
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis. CVPR 2025: 24108-24118 - [c17]Siyuan Jin, Sirui Zhao
, Yifan Xu
, Shifeng Liu
, Mengduo Wu, Tong Xu
:
JoyLive: Efficient Audio-Driven Portrait Animation by 3D Implict Keypoints. ICIC (15) 2025: 499-510 - [c16]Xiaobai Li, Xinglong Mao, Hao Zou, Jingjing Chen, Sirui Zhao:
Cross-Cultural Nuances of Micro-Expressions and Action Units: A Comparative Study. ICMEW 2025: 1-6 - [c15]Fangyuan Liu, Sirui Zhao, Tong Xu
, Yu Sun, Hao Wang, Suojuan Zhang, Enhong Chen:
PhysFFTFormer: A Frequency Domain-based Vision Transformer for Efficient Remote Physiological Measurement. ICME 2025: 1-6 - [c14]Fangyuan Liu
, Sirui Zhao
, Kang Yin
, Tong Xu
, Enhong Chen
:
DepFormer: A Unified Framework with Bimodal Collaborative Transformer for Depression Detection. ACM Multimedia 2025: 13930-13936 - [i20]Shifeng Liu, Xinglong Mao, Sirui Zhao, Peiming Li, Tong Xu
, Enhong Chen:
MER-CLIP: AU-Guided Vision-Language Alignment for Micro-Expression Recognition. CoRR abs/2505.05937 (2025) - [i19]Zhengye Zhang, Sirui Zhao, Shifeng Liu, Shukang Yin, Xinglong Mao, Tong Xu
, Enhong Chen:
MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception. CoRR abs/2505.07007 (2025) - [i18]Xinglong Mao, Shifeng Liu, Sirui Zhao, Tong Xu
, Enhong Chen:
MERba: Multi-Receptive Field MambaVision for Micro-Expression Recognition. CoRR abs/2506.14468 (2025) - [i17]Yubo Huang, Weiqiang Wang, Sirui Zhao, Tong Xu, Lin Liu, Enhong Chen:
Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Router. CoRR abs/2506.19833 (2025) - [i16]Xianglong Shi, Silin Cheng, Sirui Zhao, Yunhan Jiang, Enhong Chen, Yang Liu, Sébastien Ourselin:
LIHE: Linguistic Instance-Split Hyperbolic-Euclidean Framework for Generalized Weakly-Supervised Referring Expression Comprehension. CoRR abs/2511.12020 (2025) - [i15]Yubo Huang, Hailong Guo, Fangtai Wu, Shifeng Zhang, Shijie Huang, Qijun Gan, Lin Liu, Sirui Zhao, Enhong Chen, Jiaming Liu, Steven Hoi:
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length. CoRR abs/2512.04677 (2025) - [i14]Kang Yin, Chunyu Qiang, Sirui Zhao, Xiaopeng Wang, Yuzhe Liang, Pengfei Cai, Tong Xu
, Chen Zhang, Enhong Chen:
DMP-TTS: Disentangled multi-modal Prompting for Controllable Text-to-Speech with Chained Guidance. CoRR abs/2512.09504 (2025) - 2024
- [j10]Shukang Yin, Chaoyou Fu
, Sirui Zhao, Tong Xu
, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun
, Enhong Chen:
Woodpecker: hallucination correction for multimodal large language models. Sci. China Inf. Sci. 67(12) (2024) - [j9]Sirui Zhao
, Huaying Tang
, Xinglong Mao
, Shifeng Liu
, Yiming Zhang
, Hao Wang
, Tong Xu
, Enhong Chen
:
DFME: A New Benchmark for Dynamic Facial Micro-Expression Recognition. IEEE Trans. Affect. Comput. 15(3): 1371-1386 (2024) - [j8]Shukang Yin
, Sirui Zhao
, Hao Wang
, Tong Xu
, Enhong Chen
:
Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval. ACM Trans. Multim. Comput. Commun. Appl. 20(10): 316:1-316:21 (2024) - [c13]Shifeng Liu, Xinglong Mao, Sirui Zhao, Chaoyou Fu
, Ying Yu, Tong Xu
, Enhong Chen:
TGMAE: Self-supervised Micro-Expression Recognition with Temporal Gaussian Masked Autoencoder. ICME 2024: 1-6 - [c12]Mingjia Yin
, Hao Wang
, Wei Guo
, Yong Liu
, Suojuan Zhang
, Sirui Zhao
, Defu Lian
, Enhong Chen
:
Dataset Regeneration for Sequential Recommendation. KDD 2024: 3954-3965 - [c11]Chenxiao Liu
, Zheyong Xie
, Sirui Zhao
, Jin Zhou
, Tong Xu
, Minglei Li
, Enhong Chen
:
Speak From Heart: An Emotion-Guided LLM-Based Multimodal Method for Emotional Dialogue Generation. ICMR 2024: 533-542 - [c10]Zhengye Zhang
, Sirui Zhao
, Xinglong Mao
, Shifeng Liu
, Hao Wang
, Tong Xu
, Enhong Chen
:
A Multi-scale Feature Learning Network with Optical Flow Correction for Micro- and Macro-expression Spotting. ACM Multimedia 2024: 11497-11502 - [c9]Xinglong Mao, Shifeng Liu, Sirui Zhao, Yiming Zhang, Hao Wang, Tong Xu
, Enhong Chen:
H2LMER: A Cross Frame-Rate Representation Alignment Framework for Micro-expression Recognition. PRCV (11) 2024: 459-472 - [i13]Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Zhi Li, Sirui Zhao, Defu Lian, Enhong Chen:
Learning Partially Aligned Item Representation for Cross-Domain Sequential Recommendation. CoRR abs/2405.12473 (2024) - [i12]Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao, Defu Lian, Enhong Chen:
Dataset Regeneration for Sequential Recommendation. CoRR abs/2405.17795 (2024) - [i11]Chaoyou Fu
, Yuhan Dai, Yondong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li
, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu
, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun
:
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis. CoRR abs/2405.21075 (2024) - [i10]Tingjia Shen, Hao Wang, Jiaqing Zhang, Sirui Zhao, Liangyue Li, Zulong Chen, Defu Lian, Enhong Chen:
Exploring User Retrieval Integration towards Large Language Models for Cross-Domain Sequential Recommendation. CoRR abs/2406.03085 (2024) - [i9]Chaoyou Fu
, Yifan Zhang, Shukang Yin, Bo Li, Xinyu Fang, Sirui Zhao, Haodong Duan, Xing Sun
, Ziwei Liu, Liang Wang, Caifeng Shan
, Ran He:
MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs. CoRR abs/2411.15296 (2024) - [i8]Shukang Yin, Chaoyou Fu
, Sirui Zhao, Yunhang Shen, Chunjiang Ge, Yan Yang, Zuwei Long, Yuhan Dai, Tong Xu
, Xing Sun
, Ran He, Caifeng Shan
, Enhong Chen:
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs. CoRR abs/2411.19951 (2024) - 2023
- [j7]Mingdi Hu, Long Bai, Jiulun Fan, Sirui Zhao, Enhong Chen:
Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion. Frontiers Comput. Sci. 17(3): 173321 (2023) - [j6]Sirui Zhao, Hongyu Jiang
, Hanqing Tao
, Rui Zha, Kun Zhang
, Tong Xu
, Enhong Chen
:
PEDM: A Multi-task Learning Model for Persona-aware Emoji-embedded Dialogue Generation. ACM Trans. Multim. Comput. Commun. Appl. 19(3s): 132:1-132:21 (2023) - [c8]Mingjia Yin
, Hao Wang
, Xiang Xu
, Likang Wu
, Sirui Zhao
, Wei Guo
, Yong Liu
, Ruiming Tang
, Defu Lian
, Enhong Chen
:
APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation. CIKM 2023: 3009-3019 - [c7]Shukang Yin, Shiwei Wu, Tong Xu
, Shifeng Liu, Sirui Zhao, Enhong Chen:
AU-aware graph convolutional network for Macroand Micro-expression spotting. ICME 2023: 228-233 - [c6]Yiming Zhang, Hao Wang, Yifan Xu, Xinglong Mao, Tong Xu
, Sirui Zhao, Enhong Chen:
Adaptive Graph Attention Network with Temporal Fusion for Micro-Expressions Recognition. ICME 2023: 1391-1396 - [c5]Minghao Liu
, Haiyi Liu
, Sirui Zhao
, Fei Ma
, Minglei Li
, Zonghong Dai
, Hao Wang
, Tong Xu
, Enhong Chen
:
STAN: Spatial-Temporal Awareness Network for Temporal Action Detection. MMSports@MM 2023: 161-165 - [i7]Sirui Zhao, Huaying Tang, Xinglong Mao, Shifeng Liu, Hanqing Tao, Hao Wang, Tong Xu
, Enhong Chen:
More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates. CoRR abs/2301.00985 (2023) - [i6]Shukang Yin, Shiwei Wu, Tong Xu
, Shifeng Liu, Sirui Zhao, Enhong Chen:
AU-aware graph convolutional network for Macro- and Micro-expression spotting. CoRR abs/2303.09114 (2023) - [i5]Shukang Yin, Chaoyou Fu
, Sirui Zhao, Ke Li, Xing Sun
, Tong Xu
, Enhong Chen:
A Survey on Multimodal Large Language Models. CoRR abs/2306.13549 (2023) - [i4]Chao Zhang, Shiwei Wu, Sirui Zhao, Tong Xu, Enhong Chen:
A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference. CoRR abs/2306.14412 (2023) - [i3]Shukang Yin, Chaoyou Fu
, Sirui Zhao, Tong Xu
, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun
, Enhong Chen:
Woodpecker: Hallucination Correction for Multimodal Large Language Models. CoRR abs/2310.16045 (2023) - [i2]Mingjia Yin, Hao Wang, Xiang Xu, Likang Wu, Sirui Zhao, Wei Guo, Yong Liu, Ruiming Tang
, Defu Lian
, Enhong Chen:
APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation. CoRR abs/2311.02816 (2023) - [i1]Chaoyou Fu
, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun
:
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise. CoRR abs/2312.12436 (2023) - 2022
- [j5]Sirui Zhao
, Huaying Tang
, Shifeng Liu, Yangsong Zhang, Hao Wang, Tong Xu
, Enhong Chen, Cuntai Guan:
ME-PLAN: A deep prototypical learning with local attention network for dynamic micro-expression recognition. Neural Networks 153: 427-443 (2022) - [j4]Wei Cao
, Kun Zhang
, Shulan Ruan
, Hanqing Tao
, Sirui Zhao, Hao Wang, Qi Liu
, Enhong Chen
:
Causal Narrative Comprehension: A New Perspective for Emotion Cause Extraction. IEEE Trans. Affect. Comput. 13(4): 1743-1758 (2022) - [c4]Rijin Jin, Sirui Zhao, Zhongkai Hao, Yifan Xu, Tong Xu
, Enhong Chen:
AVT: Au-Assisted Visual Transformer for Facial Expression Recognition. ICIP 2022: 2661-2665 - [c3]Sirui Zhao, Shukang Yin, Huaying Tang, Rijin Jin, Yifan Xu, Tong Xu
, Enhong Chen:
Fine-grained Micro-Expression Generation based on Thin-Plate Spline and Relative AU Constraint. ACM Multimedia 2022: 7150-7154 - [c2]Wenhao Leng, Sirui Zhao, Yiming Zhang, Shifeng Liu, Xinglong Mao, Hao Wang, Tong Xu
, Enhong Chen:
ABPN: Apex and Boundary Perception Network for Micro- and Macro-Expression Spotting. ACM Multimedia 2022: 7160-7164 - 2021
- [j3]Sirui Zhao, Hanqing Tao, Yangsong Zhang, Tong Xu
, Kun Zhang, Zhongkai Hao, Enhong Chen
:
A two-stage 3D CNN based learning method for spontaneous micro-expression recognition. Neurocomputing 448: 276-289 (2021) - [j2]Liang Fan, Cheng Chen, Sirui Zhao, Xiaorong Zhang, Yadong Wu, Fang Wang:
Multi-threaded parallel projection tetrahedral algorithm for unstructured volume rendering. J. Vis. 24(2): 261-274 (2021) - [j1]Yangsong Zhang, Huan Cai
, Li Nie
, Peng Xu, Sirui Zhao, Cuntai Guan
:
An end-to-end 3D convolutional neural network for decoding attentive mental state. Neural Networks 144: 129-137 (2021) - [c1]Yifan Xu, Sirui Zhao, Huaying Tang, Xinglong Mao, Tong Xu
, Enhong Chen
:
FAMGAN: Fine-grained AUs Modulation based Generative Adversarial Network for Micro-Expression Generation. ACM Multimedia 2021: 4813-4817
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-05-11 00:43 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint