default search action
ICME 2023: Brisbane, Australia
- IEEE International Conference on Multimedia and Expo, ICME 2023, Brisbane, Australia, July 10-14, 2023. IEEE 2023, ISBN 978-1-6654-6891-6
- Prashant Pandey, Mustafa Chasmai, Monish Natarajan, Brejesh Lall:
Weakly Supervised Few-Shot and Zero-Shot Semantic Segmentation with Mean Instance Aware Prompt Learning. 1-6 - Qianwen Cao, Heyan Huang, Minpeng Liao, Xianling Mao:
Ada-SwinBERT: Adaptive Token Selection for Efficient Video Captioning with Online Self-Distillation. 7-12 - Jiuxiang You, Zhenguo Yang, Qing Li, Wenyin Liu:
A Retriever-Reader Framework with Visual Entity Linking for Knowledge-Based Visual Question Answering. 13-18 - Pufen Zhang, Peng Shi, Song Zhang:
2S-DFN: Dual-semantic Decoding Fusion Networks for Fine-grained Image Recognition. 19-24 - Yongzhu Miao, Shasha Li, Jintao Tang, Ting Wang:
MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models. 25-30 - Sai Shashank Kalakonda, Shubh Maheshwari, Ravi Kiran Sarvadevabhatla:
Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Action Generation. 31-36 - Tianhua Xu, Sheng-hua Zhong, Zhijiao Xiao:
Protecting Intellectual Property of EEG-based Model with Watermarking. 37-42 - Hanxiu Zhang, Guitao Cao, Xinyue Zhang, Jing Xiang, Chunwei Wu:
Making Adversarial Attack Imperceptible in Frequency Domain: A Watermark-based Framework. 43-48 - Jie Luo, Peisong He, Jiayong Liu, Hongxia Wang, Chunwang Wu, Yijing Chen, Wanjie Li, Jiangchuan Li:
Content-adaptive Adversarial Embedding for Image Steganography Using Deep Reinforcement Learning. 49-54 - Youqiang Sun, Jianyi Liu, Ru Zhang:
A Robust Generative Image Steganography Method based on Guidance Features in Image Synthesis. 55-60 - Shiqiang Wu, Jie Liu, Ying Huang, Hu Guan, Shuwu Zhang:
Adversarial Audio Watermarking: Embedding Watermark into Deep Feature. 61-66 - Tengjun Liu, Ying Chen, Wanxuan Gu:
Deniable Diffusion Generative Steganography. 67-71 - Songbin Li, Xiangzhi Yang, Jingang Wang:
Sea Surface Object Detection Based on Background Dynamic Perception and Cross-Layer Semantic Interaction. 72-77 - Guikun Chen, Lin Li, Yawei Luo, Jun Xiao:
Addressing Predicate Overlap in Scene Graph Generation with Semantic Granularity Controller. 78-83 - Shiqi Ren, Chao Zhu, Mengyin Liu, Xu-Cheng Yin:
Towards Discriminative Semantic Relationship for Fine-grained Crowd Counting. 84-89 - Jun Xie, Yixuan Zhou, Xing Xu, Guoqing Wang, Fumin Shen, Yang Yang:
Region-Aware Semantic Consistency for Unsupervised Domain-Adaptive Semantic Segmentation. 90-95 - Chuang Zhao, Hefei Ling, Yuxuan Shi, Chengxin Zhao, Jiazhong Chen, Qiang Cao:
Deep Unsupervised Hashing with Selective Semantic Mining. 96-101 - Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong:
Boosting Interactive Image Segmentation by Exploiting Semantic Clues. 102-107 - Dafeng Li, Yingying Zhu:
Visual-Linguistic Alignment and Composition for Image Retrieval with Text Feedback. 108-113 - Xinyu Zhou, Anna Zhu, Huen Chen, Wei Pan:
Scene Text Involved "Text"-to-Image Retrieval through Logically Hierarchical Matching. 114-119 - Yi Li, Meihua Yu, Xin Xie, Haiyan Fu, Hao He, Yanqing Guo:
Federating Hashing Networks Adaptively for Privacy-Preserving Retrieval. 120-125 - Kangkang Lu, Yanhua Yu, Meiyu Liang, Min Zhang, Xiaowen Cao, Zehua Zhao, Mengran Yin, Zhe Xue:
Deep Unsupervised Momentum Contrastive Hashing for Cross-modal Retrieval. 126-131 - Yiyang Cai, Jiaming Lu, Jiewen Wang, Shuang Liang:
Uncertainty-Aware Cross-Modal Transfer Network for Sketch-Based 3D Shape Retrieval. 132-137 - Guoliang Wang, Yanlei Shang, Yong Chen, Chaoqi Zhen, Dequan Cheng:
Scene Graph based Fusion Network for Image-Text Retrieval. 138-143 - Yuchao Feng, Honghui Xu, Jiawei Jiang, Jianwei Zheng:
Compact Intertemporal Coupling Network for Remote Sensing Change Detection. 144-149 - Jueyu Chen, Guanyu Xing, Jingwei Liao, Housheng Wei, Yanli Liu:
Boundary-aware Shadow Detection via Mask Decoupling and Feature Correction. 150-155 - Yuzhong Zhao, Yuanqiang Cai, Weijia Wu, Weiqiang Wang:
Explore Faster Localization Learning For Scene Text Detection. 156-161 - Xiaofeng Ji, Jin Chen, Xinxiao Wu:
Counterfactual Inference for Visual Relationship Detection in Videos. 162-167 - Huayi Zhou, Fei Jiang, Hongtao Lu:
Body-Part Joint Detection and Association via Extended Object Representation. 168-173 - Jian Cui, Lin Li, Xiaohui Tao:
Be-or-Not Prompt Enhanced Hard Negatives Generating For Memes Category Detection. 174-179 - Yanni Wang, Gang Yang, Dayong Ding, Jianchun Zhao:
Automatic Retinal Nerve Fiber Trajectory Simulation and Quasi-polar Transformation for Detecting Retinal Nerve Fiber Layer Defect in Fundus Images. 180-185 - Jiawei Jiang, Jiacheng Chen, Honghui Xu, Yuchao Feng, Jianwei Zheng:
GA-HQS: MRI reconstruction via a generically accelerated unfolding approach. 186-191 - Yi Li, Baoyao Yang, Dan Pan, An Zeng, Long Wu, Yang Yang:
Early Diagnosis of Alzheimer's Disease Based on Multimodal Hypergraph Attention Network. 192-197 - Shanshan Huang, Qingsong Li, Lei Wang, Yuanhao Wang, Li Liu:
Score-based causal feature selection for cancer risk prediction. 198-203 - Wentian Cai, Yulin Cheng, Ying Gao, Weixiao Liu, Xinyan Xie, Xiongwen Luo, Weixian Yang, Zaiyi Liu, Changhong Liang:
A Dual-Path Supplemental Information Learning Architecture for Breast Cancer Ki-67 Status Prediction in T2w MRI. 210-215 - Hui Zhang, Shiqi Shen, Jinhua Xu:
Expression-Guided Attention GAN for Fine-Grained Facial Expression Editing. 216-221 - Yini Fang, Didan Deng, Liang Wu, Frederic Jumelle, Bertram E. Shi:
RMES: Real-Time Micro-Expression Spotting Using Phase From Riesz Pyramid. 222-227 - Shukang Yin, Shiwei Wu, Tong Xu, Shifeng Liu, Sirui Zhao, Enhong Chen:
AU-aware graph convolutional network for Macroand Micro-expression spotting. 228-233 - Hao Sun, Chenchen Pi, Wei Xie:
Semi-Supervised Facial Expression Recognition by Exploring False Pseudo-Labels. 234-239 - Jingning Xu, Benlai Tang, Mingjie Wang, Minghao Li, Meirong Ma:
CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation. 240-245 - David Anghelone, Sarah Lannes, Antitza Dantcheva:
ANYRES: Generating High-Resolution visible-face images from Low-Resolution thermal-face images. 246-251 - Yutong Li, Zhenyu Liu, Gang Li, Qiongqiong Chen, Zhijie Ding, Xiping Hu, Bin Hu:
A Visually Interpretable Convolutional-Transformer Model for Assessing Depression from Facial Images. 252-257 - Zhaowen Li, Xu Zhao, Peigeng Ding, Zongxing Gao, Yuting Yang, Ming Tang, Jinqiao Wang:
FreConv: Frequency Branch-and-Integration Convolutional Networks. 258-263 - Ruofan Wang, Jiayu Guo, Rui-Wei Zhao, Ling Su, Yingzi Ye, Xiaobo Zhang, Yuejie Zhang, Rui Feng:
Class-aware Variational Auto-encoder for Open Set Recognition. 264-269 - Mingyang Zhang, Xinyi Yu, Jingtao Rong, Linlin Ou:
Repnas: Searching for Efficient Re-Parameterizing Blocks. 270-275 - Bowen Zhao, Weidong Chen, Bo Hu, Hongtao Xie, Zhendong Mao:
Difference-Aware Iterative Reasoning Network for Key Relation Detection. 276-281 - Luying Li, Lizhuang Ma:
Injecting-Diffusion: Inject Domain-Independent Contents into Diffusion Models for Unpaired Image-to-Image Translation. 282-287 - Lei Xu, Rong Wang, Feiping Nie, Jun Wu, Xuelong Li:
Semi-Supervised Top-k Feature Selection with a General Optimization Framework. 288-293 - Yukun Zhang, Shengming Yuan, Jingkuan Song, Yixuan Zhou, Lin Zhang, Yulan He:
Towards Boosting Black-Box Attack Via Sharpness-Aware. 294-299 - Xiaolin Zhai, Zhengxi Hu, Dingye Yang, Shichao Wu, Jingtai Liu:
Learning Group Residual Representation for Group Activity Prediction*. 300-305 - Xuesong Guo, Shuo Wang, Jiahao Chang, Zehui Chen, Feng Zhao:
SAFE: Simultaneous Alignment of Features and Predictions for Dense Object Detectors. 306-311 - Xiaohong Xiang, Fuyuan Zhang, Xin Deng, Ke Hu:
MSG-CAM:Multi-scale inputs make a better visual interpretation of CNN networks. 312-317 - Peng Yan, Guodong Long:
Personalization Disentanglement for Federated Learning. 318-323 - Yuxin Shi, Zelei Liu, Zhuan Shi, Han Yu:
Fairness-Aware Client Selection for Federated Learning. 324-329 - Xiaoli Tang, Han Yu:
Utility-Maximizing Bidding Strategy for Data Consumers in Auction-Based Federated Learning. 330-335 - Zhiwei Xiong, Han Yu, Zhiqi Shen:
Federated Learning for Personalized Image Aesthetics Assessment. 336-341 - Yue Huang, Lanju Kong, Qingzhong Li, Baochen Zhang:
Decentralized Federated Learning Via Mutual Knowledge Distillation. 342-347 - Zekai Chen, Fuyi Wang, Zhiwei Zheng, Ximeng Liu, Yujie Lin:
Fedward: Flexible Federated Backdoor Defense Framework with Non-IID Data. 348-353 - Jialing He, Zhen Qin, Hangcheng Liu, Shangwei Guo, Biwen Chen, Ning Wang, Tao Xiang:
Contrastive Fusion Representation: Mitigating Adversarial Attacks on VQA Models. 354-359 - Zhengyu Wang, Yujie Zhang, Qi Yang, Yiling Xu, Yifei Zhou, Jun Sun, Shan Liu:
Improving Point Cloud Quality Metrics with Noticeable Possibility Maps. 360-365 - Haoning Wu, Liang Liao, Jingwen Hou, Chaofeng Chen, Erli Zhang, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin:
Exploring Opinion-Unaware Video Quality Assessment with Semantic Affinity Criterion. 366-371 - Lirong Huang, Rong Zhang, Miaohui Wang:
Just Noticeable Difference Estimation for Screen Content Images: A Content Uncertainty-guided Approach. 372-377 - Hui Wang, Xiguang Zheng, Yong Qin:
Intermediate-Task Learning with Pretrained Model for Synthesized Speech MOS Prediction. 378-383 - Zenan Xu, Wanjun Zhong, Qinliang Su, Fuwei Zhang:
Cross-Modal-Aware Representation Learning with Syntactic Hypergraph Convolutional Network for VideoQA. 384-389 - Hui Su, Yue Ye, Wei Hua, Lechao Cheng, Mingli Song:
SASFormer: Transformers for Sparsely Annotated Semantic Segmentation. 390-395 - Wujie Sun, Defang Chen, Can Wang, Deshi Ye, Yan Feng, Chun Chen:
Holistic Weighted Distillation for Semantic Segmentation. 396-401 - Feng Jiang, Heng Gao, Shoumeng Qiu, Haiqiang Zhang, Ru Wan, Jian Pu:
Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation. 402-407 - Huazheng Hao, Hui Xiao, Li Dong, Diqun Yan, Dongtai Liang, Jiayan Zhuang, Chengbin Peng:
A Pseudo-Dual Self-Rectification Framework for Semantic Segmentation. 408-413 - Feifei Ding, Jianjun Li, Wanyong Tian:
Dual-level Consistency Learning for Unsupervised Domain Adaptive Night-time Semantic Segmentation. 420-425 - Wenrui Li, Zhengyu Ma, Liang-Jian Deng, Hengyu Man, Xiaopeng Fan:
Modality-Fusion Spiking Transformer Network for Audio-Visual Zero-Shot Learning. 426-431 - Rui Gao, Fan Wan, Daniel Organisciak, Jiyao Pu, Haoran Duan, Peng Zhang, Xingsong Hou, Yang Long:
Privacy-Enhanced Zero-Shot Learning via Data-Free Knowledge Transfer. 432-437 - Ting Guo, Jiye Liang, Guo-Sen Xie:
Swap-Reconstruction Autoencoder for Compositional Zero-Shot Learning. 438-443 - Xinmiao Dai, Chong Wang, Haohe Li, Sunqi Lin, Li Dong, Jiafei Wu, Jun Wang:
Synthetic Feature Assessment for Zero-Shot Object Detection. 444-449 - Yapeng Li, Yong Luo, Bo Du:
Audio-Visual Generalized Zero-Shot Learning Based on Variational Information Bottleneck. 450-455 - Han Jiang, Xiaoshan Yang, Chaofan Chen, Changsheng Xu:
Fine-grained Primitive Representation Learning for Compositional Zero-shot Classification. 456-461 - Jingwei Wang, Peng Zhou, Xianjun Han, Yanming Chen:
Medical Image Super-Resolution via Diagnosis-Guided Attention. 462-467 - Hong Zhang, Shenglun Chen, Zhihui Wang, Haojie Li, Wanli Ouyang:
Denser is Better:cost distribution super-resolution network for more accurate sub-pixel disparity. 468-473 - Lin Sun, Chao Yang, Bin Jiang:
DSP-Net: Diverse Structure Prior Network for Image Inpainting. 474-479 - Zekun Ai, Xiaotong Luo, Yanyun Qu:
Joint Feature Aggregation for Stereo Image Super-resolution. 480-485 - Zijian Yuan, Kan Chang, Zhiquan Liu, Xinjie Wei, Boning Chen:
Joint Super-Resolution and Classification Based on Bidirectional Mapping and Multiple Constraints. 486-491 - Qichen Wei, Zijie Zuo, Jie Nie, Jiahao Du, Yaning Diao, Min Ye, Xinyue Liang:
Inpainting of Remote Sensing Sea Surface Temperature image with Multi-scale Physical Constraints. 492-497 - Lei Chen, Huhe Dai, Yuan Zheng:
ICANet: A Lightweight Increasing Context Aided Network for Real-Time Image Semantic Segmentation. 492-497 - Zhijie Huang, Tianyi Sun, Xiaopeng Guo, Yanze Wang, Jun Sun:
Generalized Compressed Video Restoration by Multi-Scale Temporal Fusion and Hierarchical Quality Score Estimation. 498-503 - Yuan Zou, Yinyao Ma:
Edgeformer: Edge-Enhanced Transformer for High-Quality Image Deblurring. 504-509 - Yubo Huang, Jia Wang, Peipei Li, Liuyu Xiang, Peigang Li, Zhaofeng He:
Generative Iris Prior Embedded Transformer for Iris Restoration. 510-515 - Zhongbao Yang, Jinshan Pan:
MBDFNet: Multi-scale Bidirectional Dynamic Feature Fusion Network for Efficient Image Deblurring. 522-527 - Minhua Liu, Yuanman Li, Rongqin Liang, Jiaxiang You, Xia Li:
Multiple degraded image restoration via degradation history estimation. 528-533 - Jintao Zhang, Guangyi Xiao:
Gradual Migration and Style Consistency for Unsupervised Domain Adaptation. 534-539 - Han Xie, Zhifeng Shen, Shicai Yang, Weijie Chen, Luojun Lin:
Adapt then Generalize: A Simple Two-Stage Framework for Semi-Supervised Domain Generalization. 540-545 - Hongjian Song, Jie Tang, Hongzhao Xiao, Juncheng Hu:
Rethinking Overfitting of Multiple Instance Learning for Whole Slide Image Classification. 546-551 - Qiang Chen, Dong Zhang, Shoushan Li, Guodong Zhou:
A Unified MRC Framework with Multi-Query for Multi-modal Relation Triplets Extraction. 552-557 - Jiaxin Yang, Xiaofei Li, Jun Zhang, Shuohao Li:
Feature Bias Correction: A Feature Augmentation Method for Long-tailed Recognition. 558-563 - Yuling Jiang, Yingyuan Zhao, Bing-Kun Bao:
Recombination Samples Training for Robust Natural Language Visual Reasoning. 564-569 - Yansong Qu, Yuze Wang, Yue Qi:
SG-NeRF: Semantic-guided Point-based Neural Radiance Fields. 570-575 - Hai Zhou, Zhe Xue, Ying Liu, Boang Li, Junping Du, Meiyu Liang:
RTMC: A Rubost Trusted Multi-View Classification Framework. 576-581 - Xinjiao Zhou, Bin Jiang, Chao Yang, Haotian Hu, Xiaofei Huo:
DF-CLIP: Towards Disentangled and Fine-grained Image Editing from Text. 582-587 - Changshuo Wang, Lei Wu, Xu Chen, Xiang Li, Lei Meng, Xiangxu Meng:
Letter Embedding Guidance Diffusion Model for Scene Text Editing. 588-593 - Rongyu Zhang, Yun Chen, Chenrui Wu, Fangxin Wang:
Cluster-driven GNN-based Federated Recommendation with Biased Message Dropout. 594-599 - Tianyu Huai, Shuwen Yang, Junhang Zhang, Guoan Wang, Xinru Yu, Tianlong Ma, Liang He:
SQT: Debiased Visual Question Answering via Shuffling Question Types. 600-605 - Shizhuo Deng, Chuangui Yang, Zhubao Guo, Boqian Lin, Dongyue Chen, Tong Jia, Botao Wang:
Fast Personalized Human Activity Recognition on Heuristic Parameter Estimation. 606-611 - Yaolong Ju, Chunyang Xu, Yichen Guo, Jinhu Li, Simon Lui:
Improving Automatic Singing Skill Evaluation with Timbral Features, Attention, and Singing Voice Separation. 612-617 - Han Guo, Yuanlong Yu, Yujie Wang, Xuelin Chen, Yixin Zhuang:
Learning High Frequency Surface Functions In Shells. 618-623 - Eli Lei, Jia Shao, Youfa Liu, Bo Du:
Multi-template Tracker Driven by Cache Manager Algorithm, Towards Multi-distractor Scenarios. 624-629 - Aoran Liu, Kun Hu, Wenxi Yue, Qiuxia Wu, Zhiyong Wang:
Material-Aware Self-Supervised Network for Dynamic 3D Garment Simulation. 630-635 - Yulin Wu, Ruimin Hu, Xiaochen Wang:
Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network. 636-641 - Jinxin Wang, Zhongwen Guo, Chao Yang, Xiaomei Li, Ziyuan Cui:
Multi-Scale Hybrid Fusion Network for Mandarin Audio-Visual Speech Recognition. 642-647 - Tianhan Liu, Zhuang Qi, Zitan Chen, Xiangxu Meng, Lei Meng:
Cross-Training with Prototypical Distillation for improving the generalization of Federated Learning. 648-653 - Mehdi Setayesh, Vincent W. S. Wong:
A Content-based Viewport Prediction Framework for 360° Video Using Personalized Federated Learning and Fusion Techniques. 654-659 - Chenrui Wu, Zexi Li, Fangxin Wang, Chao Wu:
Learning Cautiously in Federated Learning with Noisy and Heterogeneous Clients. 660-665 - Yulan Gao, Yansong Zhao, Han Yu:
Multi-Tier Client Selection for Mobile Federated Learning Networks. 666-671 - Chengyi Yang, Zhaoxiang Hou, Sheng Guo, Hui Chen, Zengxiang Li:
SWATM: Contribution-Aware Adaptive Federated Learning Framework Based on Augmented Shapley Values. 672-677 - Yiqiang Chen, Xiaodong Yang, Yuting He, Chunyan Miao, Piu Chan:
FedDBM: Federated Digital Biomarker for Detecting Parkinson's Disease Progress. 678-683 - Haihang Ruan, Feng Wang, Tongda Xu, Zhiyong Tan, Yan Wang:
MIXLIC: Mixing Global and Local Context Model for learned Image Compression. 684-689 - Ruoke Yan, Qian Yin, Xinfeng Zhang, Siwei Ma:
Model-Driven Compression for Digital Human Using Multi-Granularity Representations. 690-695 - Hengyu Man, Xingtao Wang, Riyu Lu, Xiaopeng Fan:
Meta-ILF: In-Loop Filter with Customized Weights For VVC Intra Coding. 696-701 - Yunhui Shi, Pengquan Wang, Jin Wang, Baocai Yin, Nam Ling:
Variable-Rate Neural Image Compression with Joint Content-Channel Features and Accurate R-λ Model. 702-707 - Wenyi Wang, Yingzhan Xu, Kai Zhang, Li Zhang:
Peer Upsampled Transform Domain Prediction for G-PCC. 708-713 - Qiuyue Fang, Tao Xu, Lai Jiang, Shengxi Li, Mai Xu, Yunjin Chen, Leonid Sigal:
Optimizing DNN based quality assessment metric for image compression: A novel rate control method. 714-719 - Junhang Zhang, Zisong Zhuang, Luwei Xiao, Xingjiao Wu, Tianlong Ma, Liang He:
Dual-Expert Distillation Network for Few-Shot Segmentation. 720-725 - Linglan Zhao, Jing Lu, Zhanzhan Cheng, Duo Liu, Xiangzhong Fang:
Rethinking Self-Supervision for Few-Shot Class-Incremental Learning. 726-731 - Yongliang Su, Xu Chen, Lei Wu, Xiangxu Meng:
Learning Component-Level and Inter-Class Glyph Representation for few-shot Font Generation. 738-743 - Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang:
Masked Cross-image Encoding for Few-shot Segmentation. 744-749 - Xueyang Zhang, Shuxian Wang, Jun Du, Genwei Yan, Jigang Tang, Tian Gao, Xin Fang, Jia Pan, Jianqing Gao:
Frame-Level Embedding Learning for Few-shot Bioacoustic Event Detection. 750-755 - Xiaojia Chen, Xuanhan Wang, Beitao Chen, Lianli Gao:
End-To-End Part-Level Action Parsing With Transformer. 756-761 - Kaixiang Yang, Junyu Gao, Yangbo Feng, Changsheng Xu:
Leveraging Attribute Knowledge for Open-set Action Recognition. 762-767 - Hailun Zhang, Ziyun Zeng, Qijun Zhao, Zhen Zhai:
ConCAP: Contrastive Context-Aware Prompt for Resource-hungry Action Recognition. 768-773 - Wentian Xin, Hongkai Lin, Ruyi Liu, Yi Liu, Qiguang Miao:
Is Really Correlation Information Represented Well in Self-Attention for Skeleton-based Action Recognition? 780-785 - Chang Li, Qian Huang, Yingchi Mao:
DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition. 786-791 - Shilian Wu, Yongrui Li, Zengfu Wang:
Improving CTC-based Handwritten Chinese Text Recognition with Cross-Modality Knowledge Distillation and Feature Aggregation. 792-797 - Gao-Dong Liu, Wan-Lei Zhao, Jie Zhao:
Decoupled Mutual Distillation for Incremental Object Detection. 798-803 - Wujie Sun, Defang Chen, Can Wang, Deshi Ye, Yan Feng, Chun Chen:
Accelerating Diffusion Sampling with Classifier-based Feature Distillation. 810-815 - Dongqin Liu, Wentao Li, Wei Zhou, Zhaoxing Li, Jiao Dai, Jizhong Han, Ruixuan Li, Songlin Hu:
Semantic Stage-Wise Learning for Knowledge Distillation. 816-821 - Hao Zhang, Yanxu Hu, Jiawen Peng, Andy J. Ma:
Discriminative Gradient Adjustment with Coupled Knowledge Distillation for Class Incremental Learning. 822-827 - Xiaowen Ma, Rui Che, Tingfeng Hong, Mengting Ma, Ziyan Zhao, Tian Feng, Wei Zhang:
SACANet: scene-aware class attention network for semantic segmentation of remote sensing images. 828-833 - Hongyu Gu, Yunzhi Zhuge, Lu Zhang, Jinqing Qi, Huchuan Lu:
Few-shot Semantic Segmentation by Exploiting Dynamic and Regional Contexts. 834-839 - Guoxing Yang, Feifei Fu, Nanyi Fei, Haoran Wu, Ruitao Ma, Zhiwu Lu:
DiST-GAN: Distillation-based Semantic Transfer for Text-Guided Face Generation. 840-845 - Guoying Sun, Meng Yang:
Self-Attention Prediction Correction with Channel Suppression for Weakly-Supervised Semantic Segmentation. 846-851 - Rui Chen, Tao Chen, Qiong Wang, Yazhou Yao:
Semi-Supervised Semantic Segmentation With Region Relevance. 852-857 - Jiahao Guo, Chao Liang, Zhongyuan Wang:
Who, What and Where: Composite-semantic Instance Search for Story Videos. 858-863 - Yan Wang, Yu-Ting Su, Wenhui Li, Chenggang Yan, Bolun Zheng, Xuanya Li, An-An Liu:
Semantic Embedding Uncertainty Learning for Image and Text Matching. 864-869 - Yifan Shang, Xiucai Ye, Tetsuya Sakurai:
Multi-view Network Embedding with Structure and Semantic Contrastive Learning. 870-875 - Guoqing Yang, Chuang Zhu, Yu Zhang:
A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation. 876-881 - Yuxin Jin, Ming Qian, Jincheng Xiong, Nan Xue, Gui-Song Xia:
Depth and DOF Cues Make A Better Defocus Blur Detector. 882-887 - Jialong Zhang, Lijun Zhao, Jinjing Zhang, Ke Wang, Anhong Wang:
Explainable Unfolding Network For Joint Edge-Preserving Depth Map Super-Resolution. 888-893 - Xianhe Jiao, Junli Zhao, Chenlei Lv, Fuqing Duan, Zhenkuan Pan, Xin Li:
Robust 3D Craniofacial Landmarks Localization by An End-to-End Regression Network. 900-905 - Xueyang Li, Minyang Xu, Xiangdong Zhou:
Twins-Mix: Self Mixing in Latent Space for Reasonable Data Augmentation of 3D Computer-Aided Design Generative Modeling. 906-911 - Shaoxu Li, Ye Pan:
Rendering and Reconstruction Based 3D Portrait Stylization. 912-917 - Zhenjiang Du, Yi Lu, Guan Wang, Ning Xie, Yang Yang:
GT-Net: Variational Autoencoder Networks based on Graph Transformer for 3D Shape Learning. 918-923 - Jing Hu, Xincheng Wang, Ziheng Liao, Tingsong Xiao:
M-GCN: Multi-scale Graph Convolutional Network for 3D Point Cloud Classification. 924-929 - Boyang Zhang, Suping Wu, Leyang Yang, Bin Wang, Wenlong Lu:
A Lightweight Grouped Low-rank Tensor Approximation Network for 3D Mesh Reconstruction From Videos. 930-935 - Xin Zou, Chang Tang, Wei Zhang, Kun Sun, Liangxiao Jiang:
Hierarchical Attention Learning for Multimodal Classification. 936-941 - Zeman Shao, Gautham Vinod, Jiangpeng He, Fengqing Zhu:
An End-to-End Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image. 942-947 - Yuxiang An, Dongnan Liu, Weidong Cai:
Unsupervised Domain Adaptation for Neuron Membrane Segmentation based on Structural Features. 948-953 - Nan Wang, Chengwei Chen, Lizhuang Ma, Shaohui Lin:
Latent Feature Regularization based Adversarial Network for Brain Tumor Anomaly Detection. 954-959 - Zhenda Xu, Jiahao Hu, Qiang Gao, Donghua Hang, Qihua Zhou, Song Guo, Aiqian Gan:
Development of Deep Learning Algorithms for Automated Scoliosis and Abnormal Posture Screening Using 2D Back Image. 960-965 - Yu Tang, Gang Yang, Jianchun Zhao, Dayong Ding, Jun Wu:
LACL: Lesion-Aware Contrastive Learning Framework for Medical Image Classification. 966-971 - Yinan Mao, Bowei He, Shiji Zhou, Chen Ma, Zhi Wang:
Collaborative Edge Caching: a Meta Reinforcement Learning Approach with Edge Sampling. 972-977 - Feng Peng, Bingcong Lu, Li Song, Rong Xie, Yanmei Liu, Ying Chen:
PACC: Perception Aware Congestion Control for Real-time Communication. 978-983 - Xueting Jiang, Xin Liu, Yiu-Ming Cheung, Xing Xu, Shu-Kai Zheng, Taihao Li:
Label-Semantic-Enhanced Online Hashing for Efficient Cross-modal Retrieval. 984-989 - Cheng Zhan, Huan Yan, Han Hu, Liyue Zhu, Shubin Xu:
QoE Maximization for Aerial Video Streaming with Multiple Cellular Connected UAVs. 990-995 - Dieli Hu, Wen Ji, Zhi Wang:
Multi-stream Adaptive Offloading of Joint Compressed Video Streams, Feature Streams, and Semantic Streams in Edge Computing Systems. 996-1001 - Jangwoo Son, Yago Sanchez, Christian Hampe, Dominik Schnieders, Thomas Schierl, Cornelius Hellge:
L4S Congestion Control Algorithm for Interactive Low Latency Applications over 5G. 1002-1007 - Hao Ren, Wu Ran, Xingson Liu, Haoran Ren, Hong Lu, Rui Zhang, Cheng Jin:
Weakly-supervised Temporal Action Localization with Adaptive Clustering and Refining Network. 1008-1013 - Dazhao Du, Bing Su, Yu Li, Zhongang Qi, Lingyu Si, Ying Shan:
Do We Really Need Temporal Convolutions in Action Segmentation? 1014-1019 - Guo Chen, Yin-Dong Zheng, Zhe Chen, Jiahao Wang, Tong Lu:
ELAN: Enhancing Temporal Action Detection with Location Awareness. 1020-1025 - Yin-Dong Zheng, Guo Chen, Minglei Yuan, Tong Lu:
MRSN: Multi-Relation Support Network for Video Action Detection. 1026-1031 - Qinying Liu, Zilei Wang, Ruoxi Chen, Zhilin Li:
Unleashing the Potential of Adjacent Snippets for Weakly-supervised Temporal Action Localization. 1032-1037 - Zikun Zhuang, Ruihao Qian, Chi Xie, Shuang Liang:
Compositional Learning in Transformer-Based Human-Object Interaction Detection. 1038-1043 - Junkai Yan, Lingxiao Yang, Yipeng Gao, Wei-Shi Zheng:
Self-supervised Cross-stage Regional Contrastive Learning for Object Detection. 1044-1049 - Bingchao Wu, Yangyuxuan Kang, Daoguang Zan, Bei Guan, Yongji Wang:
Hierarchical and Contrastive Representation Learning for Knowledge-Aware Recommendation. 1050-1055 - Qingzhong Chen, Shilun Cai, Crystal Cai, Zefang Yu, Dahong Qian, Suncheng Xiang:
Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval. 1056-1061 - Wenye Lin, Yifeng Ding, Zhixiong Cao, Hai-Tao Zheng:
Establishing a Stronger Baseline for Lightweight Contrastive Models. 1062-1067 - Jinyong Wen, Yuhu Wang, Chunxia Zhang, Shiming Xiang, Chunhong Pan:
Graph Information Interaction on Feature and Structure via Cross-modal Contrastive Learning. 1068-1073 - Yidan Fan, Wenhuan Lu, Yahong Han:
Discriminative and Contrastive Consistency for Semi-supervised Domain Adaptive Image Classification. 1074-1079 - Feng Liu, Deyi Tuo, Yinan Xu, Xintong Han:
CoverHunter: Cover Song Identification with Refined Attention and Alignments. 1080-1085 - Iacopo Ghinassi, Matthew Purver, Huy Phan, Chris Newell:
Exploring Pre-Trained Neural Audio Representations for Audio Topic Segmentation. 1086-1091 - Xun Zhou, Wujin Sun, Xiaodong Shi:
A High-Quality Melody-Aware Peking Opera Synthesizer Using Data Augmentation. 1092-1097 - Xinlu Liu, Jiale Qian, Qiqi He, Yi Yu, Wei Li:
LC-Beating: An Online System for Beat and Downbeat Tracking using Latency-Controlled Mechanism. 1098-1103 - Honglin Mu, Wentian Xia, Wanxiang Che:
Improving Domain Generalization for Sound Classification with Sparse Frequency-Regularized Transformer. 1104-1108 - Yulun Wu, Jiahao Zhao, Yi Yu, Wei Li:
MFAE: Masked frame-level autoencoder with hybrid-supervision for low-resource music transcription. 1109-1114 - Hongji Yang, Jiao Liu, Shao-Ping Lu, Bo Ren:
Self-Supervised Implicit 3D Reconstruction via RGB-D Scans. 1115-1120 - Yang Wu, Lingyan Liang, Yaqian Zhao, Kaihua Zhang:
Object-Aware Calibrated Depth-Guided Transformer for RGB-D Co-Salient Object Detection. 1121-1126 - Yufan Deng, Xin Deng, Mai Xu:
A Two-stage hybrid CNN-Transformer Network for RGB Guided Indoor Depth Completion. 1127-1132 - Peiyuan Zhi, Kaiyue Zhou, Yali Li, Shengjin Wang:
Feature Decoupling and Uncertainty Estimation for 3D Object Detection. 1133-1138 - Lianggangxu Chen, Jiale Lu, Changbo Wang, Gaoqi He:
Scene Graph Generation using Depth-based Multimodal Network. 1139-1144 - Linlong Fan, Yanqi Ge, Wen Li, Lixin Duan:
Multi-View Token Clustering and Fusion for 3D Object Recognition and Retrieval. 1145-1150 - Gang Wang, Yufei Chen:
Local Consensus Transformer for Correspondence Learning. 1151-1156 - Bowen Zheng, Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan:
Preserving Locality in Vision Transformers for Class Incremental Learning. 1157-1162 - Ruichao Hou, Boyue Xu, Tongwei Ren, Gangshan Wu:
MTNet: Learning Modality-aware Representation with Transformer for RGBT Tracking. 1163-1168 - Zixuan Su, Jingjing Chen, Lei Pang, Chong-Wah Ngo, Yu-Gang Jiang:
Adaptive Split-Fusion Transformer. 1169-1174 - Yijun Long, Zhaoyu Chen, Hong Lu, Wenqiang Zhang:
GSFormer: Geometric-Spatial Transformer on Point Cloud Completion. 1175-1180 - Chaohao Wen, Xun Gong:
SDGFormer: An Efficient Convolution Network Structurally Similar to Transformer. 1181-1186 - Huaming Wang, Jianwei Fei, Yunshu Dai, Lingyun Leng, Zhihua Xia:
General GAN-generated Image Detection by Data Augmentation in Fingerprint Domain. 1187-1192 - Qichao Ying, Hang Zhou, Xiaoxiao Hu, Zhenxing Qian, Sheng Li, Xinpeng Zhang:
Image Protection for Robust Cropping Localization and Recovery. 1193-1198 - Pei-Kai Huang, Jun-Xiong Chong, Hui-Yu Ni, Tzu-Hsien Chen, Chiou-Ting Hsu:
Towards Diverse Liveness Feature Representation and Domain Expansion for Cross-Domain Face Anti-Spoofing. 1199-1204 - Xin Dong, Tao Wang, Zhendong Li, Hao Liu:
Joint Statistical and Causal Feature Modulated Face Anti-Spoofing. 1205-1210 - Yuwei Zeng, Jingxuan Tan, Zhengxin You, Zhenxing Qian, Xinpeng Zhang:
Watermarks for Generative Adversarial Network Based on Steganographic Invisible Backdoor. 1211-1216 - Yan Fang, Zhongyuan Wang, Jikang Cheng, Ruoxi Wang, Chao Liang:
Promoting adversarial transferability with enhanced loss flatness. 1217-1222 - Yuezun Li, Jiaran Zhou, Siwei Lyu:
Face Poison: Obstructing DeepFakes by Disrupting Face Detection. 1223-1228 - Wen Liu, Degang Sun, Yan Wang, Zhongyuan Chen, Xinbo Han, Haitian Yang:
ABTD-Net: Autonomous Baggage Threat Detection Networks for X-ray Images. 1229-1234 - Zhi Zeng, Mingmin Wu, Guodong Li, Xiang Li, Zhongqiang Huang, Ying Sha:
An Explainable Multi-view Semantic Fusion Model for Multimodal Fake News Detection. 1235-1240 - Hao Li, Xiangyang Luo, Yi Zhang:
Improving CoatNet for Spatial and JPEG Domain Steganalysis. 1241-1246 - Shuai Hao, Jialin Yang, Xu Jia, You He, Huchuan Lu:
Image Super-Resolution with Implicit Texture Pattern Modulation. 1247-1252 - Feihong Qin, Liyan Zhang:
Towards Efficient Large Mask Inpainting via Knowledge Transfer. 1253-1258 - Shuyi Qu, Zhenxing Niu, Jianke Zhu, Bin Dong, Kaizhu Huang:
Structure First Detail Next: Image Inpainting with Pyramid Generator. 1265-1270 - Deyang Liu, Yifan Mao, Xiaofei Zhou, Ping An, Yuming Fang:
Learning a Multilevel Cooperative View Reconstruction Network for Light Field Angular Super-Resolution. 1271-1276 - Jiancong Feng, Yuan-Gen Wang, Fengchuang Xing:
NLCUnet: Single-Image Super-Resolution Network with Hairline Details. 1277-1282 - Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Shuai Cui:
An Order-Complexity Model for Aesthetic Quality Assessment of Symbolic Homophony Music Scores. 1289-1294 - Zehong Zhou, Fei Zhou, Guoping Qiu:
Collaborative Auto-encoding for Blind Image Quality Assessment. 1295-1300 - Jiaming Xie, Yu Luo, Jie Ling, Guanghui Yue:
No Reference Image Quality Assessment Via Quality Difference Learning. 1301-1306 - Yi Huang, Xiaoguang Tu, Gui Fu, Tingting Liu, Bokai Liu, Ming Yang, Ziliang Feng:
Low-Light Image Enhancement by Learning Contrastive Representations in Spatial and Frequency Domains. 1307-1312 - Lanxin Zhao, Dengshi Li, Jing Xiao, Chenyi Zhu:
Noise adaptive speech intelligibility enhancement based on improved StarGAN*. 1313-1318 - Bo Li, Lin Yuanbo Wu, Deyin Liu, Hongyang Chen, Yuanxin Ye, Xianghua Xie:
Image Template Matching via Dense and Consistent Contrastive Learning. 1319-1324 - Andreas Sochopoulos, Ioannis Mademlis, Evangelos Charalampakis, Sotirios Papadopoulos, Ioannis Pitas:
Deep Reinforcement Learning with semi-expert distillation for autonomous UAV cinematography. 1325-1330 - Xucheng Wang, Xiangyang Yang, Hengzhou Ye, Shuiwang Li:
Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking. 1331-1336 - Pan Mu, Jing Fang, Haotian Qian, Cong Bai:
Transmission and Color-guided Network for Underwater Image Enhancement. 1337-1342 - Dan Zeng, Mingliang Zou, Xucheng Wang, Shuiwang Li:
Towards Discriminative Representations with Contrastive Instances for Real-Time UAV Tracking. 1349-1354 - Rizwan Khan, Atif Mehmood, Saeed Akbar, Zhonglong Zheng:
Underwater Image Enhancement with an Adaptive Self Supervised Network. 1355-1360 - Cong Liang, Shangfei Wang, Xiaoping Chen:
Privacy-Protected Facial Expression Recognition Augmented by High-Resolution Facial Images. 1361-1366 - Feipeng Ma, Yueyi Zhang, Xiaoyan Sun:
Multimodal Sentiment Analysis with Preferential Fusion and Distance-aware Contrastive Learning. 1367-1372 - Wenxiu Geng, Yulong Bian, Xiangxian Li:
A Multi-View Co-Learning Method for Multimodal Sentiment Analysis. 1373-1378 - Zenan Xu, Qinliang Su, Junxi Xiao:
Multimodal Aspect-Based Sentiment Classification with Knowledge-Injected Transformer. 1379-1384 - Chuang Chen, Xiao Sun:
STA-GCN:Spatial Temporal Adaptive Graph Convolutional Network for Gait Emotion Recognition. 1385-1390 - Yiming Zhang, Hao Wang, Yifan Xu, Xinglong Mao, Tong Xu, Sirui Zhao, Enhong Chen:
Adaptive Graph Attention Network with Temporal Fusion for Micro-Expressions Recognition. 1391-1396 - Haoyu Zhou, Wei Hu, Ying Li, Chu He, Xi Chen:
Deep Homography Estimation With Feature Correlation Transformer. 1397-1402 - Zepeng Huang, Qi Wan, Junliang Chen, Xiaodong Zhao, Kai Ye, Linlin Shen:
ADATS: Adaptive RoI-Align based Transformer for End-to-End Text Spotting. 1403-1408 - Zao Zhang, Dong Yuan, Yu Zhang, Wei Bao:
Trajectory Alignment based Multi-Scaled Temporal Attention for Efficient Video Transformer. 1409-1414 - Qunchao Jin, Hongyu Hou, Guixu Zhang, Haoan Wang, Zhi Li:
Swin-ASNet: An Adaptive RGB-selection Network with Swin Transformer for Retinal Vessel Segmentation. 1415-1420 - Xin Yang, Hengliang Zhu, Guojun Mao, Shuli Xing:
OAFormer: Occlusion Aware Transformer for Camouflaged Object Detection. 1421-1426 - Zhuojun Zou, Xuexin Liu, Yuanpei Zhang, Lin Shu, Jie Hao:
Know Who You Are: Learning Target-Aware Transformer for Object Tracking. 1427-1432 - Wei Lu, Yang Jiang, Peiguang Jing, Jinghui Chu, Fugui Fan:
A Novel Channel Pruning Approach based on Local Attention and Global Ranking for CNN Model Compression. 1433-1438 - Yiding Liu, Yinglei Teng, Tao Niu:
Splittable Pattern-Specific Weight Pruning for Deep Neural Networks. 1439-1444 - Minyu Sun, Bin Jiang, Chao Yang:
Dynamic Dense-Sparse Representations for Real-Time Question Answering. 1445-1446 - Da Shi, Jingsheng Gao, Ting Liu, Yuzhuo Fu:
DynaSlim: Dynamic Slimming for Vision Transformers. 1451-1456 - Kai Feng, Zhuo Chen, Fei Gao, Zhe Wang, Long Xu, Weisi Lin:
Post-Training Quantization for Vision Transformer in Transformed Domain. 1457-1462 - Chaoran Chen, Mai Xu, Shengxi Li, Tie Liu, Minglang Qiao, Zhuoyi Lv:
Residual based hierarchical feature compression for multi-task machine vision. 1463-1468 - Shangchao Su, Bin Li, Chengzhi Zhang, Mingzhao Yang, Xiangyang Xue:
Cross-domain Federated Object Detection. 1469-1474 - Mei Ma, Ling Lin, Heng Wang, Zhendong Li, Hao Liu:
Cross-Modality Fourier Feature for Medical Image Synthesis. 1475-1480 - Ziwei Wang, Reza Arablouei, Jiajun Liu, Paulo Borges, Greg Bishop-Hurley, Nicholas Heaney:
Point-Syn2Real: Semi-Supervised Synthetic-to-Real Cross-Domain Learning for Object Classification in 3D Point Clouds. 1481-1486 - Zezhong Lv, Bing Su:
Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding. 1487-1492 - Sujuan Hou, Xingzhuo Li, Weiqing Min, Jiacheng Li, Jing Wang, Yuanjie Zheng, Shuqiang Jiang:
A Cross-direction Task Decoupling Network for Small Logo Detection. 1493-1498 - Wen Wang, Ling Zhong, Guang Gao, Minhong Wan, Jason Gu:
CHAN: Cross-Modal Hybrid Attention Network for Temporal Language Grounding in Videos. 1499-1504 - Zihan Fang, Shide Du, Yaqing Chen, Shiping Wang:
DMRL-Net: Differentiable Multi-view Representation Learning Network. 1505-1510 - Jueqi Wei, Yuanwu Xu, Mohan Chen, Yuejie Zhang, Rui Feng, Shang Gao:
Conditional Video-Text Reconstruction Network with Cauchy Mask for Weakly Supervised Temporal Sentence Grounding. 1511-1516 - Yuzhong Zhao, Weijia Wu, Zhuang Li, Jiahong Li, Weiqiang Wang:
FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation. 1517-1522 - Hongzhou Wu, Yifan Lyu, Xingyu Shen, Xuechen Zhao, Mengzhu Wang, Xiang Zhang, Zhigang Luo:
Atomic-action-based Contrastive Network for Weakly Supervised Temporal Language Grounding. 1523-1528 - Xiaoqian Liu, Xiuyun Li, Yuan Cao, Fan Zhang, Xiongnan Jin, Jinpeng Chen:
Mandari: Multi-Modal Temporal Knowledge Graph-aware Sub-graph Embedding for Next-POI Recommendation. 1529-1534 - Qin Chao, Eunsoo Kim, Boyang Li:
Movie Box Office Prediction With Self-Supervised and Visually Grounded Pretraining. 1535-1540 - Wenbin Zou, Guoguang Hua, Guangxu Chen, Zaiyue He, Guangli Liu, Pengfei Chen, Yuyang Li, Huakun Li, Lei Zheng, Shishun Tian:
Need a dog for seeing eye? A Walk Viewpoint Dataset for Freespace Detection in Unstructured Environments. 1541-1546 - Linfan Zha, Yanming Chen, Peng Zhou, Yiwen Zhang:
Intensifying The Consistency of Pseudo Label Refinement for Unsupervised Domain Adaptation Person Re-Identification. 1547-1552 - Zihao Bu, Xiaoxiao Wang, Chengjian Qiu, Zhixuan Wang, Kai Han, Xiuhong Shan, Zhe Liu:
Noisy-to-Clean Label Learning for Medical Image Segmentation. 1553-1558 - Wenhao Hu, Yingying Liu, Jiazhen Xu, Xuanyu Chen, Gaoang Wang:
Learning Discrimination from Contaminated Data: Multi-Instance Learning for Unsupervised Anomaly Detection. 1559-1564 - Bin Zheng, Miaohui Wang:
Rethinking Video Error Concealment: A Benchmark Dataset. 1565-1570 - Zemian Guo, Yingying Zhu:
Visual Place Recognition Datasets for Indoor Spaces. 1571-1576 - Dan You, Pengcheng Xia, Qiuzhu Chen, Minghui Wu, Suncheng Xiang, Jun Wang:
AutoKary2022: A Large-Scale Densely Annotated Dataset for Chromosome Instance Segmentation. 1577-1582 - Salman Siddique Khan, Vivek Boominathan, Ashok Veeraraghavan, Kaushik Mitra:
Designing Optics and Algorithm for Ultra-Thin, High-Speed Lensless Cameras. 1583-1588 - Yangke Ying, Jin Wang, Yunhui Shi, Baocai Yin:
Dual-Domain Feature Learning and Memory-Enhanced Unfolding Network for Spectral Compressive Imaging. 1589-1594 - Shumian Yang, Xinxin Xiang, Fenghua Tong, Dawei Zhao, Xin Li:
Image Compressed Sensing Using Multi-Scale Characteristic Residual Learning. 1595-1600 - Pinjun Luo, Guoqiang Xiao, Xinbo Gao, Song Wu:
LKD-Net: Large Kernel Convolution Network for Single Image Dehazing. 1601-1606 - Haoran Huang, Yuhui Quan, Zhenghua Lei, Jinlong Hu, Yan Huang:
Video Noise Removal Using Progressive Decomposition With Conditional Invertibility. 1607-1612 - Shaokai Liu, Hao Feng, Wengang Zhou, Houqiang Li, Cong Liu, Feng Wu:
DocMAE: Document Image Rectification via Self-supervised Representation Learning. 1613-1618 - He Zhu, Yang Chen, Guyue Hu, Shan Yu:
Information-density Masking Strategy for Masked Image Modeling. 1619-1624 - Zheyuan Liu, Pan Mu, Hanning Xu, Cong Bai:
Histogram-guided Video Colorization Structure with Spatial-Temporal Connection. 1625-1630 - Xinye Yang, Dongbao Yang, Yu Zhou, Youhui Guo, Weiping Wang:
Mask-Guided Stamp Erasure for Real Document Image. 1631-1636 - Yu Cao, Hao Tian, P. Y. Mok:
Attention-Aware Anime Line Drawing Colorization. 1637-1642 - Xinghui Li, Yikang Ding, Jia Guo, Xiansong Lai, Shihao Ren, Wensen Feng, Long Zeng:
Edge-aware Neural Implicit Surface Reconstruction. 1643-1648 - Yinhe Lin, Fei Chen, Hang Cheng, Meiqing Wang:
Handwriting Curve Interpolation Using Gradient Graph Laplacian Regularizer. 1649-1654 - Vibhoothi, François Pitié, Angeliki Katsenou, Yeping Su, Balu Adsumilli, Anil C. Kokaram:
Comparison of HDR quality metrics in Per-Clip Lagrangian multiplier optimisation with AV1. 1655-1660 - Chunyi Li, May Lim, Abdelhak Bentaleb, Roger Zimmermann:
A Real-Time Blind Quality-of-Experience Assessment Metric for HTTP Adaptive Streaming. 1661-1666 - Andréas Pastor, Patrick Le Callet:
Towards Guidelines for Subjective Haptic Quality Assessment: A Case Study on Quality Assessment of Compressed Haptic Signals. 1667-1672 - Vignesh V. Menon, Jingwen Zhu, Prajit T. Rajendran, Hadi Amirpour, Patrick Le Callet, Christian Timmerer:
Just Noticeable Difference-Aware Per-Scene Bitrate-Laddering for Adaptive Video Streaming. 1673-1678 - Hadi Amirpour, Vignesh V. Menon, Samira Afzal, Radu Prodan, Christian Timmerer:
Optimizing Video Streaming for Sustainability and Quality: The Role of Preset Selection in Per-Title Encoding. 1679-1684 - Zicheng Zhang, Hao Chen, Xun Cao, Zhan Ma:
Anableps: Adapting Bitrate for Real-Time Communication Using VBR-encoded Video. 1685-1690 - Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation-based Voice Conversion. 1691-1696 - Hegen Yan, Zhihua Lu:
A Disentangled Recurrent Variational Autoencoder for Speech Enhancement. 1697-1702 - Sipan Li, Songxiang Liu, Luwen Zhang, Xiang Li, Yanyao Bian, Chao Weng, Zhiyong Wu, Helen Meng:
SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias. 1703-1708 - Zhibin Qiu, Yachao Guo, Mengfan Fu, Hao Huang, Ying Hu, Liang He, Fuchun Sun:
CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising. 1709-1714 - Ying Hu, Shijing Hou, Huamin Yang, Hao Huang, Liang He:
A Joint Network Based on Interactive Attention for Speech Emotion Recognition. 1715-1720 - Fangjing Niu, Tengfei Cao, Ying Hu, Hao Huang, Liang He:
Speech Topic Classification Based on Pre-trained and Graph Networks. 1721-1726 - Zhuoming Dong, Huajun Zhou, Jianhuang Lai:
Unsupervised 3D Face Reconstruction with Reprogramming Skip Connections. 1727-1732 - Pengfei Hu, Yingfan Tao, Qiqi Bao, Guijin Wang, Wenming Yang:
EvenFace: Deep Face Recognition with Uniform Distribution of Identities. 1733-1738 - Xiaomeng Fu, Xi Wang, Jin Liu, Jiao Dai, Jizhong Han:
Large Pose Friendly Face Reenactment using subtle motions. 1739-1744 - Wei Xu, Kangkang Wang, Ziliang Chen, Bin He, Bi Li, Haocheng Feng, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding:
MSAbox: A spatially stable face detector. 1745-1750 - Xianliang Huang, Yining Lang, Ying Guo, Yuan He, Hui Xue, Li Zhao, Shuigeng Zhou:
DR-Net: A Multi-View Face Synthesis Network Driven by Dual Representation. 1751-1756 - Weichen Zhang, Xiang Zhou, YuKang Cao, WenSen Feng, Chun Yuan:
MA-NeRF: Motion-Assisted Neural Radiance Fields for Face Synthesis from Sparse Images. 1757-1762 - Yaoru Luo, Ge Yang:
Enhancing Robustness of Deep Networks Against Noisy Labels Based on A Two-Phase Formulation of Their Learning Behavior. 1763-1768 - Yadang Chen, Dingwei Zhang, Zhi-Xin Yang, Enhua Wu:
Robust and Efficient Memory Network for Video Object Segmentation. 1769-1774 - Hao Yang, Min Wang, Zhengfei Yu, Yun Zhou:
Weight-based Regularization for Improving Robustness in Image Classification. 1775-1780 - Wenyi Feng, Wei Guo, Ting Xiao, Zhe Wang:
Robust Structured Sparse Subspace Clustering with Neighborhood Preserving Projection. 1781-1786 - Jiawei Lin, Shuoyao Wang:
Improving robustness of learning-based adaptive video streaming in wildly fluctuating networks. 1787-1792 - Dong Xi, Wengang Zhou, Houqiang Li:
Robust Person Re-Identification with Wireless Signals. 1793-1798 - Tao Hong, Ya Wang, Xingwu Sun, Fengzong Lian, Zhanhui Kang, Jinwen Ma:
GradSalMix: Gradient Saliency-Based Mix for Image Data Augmentation. 1799-1804 - Hui Zhu, Yongchun Lü, Qin Ma, Xunyi Zhou, Fen Xia, Guoqing Zhao, Ning Jiang, Xiaofang Zhao:
Get a Head Start: Targeted Labeling at Source with Limited Annotation Overhead for Semi-Supervised Learning. 1805-1810 - Yan Hu, Xiaozhao Fang, Weijun Lv, Peipei Kang:
Partial multi-label learning: exploration of binary ground-truth labels. 1811-1816 - Shiya Luo, Defang Chen, Can Wang:
Customizing Synthetic Data for Data-Free Student Learning. 1817-1822 - Zhen Liang, Changyuan Zhao, Wanwei Liu, Bai Xue, Wenjing Yang:
A Geometrical Characterization on Feature Density of Image Datasets. 1823-1828 - Gang Li, Qifei Zhang, Peizheng Wang, Jie Zhang, Chao Wu:
Federated Domain Adaptation via Pseudo-label Refinement. 1829-1834 - Xinchen Gao, Yawei Li, Wen Li, Lixin Duan, Luc Van Gool, Luca Benini, Michele Magno:
Learning continuous piecewise non-linear activation functions for deep neural networks. 1835-1840 - Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong:
Discriminative Spatiotemporal Alignment for Self-Supervised Video Correspondence Learning. 1841-1846 - Jia Chen, Haidongqing Yuan, Fei Fang, Tao Peng, Xinrong Hu:
Unsupervised Fashion Style Learning by Solving Fashion Jigsaw Puzzles. 1847-1852 - Selen Pehlivan, Jorma Laaksonen:
Anchor-Free Action Proposal Network with Uncertainty Estimation. 1853-1858 - Shalayiding Sirejiding, Yuxiang Lu, Hongtao Lu, Yue Ding:
Scale-Aware Task Message Transferring for Multi-Task Learning. 1859-1864 - Yuhu Wang, Shiming Xiang, Chunhong Pan:
Improving the Homophily of Heterophilic Graphs for Semi-Supervised Node Classification. 1865-1870 - Kai Leng, Cong Yang, Wei Sui, Jie Liu, Zhijun Li:
Sitpose: A Siamese Convolutional Transformer for Relative Camera Pose Estimation. 1871-1876 - Xiaocong Wang, Chaoyue Wu, Haiyang Yu, Bin Li, Xiangyang Xue:
TextFormer: Component-aware Text Segmentation with Transformer. 1877-1882 - Hui Lu, Ronald Poppe, Albert Ali Salah:
SCFormer: Integrating hybrid Features in Vision Transformers. 1883-1888 - Tianyu Song, Pengpeng Li, Guiyue Jin, Jiyu Jin, Shumin Fan, Xiang Chen:
Image Deraining Transformer with Sparsity and Frequency Guidance. 1889-1894 - Beiying Yang, Guibo Zhu, Guojing Ge, Jinzhao Luo, Jinqiao Wang:
ShiftFormer: Spatial-Temporal Shift Operation in Video Transformer. 1895-1900 - Tianxiang Chen, Qi Chu, Zhentao Tan, Bin Liu, Nenghai Yu:
ABMNet: Coupling Transformer with CNN Based on Adams-Bashforth-Moulton Method for Infrared Small Target Detection. 1901-1906 - Yue He, Yufan Wang, Linlong He, Guangyao Pan, He Ma:
ART: An Efficient Transformer with Atrous Residual Learning for Medical Images. 1907-1912 - Shiao Xie, Huimin Huang, Ziwei Niu, Lanfen Lin, Yen-Wei Chen:
MedFCT: A Frequency Domain Joint CNN-Transformer Network for Semi-supervised Medical Image Segmentation. 1913-1918 - Jia Chen, Zhenpeng Fu, Fei Fang, Mingfu Xiong, Xinrong Hu, Tao Peng:
Cross-cycle Transformer-based Stitching Method for Low-resolution Borehole Images. 1919-1924 - Jiquan Peng, Chaozhuo Li, Yi Zhao, Yuting Lin, Xiaohan Fang, Jibing Gong:
Improving Vision Transformers with Nested Multi-head Attentions. 1925-1930 - Yuzhang Hu, Minghao Liu, Wenhan Yang, Jiaying Liu, Zongming Guo:
Collaborative Spatial-Temporal Distillation for Efficient Video Deraining. 1937-1942 - Hailin Zhang, Defang Chen, Can Wang:
Adaptive Multi-Teacher Knowledge Distillation with Meta-Learning. 1943-1948 - Defang Cai, Pan Mu, Sixian Chan, Zhanpeng Shao, Cong Bai:
Towards General and Fast Video Derain via Knowledge Distillation. 1949-1954 - Jian Zhu, Xiaohu Ruan, Yongli Cheng, Zhangmin Huang, Yu Cui, Lingfang Zeng:
Deep Metric Multi-View Hashing for Multimedia Retrieval. 1955-1960 - Jinyu Li, Fuwei Zhang, Shujin Lin, Fan Zhou, Ruomei Wang:
MIM: Lightweight Multi-Modal Interaction Model for Joint Video Moment Retrieval and Highlight Detection. 1961-1966 - Xu Zhang, Xinzheng Niu, Philippe Fournier-Viger, Xudong Dai:
Image-text Retrieval via Preserving Main Semantics of Vision. 1967-1972 - Xun Jiang, Zhiguo Chen, Xing Xu, Fumin Shen, Zuo Cao, Xunliang Cai:
Progressive Event Alignment Network for Partial Relevant Video Retrieval. 1973-1978 - Mengyu Yang, Di Wu, Zelong Wang, Miao Hu, Yipeng Zhou:
Understanding and Improving Perceptual Quality of Volumetric Video Streaming. 1979-1984 - Lei Wei, Shuai Wan, Xiaobin Ding, FuZheng Yang, Zhecheng Wang:
Adaptive Geometry Reconstruction for Geometry-based Point Cloud Compression. 1985-1990 - Chen Chen, Hui Yuan, Hao Liu, Junhui Hou, Raouf Hamzaoui:
CAS-Net: Cascade Attention-Based Sampling Neural Network for Point Cloud Simplification. 1991-1996 - Lei Liu, Zhihao Hu, Jing Zhang:
PCHM-Net: A New Point Cloud Compression Framework for Both Human Vision and Machine Vision. 1997-2002 - Rui Song, Chunyang Fu, Shan Liu, Ge Li:
Large-Scale Spatio-Temporal Attention Based Entropy Model for Point Cloud Compression. 2003-2008 - Haipeng Zhang, Jie Zhang, Weimiao Feng, Kaigui Bian, Hu Tuo:
Edge-FVV: Free Viewpoint Video Streaming by Learning at the Edge. 2009-2014 - Weijia Wang, Xuequan Lu, Di Shao, Xiao Liu, Richard Dazeley, Antonio Robles-Kelly, Wei Pan:
Weighted Point Cloud Normal Estimation. 2015-2020 - Yiheng Li, Canhui Tang, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du:
HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching. 2021-2026 - Yakun Ju, Cong Zhang, Songsong Huang, Yuan Rao, Kin-Man Lam:
Learning Deep Photometric Stereo Network with Reflectance Priors. 2027-2032 - Chenyangguang Zhang, Zhiqiang Lou, Yan Di, Federico Tombari, Xiangyang Ji:
SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance. 2033-2038 - Ke Liu, Ning Ma, Zhihua Wang, Jingjun Gu, Jiajun Bu, Haishuai Wang:
Implicit Neural Distance Optimization for Mesh Neural Subdivision. 2039-2044 - Ke Ren, Zhenjiang Du, Qifeng He, Ning Xie, Guan Wang:
MRRA-GAN: Multi-Resolution Relation-Aware GAN for Point Cloud Completion. 2045-2050 - Shanshan Zhong, Wushao Wen, Jinghui Qin, Qiangpu Chen, Zhongzhan Huang:
LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem. 2051-2056 - Hui Lu, Ronald Poppe, Albert Ali Salah:
LA-layer: General local attention layer for full attention networks. 2057-2062 - Qiangxi Zhu, Zhixin Li:
A Progressive Gated Attention Model for Fine-Grained Visual Classification. 2063-2068 - Yubo Wu, Yurui Ren, Yuanqi Chen, Ge Li:
Flow-Guided Attention Deformation for Person Image Generation. 2069-2074 - Jinyi Fang, Bingke Zhu, Yingying Chen, Jinqiao Wang, Ming Tang:
Explicit Attention Modeling for Pedestrian Attribute Recognition. 2075-2080 - Yaxi Chen, Ruimin Hu, Danni Xu, Zheng Wang, Linbo Luo, Dengshi Li:
Hidden Follower Detection via Refined Gaze and Walking State Estimation. 2081-2086 - Zhenbei Wu, Haoge Deng, Qiang Wang, Di Kong, Jie Yang, Yonggang Qi:
SketchScene: Scene Sketch To Image Generation With Diffusion Models. 2087-2092 - Yanjie Pan, Yaru Du, Shandong Wang, Yun Ye, Yong Jiang, Zhen Zhou, Li Xu, Ming Lu, Yunbiao Lin, Jiehui Lu:
DanceU: motion-and-music-based automatic effect generation for dance videos. 2093-2098 - Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han:
FONT: Flow-guided One-shot Talking Head Generation with Natural Head Motions. 2099-2104 - Wanqing Wu, Aihua Mao, Wenwei Yan, Qing Liu:
UFS-Net: Unsupervised Network For Fashion Style Editing And Generation. 2105-2110 - Yuxin Hou, Hongxun Yao, Haoran Li:
Graph Convolutional GRU for Music-Oriented Dance Choreography Generation. 2111-2116 - Zhongqi Wang, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan:
CCLAP: Controllable Chinese Landscape Painting Generation Via Latent Diffusion Model. 2117-2122 - Zhongan Wang, Shuai Shi, Yingna Wu, Rui Yang:
Prototype calibration for long tailed recognition. 2123-2128 - Son Duy Dao, Dat Huynh, He Zhao, Dinh Phung, Jianfei Cai:
Open-Vocabulary Multi-label Image Classification with Pretrained Vision-Language Model. 2135-2140 - Ruotong Hu, Xianzhi Wang, Xiaojun Chang, Yeqi Hu, Xiaowei Xin, Xiangqian Ding, Baoqi Guo:
RASNet: A Reinforcement Assistant Network for Frame Selection in Video-based Posture Recognition. 2141-2146 - Shengqin Wang, Yongji Zhang, Hong Qi, Minghao Zhao, Yu Jiang:
Dynamic Spatial-temporal Hypergraph Convolutional Network for Skeleton-based Action Recognition. 2147-2152 - Zhengxuan Zhang, Weixing Mai, Haoliang Xiong, Chuhan Wu, Yun Xue:
A Token-wise Graph-based Framework for Multimodal Named Entity Recognition. 2153-2158 - Zhao Duan, Xiaoliu Luo, Taiping Zhang:
Multi-focus image fusion via gradient guidance progressive network. 2159-2164 - Chao-Liang Yu, I-Chen Lin:
Efficient Video Matting on Human Video Clips for Real-Time Application. 2165-2170 - Shen Yan, Xiaoya Cheng, Yuxiang Liu, Juelin Zhu, Rouwan Wu, Yu Liu, Maojun Zhang:
Render-and-Compare: Cross-view 6-DoF Localization from Noisy Prior. 2171-2176 - Zan Chen, Ran Li, Yongqiang Li, Yuanjing Feng:
Video Snapshot Compressive Imaging via Optical Flow. 2177-2182 - Wenpeng Xing, Jie Chen:
CasTensoRF: Cascaded Tensorial Radiance Fields for Novel View Synthesis. 2183-2188 - Lingzhi Li, Zhongshu Wang, Zhen Shen, Li Shen, Ping Tan:
Compact Real-Time Radiance Fields with Neural Codebook. 2189-2194 - Xiaowen Ma, Jiawei Yang, Tingfeng Hong, Mengting Ma, Ziyan Zhao, Tian Feng, Wei Zhang:
STNet: Spatial and Temporal feature fusion network for change detection in remote sensing images. 2195-2200 - Boyu Qiao, Kun Li, Wei Zhou, Zhou Yan, Shilong Li, Songlin Hu:
Social Bot Detection Based on Window Strategy. 2201-2206 - Wei Ma, Shiyong Lan, Weikang Huang, Wenwu Wang, Hongyu Yang, Yitong Ma, Yongjie Ma:
A Semantics-Aware Normalizing Flow Model for Anomaly Detection. 2207-2212 - Haitao Leng, Xiaoming Shi, Wei Zhou, Kuncai Zhang, Qiankun Shi, Pengcheng Zhu:
Online Action Detection with Learning Future Representations by Contrastive Learning. 2213-2218 - Hantao Zhang, Shouhong Wan, Weidong Guo, Peiquan Jin, Mingguang Zheng:
HOD: Human-Object Decoupling Network for HOI Detection. 2219-2224 - Yuzhe Mao, Weike You, Linna Zhou, Zhigao Lu:
Fixing Domain Bias for Generalized Deepfake Detection. 2225-2230 - Jiangming Chen, Wanxia Deng, Bo Peng, Tianpeng Liu, Yingmei Wei, Li Liu:
Variational Information Bottleneck for Cross Domain Object Detection. 2231-2236 - Peiwen Li, Lijun Zhang, Xiang-Dong Zhou, Yu Shi, Xiaohu Shao:
Attention Based Network with DA-Loss for X-ray Contraband Automatic Detection. 2237-2242 - Haiyan Zhang, Sumei Li:
Cross-Level Attention Based Adaptive Feature Alignment Network for Arbitrary-Shaped Text Detection. 2243-2248 - Yang Wu, Zhibin Liu, Hefeng Wu, Liang Lin:
Multi-object Video Generation from Single Frame Layouts. 2249-2254 - Hongshuo Tian, Ning Xu, Yanhui Wang, Chenggang Yan, Bolun Zheng, Xuanya Li, An-An Liu:
Towards Confidence-Aware Commonsense Knowledge Integration for Scene Graph Generation. 2255-2260 - Tianlong Ma, Xingjiao Wu, Xiangcheng Du, Yanlong Wang, Cheng Jin:
Image Layer Modeling for Complex Document Layout Generation. 2261-2266 - Jieting Chen, Junkai Ding, Wenping Chen, Qin Jin:
Knowledge Enhanced Model for Live Video Comment Generation. 2267-2272 - Yun Guo, Wei Feng, Zheng Zhang, Xiancong Ren, Yaoyu Li, Jingjing Lv, Xin Zhu, Zhangang Lin, Jingping Shao:
Mutual Query Network for Multi-Modal Product Image Segmentation. 2273-2278 - Xiaogang Du, Yinghao Wu, Tao Lei, Dongxin Gu, Yinyin Nie, Asoke K. Nandi:
ATENet: Adaptive Tiny-Object Enhanced Network for Polyp Segmentation. 2279-2284 - Gang Xu, Shengxin Wang, Thomas Lukasiewicz, Zhenghua Xu:
Adaptive-Masking Policy with Deep Reinforcement Learning for Self-Supervised Medical Image Segmentation. 2285-2290 - Hao Zeng, Xinxin Shan, Yu Feng, Ying Wen:
MSAANet: Multi-scale Axial Attention Network for medical image segmentation. 2291-2296 - Hao Yang, Min Wang, Zhengfei Yu, Yun Zhou:
A Simple Stochastic Neural Network for Improving Adversarial Robustness. 2297-2302 - Bo Zou, Chao Yang, Jiazhi Guan, Chengbin Quan, Youjian Zhao:
DFCP: Few-Shot DeepFake Detection via Contrastive Pretraining. 2303-2308 - Jiucui Lu, Yuezun Li, Jiaran Zhou, Bin Li, Siwei Lyu:
Forensics Forest: Multi-scale Hierarchical Cascade Forest for Detecting GAN-generated Faces. 2309-2314 - Bingyuan Huang, Sanshuai Cui, Xiangui Kang, Enping Li:
Transferable Waveform-level Adversarial Attack against Speech Anti-spoofing Models. 2315-2320 - Jian Zhang, Jiangqun Ni:
Domain-Invariant Feature Learning for General Face Forgery Detection. 2321-2326 - Yingjie He, Yuanman Li, Changsheng Chen, Xia Li:
Image Copy-Move Forgery Detection via Deep Cross-Scale PatchMatch. 2327-2332 - Yuxuan Zhang, Wei Yang, Rong Hu:
BAProto: Boundary-Aware Prototype for High-quality Instance Segmentation. 2333-2338 - Weiwei Li, Yuanyuan Ren, Junzhuo Liu, Chenyang Wang, Yuchen Zheng:
PMDA: Domain Alignment with Prototype Matching for Cross-Domain Adaptive Segmentation. 2339-2344 - Yongchao Wang, Bin Xiao, Xiuli Bi, Weisheng Li, Xinbo Gao:
Cross-slice Context Consistency for Semi-supervised 3D Left Atrium Segmentation. 2343-2350 - Chenbin Zhang, Qingyuan He, Kun Yan, Meng Ma, Defeng Liu, Ping Wang:
CTSSeg: Consistent Teacher-Student model for magnetic resonance image Segmentation. 2351-2356 - Xin Lv, Zhenming Su, Taiyi Zhang, Wenxiang Cheng, Xiaoqiong Qi:
Adaptive Non-local Affinity Graph for Unsupervised Image Segmentation. 2357-2362 - Yongtuo Liu, Dan Xu, Sucheng Ren, Hanjie Wu, Hongmin Cai, Shengfeng He:
Fine-grained Domain Adaptive Crowd Counting via Point-derived Segmentation. 2363-2368 - Zhengyi Liu, Xiaoshen Huang, Guanghui Zhang, Xianyong Fang, Linbo Wang, Bin Tang:
Scribble-Supervised RGB-T Salient Object Detection. 2369-2374 - Yibin Wang, Yuchao Feng, Jie Wu, Honghui Xu, Jianwei Zheng:
CA-GAN: Object Placement via Coalescing Attention based Generative Adversarial Network. 2375-2380 - Peiwen Pan, Huan Wang, Chenyi Wang, Chang Nie:
ABC: Attention with Bilinear Correlation for Infrared Small Target Detection. 2381-2386 - Bo Yuan, Yao Jiang, Keren Fu, Qijun Zhao:
Guided Focal Stack Refinement Network for Light Field Salient Object Detection. 2387-2392 - Zhenshan Tan, Cheng Chen, Xiaodong Gu:
Triplet Spatiotemporal Aggregation Network for Video Saliency Detection. 2393-2398 - Daosong Hu, Kai Huang:
GFNet: Gaze Focus Network using Attention for Gaze Estimation. 2399-2404 - Zepeng Wang, Ke Xu, Yuting Mou, Xinghao Jiang:
Feature Mixing and Disentangling for Occluded Person Re-Identification. 2405-2410 - Kaixiang Chen, Tiantian Gong, Liyan Zhang:
Multi-Scale Query-Adaptive Convolution for Generalizable Person Re-Identification. 2411-2416 - Mengzan Qi, Sixian Chan, Chen Hang, Guixu Zhang, Zhi Li:
Fine-grained Learning for Visible-Infrared Person Re-identification. 2417-2422 - Yimin Liu, Meibin Qi, Qiang Wu, Yanfang Yang, Xiaohong Li, Jian Zhang:
Camera Proxy based Contrastive Learning with Hard Sampling for Unsupervised Person Re-identification. 2423-2428 - Guoqing Zhang, Zhiyuan Luo, Weisi Lin, Xuan Jing:
Inter-Intra Camera Identity Learning for Person Re-Identification with Training in Single Camera. 2429-2434 - Tiantian Gong, Kaixiang Chen, Junsheng Wang, Liyan Zhang:
Dynamically Adaptive Instance Normalization and Attention-Aware Incremental Meta-Learning for Generalizable Person Re-identification. 2435-2440 - Qing Zhang, Weiqi Yan:
CFANet: A Cross-layer Feature Aggregation Network for Camouflaged Object Detection. 2441-2446 - Jiaxiang Dong, Li Zhang:
Multibox Sample Selection for Active Object Detection. 2447-2452 - Luojun Lin, Zhifeng Yang, Qipeng Liu, Yuanlong Yu, Qifeng Lin:
Run and Chase: Towards Accurate Source-Free Domain Adaptive Object Detection. 2453-2458 - Yuxuan Song, Xinyue Li, Lin Qi:
Camouflaged Object Detection with Feature Grafting and Distractor Aware. 2459-2464 - Dongyue Sun, Shiyao Jiang, Lin Qi:
Edge-Aware Mirror Network for Camouflaged Object Detection. 2465-2470 - Zhibin Zhang, Wanli Xue, Kaihua Zhang, Shengyong Chen:
'Skimming-Perusal' Detection: A Simple Object Detection Baseline in GigaPixel-level Images. 2471-2476 - Tong Zhu, Leida Li, Pengfei Chen, Jinjian Wu, Yuzhe Yang, Yaqian Li, Yandong Guo:
Attribute-assisted Multimodal Network for Image Aesthetics Assessment. 2477-2482 - Zicheng Zhang, Wei Sun, Yingjie Zhou, Wei Lu, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai:
EEP-3DQA: Efficient and Effective Projection-Based 3D Model Quality Assessment. 2483-2488 - Kaifa Yang, Qi Yang, Joel Jung, Yiling Xu, Xiaozhong Xu, Shan Liu:
Exploring the Influence of View and Camera Path Selection for Dynamic Mesh Quality Assessment. 2489-2494 - Shuaibing Wang, Shunli Wang, Dingkang Yang, Mingcheng Li, Ziyun Qian, Liuzhen Su, Lihua Zhang:
HandGCAT: Occlusion-Robust 3D Hand Mesh Reconstruction from Monocular Images. 2495-2500 - Wei Lu, Wei Sun, Zicheng Zhang, Danyang Tu, Xiongkuo Min, Guangtao Zhai:
BH-VQA: Blind High Frame Rate Video Quality Assessment. 2501-2506 - Yuan Chen, Sumei Li:
Multi-Level Feature-Guided Stereoscopic Video Quality Assessment Based on Transformer and Convolutional Neural Network. 2513-2518 - Zicheng Zhang, Yingjie Zhou, Wei Sun, Wei Lu, Xiongkuo Min, Yu Wang, Guangtao Zhai:
DDH-QA: A Dynamic Digital Humans Quality Assessment Database. 2519-2524 - Litian Li, Zheng Yang, Yongqi Zhai, Jiayu Yang, Ronggang Wang:
Improving Multi-generation Robustness of Learned Image Compression. 2525-2530 - Yinqi Chen, Zhiyi Lu, Ya Lu, Yangting Zheng, Peiwen Li, Shuo Kang:
Code Verification Hashing for Image Retrieval. 2531-2536 - Xinjie Zhang, Jiawei Shao, Jun Zhang:
Low-complexity Deep Video Compression with A Distributed Coding Architecture. 2537-2542 - Yulin Wu, Ruimin Hu, Xiaochen Wang:
Perceptual Audio Object Coding Using Adaptive Subband Grouping with CNN and Residual Block. 2543-2548 - Kai Wang, Yuanchao Bai, Deming Zhai, Daxin Li, Junjun Jiang, Xianming Liu:
Learning Lossless Compression for High Bit-Depth Medical Imaging. 2549-2554 - Pengpeng Yu, Dian Zuo, Yueer Huang, Ruishan Huang, Hanyun Wang, Yulan Guo, Fan Liang:
Sparse Representation based Deep Residual Geometry Compression Network for Large-scale Point Clouds. 2555-2560 - Shaokang Wang, Xiaofeng Huang, Guoqing Xiang, Xizhong Zhu, Jiaojiao Yang, Peng Zhang, Huizhu Jia, Xiaodong Xie:
An Efficient Real-Time Hardware Architecture for Deblocking Filter in AVS3. 2561-2566 - Yuqing Yang, Xin Jin, Kedeng Tong, Chen Wang, Haitian Huang:
Microimage-based Two-step Search For Plenoptic 2.0 Video Coding. 2567-2572 - Xi Xie, Kai Zhang, Li Zhang, Meng Wang, Junru Li, Shiqi Wang:
Low Complexity Transcoding from HEVC to VVC. 2573-2578 - Sixian Chan, Jiaao Cui, Yonggan Wu, Hongqiang Wang, Cong Bai:
Visible-Xray Cross-Modality Package Re-Identification. 2579-2584 - Huy Nguyen, Kien Nguyen, Sridha Sridharan, Clinton Fookes:
Aerial-Ground Person Re-ID. 2585-2590 - Astha Verma, A. Venkata Subramanyam, Mohammad Ali Jauhar, Divij Gera, Rajiv Ratn Shah:
Meta Perturbed Re-Id Defense. 2597-2602 - Guangyu Chen, Deyuan Zhang, Tao Liu, Xiaoyong Du:
EFT: Expert Fusion Transformer for Voice-Face Association Learning. 2603-2608 - Wenjun Peng, Weidong He, Derong Xu, Tong Xu, Chen Zhu, Enhong Chen:
Social Context-aware GCN for Video Character Search via Scene-prior Enhancement. 2609-2614 - Wei Chen, Jianwei Niu, Xuefeng Liu:
MRCap: Multi-modal and Multi-level Relationship-based Dense Video Captioning. 2615-2620 - Yibo Cui, Ruqiang Huang, Yakun Zhang, Yingjie Cen, Liang Xie, Ye Yan, Erwei Yin:
Auxiliary Fine-grained Alignment Constraints for Vision-and-Language Navigation. 2621-2626 - Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. 2627-2632 - Wei Song, Bin Wu, Chunping Zheng, Huayang Zhang:
Detection Of Public Speaking Anxiety: A New Dataset And Algorithm. 2633-2638 - Beitao Chen, Xuanhan Wang, Xiaojia Chen, Yulan He, Jingkuan Song:
EANet: Towards Lightweight Human Pose Estimation With Effective Aggregation Network. 2639-2644 - Zhihao Li, Huaxiang Zhang, Lei Zhu, Jiande Sun, Li Liu:
Effective Occlusion Suppression Network via Grouped Pose Estimation for Occluded Person Re-Identification. 2645-2650 - Yaoxing Wang, Heng Zhou, Zhendong Li, Xian Mo, Hao Liu:
Structural Equivariance Self-Supervised Learning for Facial Pose Estimation. 2651-2656 - Hongwei Zheng, Han Li, Bowen Shi, Wenrui Dai, Botao Wang, Yu Sun, Min Guo, Hongkai Xiong:
ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting. 2657-2662 - Guanghua Zheng, Zhongqiu Zhao, Zhao Zhang, Yi Yang:
Hierarchical Graph Neural Network for Human Pose Estimation. 2663-2668 - Chunyang Xie, Dongheng Zhang, Zhi Wu, Cong Yu, Yang Hu, Qibin Sun, Yan Chen:
RF-based Multi-view Pose Machine for Multi-Person 3D Pose Estimation. 2669-2674 - Jing Li, Liu Yang, Qilong Wang, Qinghua Hu:
Coarse Helps Fine: A Multi-Granularity Discriminative Adversarial Network for Fine-Grained Open-Set Domain Adaptation. 2675-2680 - Yao Xiao, Pengxu Wei, Cong Liu, Liang Lin:
Adversarially Robust Source-free Domain Adaptation with Relaxed Adversarial Training. 2681-2686 - Yi Li, Xin Xie, Haiyan Fu, Xiangyang Luo, Yanqing Guo:
A Compact Transformer for Adaptive Style Transfer. 2687-2692 - Jianglin Wei, Guangyi Xiao, Shun Peng, Hao Chen, Jingzhi Guo, Zhiguo Gong:
Fine-Grained Alignment for Boundary Samples under Open Set Domain Adaptation. 2693-2698 - Kai Wang, Xing Xu, Jialin Tian, Zuo Cao, Gong Zhang:
Information Selection-based Domain Adaptation from Black-box Predictors. 2699-2704 - Meng Shen, Andy J. Ma, Pong C. Yuen:
E2: Entropy Discrimination and Energy Optimization for Source-free Universal Domain Adaptation. 2705-2710 - Shengyang Sun, Xiaojin Gong:
Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection. 2711-2716 - Xiangyu Huang, Caidan Zhao, Jinhui Yu, Chenxing Gao, Zhiqiang Wu:
Multi-Level Memory-Augmented Appearance-Motion Correspondence Framework for Video Anomaly Detection. 2717-2722 - Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang:
Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance. 2723-2728 - Weilin Wan, Weizhong Zhang, Cheng Jin:
Pose-Motion Video Anomaly Detection via Memory-Augmented Reconstruction and Conditional Variational Prediction. 2729-2734 - Junyi Yan, Enguang Zuo, Chen Chen, Cheng Chen, Jie Zhong, Tianle Li, Xiaoyi Lv:
Rethinking graph anomaly detection: A self-supervised Group Discrimination paradigm with Structure-Aware. 2735-2740 - Jie Zhong, Enguang Zuo, Chen Chen, Cheng Chen, Junyi Yan, Tianle Li, Xiaoyi Lv:
A Masked Attention Network with Query Sparsity Measurement for Time Series Anomaly Detection. 2741-2746 - Qiong Wang, Kui Jiang, Jinyi Lai, Zheng Wang, Jianhui Zhang:
HPCNet: A Hybrid Progressive Coupled Network for Image Deraining. 2747-2752 - Fengchao Xiong, Jun Zhou, Zhuang Zhao, Yuntao Qian:
Iterative Refinement Network for Hyperspectral Image Denoising. 2753-2758 - Yuqi Jiang, Chune Zhang, Jiao Liu:
CS-PCN: Context-Space Progressive Collaborative Network for Image Denoising. 2759-2764 - Kangliang Liu, Xiangcheng Du, Sijie Liu, Yingbin Zheng, Xingjiao Wu, Cheng Jin:
DDT: Dual-branch Deformable Transformer for Image Denoising. 2765-2770 - Fengyi Zhang, Lin Zhang, Tianjun Zhang, Dongqing Wang:
Adaptively Hashing 3DLUTs for Lightweight Real-time Image Enhancement. 2771-2776 - Yingxue Pang, Shijie Zhao, Haiqiang Wang, Gen Zhan, Junlin Li, Li Zhang:
Frequency-Assisted Adaptive Sharpening Scheme Considering Bitrate and Quality Tradeoff. 2777-2782 - Zerun Liu, Fan Zhang, Jingxuan He, Jin Wang, Zhangye Wang, Lechao Cheng:
Text-Guided Mask-Free Local Image Retouching. 2783-2788 - Guoliang You, Xiaomeng Chu, Yifan Duan, Jie Peng, Jianmin Ji, Yu Zhang, Yanyong Zhang:
P3O: Transferring Visual Representations for Reinforcement Learning via Prompting. 2789-2794 - Yehuan Wang, Jian Hu, Lin Shang:
Accurate and Complete Captions for Question-controlled Text-aware Image Captioning. 2795-2800 - Yuhao Chen, Guoqing Zhang, Hongwei Zhang, Yuhui Zheng, Weisi Lin:
Multi-level Part-aware Feature Disentangling for Text-based Person Search. 2801-2806 - Yiren Zhang, Yuanwu Xu, Mohan Chen, Yuejie Zhang, Rui Feng, Shang Gao:
SPTNET: Span-based Prompt Tuning for Video Grounding. 2807-2812 - Xingyu Zhu, Feifei Dai, Xiaoyan Gu, Haihui Fan, Bo Li, Weiping Wang:
ERPG: Enhancing Entity Representations with Prompt Guidance for Complex Named Entity Recognition. 2813-2818 - Peizhuo Lv, Hualong Ma, Jiachen Zhou, Ruigang Liang, Kai Chen, Shengzhi Zhang, Yunfei Yang:
DBIA: Data-Free Backdoor Attack Against Transformer Networks. 2819-2824 - Yangming Zhou, Yuzhou Yang, Qichao Ying, Zhenxing Qian, Xinpeng Zhang:
Multimodal Fake News Detection via CLIP-Guided Learning. 2825-2830 - Yiqiang Lv, Jingjing Chen, Zhipeng Wei, Kai Chen, Zuxuan Wu, Yu-Gang Jiang:
Downstream Task-agnostic Transferable Attacks on Language-Image Pre-training Models. 2831-2836 - Zhongqiang Huang, Yuxue Hu, Zhi Zeng, Xiang Li, Ying Sha:
Multimodal Stacked Cross Attention Network for Fine-Grained Fake News Detection. 2837-2842 - Jinghong Xia, Hongxia Wang, Sani M. Abdullahi, Heng Wang, Fei Zhang, Bingling Luo:
Adaptive and Robust Fourier-Mellin-Based Image Watermarking for Social Networking Platforms. 2843-2848 - Pengcheng Su, Rongxin Tu, Hongmei Liu, Yue Qing, Xiangui Kang:
Adversarial Attacks on Generated Text Detectors. 2849-2854 - Qianjin Du, Wei Kun, Xiaohui Kuang, Xiang Li, Gang Zhao:
Automated Software Vulnerability Detection via Curriculum Learning. 2855-2860 - Zhi Zeng, Mingmin Wu, Guodong Li, Xiang Li, Zhongqiang Huang, Ying Sha:
Correcting the Bias: Mitigating Multimodal Inconsistency Contrastive Learning for Multimodal Fake News Detection. 2861-2866 - Shiwei Jing, Jianjun Li, Wanyong Tian:
Meaningful ciphertext image encryption based on histogram shift and ND-ICM hyperchaos. 2867-2872 - Shansong Wang, Qingtian Zeng, Weijian Ni, Xue Zhang, Cheng Cheng:
Hierarchical Class Level Attribute Guided Generative Meta Learning for Pest Image Zero-shot Learning. 2873-2878 - Wenhao Qiu, Sichao Fu, Jingyi Zhang, Chengxiang Lei, Qinmu Peng:
Semantic-visual Guided Transformer for Few-shot Class-incremental Learning. 2885-2890 - Jiaxin Chen, Yanxu Hu, Meng Shen, Andy J. Ma:
Dual Episodic Sampling and Momentum Consistency Regularization for Unsupervised Few-shot Learning. 2891-2896 - Yaqian Zhou, Yu Liu, Dan Song, Jiayu Li, Xuanya Li, An-An Liu:
Cross-domain Prototype Contrastive loss for Few-shot 2D Image-Based 3D Model Retrieval. 2897-2902 - Dianlong You, Peng Wang, Yi Zhang, Ling Wang, Shunfu Jin:
Few-Shot Object Detection via Back Propagation and Dynamic Learning. 2903-2908 - Yunkai Dang, Meijun Sun, Min Zhang, Zhengyu Chen, Xinliang Zhang, Zheng Wang, Donglin Wang:
Multi-Level Correlation Network For Few-Shot Image Classification. 2909-2914 - Xixiang Lin, Zhenghao Li, Liangchen Liu, Jun Wu, Lijun Zhang, Xiang-Dong Zhou:
Irecut+MM: Data Generalization and Metric Improvement for Few-shot Learning. 2915-2920 - Yiwen Zhang, Hailun Zhang, Qijun Zhao:
Counting and Locating Anything: Class-agnostic Few-shot Object Counting and Localization. 2921-2926 - Yanhui Wang, Ning Xu, Hongshuo Tian, Bo Lv, Yulong Duan, Xuanya Li, An-An Liu:
Knowledge Prompt Makes Composed Pre-Trained Models Zero-Shot News Captioner. 28779-2884
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.