default search action
IEEE Transactions on Multimedia, Volume 25
Volume 25, 2023
- Zan-Xia Jin, Heran Wu, Chun Yang, Fang Zhou, Jingyan Qin, Lei Xiao, Xu-Cheng Yin:
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering. 1-12 - Yu Wang, Shiwei Chen:
Multi-Agent Trajectory Prediction With Spatio-Temporal Sequence Fusion. 13-23 - Jiayi Xie, Yaochen Zhu, Zhenzhong Chen:
Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck. 24-37 - Zhicheng Guo, Jiaxuan Zhao, Licheng Jiao, Xu Liu, Fang Liu:
A Universal Quaternion Hypergraph Network for Multimodal Video Question Answering. 38-49 - Xiao Lin, Shuzhou Sun, Wei Huang, Bin Sheng, Ping Li, David Dagan Feng:
EAPT: Efficient Attention Pyramid Transformer for Image Processing. 50-61 - Zhi Li, Haoliang Li, Xin Luo, Yongjian Hu, Kwok-Yan Lam, Alex C. Kot:
Asymmetric Modality Translation for Face Presentation Attack Detection. 62-76 - Wei Lu, Desheng Li, Liqiang Nie, Peiguang Jing, Yuting Su:
Learning Dual Low-Rank Representation for Multi-Label Micro-Video Classification. 77-89 - Yun Wang, Tong Zhang, Chuanwei Zhou, Zhen Cui, Jian Yang:
Instance-Aware Deep Graph Learning for Multi-Label Classification. 90-99 - Jae Young Choi, Bumshik Lee:
Combining Deep Convolutional Neural Networks With Stochastic Ensemble Weight Optimization for Facial Expression Recognition in the Wild. 100-111 - Zerui Shao, Yifei Pu, Jiliu Zhou, Bihan Wen, Yi Zhang:
Hyper RPCA: Joint Maximum Correntropy Criterion and Laplacian Scale Mixture Modeling on-the-Fly for Moving Object Detection. 112-125 - Yajing Liu, Zhiwei Xiong, Ya Li, Xinmei Tian, Zheng-Jun Zha:
Domain Generalization Via Encoding and Resampling in a Unified Latent Space. 126-139 - Hangwei Chen, Xiongli Chai, Feng Shao, Xuejin Wang, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho:
Perceptual Quality Assessment of Cartoon Images. 140-153 - Yang Li, Shengbin Meng, Xinfeng Zhang, Meng Wang, Shiqi Wang, Yue Wang, Siwei Ma:
User-Generated Video Quality Assessment: A Subjective and Objective Study. 154-166 - Yan Yang, Jun Yu, Jian Zhang, Weidong Han, Hanliang Jiang, Qingming Huang:
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation. 167-178 - Hancheng Zhu, Yong Zhou, Leida Li, Yaqian Li, Yandong Guo:
Learning Personalized Image Aesthetics From Subjective and Objective Attributes. 179-190 - Jun Cheng, Fusheng Hao, Fengxiang He, Liu Liu, Qieshi Zhang:
Mixer-Based Semantic Spread for Few-Shot Learning. 191-202 - Haojie Yuan, Qi Chu, Feng Zhu, Rui Zhao, Bin Liu, Nenghai Yu:
AutoMA: Towards Automatic Model Augmentation for Transferable Adversarial Attacks. 203-213 - Zefan Li, Bingbing Ni, Xiaokang Yang, Wenjun Zhang, Wen Gao:
Residual Quantization for Low Bit-Width Neural Networks. 214-227 - Zhaoliang Chen, Jie Yao, Guobao Xiao, Shiping Wang:
Efficient and Differentiable Low-Rank Matrix Completion With Back Propagation. 228-242 - Tong Xue, Abdallah El Ali, Tianyi Zhang, Gangyi Ding, Pablo César:
CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360$^\circ$ VR Videos. 243-255 - Gaosheng Liu, Huanjing Yue, Jiamin Wu, Jing-Yu Yang:
Intra-Inter View Interaction Network for Light Field Image Super-Resolution. 256-266 - Zhihao Wu, Jie Wen, Yong Xu, Jian Yang, David Zhang:
Multiple Instance Detection Networks With Adaptive Instance Refinement. 267-279 - Yanhua Yang, Xiaozhe Zhang, Muli Yang, Cheng Deng:
Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning. 280-290 - Tung-I Chen, Yueh-Cheng Liu, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu:
Dual-Awareness Attention for Few-Shot Object Detection. 291-301 - Laizhong Cui, Erchao Ni, Yipeng Zhou, Zhi Wang, Lei Zhang, Jiangchuan Liu, Yuedong Xu:
Towards Real-Time Video Caching at Edge Servers: A Cost-Aware Deep Q-Learning Solution. 302-314 - Sutong Wang, Jiacheng Zhu, Yunqiang Yin, Dujuan Wang, T. C. Edwin Cheng, Yanzhang Wang:
Interpretable Multi-Modal Stacking-Based Ensemble Learning Method for Real Estate Appraisal. 315-328 - Zhihao Zhang, Xianqiang Yang, Chao Xu:
Natural Image Stitching With Layered Warping Constraint. 329-338 - Hao Tang, Guoshuai Zhao, Yuxia Wu, Xueming Qian:
Multisample-Based Contrastive Loss for Top-K Recommendation. 339-351 - Ke Zhang, Chun Yuan, Yiming Zhu, Yong Jiang, Lishu Luo:
Weakly Supervised Instance Segmentation by Exploring Entire Object Regions. 352-363 - Astha Verma, A. Venkata Subramanyam, Zheng Wang, Shin'ichi Satoh, Rajiv Ratn Shah:
Unsupervised Domain Adaptation for Person Re-Identification Via Individual-Preserving and Environmental-Switching Cyclic Generation. 364-377 - Carlos M. Lentisco, Luis Bellido, Andrés Cárdenas, Ricardo Flores Moyano, David Fernández:
Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery. 378-388 - Huicong Wu, Liang Xiao, Le Sun, Byeungwoo Jeon:
A Novel Video Stabilization Model With Motion Morphological Component Priors. 389-404 - Xuehao Gao, Yang Yang, Yimeng Zhang, Maosen Li, Jin-Gang Yu, Shaoyi Du:
Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition. 405-417 - Cheng Xue, Xionghu Zhong, Minjie Cai, Hao Chen, Wenwu Wang:
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention. 418-429 - Guang Han, Jinpeng Su, Yaoming Liu, Yuqiu Zhao, Sam Kwong:
Multi-Stage Visual Tracking With Siamese Anchor-Free Proposal Network. 430-442 - Lei Yu, Bishan Wang, Jingwei He, Gui-Song Xia, Wen Yang:
Single Image Deraining With Continuous Rain Density Estimation. 443-456 - Jianjun Xiang, Gangyi Jiang, Mei Yu, Zhidi Jiang, Yo-Sung Ho:
No-Reference Light Field Image Quality Assessment Using Four-Dimensional Sparse Transform. 457-472 - Mehdi Rahmati, Zhuoran Qi, Dario Pompili:
Underwater Adaptive Video Transmissions Using MIMO-Based Software-Defined Acoustic Modems. 473-485 - Nan Jiang, Kuiran Wang, Xiaoke Peng, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li, Guodong Guo, Qixiang Ye, Jianbin Jiao, Jian Zhao, Zhenjun Han:
Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking. 486-500 - Yujie Huang, Ming-e Jing, Jinjia Zhou, Yuhao Liu, Yibo Fan:
LCCStyle: Arbitrary Style Transfer With Low Computational Complexity. 501-514 - Jing Yi, Yaochen Zhu, Jiayi Xie, Zhenzhong Chen:
Cross-Modal Variational Auto-Encoder for Content-Based Micro-Video Background Music Recommendation. 515-528 - Luntian Mou, Chao Zhou, Pengtao Xie, Pengfei Zhao, Ramesh C. Jain, Wen Gao, Baocai Yin:
Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion. 529-542 - Wenhui Li, Yan Wang, Yuting Su, Xuanya Li, An-An Liu, Yongdong Zhang:
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching. 543-556 - Yongqiang Kong, Yunhong Wang, Annan Li, Qiuyu Huang:
Self-Sufficient Feature Enhancing Networks for Video Salient Object Detection. 557-571 - Qinchuan Zhang, Yi Jiang, Qin Zhou, Yiru Zhao, Yao Liu, Hongtao Lu, Xian-Sheng Hua:
Single Person Dense Pose Estimation via Geometric Equivariance Consistency. 572-583 - Kailun Zhou, Liping Zhao, Zigao Ye, Huihui Wang, Tao Lin, Sheng Feng, Yufen Yang:
Equal Value String and Copy Above String Based String Prediction for SCC in AVS3. 584-592 - Maja Krivokuca, Ehsan Miandji, Christine Guillemot, Philip A. Chou:
Compression of Plenoptic Point Cloud Attributes Using 6-D Point Clouds and 6-D Transforms. 593-607 - Xiaoqing Luo, Yuanhao Gao, Anqi Wang, Zhancheng Zhang, Xiaojun Wu:
IFSepR: A General Framework for Image Fusion Based on Separate Representation Learning. 608-623 - Shihao Xu, Haocong Rao, Xiping Hu, Jun Cheng, Bin Hu:
Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition. 624-634 - Huabing Zhou, Wei Wu, Yanduo Zhang, Jiayi Ma, Haibin Ling:
Semantic-Supervised Infrared and Visible Image Fusion Via a Dual-Discriminator Generative Adversarial Network. 635-648 - Ming Li, Bin Fu, Zhengfu Zhang, Yu Qiao:
Character-Aware Sampling and Rectification for Scene Text Recognition. 649-661 - Mingyue Su, Guanghua Gu, Xianlong Ren, Hao Fu, Yao Zhao:
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing. 662-675 - Lei Zhu, Xiaoqiang Wang, Ping Li, Xin Yang, Qing Zhang, Weiming Wang, Carola-Bibiane Schönlieb, C. L. Philip Chen:
S $^3$ Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection. 676-689 - Xinjue Hu, Yuxuan Pan, Yumei Wang, Lin Zhang, Shervin Shirmohammadi:
Multiple Description Coding for Best-Effort Delivery of Light Field Video Using GNN-Based Compression. 690-705 - Le Wang, Qing Li, Sanping Zhou, Nanning Zheng:
Multi-Panda Tracking. 706-720 - Changsheng Gao, Dong Liu, Li Li, Feng Wu:
Towards Task-Generic Image Compression: A Study of Semantics-Oriented Metrics. 721-735 - Pei Lv, Jianqi Fan, Xixi Nie, Weiming Dong, Xiaoheng Jiang, Bing Zhou, Mingliang Xu, Changsheng Xu:
User-Guided Personalized Image Aesthetic Assessment Based on Deep Reinforcement Learning. 736-749 - Xiao Tan, Huaian Chen, Kai Xu, Yi Jin, Changan Zhu:
Deep SR-HDR: Joint Learning of Super-Resolution and High Dynamic Range Imaging for Dynamic Scenes. 750-763 - Zhen Bai, Zhi Liu, Gongyang Li, Yang Wang:
Adaptive Group-Wise Consistency Network for Co-Saliency Detection. 764-776 - Chenghu Du, Feng Yu, Minghua Jiang, Ailing Hua, Xiong Wei, Tao Peng, Xinrong Hu:
VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment. 777-791 - Shiji Zhou, Zhi Wang, Chenghao Hu, Yinan Mao, Haopeng Yan, Shanghang Zhang, Chuan Wu, Wenwu Zhu:
Caching in Dynamic Environments: A Near-Optimal Online Learning Approach. 792-804 - Shuyi Li, Bob Zhang, Lunke Fei, Shuping Zhao, Yicong Zhou:
Learning Sparse and Discriminative Multimodal Feature Codes for Finger Recognition. 805-815 - Wenxue Cui, Shaohui Liu, Feng Jiang, Debin Zhao:
Image Compressed Sensing Using Non-Local Neural Network. 816-830 - Nastaran Nourbakhsh Kaashki, Pengpeng Hu, Adrian Munteanu:
Anet: A Deep Neural Network for Automatic 3D Anthropometric Measurement Extraction. 831-844 - Xiaoyan Cai, Sen Liu, Junwei Han, Libin Yang, Zhenguo Liu, Tianming Liu:
ChestXRayBERT: A Pretrained Language Model for Chest Radiology Report Summarization. 845-855 - Xuemeng Song, Shi-Ting Fang, Xiaolin Chen, Yinwei Wei, Zhongzhou Zhao, Liqiang Nie:
Modality-Oriented Graph Learning Toward Outfit Compatibility Modeling. 856-867 - Jie Nie, Zian Zhao, Lei Huang, Weizhi Nie, Zhiqiang Wei:
Cross-Domain Recommendation Via User-Clustering and Multidimensional Information Fusion. 868-880 - Haimin Zhang, Min Xu:
Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning. 881-891 - Fei Peng, Bo Long, Min Long:
A Semi-Fragile Reversible Watermarking for Authenticating 3D Models Based on Virtual Polygon Projection and Double Modulation Strategy. 892-906 - Karam Park, Jae Woong Soh, Nam Ik Cho:
A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution. 907-918 - Ming Li, Jun Liu, Ce Zheng, Xinming Huang, Ziming Zhang:
Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification. 919-929 - Liyuan Ma, Kejie Huang, Dongxu Wei, Zhaoyan Ming, Haibin Shen:
FDA-GAN: Flow-Based Dual Attention GAN for Human Pose Transfer. 930-941 - Chongyang Bai, Haipeng Chen, Srijan Kumar, Jure Leskovec, V. S. Subrahmanian:
M2P2: Multimodal Persuasion Prediction Using Adaptive Fusion. 942-952 - Prasen Kumar Sharma, Arun Abraham, Vikram Nelvoy Rajendiran:
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks Via Learned Weights Statistics. 953-965 - Fan Zhao, Wenda Zhao, Huimin Lu, Yong Liu, Libo Yao, Yu Liu:
Depth-Distilled Multi-Focus Image Fusion. 966-978 - Xuanhan Wang, Yuyu Guo, Jingkuan Song, Lianli Gao, Heng Tao Shen:
AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences. 979-992 - Tiejian Zhang, Xinwang Liu, Lei Gong, Siwei Wang, Xin Niu, Li Shen:
Late Fusion Multiple Kernel Clustering With Local Kernel Alignment Maximization. 993-1007 - Yiming Wang, Dongxia Chang, Zhiqiang Fu, Yao Zhao:
Consistent Multiple Graph Embedding for Multi-View Clustering. 1008-1018 - Jingjing Xiong, Lai-Man Po, Wing Yin Yu, Yuzhi Zhao, Kwok-Wai Cheung:
Distortion Map-Guided Feature Rectification for Efficient Video Semantic Segmentation. 1019-1032 - Wei Qin, Hanwang Zhang, Richang Hong, Ee-Peng Lim, Qianru Sun:
Causal Interventional Training for Image Recognition. 1033-1044 - Shikun Li, Tongliang Liu, Jiyong Tan, Dan Zeng, Shiming Ge:
Trustable Co-Label Learning From Multiple Noisy Annotators. 1045-1057 - Jiebo Luo:
Editorial. 1058-1059 - Yonggang Wen:
Editorial. 1060 - Wenqian Wang, Faliang Chang, Chunsheng Liu, Guangxin Li, Bin Wang:
GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition. 1061-1073 - Qifan Wang, Yinwei Wei, Jianhua Yin, Jianlong Wu, Xuemeng Song, Liqiang Nie:
DualGNN: Dual Graph Neural Network for Multimedia Recommendation. 1074-1084 - Xiaoping Liang, Zhenjun Tang, Jingli Wu, Zhixin Li, Xinpeng Zhang:
Robust Image Hashing With Isomap and Saliency Map for Copy Detection. 1085-1097 - Shuping Zhao, Lunke Fei, Jie Wen, Jigang Wu, Bob Zhang:
Intrinsic and Complete Structure Learning Based Incomplete Multiview Clustering. 1098-1110 - Shixiang Wu, Chao Dong, Yu Qiao:
Blind Image Restoration Based on Cycle-Consistent Network. 1111-1124 - Jose Jaena Mari Ople, Tai-Ming Huang, Ming-Chih Chiu, Yi-Ling Chen, Kai-Lung Hua:
Adjustable Model Compression Using Multiple Genetic Algorithm. 1125-1132 - Le Wang, Mo Zhou, Zhenxing Niu, Qilin Zhang, Nanning Zheng:
Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding. 1133-1147 - Weide Liu, Xiangfei Kong, Tzu-Yi Hung, Guosheng Lin:
Cross-Image Region Mining With Region Prototypical Network for Weakly Supervised Segmentation. 1148-1160 - Ziqiang Wang, Zhi Liu, Gongyang Li, Yang Wang, Tianhong Zhang, Lihua Xu, Jijun Wang:
Spatio-Temporal Self-Attention Network for Video Saliency Prediction. 1161-1174 - Rui Wang, Jun Liu, Qiuhong Ke, Duo Peng, Yinjie Lei:
Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition. 1175-1189 - Cheng Wang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen:
Person Search by a Bi-Directional Task-Consistent Learning Model. 1190-1203 - Jipeng Wu, Rongrong Ji, Qiang Wang, Shengchuan Zhang, Xiaoshuai Sun, Yan Wang, Mingliang Xu, Feiyue Huang:
Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement. 1204-1216 - Di Wang, Caiping Zhang, Quan Wang, Yumin Tian, Lihuo He, Lin Zhao:
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval. 1217-1229 - Min Cao, Cong Ding, Chen Chen, Hao Dou, Xiyuan Hu, Junchi Yan:
Progressive Context-Aware Graph Feature Learning for Target Re-Identification. 1230-1242 - Yuting Su, Wei Zhao, Peiguang Jing, Liqiang Nie:
Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment Distributions. 1243-1255 - Gaoang Wang, Yizhou Wang, Renshu Gu, Weijie Hu, Jenq-Neng Hwang:
Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking. 1256-1268 - Qiao Liu, Di Yuan, Nana Fan, Peng Gao, Xin Li, Zhenyu He:
Learning Dual-Level Deep Representation for Thermal Infrared Tracking. 1269-1281 - Wenhao Li, Hong Liu, Runwei Ding, Mengyuan Liu, Pichao Wang, Wenming Yang:
Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation. 1282-1293 - Mengxi Jia, Xinhua Cheng, Shijian Lu, Jian Zhang:
Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification. 1294-1305 - Zhe Tang, Yi Yang, Wen Li, Defu Lian, Lixin Duan:
Deep Cross-Attention Network for Crowdfunding Success Prediction. 1306-1319 - Kun Zhang, Zhendong Mao, An-An Liu, Yongdong Zhang:
Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching. 1320-1332 - Dongnan Liu, Chaoyi Zhang, Yang Song, Heng Huang, Chenyu Wang, Michael Barnett, Tom Weidong Cai:
Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement. 1333-1344 - Bin Chen, Kunhong Liu, Yong Xu, Qingqiang Wu, Junfeng Yao:
Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition. 1345-1358 - Yingjian Li, Zheng Zhang, Bingzhi Chen, Guangming Lu, David Zhang:
Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition. 1359-1373 - Jianjun Sun, Yan Zhao, Shigang Wang, Jian Wei:
3D Holoscopic Image Compression Based on Gaussian Mixture Model. 1374-1389 - Huan Liu, Wentao Liu, Zhixiang Chi, Yang Wang, Yuanhao Yu, Jun Chen, Jin Tang:
Fast Human Pose Estimation in Compressed Videos. 1390-1400 - Yujian Feng, Yimu Ji, Fei Wu, Guangwei Gao, Yang Gao, Tianliang Liu, Shangdong Liu, Xiao-Yuan Jing, Jiebo Luo:
Occluded Visible-Infrared Person Re-Identification. 1401-1413 - Haoyu Zhao, Qi Wang, Guowei Zhan, Weidong Min, Yi Zou, Shimiao Cui:
Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes. 1414-1426 - Jianjun Qian, Shumin Zhu, Chaoyu Zhao, Jian Yang, Wai Keung Wong:
OTFace: Hard Samples Guided Optimal Transport Loss for Deep Face Representation. 1427-1438 - Tianyu Shen, Deqi Li, Fei-Yue Wang, Hua Huang:
Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations. 1439-1451 - Qianqian Yu, Keqi Fan, Yuhui Zheng:
Domain Adaptive Transformer Tracking Under Occlusions. 1452-1461 - Zhihao Liu, Yuanyuan Shang, Timing Li, Guanlin Chen, Yu Wang, Qinghua Hu, Pengfei Zhu:
Robust Multi-Drone Multi-Target Tracking to Resolve Target Occlusion: A Benchmark. 1462-1476 - Zhijing Yang, Junyang Chen, Yukai Shi, Hao Li, Tianshui Chen, Liang Lin:
OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup. 1477-1488 - Kunyu Peng, Alina Roitberg, Kailun Yang, Jiaming Zhang, Rainer Stiefelhagen:
Delving Deep Into One-Shot Skeleton-Based Action Recognition With Diverse Occlusions. 1489-1504 - Guangwei Gao, Lei Tang, Fei Wu, Huimin Lu, Jian Yang:
JDSR-GAN: Constructing an Efficient Joint Learning Network for Masked Face Super-Resolution. 1505-1512 - Puning Zhang, Fengyi Huang, Dapeng Wu, Boran Yang, Zhigang Yang, Lei Tan:
Device-Edge-Cloud Collaborative Acceleration Method Towards Occluded Face Recognition in High-Traffic Areas. 1513-1520 - Qun Li, Ziyi Zhang, Feng Zhang, Fu Xiao:
HRNeXt: High-Resolution Context Network for Crowd Pose Estimation. 1521-1528 - Chunjie Ma, Li Zhuo, Jiafeng Li, Yutong Zhang, Jing Zhang:
Cascade Transformer Decoder Based Occluded Pedestrian Detection With Dynamic Deformable Convolution and Gaussian Projection Channel Attention Mechanism. 1529-1537 - Rui Wang, Yixue Hao, Long Hu, Jincai Chen, Min Chen, Di Wu:
Self-Supervised Learning With Data-Efficient Supervised Fine-Tuning for Crowd Counting. 1538-1546 - Yun Lan, Ruimin Hu, Xin Xu, Dengshi Li, Chao Wang, Xiaochen Wang:
From Collective Attribute Association of Groups to Precise Attribute Association of Individuals. 1547-1554 - Xingyu Yang, Mengya Han, Yong Luo, Han Hu, Yonggang Wen:
Two-Stream Prototype Learning Network for Few-Shot Face Recognition Under Occlusions. 1555-1563 - Qinyang Zeng, Chengju Liu, Ming Liu, Qijun Chen:
Contrastive 3D Human Skeleton Action Representation Learning via CrossMoCo With Spatiotemporal Occlusion Mask Data Augmentation. 1564-1574 - Jianping Gou, Xia Yuan, Baosheng Yu, Jiali Yu, Zhang Yi:
Intra- and Inter-Class Induced Discriminative Deep Dictionary Learning for Visual Recognition. 1575-1583 - Zheng Cao, Liming Xu, Danny Z. Chen, Honghao Gao, Jian Wu:
A Robust Shape-Aware Rib Fracture Detection and Segmentation Framework With Contrastive Learning. 1584-1591 - Junzhu Mao, Yazhou Yao, Zeren Sun, Xingguo Huang, Fumin Shen, Heng Tao Shen:
Attention Map Guided Transformer Pruning for Occluded Person Re-Identification on Edge Device. 1592-1599 - Yun Li, Zhe Liu, Lina Yao, Xiaojun Chang:
Attribute-Modulated Generative Meta Learning for Zero-Shot Learning. 1600-1610 - Mingjie Sun, Jimin Xiao, Eng Gee Lim, Yao Zhao:
Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning. 1611-1621 - Yang Chen, Lin Zhang, Ying Shen, Brian Nlong Zhao, Yicong Zhou:
Extrinsic Self-Calibration of the Surround-View System: A Weakly Supervised Approach. 1622-1635 - Rui Gao, Xingsong Hou, Jie Qin, Yuming Shen, Yang Long, Li Liu, Zhao Zhang, Ling Shao:
Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning. 1649-1664 - Rui Wang, Zuxuan Wu, Zejia Weng, Jingjing Chen, Guo-Jun Qi, Yu-Gang Jiang:
Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation. 1665-1673 - Peng Wu, Xiaotao Liu, Jing Liu:
Weakly Supervised Audio-Visual Violence Detection. 1674-1685 - Jinlong Li, Zequn Jie, Xu Wang, Yu Zhou, Xiaolin Wei, Lin Ma:
Weakly Supervised Semantic Segmentation Via Progressive Patch Learning. 1686-1699 - Yucheng Shu, Hengbo Li, Bin Xiao, Xiuli Bi, Weisheng Li:
Cross-Mix Monitoring for Medical Image Segmentation With Limited Supervision. 1700-1712 - Bin Fan, Yuzhu Yang, Wensen Feng, Fuchao Wu, Jiwen Lu, Hongmin Liu:
Seeing Through Darkness: Visual Localization at Night via Weakly Supervised Learning of Domain Invariant Features. 1713-1726 - Tao Chen, Yazhou Yao, Lei Zhang, Qiong Wang, Guo-Sen Xie, Fumin Shen:
Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation. 1727-1737 - Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao:
Learning to Minimize the Remainder in Supervised Learning. 1738-1748 - Yuhang Zhang, Xiaopeng Zhang, Jie Li, Robert C. Qiu, Haohang Xu, Qi Tian:
Semi-Supervised Contrastive Learning With Similarity Co-Calibration. 1749-1759 - Jingwei Yan, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu:
Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition. 1760-1772 - Anran Zhang, Yandan Yang, Jun Xu, Xianbin Cao, Xiantong Zhen, Ling Shao:
Latent Domain Generation for Unsupervised Domain Adaptation Object Counting. 1773-1783 - Pedro H. T. Gama, Hugo N. Oliveira, José Marcato Junior, Jefersson A. dos Santos:
Weakly Supervised Few-Shot Segmentation via Meta-Learning. 1784-1797 - Xing Lan, Qinghao Hu, Jian Cheng:
ATF: An Alternating Training Framework for Weakly Supervised Face Alignment. 1798-1809 - Xiaoliang Qian, Yinfeng Zeng, Wei Wang, Qiuwen Zhang:
Co-Saliency Detection Guided by Group Weakly Supervised Learning. 1810-1818 - Zhigang Tu, Jiaxu Zhang, Hongyan Li, Yujin Chen, Junsong Yuan:
Joint-Bone Fusion Graph Convolutional Network for Semi-Supervised Skeleton Action Recognition. 1819-1831 - Guoliang Hua, Hong Liu, Wenhao Li, Qian Zhang, Runwei Ding, Xin Xu:
Weakly-Supervised 3D Human Pose Estimation With Cross-View U-Shaped Graph Convolutional Network. 1832-1843 - Zhuo Huang, Jian Yang, Chen Gong:
They are Not Completely Useless: Towards Recycling Transferable Unlabeled Data for Class-Mismatched Semi-Supervised Learning. 1844-1857 - Peipei Song, Dan Guo, Jun Cheng, Meng Wang:
Contextual Attention Network for Emotional Video Captioning. 1858-1867 - Huifang Li, Yidong Li, Yuanzhouhan Cao, Yushan Han, Yi Jin, Yunchao Wei:
Weakly Supervised Object Detection With Class Prototypical Network. 1868-1878 - Guangwei Gao, Yi Yu, Huimin Lu, Jian Yang, Dong Yue:
Context-Patch Representation Learning With Adaptive Neighbor Embedding for Robust Face Image Super-Resolution. 1879-1889 - Yufei Yin, Jiajun Deng, Wengang Zhou, Li Li, Houqiang Li:
FI-WSOD: Foreground Information Guided Weakly Supervised Object Detection. 1890-1902 - Jun Kong, Xuefeng Tao, Min Jiang, Tianshan Liu:
Weakly Supervised Distribution Discrepancy Minimization Learning With State Information for Person Re-Identification. 1903-1915 - Xiao Dong, Gengwei Zhang, Xunlin Zhan, Yi Ding, Yunchao Wei, Minlong Lu, Xiaodan Liang:
Caption-Aided Product Detection via Collaborative Pseudo-Label Harmonization. 1916-1927 - Guodong Ding, Angela Yao:
Temporal Action Segmentation With High-Level Complex Activity Labels. 1928-1939 - Cheng Qi, Zhiyong Feng, Meng Xing, Yong Su, Jinqing Zheng, Yiming Zhang:
Energy-Based Temporal Summarized Attentive Network for Zero-Shot Action Recognition. 1940-1953 - Yuke Li, Pin Wang, Ching-Yao Chan:
RESTEP Into the Future: Relational Spatio-Temporal Learning for Multi-Person Action Forecasting. 1954-1963 - Jialun Pei, Tianyang Cheng, He Tang, Chuanbo Chen:
Transformer-Based Efficient Salient Instance Segmentation Networks With Orientative Query. 1964-1978 - Xian Zhong, Cheng Gu, Mang Ye, Wenxin Huang, Chia-Wen Lin:
Graph Complemented Latent Representation for Few-Shot Image Classification. 1979-1990 - Yu Qiu, Yun Liu, Yanan Chen, Jianwen Zhang, Jinchao Zhu, Jing Xu:
A2SPPNet: Attentive Atrous Spatial Pyramid Pooling Network for Salient Object Detection. 1991-2006 - Li Li, Zhu Li, Shan Liu, Houqiang Li:
Plenoptic Point Cloud Compression Using Multiview Extension of High Efficiency Video Coding. 2007-2021 - Siwang Zhou, Xiaoning Deng, Chengqing Li, Yonghe Liu, Hongbo Jiang:
Recognition-Oriented Image Compressive Sensing With Deep Learning. 2022-2032 - Zipeng Ye, Mengfei Xia, Ran Yi, Juyong Zhang, Yu-Kun Lai, Xuwei Huang, Guo-Xin Zhang, Yong-Jin Liu:
Audio-Driven Talking Face Video Generation With Dynamic Convolution Kernels. 2033-2046 - Chen Li, Li Song, Shuai Chen, Rong Xie, Wenjun Zhang:
Deep Online Video Stabilization Using IMU Sensors. 2047-2060 - Yufan Hu, Junyu Gao, Changsheng Xu:
Learning Scene-Aware Spatio-Temporal GNNs for Few-Shot Early Action Prediction. 2061-2073 - Mingjie Wang, Hao Cai, Xian-Feng Han, Jun Zhou, Minglun Gong:
STNet: Scale Tree Network With Multi-Level Auxiliator for Crowd Counting. 2074-2084 - Ming Lu, Tong Chen, Zhenyu Dai, Dong Wang, Dandan Ding, Zhan Ma:
Decoder-Side Cross Resolution Synthesis for Video Compression Enhancement. 2097-2110 - Hongguang Zhang, Hongdong Li, Piotr Koniusz:
Multi-Level Second-Order Few-Shot Learning. 2111-2126 - Wenli Song, Lei Zhang, Xinbo Gao:
Compound Projection Learning for Bridging Seen and Unseen Objects. 2127-2139 - Yunxin Li, Qian Yang, Qingcai Chen, Baotian Hu, Xiaolong Wang, Yuxin Ding, Lin Ma:
Fast and Robust Online Handwritten Chinese Character Recognition With Deep Spatial and Contextual Information Fusion Network. 2140-2152 - Jin Xie, Yanwei Pang, Jing Nie, Jiale Cao, Jungong Han:
Latent Feature Pyramid Network for Object Detection. 2153-2163 - Min Wang, Wengang Zhou, Qi Tian, Houqiang Li:
Deep Graph Convolutional Quantization Networks for Image Retrieval. 2164-2175 - Zelong Zeng, Zheng Wang, Fan Yang, Shin'ichi Satoh:
Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval. 2176-2188 - Yu Pang, Chengdong Wu, Hao Wu, Xiaosheng Yu:
Unsupervised Multi-Subclass Saliency Classification for Salient Object Detection. 2189-2202 - Haimin Zhang, Min Xu:
Multiscale Emotion Representation Learning for Affective Image Recognition. 2203-2212 - Jiahao Zheng, Sen Zhang, Zilu Wang, Xiaoping Wang, Zhigang Zeng:
Multi-Channel Weight-Sharing Autoencoder Based on Cascade Multi-Head Attention for Multimodal Emotion Recognition. 2213-2225 - Nan Jiang, Bin Sheng, Ping Li, Tong-Yee Lee:
PhotoHelper: Portrait Photographing Guidance Via Deep Feature Retrieval and Fusion. 2226-2238 - Xiao Li, Dong Zhang, Ming Li, Dah-Jye Lee:
Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network. 2239-2251 - Yalan Ye, Tongjie Pan, Tonghoujun Luo, Jingjing Li, Heng Tao Shen:
Learning MLatent Representations for Generalized Zero-Shot Learning. 2252-2265 - Min Meng, Mengcheng Lan, Jun Yu, Jigang Wu, Ligang Liu:
Dual-Level Adaptive and Discriminative Knowledge Transfer for Cross-Domain Recognition. 2266-2279 - Debashri Roy, Yuanyuan Li, Tong Jian, Peng Tian, Kaushik Roy Chowdhury, Stratis Ioannidis:
Multi-Modality Sensing and Data Fusion for Multi-Vehicle Detection. 2280-2295 - Zhejing Hu, Yan Liu, Gong Chen, Yongxu Liu:
Can Machines Generate Personalized Music? A Hybrid Favorite-Aware Method for User Preference Music Transfer. 2296-2308 - Nayyer Aafaq, Ajmal Mian, Naveed Akhtar, Wei Liu, Mubarak Shah:
Dense Video Captioning With Early Linguistic Information Fusion. 2309-2322 - Han Yan, Haijun Zhang, Linlin Liu, Dongliang Zhou, Xiaofei Xu, Zhao Zhang, Shuicheng Yan:
Toward Intelligent Design: An AI-Based Fashion Designer Using Generative Adversarial Networks Aided by Sketch and Rendering Generators. 2323-2338 - Jiayao Shan, Sifan Zhou, Yubo Cui, Zheng Fang:
Real-Time 3D Single Object Tracking With Transformer. 2339-2353 - Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao:
STAM: A SpatioTemporal Attention Based Memory for Video Prediction. 2354-2367 - Dezhi Peng, Lianwen Jin, Weihong Ma, Canyu Xie, Hesuo Zhang, Shenggao Zhu, Jing Li:
Recognition of Handwritten Chinese Text by Segmentation: A Segment-Annotation-Free Approach. 2368-2381 - Junke Wang, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang:
FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting. 2382-2392 - Yi-Xing Peng, Jile Jiao, Xuetao Feng, Wei-Shi Zheng:
Consistent Discrepancy Learning for Intra-Camera Supervised Person Re-Identification. 2393-2403 - Lintai Wu, Yong Xu, Junhui Hou, C. L. Philip Chen, Cheng-Lin Liu:
A Two-Level Rectification Attention Network for Scene Text Recognition. 2404-2414 - Hang Liu, Menghan Hu, Yuzhen Chen, Qingli Li, Guangtao Zhai, Simon X. Yang, Xiao-Ping Zhang, Xiaokang Yang:
Angel's Girl for Blind Painters: An Efficient Painting Navigation System Validated by Multimodal Evaluation Approach. 2415-2429 - Huakui Zhang, Yi Cai, Haopeng Ren, Qing Li:
Multimodal Topic Modeling by Exploring Characteristics of Short Text Social Media. 2430-2445 - Mengyang Sun, Wei Suo, Peng Wang, Yanning Zhang, Qi Wu:
A Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention. 2446-2458 - Jinkun You, Yuan-Gen Wang, Guopu Zhu, Ligang Wu, Hongli Zhang, Sam Kwong:
Estimating the Secret Key of Spread Spectrum Watermarking Based on Equivalent Keys. 2459-2473 - Ziqiang Zheng, Yi Bin, Xiaoou Lv, Yang Wu, Yang Yang, Heng Tao Shen:
Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation. 2474-2487 - Weihe Li, Jiawei Huang, Shiqi Wang, Chuliang Wu, Sen Liu, Jian-xin Wang:
An Apprenticeship Learning Approach for Adaptive Video Streaming Based on Chunk Quality and User Preference. 2488-2502 - Xiaoya Zhang, Shumin Zhang, Zhen Cui, Zechao Li, Jin Xie, Jian Yang:
Tube-Embedded Transformer for Pixel Prediction. 2503-2514 - Zhi Jin, Junjia Huang, Wenjin Wang, Aolin Xiong, Xiaojun Tan:
Estimating Human Weight From a Single Image. 2515-2527 - Chuan Qin, Jinchuan Hu, Fengyong Li, Zhenxing Qian, Xinpeng Zhang:
JPEG Image Encryption With Adaptive DC Coefficient Prediction and RS Pair Permutation. 2528-2542 - Lisha Wang, Chenglin Li, Wenrui Dai, Shaohui Li, Junni Zou, Hongkai Xiong:
QoE-Driven Adaptive Streaming for Point Clouds. 2543-2558 - Mengmeng Jing, Lichao Meng, Jingjing Li, Lei Zhu, Heng Tao Shen:
Adversarial Mixup Ratio Confusion for Unsupervised Domain Adaptation. 2559-2572 - Shaocan Liu, Xin Ma:
Attention-Driven Appearance-Motion Fusion Network for Action Recognition. 2573-2584 - Farzad Tashtarian, Abdelhak Bentaleb, Alireza R. Erfanian, Hermann Hellwagner, Christian Timmerer, Roger Zimmermann:
$\mathsf{HxL3}$: Optimized Delivery Architecture for HTTP Low-Latency Live Streaming. 2585-2600 - Xiaomei Zhang, Yingying Chen, Ming Tang, Jinqiao Wang, Xiangyu Zhu, Zhen Lei:
Human Parsing With Part-Aware Relation Modeling. 2601-2612 - Chengrun Qiu, Dongheng Zhang, Yang Hu, Houqiang Li, Qibin Sun, Yan Chen:
Radio-Assisted Human Detection. 2613-2623 - Pengfei Wang, Changxing Ding, Wentao Tan, Mingming Gong, Kui Jia, Dacheng Tao:
Uncertainty-Aware Clustering for Unsupervised Domain Adaptive Object Re-Identification. 2624-2635 - Liyang Sun, Yixiang Mao, Tongyu Zong, Yong Liu, Yao Wang:
Live 360 Degree Video Delivery Based on User Collaboration in a Streaming Flock. 2636-2647 - Han Fang, Zhaoyang Jia, Hang Zhou, Zehua Ma, Weiming Zhang:
Encoded Feature Enhancement in Watermarking Network for Distortion in Real Scenes. 2648-2660 - Wei Wang, Junyu Gao, Xiaoshan Yang, Changsheng Xu:
Many Hands Make Light Work: Transferring Knowledge From Auxiliary Tasks for Video-Text Retrieval. 2661-2674 - A. Sophia Koepke, Andreea-Maria Oncescu, João F. Henriques, Zeynep Akata, Samuel Albanie:
Audio Retrieval With Natural Language Queries: A Benchmark Study. 2675-2685 - En Yu, Zhuoling Li, Shoudong Han, Hongwei Wang:
RelationTrack: Relation-Aware Multiple Object Tracking With Decoupled Representation. 2686-2697 - Yi Dong, Xinghao Jiang, Zhaohong Li, Tanfeng Sun, Zhenzhen Zhang:
Multi-Channel HEVC Steganography by Minimizing IPM Steganographic Distortions. 2698-2709 - Liang Chen, Jun Liu, Weidong Chen, Bo Du:
A GLRT-Based Multi-Pixel Target Detector in Hyperspectral Imagery. 2710-2722 - Anyi Rao, Linning Xu, Zhizhong Li, Qingqiu Huang, Zhanghui Kuang, Wayne Zhang, Dahua Lin:
A Coarse-to-Fine Framework for Automatic Video Unscreen. 2723-2733 - Shuo Liu, Weize Quan, Chaoqun Wang, Yuan Liu, Bin Liu, Dong-Ming Yan:
Dense Modality Interaction Network for Audio-Visual Event Localization. 2734-2748 - Shaokun Wang, Tian Gan, Yuan Liu, Jianlong Wu, Yuan Cheng, Liqiang Nie:
Micro-Influencer Recommendation by Multi-Perspective Account Representation Learning. 2749-2760 - Desheng Cai, Shengsheng Qian, Quan Fang, Jun Hu, Wenkui Ding, Changsheng Xu:
Heterogeneous Graph Contrastive Learning Network for Personalized Micro-Video Recommendation. 2761-2773 - Lingfeng Ma, Hongtao Xie, Chuanbin Liu, Yongdong Zhang:
Learning Cross-Channel Representations for Semantic Segmentation. 2774-2787 - Zhenxiao Luo, Zelong Wang, Miao Hu, Yipeng Zhou, Di Wu:
LiveSR: Enabling Universal HD Live Video Streaming With Crowdsourced Online Learning. 2788-2798 - Qiyao Deng, Qi Li, Jie Cao, Yunfan Liu, Zhenan Sun:
Semantic-Aware Noise Driven Portrait Synthesis and Manipulation. 2799-2811 - Yuanjie Dang, Chong Huang, Peng Chen, Ronghua Liang, Xin Yang, Kwang-Ting Cheng:
Path-Analysis-Based Reinforcement Learning Algorithm for Imitation Filming. 2812-2824 - Feng Li, Yixuan Wu, Huihui Bai, Weisi Lin, Runmin Cong, Yao Zhao:
Learning Detail-Structure Alternative Optimization for Blind Super-Resolution. 2825-2838 - Zhangyu Chang, S.-H. Gary Chan:
Bi-Criteria Approximation for a Multi-Origin Multi-Channel Auto-Scaling Live Streaming Cloud. 2839-2850 - Yaxin Liu, Jianlong Wu, Leigang Qu, Tian Gan, Jianhua Yin, Liqiang Nie:
Self-Supervised Correlation Learning for Cross-Modal Retrieval. 2851-2863 - Fan Chen, Yaolin Yang, Hongjie He, Yuan Yuan:
Adaptive Coding and Ordered-Index Extended Scrambling Based RDH in Encrypted Images. 2864-2875 - Ce Wang, Dejia Xu, Renjie Wan, Bin He, Boxin Shi, Ling-Yu Duan:
Background Scene Recovery From an Image Looking Through Colored Glass. 2876-2887 - Yongri Piao, Wei Wu, Miao Zhang, Yongyao Jiang, Huchuan Lu:
Noise-Sensitive Adversarial Learning for Weakly Supervised Salient Object Detection. 2888-2897 - Laure Prétet, Gaël Richard, Clément Souchier, Geoffroy Peeters:
Video-to-Music Recommendation Using Temporal Alignment of Segments. 2898-2911 - Simeng Sun, Tao Yu, Jiahua Xu, Wei Zhou, Zhibo Chen:
GraphIQA: Learning Distortion Graph Representations for Blind Image Quality Assessment. 2912-2925 - Cong Yu, Zhi Wu, Dongheng Zhang, Zhi Lu, Yang Hu, Yan Chen:
RFGAN: RF-Based Human Synthesis. 2926-2938 - Jie Li, Cong Zhang, Zhi Liu, Richang Hong, Han Hu:
Optimal Volumetric Video Streaming With Hybrid Saliency Based Tiling. 2939-2953 - Dechao Meng, Liang Li, Xuejing Liu, Lin Gao, Qingming Huang:
Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID. 2954-2965 - Depeng Wang, Zhenzhen Hu, Yuanen Zhou, Richang Hong, Meng Wang:
A Text-Guided Generation and Refinement Model for Image Captioning. 2966-2977 - Jie Huang, Xueyang Fu, Zeyu Xiao, Feng Zhao, Zhiwei Xiong:
Low-Light Stereo Image Enhancement. 2978-2992 - Youguang Yu, Wei Zhang, Fuzheng Yang, Ge Li:
Rate-Distortion Optimized Geometry Compression for Spinning LiDAR Point Cloud. 2993-3005 - Kenan E. Ak, Ying Sun, Joo Hwee Lim:
Learning by Imagination: A Joint Framework for Text-Based Image Manipulation and Change Captioning. 3006-3016 - Yuzhi Zhao, Lai-Man Po, Wing Yin Yu, Yasar Abbas Ur Rehman, Mengyang Liu, Yujia Zhang, Weifeng Ou:
VCGAN: Video Colorization With Hybrid Generative Adversarial Network. 3017-3032 - Pengfei Zhu, Xinjie Yao, Yu Wang, Meng Cao, Binyuan Hui, Shuai Zhao, Qinghua Hu:
Latent Heterogeneous Graph Network for Incomplete Multi-View Learning. 3033-3045 - Na Li, Xinbo Zhao:
A Strong and Robust Skeleton-Based Gait Recognition Method with Gait Periodicity Priors. 3046-3058 - Yuchen Zhang, Wenrui Dai, Yong Li, Chenglin Li, Junhui Hou, Junni Zou, Hongkai Xiong:
Light Field Compression With Graph Learning and Dictionary-Guided Sparse Coding. 3059-3072 - Cheng-Hao Wu, Chih-Fan Hsu, Tzu-Kuan Hung, Carsten Griwodz, Wei Tsang Ooi, Cheng-Hsin Hsu:
Quantitative Comparison of Point Cloud Compression Algorithms With PCC Arena. 3073-3088 - Cunyi Lin, Xianwei Rong, Xiaoyan Yu:
MSAFF-Net: Multiscale Attention Feature Fusion Networks for Single Image Dehazing and Beyond. 3089-3100 - Xiaoming Zhao, Xingming Wu, Jinyu Miao, Weihai Chen, Peter C. Y. Chen, Zhengguo Li:
ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction. 3101-3112 - Yuxia Wu, Lizi Liao, Gangyi Zhang, Wenqiang Lei, Guoshuai Zhao, Xueming Qian, Tat-Seng Chua:
State Graph Reasoning for Multimodal Conversational Recommendation. 3113-3124 - Xianxu Hou, Xiaokang Zhang, Hanbang Liang, Linlin Shen, Zhong Ming:
Lifelong Age Transformation With a Deep Generative Prior. 3125-3139 - Yiming Li, Xiaoshan Yang, Xuhui Huang, Zhe Ma, Changsheng Xu:
Zero-Shot Predicate Prediction for Scene Graph Parsing. 3140-3153 - Pengfei Wang, Changxing Ding, Zhiyin Shao, Zhibin Hong, Shengli Zhang, Dacheng Tao:
Quality-Aware Part Models for Occluded Person Re-Identification. 3154-3165 - Shu-Yu Chen, Yu-Kun Lai, Shihong Xia, Paul L. Rosin, Lin Gao:
3D Face Reconstruction and Gaze Tracking in the HMD for Virtual Interaction. 3166-3179 - Shaojie Li, Mingbao Lin, Yan Wang, Fei Chao, Ling Shao, Rongrong Ji:
Learning Efficient GANs for Image Translation via Differentiable Masks and Co-Attention Distillation. 3180-3189 - Yutong Gao, Liqian Liang, Congyan Lang, Songhe Feng, Yidong Li, Yunchao Wei:
Clicking Matters: Towards Interactive Human Parsing. 3190-3203 - Yangbo Feng, Junyu Gao, Changsheng Xu:
Learning Dual-Routing Capsule Graph Neural Network for Few-Shot Video Classification. 3204-3216 - Ni Zhang, Nian Liu, Junwei Han, Kaiyuan Wan, Ling Shao:
Face De-Occlusion With Deep Cascade Guidance Learning. 3217-3229 - Xiaoke Li, Zufan Zhang, Chenquan Gan, Yong Xiang:
Multi-Label Speech Emotion Recognition via Inter-Class Difference Loss Under Response Residual Network. 3230-3244 - Peter Szabó, Anderson Augusto Simiscuka, Stefano Masneri, Mikel Zorrilla, Gabriel-Miro Muntean:
A CNN-Based Framework for Enhancing 360° VR Experiences With Multisensorial Effects. 3245-3258 - Guangwei Gao, Guoan Xu, Juncheng Li, Yi Yu, Huimin Lu, Jian Yang:
FBSNet: A Fast Bilateral Symmetrical Network for Real-Time Semantic Segmentation. 3273-3283 - Zeren Sun, Yazhou Yao, Xiu-Shen Wei, Fumin Shen, Jian Zhang, Xian-Sheng Hua:
Boosting Robust Learning Via Leveraging Reusable Samples in Noisy Web Data. 3284-3295 - Nayu Liu, Xian Sun, Hongfeng Yu, Fanglong Yao, Guangluan Xu, Kun Fu:
Abstractive Summarization for Video: A Revisit in Multistage Fusion Network With Forget Gate. 3296-3310 - Mehwish Ghafoor, Arif Mahmood:
Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework. 3311-3318 - Lei Zhang, Yingjun Du, Jiayi Shen, Xiantong Zhen:
Learning to Learn With Variational Inference for Cross-Domain Image Classification. 3319-3328 - Jian Xiong, Hao Gao, Miaohui Wang, Hongliang Li, King Ngi Ngan, Weisi Lin:
Efficient Geometry Surface Coding in V-PCC. 3329-3342 - Yahui Liu, Yajing Chen, Linchao Bao, Nicu Sebe, Bruno Lepri, Marco De Nadai:
ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation. 3343-3353 - Mingjie Sun, Jimin Xiao, Eng Gee Lim, Yao Zhao:
Starting Point Selection and Multiple-Standard Matching for Video Object Segmentation With Language Annotation. 3354-3363 - Lei Jin, Xiaojuan Wang, Xuecheng Nie, Luoqi Liu, Yandong Guo, Jian Zhao:
Grouping by Center: Predicting Centripetal Offsets for the Bottom-up Human Pose Estimation. 3364-3374 - Tong Zhu, Leida Li, Jufeng Yang, Sicheng Zhao, Hantao Liu, Jiansheng Qian:
Multimodal Sentiment Analysis With Image-Text Interaction Network. 3375-3385 - Kaiwen Yang, Xinmei Tian:
Domain-Class Correlation Decomposition for Generalizable Person Re-Identification. 3386-3396 - Weilun Wang, Wengang Zhou, Jianmin Bao, Houqiang Li:
Coherent Image Animation Using Spatial-Temporal Correspondence. 3397-3408 - Xianxu Hou, Xiaokang Zhang, Yudong Li, Linlin Shen:
TextFace: Text-to-Style Mapping Based Face Generation and Manipulation. 3409-3419 - Qing Li, Changqing Zhang, Qinghua Hu, Huazhu Fu, Pengfei Zhu:
Confidence-Aware Fusion Using Dempster-Shafer Theory for Multispectral Pedestrian Detection. 3420-3431 - Zhuangzi Li, Ge Li, Thomas H. Li, Shan Liu, Wei Gao:
Semantic Point Cloud Upsampling. 3432-3442 - Qi Liang, Qiang Li, Weizhi Nie, An-An Liu:
Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval. 3443-3455 - Yunhao Zhou, Yi Wang, Lap-Pui Chau:
Moving Towards Centers: Re-Ranking With Attention and Memory for Re-Identification. 3456-3468 - Lei Zhang, Hua Huang:
Image Stitching With Manifold Optimization. 3469-3482 - Wujie Zhou, Enquan Yang, Jingsheng Lei, Jian Wan, Lu Yu:
PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing. 3483-3494 - Yi-Jen Shih, Shih-Lun Wu, Frank Zalkow, Meinard Müller, Yi-Hsuan Yang:
Theme Transformer: Symbolic Music Generation With Theme-Conditioned Transformer. 3495-3508 - Jiayi Ma, Yang Wang, Aoxiang Fan, Guobao Xiao, Riqing Chen:
Correspondence Attention Transformer: A Context-Sensitive Network for Two-View Correspondence Learning. 3509-3524 - Fatemeh Nikoonezhad, Mohammed Ghanbari:
PRAM: Penalized Resource Allocation Method for Video Services. 3525-3533 - Di Hu, Zheng Wang, Feiping Nie, Rong Wang, Xuelong Li:
Self-Supervised Learning for Heterogeneous Audiovisual Scene Analysis. 3534-3545 - Songsong Wu, Hao Tang, Xiao-Yuan Jing, Haifeng Zhao, Jianjun Qian, Nicu Sebe, Yan Yan:
Cross-View Panorama Image Synthesis. 3546-3559 - Shihao Zou, Xinxin Zuo, Sen Wang, Yiming Qian, Chuan Guo, Li Cheng:
Human Pose and Shape Estimation From Single Polarization Images. 3560-3572 - Long Ma, Risheng Liu, Yiyang Wang, Xin Fan, Zhongxuan Luo:
Low-Light Image Enhancement via Self-Reinforced Retinex Projection Model. 3573-3586 - Chunhui Bao, Qianru Sun:
Generating Music With Emotions. 3602-3614 - Yunqing Li, Jun Du, Jianshu Zhang, Changjie Wu:
A Tree-Structure Analysis Network on Handwritten Chinese Character Error Correction. 3615-3627 - Zichen Zhao, Hai-Miao Hu, Hongda Zhang, Fei Chen, Qiang Guo:
Improving Color Constancy Using Chromaticity-Line Prior. 3642-3656 - Chang Liu, Xudong Jiang, Henghui Ding:
Instance-Specific Feature Propagation for Referring Segmentation. 3657-3667 - Liang Han, Zhaozheng Yin:
Global Memory and Local Continuity for Video Object Detection. 3681-3693 - Md Mofijul Islam, Mohammad Samin Yasar, Tariq Iqbal:
MAVEN: A Memory Augmented Recurrent Approach for Multimodal Fusion. 3694-3708 - Ercheng Pei, Yong Zhao, Meshia Cédric Oveneke, Dongmei Jiang, Hichem Sahli:
A Bayesian Filtering Framework for Continuous Affect Recognition From Facial Images. 3709-3722 - Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Knowing What it is: Semantic-Enhanced Dual Attention Transformer. 3723-3736 - Yuzhi Zhao, Lai-Man Po, Xuehui Wang, Qiong Yan, Wei Shen, Yujia Zhang, Wei Liu, Chun Kit Wong, Chiu-Sing Pang, Weifeng Ou, Wing Yin Yu, Buhua Liu:
ChildPredictor: A Child Face Prediction Framework With Disentangled Learning. 3737-3752 - Tongtong Feng, Qi Qi, Jingyu Wang, Jianxin Liao, Jiangchuan Liu:
Timely and Accurate Bitrate Switching in HTTP Adaptive Streaming With Date-Driven I-Frame Prediction. 3753-3762 - Tianyi Zhang, Abdallah El Ali, Alan Hanjalic, Pablo César:
Few-Shot Learning for Fine-Grained Emotion Recognition Using Physiological Signals. 3773-3787 - Hengyue Bi, Canhui Xu, Cao Shi, Guozhu Liu, Yuteng Li, Honghong Zhang, Jing Qu:
SRRV: A Novel Document Object Detector Based on Spatial-Related Relation and Vision. 3788-3798 - Minggang Gan, Yan Zhang:
Temporal Attention-Pyramid Pooling for Temporal Action Detection. 3799-3810 - Xin Liu, Jinhan Yi, Yiu-ming Cheung, Xing Xu, Zhen Cui:
OMGH: Online Manifold-Guided Hashing for Flexible Cross-Modal Retrieval. 3811-3824 - Wujiang Xu, Yifei Xu, Genan Sang, Li Li, Aichen Wang, Pingping Wei, Li Zhu:
Recursive Multi-Relational Graph Convolutional Network for Automatic Photo Selection. 3825-3840 - Guanglei Yang, Enrico Fini, Dan Xu, Paolo Rota, Mingli Ding, Hao Tang, Xavier Alameda-Pineda, Elisa Ricci:
Continual Attentive Fusion for Incremental Learning in Semantic Segmentation. 3841-3854 - Li Li, Zhu Li, Shan Liu, Houqiang Li:
Frame-Level Rate Control for Geometry-Based LiDAR Point Cloud Compression. 3855-3867 - Hongrun Zhang, Yanda Meng, Yitian Zhao, Xuesheng Qian, Yihong Qiao, Xiaoyun Yang, Yalin Zheng:
3D Human Pose and Shape Reconstruction From Videos via Confidence-Aware Temporal Feature Aggregation. 3868-3880 - Weiming Yang, Xianke Wang, Bowen Tian, Wei Xu, Wenqing Cheng:
A Multi-Stage Automatic Evaluation System for Sight-Singing. 3881-3893 - Kehua Guo, Changchun Shen, Bin Hu, Min Hu, Xiaoyan Kui:
RSNet: Relation Separation Network for Few-Shot Similar Class Recognition. 3894-3904 - Zhong Wang, Lin Zhang, Ying Shen, Yicong Zhou:
D-LIOM: Tightly-Coupled Direct LiDAR-Inertial Odometry and Mapping. 3905-3920 - Yunxiao Wang, Meng Liu, Yinwei Wei, Zhiyong Cheng, Yinglong Wang, Liqiang Nie:
Siamese Alignment Network for Weakly Supervised Video Moment Retrieval. 3921-3933 - Tuxin Guan, Chaofeng Li, Ke Gu, Hantao Liu, Yuhui Zheng, Xiaojun Wu:
Visibility and Distortion Measurement for No-Reference Dehazed Image Quality Assessment via Complex Contourlet Transform. 3934-3949 - Tianwen Qian, Jingjing Chen, Shaoxiang Chen, Bo Wu, Yu-Gang Jiang:
Scene Graph Refinement Network for Visual Question Answering. 3950-3961 - Kejun Wu, You Yang, Qiong Liu, Xiao-Ping Zhang:
Focal Stack Image Compression Based on Basis-Quadtree Representation. 3975-3988 - Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang:
CNDesc: Cross Normalization for Local Descriptors Learning. 3989-4001 - Yao Xue, Yu Cao, Xubin Feng, Meilin Xie, Ke Li, Xingjun Zhang, Xueming Qian:
Towards Handling Sudden Changes in Feature Maps During Depth Estimation. 4002-4012 - Xiang Deng, Songhe Feng, Gengyu Lyu, Tao Wang, Congyan Lang:
Beyond Word Embeddings: Heterogeneous Prior Knowledge Driven Multi-Label Image Classification. 4013-4025 - Chuntao Wang, Tianjian Zhang, Hao Chen, Qiong Huang, Jiangqun Ni, Xinpeng Zhang:
A Novel Encryption-Then-Lossy-Compression Scheme of Color Images Using Customized Residual Dense Spatial Network. 4026-4040 - Hengmin Zhang, Feng Qian, Bob Zhang, Wenli Du, Jianjun Qian, Jian Yang:
Incorporating Linear Regression Problems Into an Adaptive Framework With Feasible Optimizations. 4041-4051 - Jiafeng Li, Yaopeng Li, Li Zhuo, Lingyan Kuang, Tianjian Yu:
USID-Net: Unsupervised Single Image Dehazing Network via Disentangled Representations. 3587-3601 - Bairong Li, Biao Guo, Yuesheng Zhu, Jianfeng Yin, Xiangli Ji:
Superframe-Based Temporal Proposals for Weakly Supervised Temporal Action Detection. 3628-3641 - Jiaqi Zhao, Hanzheng Wang, Yong Zhou, Rui Yao, Silin Chen, Abdulmotaleb El-Saddik:
Spatial-Channel Enhanced Transformer for Visible-Infrared Person Re-Identification. 3668-3680 - Jiayi Ji, Xiaoyang Huang, Xiaoshuai Sun, Yiyi Zhou, Gen Luo, Liujuan Cao, Jianzhuang Liu, Ling Shao, Rongrong Ji:
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning. 3962-3974 - Chengpei Xu, Wenjing Jia, Tingcheng Cui, Ruomei Wang, Yuan-fang Zhang, Xiangjian He:
Arbitrary-Shape Scene Text Detection via Visual-Relational Rectification and Contour Approximation. 4052-4066 - Wenlong Cheng, Wei Tang, Yan Huang, Yiwen Luo, Liang Wang:
A Reconstruction-Based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval. 4067-4080 - Ming Li, Bin Fu, Han Chen, Junjun He, Yu Qiao:
Dual Relation Network for Scene Text Recognition. 4094-4107 - Xin Deng, Hao Wang, Mai Xu, Li Li, Zulin Wang:
Omnidirectional Image Super-Resolution via Latitude Adaptive Network. 4108-4120 - Sijie Mai, Ying Zeng, Haifeng Hu:
Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations. 4121-4134 - Xiang Wen, Shiwei Zhao, Haobo Wang, Runze Wu, Manhu Qu, Tianlei Hu, Gang Chen, Jianrong Tao, Changjie Fan:
Multi-Source Multi-Label Learning for User Profiling in Online Games. 4135-4147 - Yang Yang, Hao Zheng, Lanling Zeng, Xiangjun Shen, Yongzhao Zhan:
$L_{1}$-Regularized Reconstruction Model for Edge-Preserving Filtering. 4148-4162 - Zhengzheng Tu, Yan Ma, Zhun Li, Chenglong Li, Jieming Xu, Yongtao Liu:
RGBT Salient Object Detection: A Large-Scale Dataset and Benchmark. 4163-4176 - Yu Zhou, Weikang Gong, Yanjing Sun, Leida Li, Jinjian Wu, Xinbo Gao:
Pyramid Feature Aggregation for Hierarchical Quality Prediction of Stitched Panoramic Images. 4177-4186 - Lingxiang Yao, Worapan Kusakunniran, Peng Zhang, Qiang Wu, Jian Zhang:
Improving Disentangled Representation Learning for Gait Recognition Using Group Supervision. 4187-4198 - Chengpei Xu, Wenjing Jia, Ruomei Wang, Xiaonan Luo, Xiangjian He:
MorphText: Deep Morphology Regularized Accurate Arbitrary-Shape Scene Text Detection. 4199-4212 - Si Liu, Renda Bao, Defa Zhu, Shaofei Huang, Qiong Yan, Liang Lin, Chao Dong:
Fine-Grained Face Editing via Personalized Spatial-Aware Affine Modulation. 4213-4224 - Haodan Zhang, Yixuan Ban, Zongming Guo, Ken Chen, Xinggong Zhang:
RAM360: Robust Adaptive Multi-Layer 360$^\circ$ Video Streaming With Lyapunov Optimization. 4225-4239 - Hongyi Sun, Wanhua Li, Yueqi Duan, Jie Zhou, Jiwen Lu:
Learning Adaptive Patch Generators for Mask-Robust Image Inpainting. 4240-4252 - Huiyu Duan, Wei Shen, Xiongkuo Min, Yuan Tian, Jae-Hyun Jung, Xiaokang Yang, Guangtao Zhai:
Develop Then Rival: A Human Vision-Inspired Framework for Superimposed Image Decomposition. 4267-4281 - Bosheng Qin, Haoji Hu, Yueting Zhuang:
Deep Residual Weight-Sharing Attention Network With Low-Rank Attention for Visual Question Answering. 4282-4295 - Shihui Zhang, Dongxu Zuo, Yongliang Yang, Xiaowei Zhang:
A Transferable Adversarial Belief Attack With Salient Region Perturbation Restriction. 4296-4306 - Dong Wei, Xiaobo Shen, Quansen Sun, Xizhan Gao, Zhenwen Ren:
Sparse Representation Classifier Guided Grassmann Reconstruction Metric Learning With Applications to Image Set Analysis. 4307-4322 - Tongzhen Si, Fazhi He, Zhong Zhang, Yansong Duan:
Hybrid Contrastive Learning for Unsupervised Person Re-Identification. 4323-4334 - Xiao Wang, Xiujun Shu, Shiliang Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu:
MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking. 4335-4348 - Zhengyan Chen, Hong Liu, Linlin Zhang, Xin Liao:
Multi-Dimensional Attention With Similarity Constraint for Weakly-Supervised Temporal Action Localization. 4349-4360 - Jiacheng Chen, Bin-Bin Gao, Zongqing Lu, Jing-Hao Xue, Chengjie Wang, Qingmin Liao:
APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation. 4361-4373 - Bowen Ma, Tong Jia, Min Su, Xiaodong Jia, Dongyue Chen, Yichun Zhang:
Automated Segmentation of Prohibited Items in X-Ray Baggage Images Using Dense De-Overlap Attention Snake. 4374-4386 - Deyang Liu, Yan Huang, Yuming Fang, Yifan Zuo, Ping An:
Multi-Stream Dense View Reconstruction Network for Light Field Image Compression. 4400-4414 - Liang Xu, Cuiling Lan, Wenjun Zeng, Cewu Lu:
Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition. 4415-4425 - Chaoqin Huang, Qinwei Xu, Yanfeng Wang, Yu Wang, Ya Zhang:
Self-Supervised Masking for Unsupervised Anomaly Detection and Localization. 4426-4438 - Xiaozhou Lei, Zixiang Fei, Wenju Zhou, Huiyu Zhou, Minrui Fei:
Low-Light Image Enhancement Using the Cell Vibration Model. 4439-4454 - Kaijun Liu, Shujing Lyu, Yue Lu:
Few-Shot Segmentation for Prohibited Items Inspection With Patch-Based Self-Supervised Learning and Prototype Reverse Validation. 4455-4463 - Aite Zhao, Yue Wang, Jianbo Li:
Transferable Self-Supervised Instance Learning for Sleep Recognition. 4464-4477 - Souradeep Chakraborty, Zijun Wei, Conor Kelton, Seoyoung Ahn, Aruna Balasubramanian, Gregory J. Zelinsky, Dimitris Samaras:
Predicting Visual Attention in Graphic Design Documents. 4478-4493 - Sheng Liu, Annan Li, Jiahao Wang, Yunhong Wang:
Bidirectional Maximum Entropy Training With Word Co-Occurrence for Video Captioning. 4494-4507 - Sanchita Ghose, John J. Prevost:
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos. 4508-4519 - Wentao Tan, Lei Zhu, Jingjing Li, Huaxiang Zhang, Junwei Han:
Teacher-Student Learning: Efficient Hierarchical Message Aggregation Hashing for Cross-Modal Retrieval. 4520-4532 - Qi Liu, Honglei Su, Tianxin Chen, Hui Yuan, Raouf Hamzaoui:
No-Reference Bitstream-Layer Model for Perceptual Quality Assessment of V-PCC Encoded Point Clouds. 4533-4546 - Jin Li, Wanyun Li, Zichen Xu, Yuhao Wang, Qiegen Liu:
Wavelet Transform-Assisted Adaptive Generative Modeling for Colorization. 4547-4562 - Guangzhi Wang, Yangyang Guo, Ziwei Xu, Yongkang Wong, Mohan S. Kankanhalli:
Semantic-Aware Triplet Loss for Image Classification. 4563-4572 - Sangwook Park, David K. Han, Mounya Elhilali:
Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures. 4573-4585 - Yusheng Tao, Jian Zhang, Jiajing Hong, Yuesheng Zhu:
DREAMT: Diversity Enlarged Mutual Teaching for Unsupervised Domain Adaptive Person Re-Identification. 4586-4597 - Jingtao Xu, Yali Li, Shengjin Wang:
AdaZoom: Towards Scale-Aware Large Scene Object Detection. 4598-4609 - Xuesong Wang, Ke Jin, Yi Kong, C. L. Philip Chen, Yuhu Cheng:
Discriminator-Quality Evaluation GAN. 4081-4093 - Xiaolong Cheng, Xuan Zheng, Jialun Pei, He Tang, Zehua Lyu, Chuanbo Chen:
Depth-Induced Gap-Reducing Network for RGB-D Salient Object Detection: An Interaction, Guidance and Refinement Approach. 4253-4266 - Xuena Ren, Dongming Zhang, Xiuguo Bao, Yongdong Zhang:
S$^{2}$-Net:Semantic and Saliency Attention Network for Person Re-Identification. 4387-4399 - Pingyu Wang, Zhicheng Zhao, Fei Su, Hongying Meng:
LTReID: Factorizable Feature Generation With Independent Components for Long-Tailed Person Re-Identification. 4610-4622 - Wenbin Zou, Liang Chen, Yi Wu, Yunchen Zhang, Yuxiang Xu, Jun Shao:
Joint Wavelet Sub-Bands Guided Network for Single Image Super-Resolution. 4623-4637 - Yihao Liu, Jingwen He, Xiangyu Chen, Zhengwen Zhang, Hengyuan Zhao, Chao Dong, Yu Qiao:
Very Lightweight Photo Retouching Network With Conditional Sequential Modulation. 4638-4652 - Mingrui Zhang, Mading Li, Jiahao Yu, Li Chen:
Aesthetic Photo Collage With Deep Reinforcement Learning. 4653-4664 - Guanchen Ding, Daiqin Yang, Tao Wang, Sihan Wang, Yunfei Zhang:
Crowd Counting via Unsupervised Cross-Domain Feature Adaptation. 4665-4678 - Qingping Sun, Yi Xiao, Jie Zhang, Shizhe Zhou, Chi-Sing Leung, Xin Su:
A Local Correspondence-Aware Hybrid CNN-GCN Model for Single-Image Human Body Reconstruction. 4679-4690 - Chuanyi Zhang, Guosheng Lin, Qiong Wang, Fumin Shen, Yazhou Yao, Zhenmin Tang:
Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual Recognition. 4691-4703 - Yunan Li, Huizhou Chen, Qiguang Miao, Daohui Ge, Siyu Liang, Zhuoqi Ma, Bocheng Zhao:
Image Hazing and Dehazing: From the Viewpoint of Two-Way Image Translation With a Weakly Supervised Framework. 4704-4717 - Qitong Wang, Bin Fu, Ming Li, Junjun He, Xi Peng, Yu Qiao:
Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion. 4718-4729 - Zhi Wu, Dongheng Zhang, Chunyang Xie, Cong Yu, Jinbo Chen, Yang Hu, Yan Chen:
RFMask: A Simple Baseline for Human Silhouette Segmentation With Radio Signals. 4730-4741 - Hai Wang, Wenming Yang, Qingmin Liao, Jie Zhou:
Bi-RSTU: Bidirectional Recurrent Upsampling Network for Space-Time Video Super-Resolution. 4742-4751 - Hao Li, Jinghui Qin, Zhijing Yang, Pengxu Wei, Jinshan Pan, Liang Lin, Yukai Shi:
Real-World Image Super-Resolution by Exclusionary Dual-Learning. 4752-4763 - Li Zhang, Tong Qiao, Ming Xu, Ning Zheng, Shichuang Xie:
Unsupervised Learning-Based Framework for Deepfake Video Detection. 4785-4799 - Xulun Ye, Jieyu Zhao:
Graph Convolutional Network With Unknown Class Number. 4800-4813 - Xiangyu Hu, Liquan Shen, Mingxing Jiang, Ran Ma, Ping An:
LA-HDR: Light Adaptive HDR Reconstruction Framework for Single LDR Image Considering Varied Light Conditions. 4814-4829 - Zhuo Chen, Fei Yin, Qing Yang, Cheng-Lin Liu:
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic. 4830-4841 - Liping Nong, Jie Peng, Wenhui Zhang, Jiming Lin, Hongbing Qiu, Junyi Wang:
Adaptive Multi-Hypergraph Convolutional Networks for 3D Object Classification. 4842-4855 - Lei Qi, Lei Wang, Yinghuan Shi, Xin Geng:
A Novel Mix-Normalization Method for Generalizable Multi-Source Person Re-Identification. 4856-4867 - Linhui Dai, Xiang Song, Xiaohong Liu, Chengqi Li, Zhihao Shi, Jun Chen, Martin Brooks:
Enabling Trimap-Free Image Matting With a Frequency-Guided Saliency-Aware Network via Joint Learning. 4868-4879 - Junda Cheng, Xin Yang, Yuechuan Pu, Peng Guo:
Region Separable Stereo Matching. 4880-4893 - Jiehang Xie, Xuanbai Chen, Tianyi Zhang, Yixuan Zhang, Shao-Ping Lu, Pablo César, Yulu Yang:
Multimodal-Based and Aesthetic-Guided Narrative Video Summarization. 4894-4908 - Di Wang, Shuai Liu, Quan Wang, Yumin Tian, Lihuo He, Xinbo Gao:
Cross-Modal Enhancement Network for Multimodal Sentiment Analysis. 4909-4921 - Wenfeng Pang, Wei Xie, Qianhua He, Yanxiong Li, Jichen Yang:
Audiovisual Dependency Attention for Violence Detection in Videos. 4922-4932 - Chuangchuang Tan, Guanghua Gu, Tao Ruan, Shikui Wei, Yao Zhao:
Dual-Gradients Localization Framework With Skip-Layer Connections for Weakly Supervised Object Localization. 4933-4942 - Kangjian He, Xuejie Zhang, Dan Xu, Jian Gong, Lisiqi Xie:
Fidelity-driven Optimization Reconstruction and Details Preserving Guided Fusion for Multi-Modality Medical Image. 4943-4957 - Hamed RahmaniKhezri, Suhong Kim, Mohamed Hefeeda:
Unsupervised Single-Image Reflection Removal. 4958-4971 - Lele Fu, Zhaoliang Chen, Yongyong Chen, Shiping Wang:
Unified Low-Rank Tensor Learning and Spectral Embedding for Multi-View Subspace Clustering. 4972-4985 - Dongliang Zhou, Haijun Zhang, Qun Li, Jianghong Ma, Xiaofei Xu:
COutfitGAN: Learning to Synthesize Compatible Outfits Supervised by Silhouette Masks and Fashion Styles. 4986-5001 - Jingjing Jiang, Ziyi Liu, Nanning Zheng:
LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering. 5002-5013 - Yuchen Su, Zhiwen Shao, Yong Zhou, Fanrong Meng, Hancheng Zhu, Bing Liu, Rui Yao:
TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask. 5030-5042 - Tiesong Zhao, Ying Fang, Kai Wang, Qian Liu, Yuzhen Niu:
High Efficiency Vibrotactile Codec Based on Gate Recurrent Network. 5043-5052 - Zhi Lu, Yang Hu, Cong Yu, Yunchao Jiang, Yan Chen, Bing Zeng:
Personalized Fashion Recommendation With Discrete Content-Based Tensor Factorization. 5053-5064 - An-An Liu, Heyu Zhou, Xuanya Li, Lanjun Wang:
Vulnerability of Feature Extractors in 2D Image-Based 3D Object Retrieval. 5065-5076 - Sanaz Nami, Farhad Pakdaman, Mahmoud Reza Hashemi, Shervin Shirmohammadi:
BL-JUNIPER: A CNN-Assisted Framework for Perceptual Video Coding Leveraging Block-Level JND. 5077-5092 - Pengfei Guo, Hantao Liu, Delu Zeng, Tao Xiang, Leida Li, Ke Gu:
An Underwater Image Quality Assessment Metric. 5093-5106 - Zhulin Tao, Xiaohao Liu, Yewei Xia, Xiang Wang, Lifang Yang, Xianglin Huang, Tat-Seng Chua:
Self-Supervised Learning for Multimedia Recommendation. 5107-5116 - Devanshu Anand, Mohammed Amine Togou, Gabriel-Miro Muntean:
A Machine Learning Solution for Video Delivery to Mitigate Co-Tier Interference in 5G HetNets. 5117-5129 - Weide Liu, Chi Zhang, Henghui Ding, Tzu-Yi Hung, Guosheng Lin:
Few-Shot Segmentation With Optimal Transport Matching and Message Flow. 5130-5141 - Miao Zhang, Shunyu Yao, Beiqi Hu, Yongri Piao, Wei Ji:
C$^{2}$DFNet: Criss-Cross Dynamic Filter Network for RGB-D Salient Object Detection. 5142-5154 - Wei Zhai, Yang Cao, Haiyong Xie, Zheng-Jun Zha:
Deep Texton-Coherence Network for Camouflaged Object Detection. 5155-5165 - Jiande Sun, Fanfu Xue, Jing Li, Lei Zhu, Huaxiang Zhang, Jia Zhang:
TSINIT: A Two-Stage Inpainting Network for Incomplete Text. 5166-5177 - Haidong Qin, Jing Li, Yuqi Jiang, Yanran Dai, Shikuan Hong, Feng Zhou, Zhijun Wang, Tao Yang:
Bullet-Time Video Synthesis Based on Virtual Dynamic Target Axis. 5178-5191 - Jiaqi Zhou, Zehua Fu, Qiuyu Huang, Qingjie Liu, Yunhong Wang:
LgNet: A Local-Global Network for Action Recognition and Beyond. 5192-5205 - Xiaodi Guan, Fan Li, Yangfan Zhang, Pamela C. Cosman:
End-to-End Blind Video Quality Assessment Based on Visual and Memory Attention Modeling. 5206-5221 - Yiqing Cai, Zhenwei Ma, Changhong Lu, Changbo Wang, Gaoqi He:
Global Representation Guided Adaptive Fusion Network for Stable Video Crowd Counting. 5222-5233 - Junna Gao, Dehui Kong, Shaofan Wang, Jinghua Li, Baocai Yin:
DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction. 5248-5262 - Jingwen Hou, Weisi Lin, Guanghui Yue, Weide Liu, Baoquan Zhao:
Interaction-Matrix Based Personalized Image Aesthetics Assessment. 5263-5278 - Nam Joon Kim, Hyun Kim:
FP-AGL: Filter Pruning With Adaptive Gradient Learning for Accelerating Deep Convolutional Neural Networks. 5279-5290 - Hanqi Zhu, Jiajun Deng, Yu Zhang, Jianmin Ji, Qiuyu Mao, Houqiang Li, Yanyong Zhang:
VPFNet: Improving 3D Object Detection With Virtual Point Based LiDAR and Stereo Data Fusion. 5291-5304 - Lingyun Song, Xuequn Shang, Chen Yang, Mingxuan Sun:
Attribute-Guided Multiple Instance Hashing Network for Cross-Modal Zero-Shot Hashing. 5305-5318 - Xianjing Han, Xuemeng Song, Xingning Dong, Yinwei Wei, Meng Liu, Liqiang Nie:
DBiased-P: Dual-Biased Predicate Predictor for Unbiased Scene Graph Generation. 5319-5329 - Tianpeng Liu, Jing Li, Jia Wu, Jun Chang, Beihang Song, Bowen Yao:
Tracking With Mutual Attention Network. 5330-5343 - Yuhang Liu, Wei Wei, Daowan Peng, Xian-Ling Mao, Zhiyong He, Pan Zhou:
Depth-Aware and Semantic Guided Relational Attention Network for Visual Question Answering. 5344-5357 - Jianzhao Liu, Wei Zhou, Xin Li, Jiahua Xu, Zhibo Chen:
LIQA: Lifelong Blind Image Quality Assessment. 5358-5373 - Zhi Chen, Yadan Luo, Sen Wang, Jingjing Li, Zi Huang:
GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning. 5374-5385 - Qi Zhang, Jianchao Wei, Shanshe Wang, Siwei Ma, Wen Gao:
RealVR: Efficient, Economical, and Quality-of- Experience-Driven VR Video System Based on MPEG OMAF. 5386-5399 - Lanxiao Wang, Hongliang Li, Wenzhe Hu, Xiaoliang Zhang, Heqian Qiu, Fanman Meng, Qingbo Wu:
What Happens in Crowd Scenes: A New Dataset About Crowd Scenes for Image Captioning. 5400-5412 - Wei Tang, Fazhi He, Yu Liu:
YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer. 5413-5428 - Xianye Ben, Chen Gong, Tianhuan Huang, Chuanye Li, Rui Yan, Yujun Li:
Tackling Micro-Expression Data Shortage via Dataset Alignment and Active Learning. 5429-5443 - Ze Zhou, Quansen Sun, Hongjun Li, Chaobo Li, Zhenwen Ren:
Regression-Selective Feature-Adaptive Tracker for Visual Object Tracking. 5444-5457 - Naishan Zheng, Jie Huang, Feng Zhao, Xueyang Fu, Feng Wu:
Unsupervised Underexposed Image Enhancement via Self-Illuminated and Perceptual Guidance. 5469-5484 - Xiaochuang Shu, Xiangdong Zhang, Quanxue Gao, Ming Yang, Rong Wang, Xinbo Gao:
Self-Weighted Anchor Graph Learning for Multi-View Clustering. 5485-5499 - Maregu Assefa, Wei Jiang, Kumie Gedamu, Getinet Yilma, Bulbula Kumeda, Melese Ayalew:
Self-Supervised Scene-Debiasing for Video Representation Learning via Background Patching. 5500-5515 - Zhi Lu, Yang Hu, Cong Yu, Yan Chen, Bing Zeng:
Learning Fashion Compatibility With Context Conditioning Embedding. 5516-5526 - Xin Wei, Yuyuan Yao, Haoyu Wang, Liang Zhou:
Perception-Aware Cross-Modal Signal Reconstruction: From Audio-Haptic to Visual. 5527-5538 - Chengliang Liu, Zhihao Wu, Jie Wen, Yong Xu, Chao Huang:
Localized Sparse Incomplete Multi-View Clustering. 5539-5551 - Liming Zou, Jing Li, Wenbo Wan, Q. M. Jonathan Wu, Jiande Sun:
Robust Coverless Image Steganography Based on Neglected Coverless Image Dataset Construction. 5552-5564 - Xixi Nie, Bo Hu, Xinbo Gao:
MLNet: A Multi-Domain Lightweight Network for Multi-Focus Image Fusion. 5565-5579 - Qianting Ma, Yang Wang, Tieyong Zeng:
Retinex-Based Variational Framework for Low-Light Image Enhancement and Denoising. 5580-5588 - Huanjing Yue, Yijia Cheng, Yan Mao, Cong Cao, Jing-Yu Yang:
Recaptured Screen Image Demoiréing in Raw Domain. 5589-5600 - Jie Nie, Chenglong Wang, Shusong Yu, Jinjin Shi, Xiaowei Lv, Zhiqiang Wei:
MIGN: Multiscale Image Generation Network for Remote Sensing Image Semantic Segmentation. 5601-5613 - Lei Li, Kai Fan, Chun Yuan:
StrokeNet: Stroke Assisted and Hierarchical Graph Reasoning Networks. 5614-5625 - Rui Li, Danna Xue, Yu Zhu, Hao Wu, Jinqiu Sun, Yanning Zhang:
Self-Supervised Monocular Depth Estimation With Frequency-Based Recurrent Refinement. 5626-5637 - Xian-Feng Han, Yi-Fei Jin, Hui-Xian Cheng, Guoqiang Xiao:
Dual Transformer for Point Cloud Analysis. 5638-5648 - Jiaheng Liu, Jinyang Guo, Dong Xu:
GeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition. 5649-5661 - Weihe Li, Jiawei Huang, Wenjun Lyu, Baoshen Guo, Wanchun Jiang, Jianxin Wang:
RAV: Learning-Based Adaptive Streaming to Coordinate the Audio and Video Bitrate Selections. 5662-5675 - Kuiyuan Zhang, Zhongyun Hua, Yuanman Li, Yongyong Chen, Yicong Zhou:
AMS-Net: Adaptive Multi-Scale Network for Image Compressive Sensing. 5676-5689 - Kangle Wu, Jun Chen, Jiayi Ma:
DMEF: Multi-Exposure Image Fusion Based on a Novel Deep Decomposition Method. 5690-5703 - Huaian Chen, Jianfeng Wang, Minghui Duan, Yi Jin, Yan Kan, Changan Zhu:
Video Denoising for Scenes With Challenging Motion: A Comprehensive Analysis and a New Framework. 5704-5719 - Xiaofeng Ding, Tieyong Zeng, Jian Tang, Zhengping Che, Yaxin Peng:
SRRNet: A Semantic Representation Refinement Network for Image Segmentation. 5720-5732 - Kaihua Zhang, Yang Wu, Mingliang Dong, Bo Liu, Dong Liu, Qingshan Liu:
Deep Object Co-Segmentation and Co-Saliency Detection via High-Order Spatial-Semantic Network Modulation. 5733-5746 - Shaowei Weng, Ye Zhou, Tiancong Zhang, Mengyao Xiao, Yao Zhao:
General Framework to Reversible Data Hiding for JPEG Images With Multiple Two-Dimensional Histograms. 5747-5762 - Shule Deng, Jin-Gang Yu, Zihao Wu, Hongxia Gao, Yansheng Li, Yang Yang:
Learning Relative Feature Displacement for Few-Shot Open-Set Recognition. 5763-5774 - Jian Jin, Xingxing Zhang, Lili Meng, Weisi Lin, Jie Liang, Huaxiang Zhang, Yao Zhao:
Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding. 5775-5788 - Ginam Kim, Hyunsung Kim, Kyeongbo Kong, Jou Won Song, Suk-Ju Kang:
Human Body-Aware Feature Extractor Using Attachable Feature Corrector for Human Pose Estimation. 5789-5799 - Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, Yufei Zha:
Look&listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement. 5800-5812 - Haoyu Chen, Minggui Teng, Boxin Shi, Yizhou Wang, Tie-Jun Huang:
A Residual Learning Approach to Deblur and Generate High Frame Rate Video With an Event Camera. 5826-5839 - Frank Po Wen Lo, Yao Guo, Yingnan Sun, Jianing Qiu, Benny Lo:
An Intelligent Vision-Based Nutritional Assessment Method for Handheld Food Items. 5840-5851 - Longrong Yang, Hongliang Li, Qingbo Wu, Fanman Meng, Heqian Qiu, Linfeng Xu:
Bias-Correction Feature Learner for Semi-Supervised Instance Segmentation. 5852-5863 - Yanli Ji, Shuo Ma, Xing Xu, Xuelong Li, Heng Tao Shen:
Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for Visual-Audio Separation. 5864-5876 - Yushu Zhang, Wentao Zhou, Ruoyu Zhao, Xinpeng Zhang, Xiaochun Cao:
F-TPE: Flexible Thumbnail-Preserving Encryption Based on Multi-Pixel Sum-Preserving Encryption. 5877-5891 - Muli Yang, Chenghao Xu, Aming Wu, Cheng Deng:
A Decomposable Causal View of Compositional Zero-Shot Learning. 5892-5902 - Tianxin Huang, Hao Zou, Jinhao Cui, Jiangning Zhang, Xuemeng Yang, Lin Li, Yong Liu:
Adaptive Recurrent Forward Network for Dense Point Cloud Completion. 5903-5915 - Lin Zhang, Mingxin Zhang, Ran Song, Ziying Zhao, Xiaolei Li:
Unsupervised Embedding Learning With Mutual-Information Graph Convolutional Networks. 5916-5926 - Jie Li, Yong Xiang, Hao Wu, Shaowen Yao, Dan Xu:
Optimal Transport-Based Patch Matching for Image Style Transfer. 5927-5940 - Jacob Chakareski, Xavier Corbillon, Gwendal Simon, Viswanathan (Vishy) Swaminathan:
User Navigation Modeling, Rate-Distortion Analysis, and End-to-End Optimization for Viewport-Driven 360$^\circ $ Video Streaming. 5941-5956 - Jiaqi Zhang, Yunrui Jian, Suhong Wang, Chuanmin Jia, Shanshe Wang, Siwei Ma, Wen Gao:
Textural and Directional Information Based Offset In-Loop Filtering in AVS3. 5957-5971 - Jun-Sang Yoo, Dong-Wook Kim, Yucheng Lu, Seung-Won Jung:
RZSR: Reference-Based Zero-Shot Super-Resolution With Depth Guided Self-Exemplars. 5972-5983 - Majjed Al-Qatf, Xingfu Wang, Ammar Hawbani, Amr Abdussalam, Saeed Hamood Alsamhi:
Image Captioning With Novel Topics Guidance and Retrieval-Based Topics Re-Weighting. 5984-5999 - Zhentan Zheng, Jianyi Liu, Nanning Zheng:
P$^{2}$-GAN: Efficient Stroke Style Transfer Using Single Style Image. 6000-6012 - Ali Ak, Abhishek Goswami, Wolf Hauser, Patrick Le Callet, Frédéric Dufaux:
RV-TMO: Large-Scale Dataset for Subjective Quality Assessment of Tone Mapped Images. 6013-6025 - Pei Wang, Yun Yang, Yuelong Xia, Kun Wang, Xingyi Zhang, Song Wang:
Information Maximizing Adaptation Network With Label Distribution Priors for Unsupervised Domain Adaptation. 6026-6039 - Dingkang Liang, Wei Xu, Yingying Zhu, Yu Zhou:
Focal Inverse Distance Transform Maps for Crowd Localization. 6040-6052 - Sheikh Tania, Gour C. Karmakar, Shyh Wei Teng, M. Manzur Murshed:
A Robust Local Texture Descriptor in the Parametric Space of the Weibull Distribution. 6053-6066 - Huihui Yue, Jichang Guo, Xiangjun Yin, Yi Zhang, Sida Zheng:
Deep Label Prior: Pre-Training-Free Salient Object Detection Network Based on Label Learning. 6067-6078 - Jakub Nawala, Lucjan Janowski, Bogdan Cmiel, Krzysztof Rusek, Pablo Pérez:
Generalized Score Distribution: A Two-Parameter Discrete Distribution Accurately Describing Responses From Quality of Experience Subjective Experiments. 6090-6104 - Congcong Li, Jing Li, Yuguang Xie, Jiayang Nie, Tao Yang, Zhaoyang Lu:
Calibration-Free Cross-Camera Target Association Using Interaction Spatiotemporal Consistency. 6105-6120 - Zhe Xu, Kun Wei, Xu Yang, Cheng Deng:
Point-Supervised Video Temporal Grounding. 6121-6131 - Wenfeng Song, Xia Hou, Shuai Li, Chenglizhao Chen, Danyang Gao, Xian'e Wang, Yuzhe Sun, Jianxia Hou, Aimin Hao:
An Intelligent Virtual Standard Patient for Medical Students Training Based on Oral Knowledge Graph. 6132-6145 - Xu Yin, Dongbo Min, Yuchi Huo, Sung-Eui Yoon:
Contour-Aware Equipotential Learning for Semantic Segmentation. 6146-6156 - Junbao Zhuo, Shuhui Wang, Qingming Huang:
Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments. 6157-6170 - Zhiqi Pang, Lingling Zhao, Qiuyang Liu, Chunyu Wang:
Camera Invariant Feature Learning for Unsupervised Person Re-Identification. 6171-6182 - Jiahao Nie, Zhiwei He, Yuxiang Yang, Mingyu Gao, Zhekang Dong:
Learning Localization-Aware Target Confidence for Siamese Visual Tracking. 6194-6206 - Chao Sun, Zhedong Zheng, Xiaohan Wang, Mingliang Xu, Yi Yang:
Self-Supervised Point Cloud Representation Learning via Separating Mixed Shapes. 6207-6218 - Fuxiang Wu, Liu Liu, Fusheng Hao, Fengxiang He, Jun Cheng:
Language-Based Image Manipulation Built on Language-Guided Ranking. 6219-6231 - Yamin Sepehri, Pedram Pad, Clément Kündig, Pascal Frossard, L. Andrea Dunbar:
Privacy-Preserving Image Acquisition for Neural Vision Systems. 6232-6244 - Sweta Anmulwar, Ning Wang, Vu San Ha Huynh, Stewart Bryant, Jinze Yang, Regius Rahim Tafazolli:
HoloSync: Frame Synchronisation for Multi-Source Holographic Teleportation Applications. 6245-6257 - Jingzhao Xu, Mengke Yuan, Dong-Ming Yan, Tieru Wu:
Illumination Guided Attentive Wavelet Network for Low-Light Image Enhancement. 6258-6271 - Xixia Xu, Qi Zou, Xue Lin:
Structure-Enriched Topology Learning For Cross-Domain Multi-Person Pose Estimation. 6272-6284 - Jun Chen, Hui Duan, Yuanxin Song, Zemin Cai, Guangguang Yang:
Optical Flow Computation for Video Under the Dynamic Illumination. 6285-6300 - Jiandian Zeng, Jiantao Zhou, Tianyi Liu:
Robust Multimodal Sentiment Analysis via Tag Encoding of Uncertain Missing Modalities. 6301-6314 - Wei Wang, Junyu Gao, Changsheng Xu:
Weakly-Supervised Video Object Grounding via Learning Uni-Modal Associations. 6329-6340 - Qing Li, Ying Chen, Aoyang Zhang, Yong Jiang, Longhao Zou, Zhimin Xu, Gabriel-Miro Muntean:
A Super-Resolution Flexible Video Coding Solution for Improving Live Streaming Quality. 6341-6355 - Hanyang Jin, Shenqi Lai, Qi Tang, Tianyu Zhu, Xueming Qian:
MPPM: A Mobile-Efficient Part Model for Object re-ID. 6356-6370 - Qiangqiang Shen, Shuangyan Yi, Yongsheng Liang, Yongyong Chen, Wei Liu:
Bilateral Fast Low-Rank Representation With Equivalent Transformation for Subspace Clustering. 6371-6383 - Qiaokang Xie, Zhenbo Lu, Wengang Zhou, Houqiang Li:
Improving Person Re-Identification With Multi-Cue Similarity Embedding and Propagation. 6384-6396 - Wenhong Duan, Zhenhua Liu, Chuanmin Jia, Shanshe Wang, Siwei Ma, Wen Gao:
Differential Weight Quantization for Multi-Model Compression. 6397-6410 - Tiesong Zhao, Yuhang Huang, Weize Feng, Yiwen Xu, Sam Kwong:
Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation. 6411-6421 - Yawen Cui, Wanxia Deng, Xin Xu, Zhen Liu, Zhong Liu, Matti Pietikäinen, Li Liu:
Uncertainty-Guided Semi-Supervised Few-Shot Class-Incremental Learning With Knowledge Distillation. 6422-6435 - Dongyan Nie, Jialin Liu, Hong Fei, Xiaoying Sun:
Neuromorphic Similarity Measurement of Tactile Stimuli in Human-Machine Interface. 6436-6445 - Huaxin Pang, Shikui Wei, Gangjian Zhang, Shiyin Zhang, Shuang Qiu, Yao Zhao:
Heterogeneous Feature Alignment and Fusion in Cross-Modal Augmented Space for Composed Image Retrieval. 6446-6457 - Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang:
Reinforcement Shrink-Mask for Text Detection. 6458-6470 - Junjie Wu, Changqun Xia, Tianshu Yu, Jia Li:
View-Aware Salient Object Detection for $360^{\circ }$ Omnidirectional Image. 6471-6484 - Wendong Mao, Shuai Yang, Huihong Shi, Jiaying Liu, Zhongfeng Wang:
Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure. 6485-6498 - Guanghui Yue, Di Cheng, Leida Li, Tianwei Zhou, Hantao Liu, Tianfu Wang:
Semi-Supervised Authentically Distorted Image Quality Assessment With Consistency-Preserving Dual-Branch Convolutional Neural Network. 6499-6511 - Naiyu Fang, Lemiao Qiu, Shuyou Zhang, Zili Wang, Kerui Hu, Liangyu Dong:
A Novel Human Image Sequence Synthesis Method by Pose-Shape-Content Inference. 6512-6524 - Jinchao Zhu, Xiaoyu Zhang, Xian Fang, Yuxuan Wang, Panlong Tan, Junnan Liu:
Perception-and-Regulation Network for Salient Object Detection. 6525-6537 - Dehui Zhu, Bo Du, Yanni Dong, Liangpei Zhang:
Target Detection With Spatial-Spectral Adaptive Sample Generation and Deep Metric Learning for Hyperspectral Imagery. 6538-6550 - Yiming Wang, Dongxia Chang, Zhiqiang Fu, Jie Wen, Yao Zhao:
Graph Contrastive Partial Multi-View Clustering. 6551-6562 - Changchong Sheng, Li Liu, Wanxia Deng, Liang Bai, Zhong Liu, Songyang Lao, Gangyao Kuang, Matti Pietikäinen:
Importance-Aware Information Bottleneck Learning Paradigm for Lip Reading. 6563-6574 - Huan Deng, Zhenguo Yang, Tianyong Hao, Qing Li, Wenyin Liu:
Multimodal Affective Computing With Dense Fusion Transformer for Inter- and Intra-Modality Interactions. 6575-6587 - Gaosheng Liu, Huanjing Yue, Jiamin Wu, Jing-Yu Yang:
Efficient Light Field Angular Super-Resolution With Sub-Aperture Feature Learning and Macro-Pixel Upsampling. 6588-6600 - Yukun Qiu, Fa-Ting Hong, Wei-Hong Li, Wei-Shi Zheng:
Learning Relation Models to Detect Important People in Still Images. 6601-6615 - Congcong Zhu, Xiaoqiang Li, Jide Li, Songmin Dai, Weiqin Tong:
Multi-Sourced Knowledge Integration for Robust Self-Supervised Facial Landmark Tracking. 6616-6628 - Huibing Wang, Guangqi Jiang, Jinjia Peng, Ruoxi Deng, Xianping Fu:
Towards Adaptive Consensus Graph: Multi-View Clustering via Graph Collaboration. 6629-6641 - Yong Li, Qiang Hao, Jianguo Hu, Xinmiao Pan, Zechao Li, Zhen Cui:
3D3M: 3D Modulated Morphable Model for Monocular Face Reconstruction. 6642-6652 - Tingyu Weng, Jun Xiao, Feilong Yan, Haiyong Jiang:
Context-Aware 3D Point Cloud Semantic Segmentation With Plane Guidance. 6653-6664 - Wei Xia, Qianqian Wang, Quanxue Gao, Ming Yang, Xinbo Gao:
Self-Consistent Contrastive Attributed Graph Clustering With Pseudo-Label Prompt. 6665-6677 - Xin Yao, Min Wang, Wengang Zhou, Houqiang Li:
Hash Bit Selection With Reinforcement Learning for Image Retrieval. 6678-6687 - Chen Ju, Peisen Zhao, Siheng Chen, Ya Zhang, Xiaoyun Zhang, Yanfeng Wang, Qi Tian:
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization. 6688-6701 - Peipei Zhu, Xiao Wang, Yong Luo, Zhenglong Sun, Wei-Shi Zheng, Yaowei Wang, Changwen Chen:
Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition. 6702-6716 - Xinsheng Wang, Qicong Xie, Jihua Zhu, Lei Xie, Odette Scharenborg:
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Persons. 6717-6728 - Taro Narahara, Toshihiko Yamasaki:
Subjective Functionality and Comfort Prediction for Apartment Floor Plans and Its Application to Intuitive Online Property Search. 6729-6742 - Zhenrong Zhang, Jiefeng Ma, Jun Du, Licheng Wang, Jianshu Zhang:
Multimodal Pre-Training Based on Graph Attention Network for Document Understanding. 6743-6755 - Chunjie Zhang, Huihui Bai, Yao Zhao:
Fine-Grained Image Classification by Class and Image-Specific Decomposition With Multiple Views. 6756-6766 - Zehua Sheng, Xiongwei Liu, Si-Yuan Cao, Hui-Liang Shen, Huaqi Zhang:
Frequency-Domain Deep Guided Image Denoising. 6767-6781 - Kunpeng Niu, Yanli Liu, Enhua Wu, Guanyu Xing:
A Boundary-Aware Network for Shadow Removal. 6782-6793 - Lirong Zheng, Yanshan Li, Kaihao Zhang, Wenhan Luo:
T-Net: Deep Stacked Scale-Iteration Network for Image Dehazing. 6794-6807 - Dengyan Luo, Mao Ye, Shuai Li, Ce Zhu, Xue Li:
Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement. 6808-6820 - Jian Zhu, Qingwu Zhang, Lunke Fei, Ruichu Cai, Yuan Xie, Bin Sheng, Xiaokang Yang:
FFFN: Frame-By-Frame Feedback Fusion Network for Video Super-Resolution. 6821-6835 - Shijia Ni, Feng Shao, Xiongli Chai, Hangwei Chen, Yo-Sung Ho:
Composition-Guided Neural Network for Image Cropping Aesthetic Assessment. 6836-6851 - Yuanman Li, Jiaxiang You, Jiantao Zhou, Wei Wang, Xin Liao, Xia Li:
Image Operation Chain Detection with Machine Translation Framework. 6852-6867 - Tong Zhu, Leida Li, Jufeng Yang, Sicheng Zhao, Xiao Xiao:
Multimodal Emotion Classification With Multi-Level Semantic Reasoning Network. 6868-6880 - Pinzhuo Tian, Shaorong Xie:
An Adversarial Meta-Training Framework for Cross-Domain Few-Shot Learning. 6881-6891 - Minsoo Song, Gi-Mun Um, Heekyung Lee, Jeongil Seo, Wonjun Kim:
Dynamic Residual Filtering With Laplacian Pyramid for Instance Segmentation. 6892-6903 - Nannan Hu, Yue Ming, Chunxiao Fan, Fan Feng, Boyang Lyu:
TSFNet: Triple-Steam Image Captioning. 6904-6916 - Weimin Tan, Ganghui Ru, Yueming Jiang, Jichun Li, Bo Yan:
Rethinking and Improving Few-Shot Segmentation From a Contour-Aware Perspective. 6917-6929 - Yuchun Fang, Sirui Cai, Yiting Cao, Zhengchen Li, Zhaoxiang Zhang:
Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning. 6946-6957 - Haonan Zhang, Longjun Liu, Bingyao Kang, Nanning Zheng:
Hierarchical Model Compression via Shape-Edge Representation of Feature Maps - an Enlightenment From the Primate Visual System. 6958-6970 - Runmin Cong, Kepu Zhang, Chen Zhang, Feng Zheng, Yao Zhao, Qingming Huang, Sam Kwong:
Does Thermal Really Always Matter for RGB-T Salient Object Detection? 6971-6982 - Yadong Qu, Hongtao Xie, Shancheng Fang, Yuxin Wang, Yongdong Zhang:
ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection. 6983-6996 - Aihua Mao, Zhi Yang, Ken Lin, Jun Xuan, Yong-Jin Liu:
Positional Attention Guided Transformer-Like Architecture for Visual Question Answering. 6997-7009 - Haijin Zeng, Jize Xue, Hiep Luong, Wilfried Philips:
Multimodal Core Tensor Factorization and its Applications to Low-Rank Tensor Completion. 7010-7024 - Fengda Hao, Jiaojiao Li, Rui Song, Yunsong Li, Kailang Cao:
Structure-Aware Graph Convolution Network for Point Cloud Parsing. 7025-7036 - Wei Huang, Yintao Zhou, Yiu-ming Cheung, Peng Zhang, Yufei Zha, Meng Pang:
Facial Expression Guided Diagnosis of Parkinson's Disease via High-Quality Data Augmentation. 7037-7050 - Wuyang Li, Xinyu Liu, Yixuan Yuan:
SCAN++: Enhanced Semantic Conditioned Adaptation for Domain Adaptive Object Detection. 7051-7061 - Qingrong Cheng, Keyu Wen, Xiaodong Gu:
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks. 7062-7075 - Cankun Zhong, Wing W. Y. Ng:
A Robust Frequency-Domain-Based Graph Adaptive Network for Parkinson's Disease Detection From Gait Data. 7076-7088 - Zhonghong Ou, Zhongjie Chen, Shengyi Shen, Lina Fan, Siyuan Yao, Meina Song, Pan Hui:
Free$\rm ^{3}$Net: Gliding Free, Orientation Free, and Anchor Free Network for Oriented Object Detection. 7089-7100 - Yuchen Hong, Youwei Lyu, Si Li, Gang Cao, Boxin Shi:
Reflection Removal With NIR and RGB Image Feature Fusion. 7101-7112 - Lu Yang, Qing Song, Zhihui Wang, Zhiwei Liu, Songcen Xu, Zhihao Li:
Quality-Aware Network for Human Parsing. 7128-7138 - SangEun Lee, Chaeeun Ryu, Eunil Park:
OSANet: Object Semantic Attention Network for Visual Sentiment Analysis. 7139-7148 - Fan Liu, Huilin Chen, Zhiyong Cheng, Anan Liu, Liqiang Nie, Mohan S. Kankanhalli:
Disentangled Multimodal Representation Learning for Recommendation. 7149-7159 - Yuanwei Zhu, Yakun Huang, Xiuquan Qiao, Zhijie Tan, Boyuan Bai, Huadong Ma, Schahram Dustdar:
A Semantic-Aware Transmission With Adaptive Control Scheme for Volumetric Video Service. 7160-7172 - Yue Zhang, Chao Liang, Longxiang Jiang:
Confidence-Aware Active Feedback for Interactive Instance Search. 7173-7184 - Soushi Ueno, Takuya Fujihashi, Toshiaki Koike-Akino, Takashi Watanabe:
Point Cloud Soft Multicast for Untethered XR Users. 7185-7195 - Bianca Jansen Van Rensburg, William Puech, Jean-Pierre Pedeboy:
A Format Compliant Encryption Method for 3D Objects Allowing Hierarchical Decryption. 7196-7207 - Yuqi Bu, Liuwu Li, Jiayuan Xie, Qiong Liu, Yi Cai, Qingbao Huang, Qing Li:
Scene-Text Oriented Referring Expression Comprehension. 7208-7221 - Yan Wang, Tongtong Su, Yusen Li, Jiuwen Cao, Gang Wang, Xiaoguang Liu:
DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution. 7222-7234 - Dawei Zhao, Qingwei Gao, Yixiang Lu, Dong Sun:
Non-Aligned Multi-View Multi-Label Classification via Learning View-Specific Labels. 7235-7247 - Lianli Gao, Qike Zhao, Junchen Zhu, Sitong Su, Lechao Cheng, Lei Zhao:
From External to Internal: Structuring Image for Text-to-Image Attributes Manipulation. 7248-7261 - Yangfan Sun, Li Li, Zhu Li, Shizheng Wang, Shan Liu, Ge Li:
Learning a Compact Spatial-Angular Representation for Light Field. 7262-7273 - Yiyun Chen, Yunmeng Liu, Mingliang Chen, Zirui Wang, Wenming Yang, Qingmin Liao:
Blind JPEG Compression Artifacts Removal by Integrating Channel Regulation With Exit Strategy. 7274-7286 - Yan Bai, Jile Jiao, Yihang Lou, Shengsen Wu, Jun Liu, Xuetao Feng, Ling-Yu Duan:
Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning. 7287-7298 - Ruisong Zhang, Weize Quan, Yong Zhang, Jue Wang, Dong-Ming Yan:
W-Net: Structure and Texture Interaction for Image Inpainting. 7299-7310 - Xihua Sheng, Jiahao Li, Bin Li, Li Li, Dong Liu, Yan Lu:
Temporal Context Mining for Learned Video Compression. 7311-7322 - Zhening Xing, Yuchen Wu, Si Liu, Shangzhe Di, Huimin Ma:
Virtual Try-On With Garment Self-Occlusion Conditions. 7323-7336 - Jiaxiang Chen, Jiayuan Fan, Hancheng Ye, Jie Li, Yongbin Liao, Tao Chen:
Exploring Kernel-Based Texture Transfer for Pose-Guided Person Image Generation. 7337-7349 - Tong Qiao, Jiasheng Wu, Ning Zheng, Ming Xu, Xiangyang Luo:
FGDNet: Fine-Grained Detection Network Towards Face Anti-Spoofing. 7350-7363 - Jun Jia, Zhongpai Gao, Dandan Zhu, Xiongkuo Min, Menghan Hu, Guangtao Zhai:
RIVIE: Robust Inherent Video Information Embedding. 7364-7377 - Biwei Cao, Jiuxin Cao, Jie Gui, Jiayun Shen, Bo Liu, Lei He, Yuan Yan Tang, James Tin-Yau Kwok:
AlignVE: Visual Entailment Recognition Based on Alignment Relations. 7378-7387 - Guoqiang Gong, Linchao Zhu, Yadong Mu:
Language-Guided Multi-Granularity Context Aggregation for Temporal Sentence Grounding. 7402-7414 - Fuxiang Huang, Lei Zhang, Yuhang Zhou, Xinbo Gao:
Adversarial and Isotropic Gradient Augmentation for Image Retrieval With Text Feedback. 7415-7427 - Chengyin Xu, Zenghao Chai, Zhengzhuo Xu, Hongjia Li, Qiruyi Zuo, Lingyu Yang, Chun Yuan:
HHF: Hashing-Guided Hinge Function for Deep Hashing Retrieval. 7428-7440 - Ziming Liu, Song Guo, Jingcai Guo, Yuanyuan Xu, Fushuo Huo:
Towards Unbiased Multi-Label Zero-Shot Learning With Pyramid and Semantic Attention. 7441-7455 - Pan Yang, Xiong Luo, Jiankun Sun:
A Simple but Effective Method for Balancing Detection and Re-Identification in Multi-Object Tracking. 7456-7468 - Xiang Li, Jinglu Wang, Xiao Li, Yan Lu:
Video Instance Segmentation by Instance Flow Assembly. 7469-7479 - Masum Shah Junayed, Md Baharul Islam:
Consistent Video Inpainting Using Axial Attention-Based Style Transformer. 7494-7504 - Jiaxiang Wang, Chenglong Li, Aihua Zheng, Jin Tang, Bin Luo:
Looking and Hearing Into Details: Dual-Enhanced Siamese Adversarial Network for Audio-Visual Matching. 7505-7516 - Xiang Fang, Daizong Liu, Pan Zhou, Yuchong Hu:
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval. 7517-7532 - Kai Yang, Haijun Zhang, Feng Gao, Jianyang Shi, Yanfeng Zhang, Q. M. Jonathan Wu:
DETA: A Point-Based Tracker With Deformable Transformer and Task-Aligned Learning. 7545-7558 - Hezhen Hu, Junfu Pu, Wengang Zhou, Houqiang Li:
Collaborative Multilingual Continuous Sign Language Recognition: A Unified Framework. 7559-7570 - Han Fang, Zhaoyang Jia, Yupeng Qiu, Jiyi Zhang, Weiming Zhang, Ee-Chien Chang:
De-END: Decoder-Driven Watermarking Network. 7571-7581 - Yiran Yang, Xian Sun, Wenhui Diao, Xuee Rong, Shiyao Yan, Dongshuo Yin, Xinming Li:
Optimal Partition Assignment for Universal Object Detection. 7582-7593 - Zhao Xie, Jiansong Chen, Kewei Wu, Dan Guo, Richang Hong:
Global Temporal Difference Network for Action Recognition. 7594-7606 - Yucheng Zhu, Yunhao Li, Wei Sun, Xiongkuo Min, Guangtao Zhai, Xiaokang Yang:
Blind Image Quality Assessment via Cross-View Consistency. 7607-7620 - Wenhui Zhou, Hua Zhang, Zhengmao Yan, Weisheng Wang, Lili Lin:
DecoupledPoseNet: Cascade Decoupled Pose Learning for Unsupervised Camera Ego-Motion Estimation. 1636-1648 - Xiaobao Guo, Adams Wai-Kin Kong, Alex C. Kot:
Deep Multimodal Sequence Fusion by Regularized Expressive Representation Distillation. 2085-2096 - Sheng Huang, Yunhe Zhang, Lele Fu, Shiping Wang:
Learnable Multi-View Matrix Factorization With Graph Embedding and Flexible Loss. 3259-3272 - Xiao Fu, Hangyu Deng, Xin Yuan, Jinglu Hu:
Generating High Coherence Monophonic Music Using Monte-Carlo Tree Search. 3763-3772 - Jixiang Gao, Jingjing Chen, Huazhu Fu, Yu-Gang Jiang:
Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition. 4764-4773 - Haichao Yao, Rongrong Ni, Hadi Amirpour, Christian Timmerer, Yao Zhao:
Detection and Localization of Video Transcoding From AVC to HEVC Based on Deep Representations of Decoded Frames and PU Maps. 5014-5029 - Yangyang Li, Wei Zhai, Yang Cao, Zheng-Jun Zha:
Location-Free Camouflage Generation Network. 5234-5247 - Bin Wang, Chunsheng Liu, Faliang Chang, Wenqian Wang, Nanjun Li:
AE-Net:Adjoint Enhancement Network for Efficient Action Recognition in Video Understanding. 5458-5468 - Xiaopeng Li, Xiaojie Guo:
SPN2D-GAN: Semantic Prior Based Night-to-Day Image-to-Image Translation. 7621-7634 - Xinhui Li, Mingjia Li, Xiaopeng Li, Xiaojie Guo:
Learning Generalized Knowledge From a Single Domain on Urban-Scene Segmentation. 7635-7646 - Yujian Feng, Jian Yu, Feng Chen, Yimu Ji, Fei Wu, Shangdong Liu, Xiao-Yuan Jing:
Visible-Infrared Person Re-Identification via Cross-Modality Interaction Transformer. 7647-7659 - Shuwei Shao, Ran Li, Zhongcai Pei, Zhong Liu, Weihai Chen, Wentao Zhu, Xingming Wu, Baochang Zhang:
Towards Comprehensive Monocular Depth Estimation: Multiple Heads are Better Than One. 7660-7671 - Yipo Huang, Leida Li, Yuzhe Yang, Yaqian Li, Yandong Guo:
Explainable and Generalizable Blind Image Quality Assessment via Semantic Attribute Reasoning. 7672-7685 - Ashutosh Kulkarni, Prashant W. Patil, Subrahmanyam Murala, Sunil Gupta:
Unified Multi-Weather Visibility Restoration. 7686-7698 - Zhong Ji, Junhua Hu, Deyin Liu, Lin Yuanbo Wu, Ye Zhao:
Asymmetric Cross-Scale Alignment for Text-Based Person Search. 7699-7709 - Jiahao Hong, Wei Zhang, Zhiwei Feng, Wenqiang Zhang:
Dual Cross-Attention for Video Object Segmentation via Uncertainty Refinement. 7710-7725 - Jian Wang, Xinyue Li, Zhichao Zhang, Wei Song, Weiqi Guo:
Ranked Similarity Weighting and Top-nk Sampling in Deep Metric Learning. 7726-7735 - Yiming Bao, Xu Zhao, Dahong Qian:
FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation. 7736-7746 - Fan Luo, Shaoxiang Chen, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Self-Supervised Learning for Semi-Supervised Temporal Language Grounding. 7747-7757 - Zheren Fu, Zhendong Mao, Bo Hu, An-An Liu, Yongdong Zhang:
Intra-Class Adaptive Augmentation With Neighbor Correction for Deep Metric Learning. 7758-7771 - Han Fang, Pengfei Xiong, Luhui Xu, Wenhan Luo:
Transferring Image-CLIP to Video-Text Retrieval via Temporal Relations. 7772-7785 - Yulan Guo, Yun Wang, Longguang Wang, Zi Wang, Chen Cheng:
CVCNet: Learning Cost Volume Compression for Efficient Stereo Matching. 7786-7799 - Zhishe Wang, Wenyu Shao, Yanlin Chen, Jiawei Xu, Xiaoqin Zhang:
Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning. 7800-7813 - Dong Chen, Yueting Zhuang, Zijin Shen, Carl Yang, Guoming Wang, Siliang Tang, Yi Yang:
Cross-Modal Data Augmentation for Tasks of Different Modalities. 7814-7824 - Rahul Sharma, Krishna Somandepalli, Shrikanth Narayanan:
Cross Modal Video Representations for Weakly Supervised Active Speaker Localization. 7825-7836 - Yonghao Xu, Fengxiang He, Bo Du, Dacheng Tao, Liangpei Zhang:
Self-Ensembling GAN for Cross-Domain Semantic Segmentation. 7837-7850 - Fengyong Li, Zhenjia Pei, Xinpeng Zhang, Chuan Qin:
Image Manipulation Localization Using Multi-Scale Feature Fusion and Adaptive Edge Supervision. 7851-7866 - Nan Xu, Junyan Wang, Yuan Tian, Ruike Zhang, Wenji Mao:
AnANet: Association and Alignment Network for Modeling Implicit Relevance in Cross-Modal Correlation Classification. 7867-7880 - Weiwei Xing, Jie Yao, Zixia Liu, Weibin Liu, Shunli Zhang, Liqiang Wang:
Contrastive JS: A Novel Scheme for Enhancing the Accuracy and Robustness of Deep Models. 7881-7893 - Zhu Liu, Teng Wang, Jinrui Zhang, Feng Zheng, Wenhao Jiang, Ke Lu:
Show, Tell and Rephrase: Diverse Video Captioning via Two-Stage Progressive Training. 7894-7905 - Lingwei Wei, Dou Hu, Wei Zhou, Songlin Hu:
Modeling Both Intra- and Inter-Modality Uncertainty for Multimodal Fake News Detection. 7906-7916 - Ziyi Tang, Ruimao Zhang, Zhanglin Peng, Jinrui Chen, Liang Lin:
Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification. 7917-7929 - Jing Chen, Linlin Chen, Huanqiang Zeng, Chih-Hsien Hsia, Tianlei Wang, Kai-Kuang Ma:
3D-Gradient Guided Rate Control Model for Screen Content Video Coding. 7930-7942 - Fei Wu, Qingzhong Wang, Jiang Bian, Ning Ding, Feixiang Lu, Jun Cheng, Dejing Dou, Haoyi Xiong:
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications. 7943-7966 - Yang Zhang, Xian Zhang, Canghong Shi, Xi Wu, Xiaojie Li, Jing Peng, Kunlin Cao, Jiancheng Lv, Jiliu Zhou:
Pluralistic Face Inpainting With Transformation of Attribute Information. 7967-7979 - Man Zhang, Yong Zhou, Bing Liu, Jiaqi Zhao, Rui Yao, Zhiwen Shao, Hancheng Zhu:
Weakly Supervised Few-Shot Semantic Segmentation via Pseudo Mask Enhancement and Meta Learning. 7980-7991 - Xiaowei Chen, Guoliang Fan:
Indoor Camera Pose Estimation From Room Layouts and Image Outer Corners. 7992-8005 - Renjie Wan, Boxin Shi, Wenhan Yang, Bihan Wen, Ling-Yu Duan, Alex C. Kot:
Purifying Low-Light Images via Near-Infrared Enlightened Image. 8006-8019 - Jiarun Song, Xionghui Mao, Fuzheng Yang:
The Impact of Black Edge Artifact on QoE of the FOV-Based Cloud VR Services. 8020-8035 - Yongqing Zhu, Xiangyang Li, Mao Zheng, Jiahao Yang, Zihan Wang, Xiaoqian Guo, Zifeng Chai, Yuchen Yuan, Shuqiang Jiang:
Focus and Align: Learning Tube Tokens for Video-Language Pre-Training. 8036-8050 - Hui Zhang, Junkun Tang, Yihong Cao, Yurong Chen, Yaonan Wang, Q. M. Jonathan Wu:
Cycle Consistency Based Pseudo Label and Fine Alignment for Unsupervised Domain Adaptation. 8051-8063 - Xichuan Zhou, Rui Ding, Yuxiao Wang, Wenjia Wei, Haijun Liu:
Cellular Binary Neural Network for Accurate Image Classification and Semantic Segmentation. 8064-8075 - Jing Xiao, Kangmin Xu, Mengshun Hu, Liang Liao, Zheng Wang, Chia-Wen Lin, Mi Wang, Shin'ichi Satoh:
Progressive Motion Boosting for Video Frame Interpolation. 8076-8090 - Xiaokai Yi, Hanli Wang, Sam Kwong, C.-C. Jay Kuo:
Task-Driven Video Compression for Humans and Machines: Framework Design and Optimization. 8091-8102 - Kangle Wu, Jun Chen, Yang Yu, Jiayi Ma:
ACE-MEF: Adaptive Clarity Evaluation-Guided Network With Illumination Correction for Multi-Exposure Image Fusion. 8103-8118 - Chen Zhou, Min Jiang, Jun Kong:
BGTracker: Cross-Task Bidirectional Guidance Strategy for Multiple Object Tracking. 8132-8144 - Tianshu Song, Leida Li, Jinjian Wu, Yuzhe Yang, Yaqian Li, Yandong Guo, Guangming Shi:
Knowledge-Guided Blind Image Quality Assessment With Few Training Samples. 8145-8156 - Xuesong Wang, Ke Jin, Kun Yu, Yuhu Cheng:
Asymmetric Training in RealnessGAN. 8157-8169 - Bin Cui, Zhuang Shao, Wei Tao, Hui Zhao:
Hole Inpainting Algorithm for Half-Organized Point Cloud Obtained by Structured-Light Section System. 8170-8182 - Peng Li, Jing Gao, Jianing Zhang, Shan Jin, Zhikui Chen:
Deep Reinforcement Clustering. 8183-8193 - Qiaosong Qi, Aixi Zhang, Yue Liao, Wenyu Sun, Yongliang Wang, Xiaobo Li, Si Liu:
Simultaneously Training and Compressing Vision-and-Language Pre-Training Model. 8194-8203 - Hoda Roodaki, Mahdi Nazm Bojnordi:
Compressed Geometric Arrays for Point Cloud Processing. 8204-8211 - Ke Zhang, Miao Long, Jie Chen, Mingzhu Liu, Jingjing Li:
CFPNet: A Denoising Network for Complex Frequency Band Signal Processing. 8212-8224 - Hao Cheng, Joey Tianyi Zhou, Wee Peng Tay, Bihan Wen:
Graph Neural Networks With Triple Attention for Few-Shot Learning. 8225-8239 - Bo Seok Shim, Jae Hong Choe, Jong-Uk Hou:
Source Identification of 3D Printer Based on Layered Texture Encoders. 8240-8252 - Yibo Zhao, Hua Zhang, Zan Gao, Wen Gao, Meng Wang, Shengyong Chen:
A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization. 8253-8266 - Xiaowei Zhao, Xianglong Liu, Yuqing Ma, Shihao Bai, Yifan Shen, Zeyu Hao, Aishan Liu:
Temporal Speciation Network for Few-Shot Object Detection. 8267-8278 - Jun Cheng, Fuxiang Wu, Liu Liu, Qieshi Zhang, Leszek Rutkowski, Dacheng Tao:
InDecGAN: Learning to Generate Complex Images From Captions via Independent Object-Level Decomposition and Enhancement. 8279-8293 - Jie Li, Qi Song, Xiaohu Yan, Yongquan Chen, Rui Huang:
From Front to Rear: 3D Semantic Scene Completion Through Planar Convolution and Attention-Based Network. 8294-8307 - Binglu Wang, Kang Yang, Yongqiang Zhao, Teng Long, Xuelong Li:
Prototype-Based Intent Perception. 8308-8319 - Wanjie Li, Hongxia Wang, Yijing Chen, Sani M. Abdullahi, Jie Luo:
Constructing Immunized Stego-Image for Secure Steganography via Artificial Immune System. 8320-8333 - Xiang-Jun Shen, Yanan Cai, Stanley Ebhohimhen Abhadiomhen, Zhifeng Liu, Yongzhao Zhan, Jianping Fan:
Deep Robust Low Rank Correlation With Unifying Clustering Structure for Cross Domain Adaptation. 8334-8345 - Yahui Xu, Yi Bin, Jiwei Wei, Yang Yang, Guoqing Wang, Heng Tao Shen:
Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval. 8346-8357 - Xiufang Li, Qigong Sun, Licheng Jiao, Fang Liu, Xu Liu, Lingling Li, Puhua Chen, Yi Zuo:
$D^{3}K$: Dynastic Data-Free Knowledge Distillation. 8358-8371 - Yun Li, Zhe Liu, Xiaojun Chang, Julian J. McAuley, Lina Yao:
Diversity-Boosted Generalization-Specialization Balancing for Zero-Shot Learning. 8372-8382 - Jun Rao, Liang Ding, Shuhan Qi, Meng Fang, Yang Liu, Li Shen, Dacheng Tao:
Dynamic Contrastive Distillation for Image-Text Retrieval. 8383-8395 - Jianxin Lin, Lianying Yin, Yijun Wang:
Steformer: Efficient Stereo Image Super-Resolution With Transformer. 8396-8407 - Xiaofeng Yang, Fayao Liu, Guosheng Lin:
Effective End-to-End Vision Language Pretraining With Semantic Visual Loss. 8408-8417 - Pierre R. Lebreton, Kazuhisa Yamagishi:
Quitting Ratio-Based Bitrate Ladder Selection Mechanism for Adaptive Bitrate Video Streaming. 8418-8431 - Tengfei Liang, Yi Jin, Wu Liu, Yidong Li:
Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification. 8432-8444 - Guoqing Ma, Yalong Bai, Wei Zhang, Ting Yao, Basem Shihada, Tao Mei:
Boosting Generic Visual-Linguistic Representation With Dynamic Contexts. 8445-8457 - Zhiqing Guo, Gaobo Yang, Jiyou Chen, Xingming Sun:
Exposing Deepfake Face Forgeries With Guided Residuals. 8458-8470 - Xingyan Chen, Mu Wang, Changqiao Xu, Yu Zhao, Shujie Yang, Ke Jiang, Qing Li, Lujie Zhong, Gabriel-Miro Muntean:
FedLive: A Federated Transmission Framework for Panoramic Livecast With Reinforced Variational Inference. 8471-8486 - Yang Yu, Xiaohui Zhao, Rongrong Ni, Siyuan Yang, Yao Zhao, Alex C. Kot:
Augmented Multi-Scale Spatiotemporal Inconsistency Magnifier for Generalized DeepFake Detection. 8487-8498 - Wentao Tan, Lei Zhu, Jingjing Li, Zheng Zhang, Huaxiang Zhang:
Partial Multi-Modal Hashing via Neighbor-Aware Completion Learning. 8499-8510 - Qi Mao, Siwei Ma:
Enhancing Style-Guided Image-to-Image Translation via Self-Supervised Metric Learning. 8511-8526 - Haoyu Tian, Xin Ma, Xiang Li, Yibin Li:
Skeleton-Based Action Recognition With Select-Assemble-Normalize Graph Convolutional Networks. 8527-8538 - Daizong Liu, Xiang Fang, Wei Hu, Pan Zhou:
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding. 8539-8553 - Qiqi Bao, Yunmeng Liu, Bowen Gang, Wenming Yang, Qingmin Liao:
SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution. 8554-8565 - Zhaoyang Wang, Guanghua Liu, Dongchen Zhang, Xinhai Hua, Lingmin Xu, Peng Gao, Tao Jiang:
Edge-Assisted Massive Video Delivery Over Cell-Free Massive MIMO. 8566-8579 - Zhihao Wu, Xincan Lin, Zhenghong Lin, Zhaoliang Chen, Yang Bai, Shiping Wang:
Interpretable Graph Convolutional Network for Multi-View Semi-Supervised Learning. 8593-8606 - Yaolin Yang, Hongjie He, Fan Chen, Yuan Yuan, Ningxiong Mao:
Reversible Data Hiding in Encrypted Images Based on Time-Varying Huffman Coding Table. 8607-8619 - Hongchen Tan, Baocai Yin, Kun Wei, Xiuping Liu, Xin Li:
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis. 8620-8631 - Huaiwen Zhang, Yang Yang, Fan Qi, Shengsheng Qian, Changsheng Xu:
Robust Video-Text Retrieval Via Noisy Pair Calibration. 8632-8645 - Zheng Wang, Xing Xu, Guoqing Wang, Yang Yang, Heng Tao Shen:
Quaternion Relation Embedding for Scene Graph Generation. 8646-8656 - Binxin Yang, Xuejin Chen, Chaoqun Wang, Chi Zhang, Zihan Chen, Xiaoyan Sun:
Semantics-Preserving Sketch Embedding for Face Generation. 8657-8671 - Mingde Yao, Dongliang He, Xin Li, Zhihong Pan, Zhiwei Xiong:
Bidirectional Translation Between UHD-HDR and HD-SDR Videos. 8672-8686 - Chen Pang, Xuequan Lu, Lei Lyu:
Skeleton-Based Action Recognition Through Contrasting Two-Stream Spatial-Temporal Networks. 8699-8711 - Zhenhua Tang, Jia Li, Yanbin Hao, Richang Hong:
MLP-JCG: Multi-Layer Perceptron With Joint-Coordinate Gating for Efficient 3D Human Pose Estimation. 8712-8724 - Yunhao Du, Zhicheng Zhao, Yang Song, Yanyun Zhao, Fei Su, Tao Gong, Hongying Meng:
StrongSORT: Make DeepSORT Great Again. 8725-8737 - Shaowei Weng, Ye Zhou, Tiancong Zhang, Mengyao Xiao, Yao Zhao:
Reversible Data Hiding for JPEG Images With Adaptive Multiple Two-Dimensional Histogram and Mapping Generation. 8738-8752 - Zhuang Shao, Jungong Han, Kurt Debattista, Yanwei Pang:
Textual Context-Aware Dense Captioning With Diverse Words. 8753-8766 - Huibin Lin, Chun-Yang Zhang, Shiping Wang, Wenzhong Guo:
A Probabilistic Contrastive Framework for Semi-Supervised Learning. 8767-8779 - Aswathy Madhu, Suresh Kumaraswamy:
RQNet: Residual Quaternion CNN for Performance Enhancement in Low Complexity and Device Robust Acoustic Scene Classification. 8780-8792 - Hao Liu, Yanni Ma, Qingyong Hu, Yulan Guo:
CenterTube: Tracking Multiple 3D Objects With 4D Tubelets in Dynamic Point Clouds. 8793-8804 - Guoguang Hua, Muxin Liao, Shishun Tian, Yuhang Zhang, Wenbin Zou:
Multiple Relational Learning Network for Joint Referring Expression Comprehension and Segmentation. 8805-8816 - Rui Ma, Qingbo Wu, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu:
Forgetting to Remember: A Scalable Incremental Learning Framework for Cross-Task Blind Image Quality Assessment. 8817-8827 - Shengbin Yue, Yunbin Tu, Liang Li, Ying Yang, Shengxiang Gao, Zhengtao Yu:
I3N: Intra- and Inter-Representation Interaction Network for Change Captioning. 8828-8841 - Baptiste Chopin, Hao Tang, Naima Otberdout, Mohamed Daoudi, Nicu Sebe:
Interaction Transformer for Human Reaction Generation. 8842-8854 - Yongli Chang, Sumei Li, Anqi Liu, Jie Jin, Wei Xiang:
Coarse-to-Fine Feedback Guidance Based Stereo Image Quality Assessment Considering Dominant Eye Fusion. 8855-8867 - Depu Meng, Changqian Yu, Jiajun Deng, Deheng Qian, Houqiang Li, Dongchun Ren:
Hybrid Motion Representation Learning for Prediction From Raw Sensor Data. 8868-8879 - Yuxuan Liu, Jianxin Yang, Xiao Gu, Yijun Chen, Yao Guo, Guang-Zhong Yang:
EgoFish3D: Egocentric 3D Pose Estimation From a Fisheye Camera via Self-Supervised Learning. 8880-8891 - Weicheng Xie, Wenya Lu, Zhibin Peng, Linlin Shen:
Consistency Preservation and Feature Entropy Regularization for GAN Based Face Editing. 8892-8905 - Jiayu Jiao, Yu-Ming Tang, Kun-Yu Lin, Yipeng Gao, Andy J. Ma, Yaowei Wang, Wei-Shi Zheng:
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition. 8906-8919 - Shuo Wang, Zhihao Wu, Xiaobo Hu, Youfang Lin, Kai Lv:
Skill-Based Hierarchical Reinforcement Learning for Target Visual Navigation. 8920-8932 - Chen Chen, Dan Wang, Bin Song, Hao Tan:
Inter-Intra Modal Representation Augmentation With DCT-Transformer Adversarial Network for Image-Text Matching. 8933-8945 - Xiaoqi Wang, Jian Xiong, Weisi Lin:
Visual Interaction Perceptual Network for Blind Image Quality Assessment. 8958-8971 - Jun Zhang, Licheng Jiao, Wenping Ma, Fang Liu, Xu Liu, Lingling Li, Puhua Chen, Shuyuan Yang:
Transformer Based Conditional GAN for Multimodal Image Fusion. 8988-9001 - Zhiwu Qing, Ziyuan Huang, Shiwei Zhang, Mingqian Tang, Changxin Gao, Rong Jin, Marcelo H. Ang, Nong Sang:
ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning. 9002-9014 - Qin Xu, Jiahui Wang, Bo Jiang, Bin Luo:
Fine-Grained Visual Classification via Internal Ensemble Learning Transformer. 9015-9028 - Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang:
Text Growing on Leaf. 9029-9043 - Zhihua Wang, Qiuping Jiang, Shanshan Zhao, Wensen Feng, Weisi Lin:
Deep Blind Image Quality Assessment Powered by Online Hard Example Mining. 4774-4784 - Qiang Zhai, Fan Yang, Xin Li, Guo-Sen Xie, Hong Cheng, Zicheng Liu:
Co-Communication Graph Convolutional Network for Multi-View Crowd Counting. 5813-5825 - Xiaohan Wang, Linchao Zhu, Zhedong Zheng, Mingliang Xu, Yi Yang:
Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision. 6079-6089 - Sadbhawna Thakur, Vinit Jakhetiya, Badri N. Subudhi, Sunil Prasad Jaiswal, Leida Li, Weisi Lin:
Context Region Identification Based Quality Assessment of 3D Synthesized Views. 6183-6193 - Ran Yi, Zipeng Ye, Zhiyao Sun, Juyong Zhang, Guo-Xin Zhang, Pengfei Wan, Hujun Bao, Yong-Jin Liu:
Predicting Personalized Head Movement From Short Video and Speech Signal. 6315-6328 - Abdelhak Bentaleb, Mehmet N. Akcay, May Lim, Ali C. Begen, Roger Zimmermann:
BoB: Bandwidth Prediction for Real-Time Communications Using Heuristic and Reinforcement Learning. 6930-6945 - Jun Chen, Meng Yang, Wenping Gong, Yang Yu:
Multi-Neighborhood Guided Kendall Rank Correlation Coefficient for Feature Matching. 7113-7127 - Chen Du, Sarah Graham, Colin Depp, Truong Q. Nguyen:
View-Invariant Center-of-Pressure Metrics Estimation With Monocular RGB Camera. 7388-7401 - Xunquan Chen, Xuexin Xu, Jinhui Chen, Zhihong Zhang, Tetsuya Takiguchi, Edwin R. Hancock:
Speaker-Independent Emotional Voice Conversion via Disentangled Representations. 7480-7493 - Wei Chen, Haoyang Xu, Nan Pu, Yu Liu, Mingrui Lao, Weiping Wang, Li Liu, Michael S. Lew:
Lifelong Fine-Grained Image Retrieval. 7533-7544 - Ying Fu, Zichun Wang, Tao Zhang, Jun Zhang:
Low-Light Raw Video Denoising With a High-Quality Realistic Motion Dataset. 8119-8131 - Huafeng Liu, Pai Peng, Tao Chen, Qiong Wang, Yazhou Yao, Xian-Sheng Hua:
FECANet: Boosting Few-Shot Semantic Segmentation With Feature-Enhanced Context-Aware Network. 8580-8592 - Zehua Ma, Xi Yang, Han Fang, Weiming Zhang, Nenghai Yu:
OAcode: Overall Aesthetic 2D Barcode on Screen. 8687-8698 - Rong-Cheng Tu, Xian-Ling Mao, Qinghong Lin, Wenjin Ji, Weize Qin, Wei Wei, Heyan Huang:
Unsupervised Cross-Modal Hashing via Semantic Text Mining. 8946-8957 - Jun Xiao, Xinyang Jiang, Ningxin Zheng, Huan Yang, Yifan Yang, Yuqing Yang, Dongsheng Li, Kin-Man Lam:
Online Video Super-Resolution With Convolutional Kernel Bypass Grafts. 8972-8987 - Quanling Meng, Shengping Zhang, Zonglin Li, Chenyang Wang, Weigang Zhang, Qingming Huang:
Automatic Shadow Generation via Exposure Fusion. 9044-9056 - Yukun Zuo, Hantao Yao, Liansheng Zhuang, Changsheng Xu:
Dual Structural Knowledge Interaction for Domain Adaptation. 9057-9070 - Xue-Ying Ding, Xiao-Qian Liu, Xin Luo, Xin-Shun Xu:
DOC: Text Recognition via Dual Adaptation and Clustering. 9071-9081 - Kaiyi Luo, Chao Zhang, Huaxiong Li, Xiuyi Jia, Chunlin Chen:
Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval. 9082-9095 - Yuxiang Yang, Xing Tian, Wing W. Y. Ng, Ying Gao:
Knowledge Distillation Hashing for Occluded Face Retrieval. 9096-9107 - Dongyun Lin, Yiqun Li, Yi Cheng, Shitala Prasad, Aiyuan Guo, Yanpeng Cao:
Multi-Range View Aggregation Network With Vision Transformer Feature Fusion for 3D Object Retrieval. 9108-9119 - Peiguang Jing, Kai Cui, Weili Guan, Liqiang Nie, Yuting Su:
Category-Aware Multimodal Attention Network for Fashion Compatibility Modeling. 9120-9131 - Yuan Zhang, Lingjun Pu, Tao Lin, Jinyao Yan:
QoE-Oriented Mobile Virtual Reality Game in Distributed Edge Networks. 9132-9146 - Hao Chen, Xiu-Shen Wei, Liang Xiao:
Prototype Learning for Automatic Check-Out. 9147-9160 - Yanzhao Xie, Rukai Wei, Jingkuan Song, Yu Liu, Yangtao Wang, Ke Zhou:
Label-Affinity Self-Adaptive Central Similarity Hashing for Image Retrieval. 9161-9174 - Guanyu Zhu, Yong Zhou, Rui Yao, Hancheng Zhu:
Cross-Class Bias Rectification for Point Cloud Few-Shot Segmentation. 9175-9188 - Jie Guo, Meiting Wang, Yan Zhou, Bin Song, Yuhao Chi, Wei Fan, Jianglong Chang:
HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval. 9189-9202 - Shunxin Xiao, Shide Du, Zhaoliang Chen, Yunhe Zhang, Shiping Wang:
Dual Fusion-Propagation Graph Neural Network for Multi-View Clustering. 9203-9215 - Tiankai Hang, Huan Yang, Bei Liu, Jianlong Fu, Xin Geng, Baining Guo:
Language-Guided Face Animation by Recurrent StyleGAN-Based Generator. 9216-9227 - Wenda Zhao, Fei Wei, Haipeng Wang, You He, Huchuan Lu:
Full-Scene Defocus Blur Detection With DeFBD+ via Multi-Level Distillation Learning. 9228-9240 - Yanxiong Li, Hao Chen, Wenchang Cao, Qisheng Huang, Qianhua He:
Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction. 9241-9253 - Qiang Zhou, Chaohui Yu:
Object Detection Made Simpler by Eliminating Heuristic NMS. 9254-9262 - Yingjie Song, Zhi Liu, Gongyang Li, Dan Zeng, Tianhong Zhang, Lihua Xu, Jijun Wang:
RINet: Relative Importance-Aware Network for Fixation Prediction. 9263-9277 - Fuyun Wang, Xingyu Gao, Zhenyu Chen, Lei Lyu:
Contrastive Multi-Level Graph Neural Networks for Session-Based Recommendation. 9278-9289 - Shuwei Huo, Yuan Zhou, Ruolin Wang, Wei Xiang, Sun-Yuan Kung:
Semantic Relevance Learning for Video-Query Based Video Moment Retrieval. 9290-9301 - Pu Li, Marie A. Roch, Holger Klinck, Erica Fleishman, Douglas Gillespie, Eva-Marie Nosal, Yu Shiu, Xiaobai Liu:
Learning Stage-Wise GANs for Whistle Extraction in Time-Frequency Spectrograms. 9302-9314 - Ziqiang Wu, Bingpeng Ma, Hong Chang, Shiguang Shan:
Refined Knowledge Transfer for Language-Based Person Search. 9315-9329 - Wenbin Wang, Maurice Pagnucco, Chengpei Xu, Yang Song:
InterREC: An Interpretable Method for Referring Expression Comprehension. 9330-9342 - Kang Liu, Feng Xue, Dan Guo, Peijie Sun, Shengsheng Qian, Richang Hong:
Multimodal Graph Contrastive Learning for Multimedia-Based Recommendation. 9343-9355 - Jieyan Liu, Hongcai He, Mingzhu Liu, Jingjing Li, Ke Lu:
Manifold Regularized Joint Transfer for Open Set Domain Adaptation. 9356-9369 - Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan:
MotionVideoGAN: A Novel Video Generator Based on the Motion Space Learned From Image Pairs. 9370-9382 - Weizhi Nie, Chuanqi Jiao, Rihao Chang, Lei Qu, An-An Liu:
CPG3D: Cross-Modal Priors Guided 3D Object Reconstruction. 9383-9396 - Zikang Yuan, Junda Cheng, Xin Yang:
CR-LDSO: Direct Sparse LiDAR-Assisted Visual Odometry With Cloud Reusing. 9397-9409 - Xiao Lv, Tao Xiang, Ying Yang, Hantao Liu:
Blind Dehazed Image Quality Assessment: A Deep CNN-Based Approach. 9410-9424 - Kun Xia, Le Wang, Yichao Shen, Sanping Zhou, Gang Hua, Wei Tang:
Exploring Action Centers for Temporal Action Localization. 9425-9436 - Jiawei Liu, Qiang Wang, Huijie Fan, Wentao Li, Liangqiong Qu, Yandong Tang:
A Decoupled Multi-Task Network for Shadow Removal. 9449-9463 - Xiaobao Guo, A. C. Kot, Adams Wai-Kin Kong:
Pace-Adaptive and Noise-Resistant Contrastive Learning for Multimodal Feature Fusion. 9437-9448 - Yangbo Feng, Junyu Gao, Shicai Yang, Changsheng Xu:
Spatial-Temporal Exclusive Capsule Network for Open Set Action Recognition. 9464-9478 - Yongchun Chen, Min Liu, Xueping Wang, Fei Wang, An-An Liu, Yaonan Wang:
Refining Noisy Labels With Label Reliability Perception for Person Re-Identification. 9479-9490 - Sebastiano Verde, Cecilia Pasquini, Federica Lago, Alessandro Goller, Francesco G. B. De Natale, Alessandro Piva, Giulia Boato:
Multi-Clue Reconstruction of Sharing Chains for Social Media Images. 9491-9505 - Shideng Lin, Fan Tang, Weiming Dong, Xingjia Pan, Changsheng Xu:
SMNet: Synchronous Multi-Scale Low Light Enhancement Network With Local and Global Concern. 9506-9517 - Yunbin Tu, Liang Li, Li Su, Ke Lu, Qingming Huang:
Neighborhood Contrastive Transformer for Change Captioning. 9518-9529 - Xiaoqing Liu, Huanqiang Zeng, Yifan Shi, Jianqing Zhu, Chih-Hsien Hsia, Kai-Kuang Ma:
Deep Cross-Modal Hashing Based on Semantic Consistent Ranking. 9530-9542 - Zhou Yu, Zitian Jin, Jun Yu, Mingliang Xu, Hongbo Wang, Jianping Fan:
Bilaterally Slimmable Transformer for Elastic and Efficient Visual Question Answering. 9543-9556 - Nian Hu, Xiangdong Huang, Wenhui Li, Xuanya Li, An-An Liu:
Cross-Domain Image-Object Retrieval Based on Weighted Optimal Transport. 9557-9571 - Chen Wan, Fangjun Huang, Xianfeng Zhao:
Average Gradient-Based Adversarial Attack. 9572-9585 - Hao Liu, Mei Ma, Zixian Gao, Zongyong Deng, Fengjun Li, Zhendong Li:
Siamese Graph Learning for Semi-Supervised Age Estimation. 9586-9596 - Xiaodong Wang, Zhedong Zheng, Yang He, Fei Yan, Zhiqiang Zeng, Yi Yang:
Progressive Local Filter Pruning for Image Retrieval Acceleration. 9597-9607
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.