default search action
IEEE Transactions on Multimedia, Volume 24
Volume 24, 2022
- Pan Gao, Pengwei Zhang, Aljosa Smolic:
Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach. 1-16 - Haonan Su, Long Yu, Cheolkon Jung:
Joint Contrast Enhancement and Noise Reduction of Low Light Images Via JND Transform. 17-32 - Yongqiang Gui, Hancheng Lu, Feng Wu, Chang Wen Chen:
LensCast: Robust Wireless Video Transmission Over MmWave MIMO With Lens Antenna Array. 33-48 - Haonan Fan, Hai-Miao Hu, Shuailing Liu, Weiqing Lu, Shiliang Pu:
Correlation Graph Convolutional Network for Pedestrian Attribute Recognition. 49-60 - Chih-Hung Liang, Yu-An Chen, Yueh-Cheng Liu, Winston H. Hsu:
Raw Image Deblurring. 61-72 - Zhenyu Wu, Shuai Li, Chenglizhao Chen, Aimin Hao, Hong Qin:
Deeper Look at Image Salient Object Detection: Bi-Stream Network With a Small Training Dataset. 73-86 - Lei Cao, Huijun Zhang, Ling Feng:
Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media. 87-102 - Chunxiao Liu, Zhendong Mao, Tianzhu Zhang, An-An Liu, Bin Wang, Yongdong Zhang:
Focus Your Attention: A Focal Attention for Multimodal Learning. 103-115 - Fei Ye, Chaoqin Huang, Jinkun Cao, Maosen Li, Ya Zhang, Cewu Lu:
Attribute Restoration Framework for Anomaly Detection. 116-127 - Shaoyue Song, Zhenjiang Miao, Hongkai Yu, Jianwu Fang, Kang Zheng, Cong Ma, Song Wang:
Deep Domain Adaptation Based Multi-Spectral Salient Object Detection. 128-140 - Haitao Zeng, Xinhang Song, Gongwei Chen, Shuqiang Jiang:
Amorphous Region Context Modeling for Scene Recognition. 141-151 - Xinpeng Huang, Ping An, Yilei Chen, Deyang Liu, Liquan Shen:
Low Bitrate Light Field Compression With Geometry and Content Consistency. 152-165 - Xingyuan Zhang, Fuhai Zhang:
Differentiable Spatial Regression: A Novel Method for 3D Hand Pose Estimation. 166-176 - Xingyu Chen, Jin Li, Xuguang Lan, Nanning Zheng:
Generalized Zero-Shot Learning Via Multi-Modal Aggregated Posterior Aligning Neural Network. 177-187 - Jingjia Huang, Wei Yan, Thomas H. Li, Shan Liu, Ge Li:
Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition. 188-201 - Canqiang Chen, Chunmei Qing, Xiangmin Xu, Patrick Dickinson:
Cross Parallax Attention Network for Stereo Image Super-Resolution. 202-216 - Xun Gong, Zu Yao, Xin Li, Yueqiao Fan, Bin Luo, Jianfeng Fan, Boji Lao:
LAG-Net: Multi-Granularity Network for Person Re-Identification via Local Attention System. 217-229 - Jinwei Wang, Junjie Zhao, Qilin Yin, Xiangyang Luo, Yuhui Zheng, Yun-Qing Shi, Sunil Kr. Jha:
SmsNet: A New Deep Convolutional Neural Network Model for Adversarial Example Detection. 230-244 - Joongchol Shin, Hasil Park, Joonki Paik:
Region-Based Dehazing via Dual-Supervised Triple-Convolutional Network. 245-260 - Yu-Jen Ma, Hong-Han Shuai, Wen-Huang Cheng:
Spatiotemporal Dilated Convolution With Uncertain Matching for Video-Based Crowd Estimation. 261-273 - Che Sun, Hao Song, Xinxiao Wu, Yunde Jia, Jiebo Luo:
Exploiting Informative Video Segments for Temporal Action Localization. 274-287 - Xu Chen, Chenqiang Gao, Chaoyu Li, Yi Yang, Deyu Meng:
Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism. 288-300 - Xuefeng Zhu, Xiaojun Wu, Tianyang Xu, Zhen-Hua Feng, Josef Kittler:
Robust Visual Object Tracking Via Adaptive Attribute-Aware Discriminative Correlation Filters. 301-312 - Xing Zhang, Zuxuan Wu, Yu-Gang Jiang:
SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition. 313-322 - Bo Wang, Mingwei Xu, Fengyuan Ren, Jianping Wu:
Improving Robustness of DASH Against Unpredictable Network Variations. 323-337 - Aihua Zheng, Menglan Hu, Bo Jiang, Yan Huang, Yan Yan, Bin Luo:
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching. 338-351 - Mengyang Zhang, Guohui Tian, Ying Zhang, Peng Duan:
Reinforcement Learning for Logic Recipe Generation: Bridging Gaps From Images to Plans. 352-365 - Mauricio Perez, Jun Liu, Alex C. Kot:
Interaction Relational Network for Mutual Action Recognition. 366-376 - Lê Minh Ngô, Sezer Karaoglu, Theo Gevers:
Self-Supervised Face Image Manipulation by Conditioning GAN on Face Decomposition. 377-385 - Pantelis Maniotis, Nikolaos Thomos:
Viewport-Aware Deep Reinforcement Learning Approach for 360$^\circ$ Video Caching. 386-399 - Xinchao Dong, Liquan Shen, Mei Yu, Hao Yang:
Fast Intra Mode Decision Algorithm for Versatile Video Coding. 400-414 - Yaoyu Li, Hantao Yao, Changsheng Xu:
Intra-Domain Consistency Enhancement for Unsupervised Person Re-Identification. 415-425 - Zhihao Shi, Xiaohong Liu, Kangdi Shi, Linhui Dai, Jun Chen:
Video Frame Interpolation via Generalized Deformable Convolution. 426-439 - Yang Zhang, Moyun Liu, Jingwu He, Fei Pan, Yanwen Guo:
Affinity Fusion Graph-Based Framework for Natural Image Segmentation. 440-450 - Zhuoman Liu, Wei Jia, Ming Yang, Peiyao Luo, Yong Guo, Mingkui Tan:
Deep View Synthesis via Self-Consistent Generative Network. 451-465 - Peng-Fei Zhang, Yang Li, Zi Huang, Xin-Shun Xu:
Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval. 466-479 - Ziqiang Zheng, Zhibin Yu, Haiyong Zheng, Yang Yang, Heng Tao Shen:
One-Shot Image-to-Image Translation via Part-Global Learning With a Multi-Adversarial Framework. 480-491 - Tengpeng Li, Kaihua Zhang, Shiwen Shen, Bo Liu, Qingshan Liu, Zhu Li:
Image Co-Saliency Detection and Instance Co-Segmentation Using Attention Graph Clustering Based Graph Convolutional Network. 492-505 - Xusong Chen, Chenyi Lei, Dong Liu, Guoxin Wang, Haihong Tang, Zheng-Jun Zha, Houqiang Li:
E-Commerce Storytelling Recommendation Using Attentional Domain-Transfer Network and Adversarial Pre-Training. 506-518 - Zhaoqing Pan, Feng Yuan, Jianjun Lei, Wanqing Li, Nam Ling, Sam Kwong:
MIEGAN: Mobile Image Enhancement via a Multi-Module Cascade Neural Network. 519-533 - Liming Xu, Xianhua Zeng, Weisheng Li, Ling Bai:
IDHashGAN: Deep Hashing With Generative Adversarial Nets for Incomplete Data Retrieval. 534-545 - Huafeng Liu, Chuanyi Zhang, Yazhou Yao, Xiu-Shen Wei, Fumin Shen, Zhenmin Tang, Jian Zhang:
Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples. 546-557 - Jianjie Lu, Weidong Zhang, Haibing Yin:
Generate and Purify: Efficient Person Data Generation for Re-Identification. 558-566 - Qin Xu, Yiming Mei, Jinpei Liu, Chenglong Li:
Multimodal Cross-Layer Bilinear Pooling for RGBT Tracking. 567-580 - Lijuan Sun, Songhe Feng, Jun Liu, Gengyu Lyu, Congyan Lang:
Global-Local Label Correlation for Partial Multi-Label Learning. 581-593 - Huan Li, Ping Wei, Ping Hu:
AVN: An Adversarial Variation Network Model for Handwritten Signature Verification. 594-608 - Zhongze Chen, Jing Li, Jia Wu, Jun Chang, Yafu Xiao, Xiaoting Wang:
Drift-Proof Tracking With Deep Reinforcement Learning. 609-624 - Rizard Renanda Adhi Pramono, Yie-Tarng Chen, Wen-Hsien Fang:
Spatial-Temporal Action Localization With Hierarchical Self-Attention. 625-639 - Heng Yao, Mian Zou, Chuan Qin, Xinpeng Zhang:
Signal-Dependent Noise Estimation for a Real-Camera Model via Weight and Shape Constraints. 640-654 - Jun Chen, Xuejiao Li, Linbo Luo, Jiayi Ma:
Multi-Focus Image Fusion Based on Multi-Scale Gradients and Image Matting. 655-667 - Linchao Zhu, Hehe Fan, Yawei Luo, Mingliang Xu, Yi Yang:
Temporal Cross-Layer Correlation Mining for Action Recognition. 668-676 - Zhaoyu Zhang, Mengyan Li, Haonian Xie, Jun Yu, Tongliang Liu, Chang Wen Chen:
TWGAN: Twin Discriminator Generative Adversarial Networks. 677-688 - Md. Moniruzzaman, Zhaozheng Yin, Zhihai He, Ruwen Qin, Ming C. Leu:
Human Action Recognition by Discriminative Feature Pooling and Video Segment Attention Model. 689-701 - Meng Chang, Huajun Feng, Zhihai Xu, Qi Li:
Low-Light Image Restoration With Short- and Long-Exposure Raw Pairs. 702-714 - Hanli Wang, Pengjie Tang, Qinyu Li, Meng Cheng:
Emotion Expression With Fact Transfer for Video Description. 715-727 - Sihao Lin, Wenhao Wu, Si Wu, Yong Xu, Hau-San Wong:
Unreliable-to-Reliable Instance Translation for Semi-Supervised Pedestrian Detection. 728-739 - Zhi Zeng, Ting Wang, Fulei Ma, Liang Zhang, Peiyi Shen, Syed Afaq Ali Shah, Mohammed Bennamoun:
Probability-Based Framework to Fuse Temporal Consistency and Semantic Information for Background Segmentation. 740-754 - Yuan-fang Zhang, Jiangbin Zheng, Wenjing Jia, Wenfeng Huang, Long Li, Nian Liu, Fei Li, Xiangjian He:
Deep RGB-D Saliency Detection Without Depth. 755-767 - Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li:
Spatial-Temporal Multi-Cue Network for Sign Language Recognition and Translation. 768-779 - Amir Shirian, Subarna Tripathi, Tanaya Guha:
Dynamic Emotion Modeling With Learnable Graphs and Graph Inception Network. 780-790 - Jingkuan Song, Jingqiu Zhang, Lianli Gao, Zhou Zhao, Heng Tao Shen:
AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs. 791-804 - Desheng Cai, Shengsheng Qian, Quan Fang, Changsheng Xu:
Heterogeneous Hierarchical Feature Aggregation Network for Personalized Micro-Video Recommendation. 805-818 - Huijing Zhan, Jie Lin, Kenan Emir Ak, Boxin Shi, Ling-Yu Duan, Alex C. Kot:
$A^3$-FKG: Attentive Attribute-Aware Fashion Knowledge Graph for Outfit Preference Prediction. 819-831 - Hongchen Tan, Xiuping Liu, Baocai Yin, Xin Li:
Cross-Modal Semantic Matching Generative Adversarial Networks for Text-to-Image Synthesis. 832-845 - Aite Zhao, Junyu Dong, Jianbo Li, Lin Qi, Huiyu Zhou:
Associated Spatio-Temporal Capsule Network for Gait Recognition. 846-860 - Jiaxu Leng, Ying Liu, Zhihui Wang, Haibo Hu, Xinbo Gao:
CrossNet: Detecting Objects as Crosses. 861-875 - Yangyang Shu, Qian Li, Chang Xu, Shaowu Liu, Guandong Xu:
V-SVR+: Support Vector Regression With Variational Privileged Information. 876-889 - Yifang Yin, Ying Zhang, Zhenguang Liu, Sheng Wang, Rajiv Ratn Shah, Roger Zimmermann:
GPS2Vec: Pre-Trained Semantic Embeddings for Worldwide GPS Coordinates. 890-903 - Huixia Ben, Yingwei Pan, Yehao Li, Ting Yao, Richang Hong, Meng Wang, Tao Mei:
Unpaired Image Captioning With semantic-Constrained Self-Learning. 904-916 - Yanhao Tan, Mohammad Muntasir Rahman, Yanfu Yan, Jian Xue, Ling Shao, Ke Lu:
Fine-Grained Categorization From RGB-D Images. 917-928 - Xiao Luan, Yuanyuan Zhao, Weihua Ou, Linghui Liu, Weisheng Li, Yucheng Shu, Hongmin Geng:
Collaborative Learning With a Multi-Branch Framework for Feature Enhancement. 929-941 - Xinyuan Qian, Alessio Brutti, Oswald Lanz, Maurizio Omologo, Andrea Cavallaro:
Audio-Visual Tracking of Concurrent Speakers. 942-954 - Han Fang, Dongdong Chen, Feng Wang, Zehua Ma, Honggu Liu, Wenbo Zhou, Weiming Zhang, Nenghai Yu:
TERA: Screen-to-Camera Image Code With Transparency, Efficiency, Robustness and Adaptability. 955-967 - Tao Chen, Guo-Sen Xie, Yazhou Yao, Qiong Wang, Fumin Shen, Zhenmin Tang, Jian Zhang:
Semantically Meaningful Class Prototype Learning for One-Shot Image Segmentation. 968-980 - Pandeng Li, Hongtao Xie, Shaobo Min, Zheng-Jun Zha, Yongdong Zhang:
Online Residual Quantization Via Streaming Data Correlation Preserving. 981-994 - Tianze Gao, Huihui Pan, Zidong Wang, Huijun Gao:
A CRF-Based Framework for Tracklet Inactivation in Online Multi-Object Tracking. 995-1007 - Mahesh Kumar Krishna Reddy, Mrigank Rochan, Yiwei Lu, Yang Wang:
AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting. 1008-1019 - Yukun Zuo, Hantao Yao, Liansheng Zhuang, Changsheng Xu:
Seek Common Ground While Reserving Differences: A Model-Agnostic Module for Noisy Domain Adaptation. 1020-1030 - Qi Wang, Weidong Min, Qing Han, Qian Liu, Cheng Zha, Haoyu Zhao, Zitai Wei:
Inter-Domain Adaptation Label for Data Augmentation in Vehicle Re-Identification. 1031-1041 - Tao Chen, Shui-Hua Wang, Qiong Wang, Zheng Zhang, Guo-Sen Xie, Zhenmin Tang:
Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation. 1042-1054 - Xiuwen Gong, Jiahui Yang, Dong Yuan, Wei Bao:
Generalized Large Margin $k$NN for Partial Label Learning. 1055-1066 - Jing Yi, Zhenzhong Chen:
Multi-Modal Variational Graph Auto-Encoder for Recommendation Systems. 1067-1079 - Zhengning Wu, Xiaobo Xia, Ruxin Wang, Jiatong Li, Jun Yu, Yinian Mao, Tongliang Liu:
LR-SVM+: Learning Using Privileged Information with Noisy Labels. 1080-1092 - Zeren Sun, Huafeng Liu, Qiong Wang, Tianfei Zhou, Qi Wu, Zhenmin Tang:
Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise. 1093-1104 - Huafeng Liu, Haofeng Zhang, Jianfeng Lu, Zhenmin Tang:
Exploiting Web Images for Fine-Grained Visual Recognition via Dynamic Loss Correction and Global Sample Selection. 1105-1115 - Xiaobo Shen, Guohua Dong, Yuhui Zheng, Long Lan, Ivor W. Tsang, Quan-Sen Sun:
Deep Co-Image-Label Hashing for Multi-Label Image Retrieval. 1116-1126 - Hao-Chiang Shao, Hsin-Chieh Wang, Weng-Tai Su, Chia-Wen Lin:
Ensemble Learning With Manifold-Based Data Splitting for Noisy Label Correction. 1127-1140 - Junya Teng, Xiankai Lu, Yongshun Gong, Xinfang Liu, Xiushan Nie, Yilong Yin:
Regularized Two Granularity Loss Function for Weakly Supervised Video Moment Retrieval. 1141-1151 - Sijie Song, Jiaying Liu, Lilang Lin, Zongming Guo:
Learning to Recognize Human Actions From Noisy Skeleton Data Via Noise Adaptation. 1152-1163 - Jingyu Hao, Chengjia Wang, Guang Yang, Zhifan Gao, Jinglin Zhang, Heye Zhang:
Annealing Genetic GAN for Imbalanced Web Data Learning. 1164-1174 - Bin Zhu, Chong-Wah Ngo, Wing Kwong Chan:
Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance. 1175-1185 - Yaoyao Zhong, Weihong Deng, Han Fang, Jiani Hu, Dongyue Zhao, Xian Li, Dongchao Wen:
Dynamic Training Data Dropout for Robust Deep Face Recognition. 1186-1197 - Chuanyi Zhang, Qiong Wang, Guo-Sen Xie, Qi Wu, Fumin Shen, Zhenmin Tang:
Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition. 1198-1209 - Shiji Zhou, Lianzhe Wang, Shanghang Zhang, Zhi Wang, Wenwu Zhu:
Active Gradual Domain Adaptation: Dataset and Approach. 1210-1220 - Gongmian Wang, Xing Xu, Fumin Shen, Huimin Lu, Yanli Ji, Heng Tao Shen:
Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query. 1221-1232 - Bingwen Hu, Ping Liu, Zhedong Zheng, Mingwu Ren:
SPG-VTON: Semantic Prediction Guidance for Multi-Pose Virtual Try-on. 1233-1246 - Zhenfeng Xue, Weijie Mao, Liang Zheng:
Learning to Simulate Complex Scenes for Street Scene Segmentation. 1253-1265 - Mitra Tajrobehkar, Kaihua Tang, Hanwang Zhang, Joo-Hwee Lim:
Align R-CNN: A Pairwise Head Network for Visual Relationship Detection. 1266-1276 - Peiguang Jing, Jing Zhang, Liqiang Nie, Shu Ye, Jing Liu, Yuting Su:
Tripartite Graph Regularized Latent Low-Rank Representation for Fashion Compatibility Prediction. 1277-1287 - Yaomin Wang, Wenguang He:
High Capacity Reversible Data Hiding in Encrypted Image Based on Adaptive MSB Prediction. 1288-1298 - Shiguang Liu, Ting Zhu:
Structure-Guided Arbitrary Style Transfer for Artistic Image and Video. 1299-1312 - Dung Nguyen, Duc Thanh Nguyen, Rui Zeng, Thanh Thi Nguyen, Son N. Tran, Thin Nguyen, Sridha Sridharan, Clinton Fookes:
Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition. 1313-1324 - Yalan Ye, Yukun He, Tongjie Pan, Jingjing Li, Heng Tao Shen:
Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning. 1325-1337 - Haoyu Tang, Jihua Zhu, Meng Liu, Zan Gao, Zhiyong Cheng:
Frame-Wise Cross-Modal Matching for Video Moment Retrieval. 1338-1349 - Tianchi Huang, Rui-Xiao Zhang, Lifeng Sun:
Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services. 1350-1365 - Chong Mou, Jian Zhang, Xiaopeng Fan, Hangfan Liu, Ronggang Wang:
COLA-Net: Collaborative Attention Network for Image Restoration. 1366-1377 - Kaihao Zhang, Wenhan Luo, Lin Ma, Wenqi Ren, Hongdong Li:
Disentangled Feature Networks for Facial Portrait and Caricature Generation. 1378-1388 - Kyohoon Sim, Jiachen Yang, Wen Lu, Xinbo Gao:
Blind Stereoscopic Image Quality Evaluator Based on Binocular Semantic and Quality Channels. 1389-1398 - Giulia Slavic, Mohamad Baydoun, Damian Campo, Lucio Marcenaro, Carlo S. Regazzoni:
Multilevel Anomaly Detection Through Variational Autoencoders and Bayesian Models for Self-Aware Embodied Agents. 1399-1414 - Lu Zhang, Jingsong Xu, Yongshun Gong, Litao Yu, Jian Zhang, Jialie Shen:
Unsupervised Image and Text Fusion for Travel Information Enhancement. 1415-1425 - Amin Parvaneh, Ehsan Abbasnejad, Qi Wu, Qinfeng (Javen) Shi, Anton van den Hengel:
Show, Price and Negotiate: A Negotiator With Online Value Look-Ahead. 1426-1434 - Jialu Huang, Jing Liao, Sam Kwong:
Unsupervised Image-to-Image Translation via Pre-Trained StyleGAN2 Network. 1435-1448 - Huaiwen Zhang, Shengsheng Qian, Quan Fang, Changsheng Xu:
Multi-Modal Meta Multi-Task Learning for Social Media Rumor Detection. 1449-1459 - Rencan Nie, Chaozhen Ma, Jinde Cao, Hongwei Ding, Dongming Zhou:
A Total Variation With Joint Norms For Infrared and Visible Image Fusion. 1460-1472 - Yitong Yan, Chuangchuang Liu, Changyou Chen, Xianfang Sun, Longcun Jin, Xinyi Peng, Xiang Zhou:
Fine-Grained Attention and Feature-Sharing Generative Adversarial Networks for Single Image Super-Resolution. 1473-1487 - Lei Gao, Ling Guan:
A Discriminative Vectorial Framework for Multi-Modal Feature Representation. 1503-1514 - Yongqiang Kong, Yunhong Wang, Annan Li:
Spatiotemporal Saliency Representation Learning for Video Action Recognition. 1515-1528 - Qi Cheng, Hangguan Shan, Weihua Zhuang, Lu Yu, Zhaoyang Zhang, Tony Q. S. Quek:
Design and Analysis of MEC- and Proactive Caching-Based 360° Mobile VR Video Streaming. 1529-1544 - Qing Zhang, Yongwei Nie, Lei Zhu, Chunxia Xiao, Wei-Shi Zheng:
A Blind Color Separation Model for Faithful Palette-Based Image Recoloring. 1545-1557 - Xiaoyu Chen, Hongliang Li, Qingbo Wu, Fanman Meng, Heqian Qiu:
Bal-R$^2$CNN: High Quality Recurrent Object Detection With Balance Optimization. 1558-1569 - Yan Huang, Qiang Wu, Jingsong Xu, Yi Zhong, Peng Zhang, Zhaoxiang Zhang:
Alleviating Modality Bias Training for Infrared-Visible Person Re-Identification. 1570-1582 - Rui Guo, Xiangxin Shao, Chencheng Zhang, Xiaohua Qian:
Multi-Scale Sparse Graph Convolutional Network For the Assessment of Parkinsonian Gait. 1583-1594 - Xuejin Wang, Feng Shao, Qiuping Jiang, Xiongli Chai, Xiangchao Meng, Yo-Sung Ho:
List-Wise Rank Learning for Stereoscopic Image Retargeting Quality Assessment. 1595-1608 - Jinxiu Liang, Jingwen Wang, Yuhui Quan, Tianyi Chen, Jiaying Liu, Haibin Ling, Yong Xu:
Recurrent Exposure Generation for Low-Light Face Detection. 1609-1621 - Yong Yang, Juwei Guan, Shuying Huang, Weiguo Wan, Yating Xu, Jiaxiang Liu:
End-to-End Rain Removal Network Based on Progressive Residual Detail Supplement. 1622-1636 - Zhenyu Shu, Sipeng Yang, Shiqing Xin, Chaoyi Pang, Xiaogang Jin, Ladislav Kavan, Ligang Liu:
Detecting 3D Points of Interest Using Projective Neural Networks. 1637-1650 - Nianchang Huang, Yang Yang, Dingwen Zhang, Qiang Zhang, Jungong Han:
Employing Bilinear Fusion and Saliency Prior Information for RGB-D Salient Object Detection. 1651-1664 - Cheng Yan, Guansong Pang, Xiao Bai, Changhong Liu, Xin Ning, Lin Gu, Jun Zhou:
Beyond Triplet Loss: Person Re-Identification With Fine-Grained Difference-Aware Pairwise Loss. 1665-1677 - Wing W. Y. Ng, Mingyang Zhang, Ting Wang:
Multi-Localized Sensitive Autoencoder-Attention-LSTM For Skeleton-based Action Recognition. 1678-1690 - Salma Emara, Silas L. Fong, Baochun Li, Ashish Khisti, Wai-Tian Tan, Xiaoqing Zhu, John G. Apostolopoulos:
Low-Latency Network-Adaptive Error Control for Interactive Streaming. 1691-1706 - Jie-Ru Lin, Mei-Juan Chen, Chia-Hung Yeh, Yong-Ci Chen, Lih-Jen Kau, Chuan-Yu Chang, Min-Hui Lin:
Visual Perception Based Algorithm for Fast Depth Intra Coding of 3D-HEVC. 1707-1720 - Zhenyang Zhu, Masahiro Toyoura, Kentaro Go, Kenji Kashiwagi, Issei Fujishiro, Tien-Tsin Wong, Xiaoyang Mao:
Personalized Image Recoloring for Color Vision Deficiency Compensation. 1721-1734 - Min Li, Zhenjiang Miao, Xiao-Ping Zhang, Wanru Xu, Cong Ma, Ningwei Xie:
Rhythm-Aware Sequence-to-Sequence Learning for Labanotation Generation With Gesture-Sensitive Graph Convolutional Encoding. 1488-1502 - Ruiheng Zhang, Lixin Xu, Zhengyu Yu, Ye Shi, Chengpo Mu, Min Xu:
Deep-IRTarget: An Automatic Target Detector in Infrared Imagery Using Dual-Domain Feature Extraction and Allocation. 1735-1749 - Nikolas Adaloglou, Theocharis Chatzis, Ilias Papastratis, Andreas Stergioulas, Georgios Th. Papadopoulos, Vassia Zacharopoulou, George J. Xydopoulos, Klimnis Atzakas, Dimitris Papazachariou, Petros Daras:
A Comprehensive Study on Deep Learning-Based Methods for Sign Language Recognition. 1750-1762 - Hailong Ning, Xiangtao Zheng, Xiaoqiang Lu, Yuan Yuan:
Disentangled Representation Learning for Cross-Modal Biometric Matching. 1763-1774 - Litao Yu, Jian Zhang, Qiang Wu:
Dual Attention on Pyramid Feature Maps for Image Captioning. 1775-1786 - Xiyan Liu, Gaofeng Meng, Jianlong Chang, Ruiguang Hu, Shiming Xiang, Chunhong Pan:
Decoupled Representation Learning for Character Glyph Synthesis. 1787-1799 - Feifei Zhang, Mingliang Xu, Changsheng Xu:
Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data. 1800-1814 - Chenlei Lv, Weisi Lin, Baoquan Zhao:
Voxel Structure-Based Mesh Reconstruction From a 3D Point Cloud. 1815-1829 - Lu Zhang, Jialie Shen, Jian Zhang, Jingsong Xu, Zhibin Li, Yazhou Yao, Litao Yu:
Multimodal Marketing Intent Analysis for Effective Targeted Advertising. 1830-1843 - Wei Chen, Yu Liu, Nan Pu, Weiping Wang, Li Liu, Michael S. Lew:
Feature Estimations Based Correlation Distillation for Incremental Image Retrieval. 1844-1856 - Yuanhao Zhai, Le Wang, Wei Tang, Qilin Zhang, Nanning Zheng, Gang Hua:
Action Coherence Network for Weakly-Supervised Temporal Action Localization. 1857-1870 - Yuwu Lu, Desheng Li, Wenjing Wang, Zhihui Lai, Jie Zhou, Xuelong Li:
Discriminative Invariant Alignment for Unsupervised Domain Adaptation. 1871-1882 - Pengwen Dai, Yang Li, Hua Zhang, Jingzhi Li, Xiaochun Cao:
Accurate Scene Text Detection Via Scale-Aware Data Augmentation and Shape Similarity Constraint. 1883-1895 - Yang Jin, Wenhao Jiang, Yi Yang, Yadong Mu:
Zero-Shot Video Event Detection With High-Order Semantic Concept Discovery and Matching. 1896-1908 - Wan-Lei Zhao, Hui Wang, Chong-Wah Ngo:
Approximate k-NN Graph Construction: A Generic Online Approach. 1909-1921 - Liang Lin, Pengxiang Yan, Xiaoqian Xu, Sibei Yang, Kun Zeng, Guanbin Li:
Structured Attention Network for Referring Image Segmentation. 1922-1932 - Hantao Yao, Shaobo Min, Yongdong Zhang, Changsheng Xu:
Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning. 1933-1942 - Shi Qiu, Saeed Anwar, Nick Barnes:
Geometric Back-Projection Network for Point Cloud Classification. 1943-1955 - Kai Yang, Zhenyu He, Wenjie Pei, Zikun Zhou, Xin Li, Di Yuan, Haijun Zhang:
SiamCorners: Siamese Corner Networks for Visual Tracking. 1956-1967 - Chunfang Deng, Mengmeng Wang, Liang Liu, Yong Liu, Yunliang Jiang:
Extended Feature Pyramid Network for Small Object Detection. 1968-1979 - Pengfei Guo, Lang He, Shuangyin Liu, Delu Zeng, Hantao Liu:
Underwater Image Quality Assessment: Subjective and Objective Methods. 1980-1989 - Xiaokang Zhang, Yuanlue Zhu, Wenting Chen, Wenshuang Liu, Linlin Shen:
Gated SwitchGAN for Multi-Domain Facial Image Translation. 1990-2003 - Qingbao Huang, Yu Liang, Jielong Wei, Yi Cai, Hanyu Liang, Ho-fung Leung, Qing Li:
Image Difference Captioning With Instance-Level Fine-Grained Feature Representation. 2004-2017 - Chen Du, Sarah Graham, Colin Depp, Truong Nguyen:
Multi-Task Center-of-Pressure Metrics Estimation With Graph Convolutional Network. 2018-2033 - Lixi Deng, Jingjing Chen, Chong-Wah Ngo, Qianru Sun, Sheng Tang, Yongdong Zhang, Tat-Seng Chua:
Mixed Dish Recognition With Contextual Relation and Domain Alignment. 2034-2045 - Fu-Sheng Tsai, Wei-Wen Chang, Chi-Chun Lee:
A Social Condition-Enhanced Network for Recognizing Power Distance Using Expressive Prosody and Intrinsic Brain Connectivity. 2046-2057 - Shuai Wu, Yong Xu, Bob Zhang, Jian Yang, David Zhang:
Deformable Template Network (DTN) for Object Detection. 2058-2068 - Zhenfeng Shao, Gui Cheng, Jiayi Ma, Zhongyuan Wang, Jiaming Wang, Deren Li:
Real-Time and Accurate UAV Pedestrian Detection for Social Distancing Monitoring in COVID-19 Pandemic. 2069-2083 - Jindou Liu, Zhaohong Li, Xinghao Jiang, Zhenzhen Zhang:
A High-Performance CNN-Applied HEVC Steganography Based on Diamond-Coded PU Partition Modes. 2084-2097 - Hu Zhu, Hao Peng, Guoxia Xu, Lizhen Deng, Yueying Cheng, Aiguo Song:
Bilateral Weighted Regression Ranking Model With Spatial-Temporal Correlation Filter for Visual Tracking. 2098-2111 - Fangxiang Feng, Tianrui Niu, Ruifan Li, Xiaojie Wang:
Modality Disentangled Discriminator for Text-to-Image Synthesis. 2112-2124 - Aliaksei Mikhailiuk, María Pérez-Ortiz, Dingcheng Yue, Wilson Suen, Rafal K. Mantiuk:
Consolidated Dataset and Metrics for High-Dynamic-Range Image Quality. 2125-2138 - Pengpeng Hu, Edmond S. L. Ho, Adrian Munteanu:
3DBodyNet: Fast Reconstruction of 3D Animatable Human Body Shape From a Single Commodity Depth Camera. 2139-2149 - Yingying Zhao, Mingzhi Dong, Yujiang Wang, Da Feng, Qin Lv, Robert P. Dick, Dongsheng Li, Tun Lu, Ning Gu, Li Shang:
A Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline. 2150-2163 - Huaian Chen, Yi Jin, Kai Xu, Yuxuan Chen, Changan Zhu:
Multiframe-to-Multiframe Network for Video Denoising. 2164-2178 - Takuya Fujihashi, Toshiaki Koike-Akino, Takashi Watanabe, Philip V. Orlik:
HoloCast+: Hybrid Digital-Analog Transmission for Graceful Point Cloud Delivery With Graph Fourier Transform. 2179-2191 - Wujie Zhou, Yun Zhu, Jingsheng Lei, Jian Wan, Lu Yu:
CCAFNet: Crossflow and Cross-Scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images. 2192-2204 - Aniwat Phaphuangwittayakul, Yi Guo, Fangli Ying:
Fast Adaptive Meta-Learning for Few-Shot Image Generation. 2205-2217 - Jinpeng Chen, Pinguang Ying, Xiangling Fu, Xiaopeng Luo, Hao Guan, Kaimin Wei:
Automatic Tagging by Leveraging Visual and Annotated Features in Social Media. 2218-2229 - Gerasimos Arvanitis, Evangelia I. Zacharaki, Libor Vása, Konstantinos Moustakas:
Broad-to-Narrow Registration and Identification of 3D Objects in Partially Scanned and Cluttered Point Clouds. 2230-2245 - Chong Zhang, Zongxian Li, Jingjing Liu, Peixi Peng, Qixiang Ye, Shijian Lu, Tie-Jun Huang, Yonghong Tian:
Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection. 2246-2258 - Yuqing Liu, Shiqi Wang, Jian Zhang, Shanshe Wang, Siwei Ma, Wen Gao:
Iterative Network for Image Super-Resolution. 2259-2272 - Yi Huang, Xiaoshan Yang, Junyun Gao, Changsheng Xu:
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition. 2273-2286 - Yujuan Ding, Yunshan Ma, Lizi Liao, Wai Keung Wong, Tat-Seng Chua:
Leveraging Multiple Relations for Fashion Trend Forecasting Based on Social Media. 2287-2299 - Abdelhak Bentaleb, Mehmet N. Akcay, May Lim, Ali C. Begen, Roger Zimmermann:
Catching the Moment With LoL$^+$ in Twitch-Like Low-Latency Live Streaming Platforms. 2300-2314 - Josiah W. Smith, Orges Furxhi, Murat Torlak:
An FCNN-Based Super-Resolution Mmwave Radar Framework for Contactless Musical Instrument Interface. 2315-2328 - Lianli Gao, Zijie Huang, Jingkuan Song, Yang Yang, Heng Tao Shen:
Push & Pull: Transferable Adversarial Examples With Attentive Attack. 2329-2338 - Seungjun Lee, Haesang Yang, Hwiyong Choi, Woojae Seong:
Zero-Shot Single-Microphone Sound Classification and Localization in a Building Via the Synthesis of Unseen Features. 2339-2351 - Wei Jia, Li Li, Anique Akhtar, Zhu Li, Shan Liu:
Convolutional Neural Network-Based Occupancy Map Accuracy Improvement for Video-Based Point Cloud Compression. 2352-2365 - Ruijun Ma, Shuyi Li, Bob Zhang, Zhengming Li:
Towards Fast and Robust Real Image Denoising With Attentive Neural Network and PID Controller. 2366-2377 - Pablo Carballeira, Carlos Carmona, César Díaz, Daniel Berjón, Daniel Corregidor, Julián Cabrera, Francisco Morán, Carmen Doblado, Sergio Arnaldo, María del Mar Martín, Narciso García:
FVV Live: A Real-Time Free-Viewpoint Video System With Consumer Electronics Hardware. 2378-2391 - Shaopeng Liu, Guohui Tian, Ying Zhang, Peng Duan:
Scene Recognition Mechanism for Service Robot Adapting Various Families: A CNN-Based Approach Using Multi-Type Cameras. 2392-2406 - Wanxia Deng, Lingjun Zhao, Qing Liao, Deke Guo, Gangyao Kuang, Dewen Hu, Matti Pietikäinen, Li Liu:
Informative Feature Disentanglement for Unsupervised Domain Adaptation. 2407-2421 - Xuejin Wang, Feng Shao, Qiuping Jiang, Zhenqi Fu, Xiangchao Meng, Ke Gu, Yo-Sung Ho:
Combining Retargeting Quality and Depth Perception Measures for Quality Evaluation of Retargeted Stereopairs. 2422-2434 - Yudong Mao, Qiuping Jiang, Runmin Cong, Wei Gao, Feng Shao, Sam Kwong:
Cross-Modality Fusion and Progressive Integration Network for Saliency Prediction on Stereoscopic 3D Images. 2435-2448 - Hai Liu, Shuai Fang, Zhaoli Zhang, Duantengchuan Li, Ke Lin, Jiazhang Wang:
MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation. 2449-2460 - Zhenglai Li, Chang Tang, Xinwang Liu, Xiao Zheng, Wei Zhang, En Zhu:
Consensus Graph Learning for Multi-View Clustering. 2461-2472 - Yang Hu, Guihua Wen, Adriane Chapman, Pei Yang, Mingnan Luo, Yingxue Xu, Dan Dai, Wendy Hall:
Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition. 2473-2487 - Sijie Mai, Haifeng Hu, Songlong Xing:
A Unimodal Representation Learning and Recurrent Decomposition Fusion Structure for Utterance-Level Multimodal Embedding Learning. 2488-2501 - Dayan Guan, Jiaxing Huang, Aoran Xiao, Shijian Lu, Yanpeng Cao:
Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection. 2502-2514 - Hao Wang, Doyen Sahoo, Chenghao Liu, Ke Shu, Palakorn Achananuparp, Ee-Peng Lim, Steven C. H. Hoi:
Cross-Modal Food Retrieval: Learning a Joint Embedding of Food Images and Recipes With Semantic Consistency and Attention Mechanism. 2515-2525 - Wujie Zhou, Xinyang Lin, Jingsheng Lei, Lu Yu, Jenq-Neng Hwang:
MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing. 2526-2538 - Danny Websdale, Sarah Taylor, Ben Milner:
Speaker-Independent Speech Animation Using Perceptual Loss Functions and Synthetic Data. 2539-2552 - Jinpeng Wang, Yiqi Lin, Manlin Zhang, Yuan Gao, Andy J. Ma:
Multi-Level Temporal Dilated Dense Prediction for Action Recognition. 2553-2566 - Mengjing Sun, Siwei Wang, Pei Zhang, Xinwang Liu, Xifeng Guo, Sihang Zhou, En Zhu:
Projective Multiple Kernel Subspace Clustering. 2567-2579 - Xiao-Long Yun, Yan-Ming Zhang, Fei Yin, Cheng-Lin Liu:
Instance GNN: A Learning Framework for Joint Symbol Segmentation and Recognition in Online Handwritten Diagrams. 2580-2594 - Shaokun Wang, Tian Gan, Yuan Liu, Li Zhang, Jianlong Wu, Liqiang Nie:
Discover Micro-Influencers for Brands via Better Understanding. 2595-2605 - Zhaoyu Guo, Zhou Zhao, Weike Jin, Dazhou Wang, Ruitao Liu, Jun Yu:
TaoHighlight: Commodity-Aware Multi-Modal Video Highlight Detection in E-Commerce. 2606-2616 - Xihua Sheng, Li Li, Dong Liu, Zhiwei Xiong, Zhu Li, Feng Wu:
Deep-PCAC: An End-to-End Deep Lossy Compression Framework for Point Cloud Attributes. 2617-2632 - Zhaoyi Yan, Ruimao Zhang, Hongzhi Zhang, Qingfu Zhang, Wangmeng Zuo:
Crowd Counting Via Perspective-Guided Fractional-Dilation Convolution. 2633-2647 - Rongjie Xia, Yanshan Li, Wenhan Luo:
LAGA-Net: Local-and-Global Attention Network for Skeleton Based Action Recognition. 2648-2661 - Jian Zhao, Weizhen Qi, Wengang Zhou, Nan Duan, Ming Zhou, Houqiang Li:
Conditional Sentence Generation and Cross-Modal Reranking for Sign Language Translation. 2662-2672 - Zheng Gu, Chuanqi Dong, Jing Huo, Wenbin Li, Yang Gao:
CariMe: Unpaired Caricature Generation With Multiple Exaggerations. 2673-2686 - Yujuan Ding, Yunshan Ma, Wai Keung Wong, Tat-Seng Chua:
Modeling Instant User Intent and Content-Level Transition for Sequential Fashion Recommendation. 2687-2700 - Jian Zhang, Alan Hanjalic, Ramesh C. Jain, Xian-Sheng Hua, Shin'ichi Satoh, Yazhou Yao, Dan Zeng:
Guest Editorial: Learning From Noisy Multimedia Data. 1247-1252 - Yinwei Wei, Xiang Wang, Xiangnan He, Liqiang Nie, Yong Rui, Tat-Seng Chua:
Hierarchical User Intent Graph Network for Multimedia Recommendation. 2701-2712 - So Yeon Jo, Siyeong Lee, Namhyun Ahn, Suk-Ju Kang:
Deep Arbitrary HDRI: Inverse Tone Mapping With Controllable Exposure Changes. 2713-2726 - Yuhao Liu, Jiake Xie, Yu Qiao, Yong Tang, Xin Yang:
Prior-Induced Information Alignment for Image Matting. 2727-2738 - Yong Deng, Jimin Xiao, Steven Zhiying Zhou:
ToF and Stereo Data Fusion Using Dynamic Search Range Stereo Matching. 2739-2751 - Xinzhou Xu, Jun Deng, Nicholas Cummins, Zixing Zhang, Li Zhao, Björn W. Schuller:
Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes. 2752-2765 - Baojie Fan, Jiandong Tian, Yan Peng, Yandong Tang:
Discriminative Siamese Complementary Tracker With Flexible Update. 2766-2778 - Min Wang, Wengang Zhou, Qi Tian, Houqiang Li:
Deep Enhanced Weakly-Supervised Hashing With Iterative Tag Refinement. 2779-2790 - Mao Xi, Wengang Zhou, Ning Wang, Houqiang Li:
Learning Temporal-Correlated and Channel- Decorrelated Siamese Networks for Visual Tracking. 2791-2803 - Xuan Wang, Shenqi Lai, Zhenhua Chai, Xingjun Zhang, Xueming Qian:
SPGNet: Serial and Parallel Group Network. 2804-2814 - Ning Zhang, Yang Zhao, Chao Wang, Ronggang Wang:
A Real-Time Semi-Supervised Deep Tone Mapping Network. 2815-2827 - Zihan Ye, Fuyuan Hu, Fan Lyu, Linyan Li, Kaizhu Huang:
Disentangling Semantic-to-Visual Confusion for Zero-Shot Learning. 2828-2840 - Jia-Li Yin, Bo-Hao Chen, Yan-Tsung Peng:
Two Exposure Fusion Using Prior-Aware Generative Adversarial Network. 2841-2851 - Xin Li, Fan Yang, Ao Luo, Zhicheng Jiao, Hong Cheng, Zicheng Liu:
EFRNet: Efficient Feature Reconstructing Network for Real-Time Scene Parsing. 2852-2865 - Anique Akhtar, Wen Gao, Li Li, Zhu Li, Wei Jia, Shan Liu:
Video-Based Point Cloud Compression Artifact Removal. 2866-2876 - Zongyao He, Zhi Jin, Yao Zhao:
SRDRL: A Blind Super-Resolution Framework With Degradation Reconstruction Loss. 2877-2889 - Yang Liu, Faming Fang, Tingting Wang, Juncheng Li, Yun Sheng, Guixu Zhang:
Multi-Scale Grid Network for Image Deblurring With High-Frequency Guidance. 2890-2901 - Huabin Liu, Jianguo Li, Dian Li, John See, Weiyao Lin:
Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition. 2902-2913 - Xue Song, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval. 2914-2923 - Lingfeng Qu, Fan Chen, Shanjun Zhang, Hongjie He:
Cryptanalysis of Reversible Data Hiding in Encrypted Images by Block Permutation and Co-Modulation. 2924-2937 - Chuanwu Ling, Xiaogang Zhang, Hua Chen:
Unsupervised Monocular Depth Estimation Using Attention and Multi-Warp Reconstruction. 2938-2949 - Lingyun Yu, Hongtao Xie, Yongdong Zhang:
Multimodal Learning for Temporally Coherent Talking Face Generation With Articulator Synergy. 2950-2962 - Hao Tang, Nicu Sebe:
Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes. 2963-2974 - Tianyi Chen, Si Wu, Xuhui Yang, Yong Xu, Hau-San Wong:
Semantic Regularized Class-Conditional GANs for Semi-Supervised Fine-Grained Image Synthesis. 2975-2985 - Xi Zhang, Feifei Zhang, Changsheng Xu:
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning. 2986-2997 - Xinhong Ma, Xiaoshan Yang, Junyu Gao, Changsheng Xu:
The Model May Fit You: User-Generalized Cross-Modal Retrieval. 2998-3012 - Yuqing Song, Shizhe Chen, Qin Jin, Wei Luo, Jun Xie, Fei Huang:
Enhancing Neural Machine Translation With Dual-Side Multimodal Awareness. 3013-3024 - Wei-Lun Huang, Chun-Yi Hung, I-Chen Lin:
Confidence-Based 6D Object Pose Estimation. 3025-3035 - Tun Zhu, Daoxin Zhang, Yao Hu, Tianran Wang, Xiaolong Jiang, Jianke Zhu, Jiawei Li:
Horizontal-to-Vertical Video Conversion. 3036-3048 - Xuekun Jiang, Libiao Jin, Anyi Rao, Linning Xu, Dahua Lin:
Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos. 3049-3059 - Yiheng Liu, Wengang Zhou, Mao Xi, Sanjing Shen, Houqiang Li:
Multi-Modal Context Propagation for Person Re-Identification With Wireless Positioning. 3060-3073 - Xiangyuan Zhu, Kehua Guo, Hui Fang, Liang Chen, Sheng Ren, Bin Hu:
Cross View Capture for Stereo Image Super-Resolution. 3074-3086 - Jesús Gutiérrez, Pablo Pérez, Marta Orduna, Ashutosh Singla, Carlos Cortés, Pramit Mazumdar, Irene Viola, Kjell Brunnström, Federica Battisti, Natalia Cieplinska, Dawid Juszka, Lucjan Janowski, Mikolaj Leszczuk, Anthony Adeyemi-Ejeye, Yaosi Hu, Zhenzhong Chen, Glenn Van Wallendael, Peter Lambert, César Díaz, John Hedlund, Omar Hamsis, Stephan Fremerey, Frank Hofmeyer, Alexander Raake, Pablo César, Marco Carli, Narciso García:
Subjective Evaluation of Visual Quality and Simulator Sickness of Short 360$^\circ$ Videos: ITU-T Rec. P.919. 3087-3100 - Zongjian Zhang, Qiang Wu, Yang Wang, Fang Chen:
Exploring Pairwise Relationships Adaptively From Linguistic Context in Image Captioning. 3101-3113 - Qiaosi Yi, Juncheng Li, Faming Fang, Aiwen Jiang, Guixu Zhang:
Efficient and Accurate Multi-Scale Topological Network for Single Image Dehazing. 3114-3128 - Mengting Xing, Hongtao Xie, Qingfeng Tan, Shancheng Fang, Yuxin Wang, Zhengjun Zha, Yongdong Zhang:
Boundary-Aware Arbitrary-Shaped Scene Text Detector With Learnable Embedding Network. 3129-3143 - Hui Zhang, Xiangwei Wang, Xiaochuan Yin, Mingxiao Du, Chengju Liu, Qijun Chen:
Geometry-Constrained Scale Estimation for Monocular Visual Odometry. 3144-3156 - Jiayi Ma, Chengli Peng, Xin Tian, Junjun Jiang:
DBDnet: A Deep Boosting Strategy for Image Denoising. 3157-3168 - Shurun Wang, Shiqi Wang, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao:
Towards Analysis-Friendly Face Representation With Scalable Feature and Texture Compression. 3169-3181 - Wei Xia, Qianqian Wang, Quanxue Gao, Xiangdong Zhang, Xinbo Gao:
Self-Supervised Graph Convolutional Network for Multi-View Clustering. 3182-3192 - Chunli Meng, Ping An, Xinpeng Huang, Chao Yang, Liquan Shen, Bin Wang:
Objective Quality Assessment of Lenslet Light Field Image Based on Focus Stack. 3193-3207 - Tianyi Wu, Sheng Tang, Rui Zhang, Guodong Guo:
Consensus Feature Network for Scene Parsing. 3208-3217 - Bo Jiang, Xixi Wang, Aihua Zheng, Jin Tang, Bin Luo:
PH-GCN: Person Retrieval With Part-Based Hierarchical Graph Convolutional Network. 3218-3228 - Xulin Song, Zhong Jin:
Robust Label Rectifying With Consistent Contrastive-Learning for Domain Adaptive Person Re-Identification. 3229-3239 - Xiaoyan Zhang, Yukai Song, Zhuopeng Li, Jianmin Jiang:
PR-RL: Portrait Relighting Via Deep Reinforcement Learning. 3240-3255 - Hadi Amirpour, António M. G. Pinheiro, Elsa Susana Reis Fonseca, Mohammed Ghanbari, Manuela Pereira:
Quality Evaluation of Holographic Images Coded With Standard Codecs. 3256-3264 - Ke Xu, Xinghao Jiang, Tanfeng Sun:
Gait Recognition Based on Local Graphical Skeleton Descriptor With Pairwise Similarity Network. 3265-3275 - Yuechen Wang, Jiajun Deng, Wengang Zhou, Houqiang Li:
Weakly Supervised Temporal Adjacent Network for Language Grounding. 3276-3286 - Dan Li, Changde Du, Haibao Wang, Qiongyi Zhou, Huiguang He:
Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding. 3287-3299 - Dong Huang, Xiaoyi Feng, Haixi Zhang, Zitong Yu, Jinye Peng, Guoying Zhao, Zhaoqiang Xia:
Spatio-Temporal Pain Estimation Network With Measuring Pseudo Heart Rate Gain. 3300-3313 - Lihua Jian, Rakiba Rayhana, Ling Ma, Shaowu Wu, Zheng Liu, Huiqin Jiang:
Infrared and Visible Image Fusion Based on Deep Decomposition Network and Saliency Analysis. 3314-3326 - Wei Huang, Siyuan Zhang, Peng Zhang, Yufei Zha, Yuming Fang, Yanning Zhang:
Identity-Aware Facial Expression Recognition Via Deep Metric Learning Based on Synthesized Images. 3327-3339 - Congxuan Zhang, Zhongkai Zhou, Zhen Chen, Weiming Hu, Ming Li, Shaofeng Jiang:
Self-Attention-Based Multiscale Feature Learning Optical Flow With Occlusion Feature Map Prediction. 3340-3354 - Krishna Somandepalli, Rajat Hebbar, Shrikanth Narayanan:
Robust Character Labeling in Movie Videos: Data Resources and Self-Supervised Feature Adaptation. 3355-3368 - Jianyu Wang, Bing-Kun Bao, Changsheng Xu:
DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering. 3369-3380 - Junmei Hao, Yujie Dun, Guoshuai Zhao, Yuxia Wu, Xueming Qian:
Annular-Graph Attention Model for Personalized Sequential Recommendation. 3381-3391 - Zheng Zhang, Xunguang Wang, Guangming Lu, Fumin Shen, Lei Zhu:
Targeted Attack of Deep Hashing Via Prototype-Supervised Adversarial Networks. 3392-3404 - Ninglin Ouyang, Qingbao Huang, Pijian Li, Yi Cai, Bin Liu, Ho-fung Leung, Qing Li:
Suppressing Biased Samples for Robust VQA. 3405-3415 - Chenhui Li, Peiying Zhang, Changbo Wang:
Harmonious Textual Layout Generation Over Natural Images via Deep Aesthetics Learning. 3416-3428 - Feng Ding, Guopu Zhu, Yingcan Li, Xinpeng Zhang, Pradeep K. Atrey, Siwei Lyu:
Anti-Forensics for Face Swapping Videos via Adversarial Training. 3429-3441 - Pablo Pérez, Lucjan Janowski, Narciso García, Margaret H. Pinson:
Subjective Assessment Experiments That Recruit Few Observers With Repetitions (FOWR). 3442-3454 - Peiguang Li, Xian Sun, Hongfeng Yu, Yu Tian, Fanglong Yao, Guangluan Xu:
Entity-Oriented Multi-Modal Alignment and Fusion Network for Fake News Detection. 3455-3468 - Dawei Zhou, Nannan Wang, Chunlei Peng, Yi Yu, Xi Yang, Xinbo Gao:
Towards Multi-Domain Face Synthesis Via Domain-Invariant Representations and Multi-Level Feature Parts. 3469-3479 - Sefik Emre Eskimez, You Zhang, Zhiyao Duan:
Speech Driven Talking Face Generation From a Single Image and an Emotion Condition. 3480-3490 - Xiaobin Tan, Lei Xu, Jiawei Ni, Simin Li, Xiaofeng Jiang, Quan Zheng:
Game Theory Based Dynamic Adaptive Video Streaming for Multi-Client Over NDN. 3491-3505 - Yifan Zuo, Hao Wang, Yuming Fang, Xiaoshui Huang, Xiwu Shang, Qiang Wu:
MIG-Net: Multi-Scale Network Alternatively Guided by Intensity and Gradient Features for Depth Map Super-Resolution. 3506-3519 - Shengsheng Qian, Dizhan Xue, Quan Fang, Changsheng Xu:
Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal Retrieval. 3520-3532 - Dae Ha Kim, Byung Cheol Song:
Deep Metric Learning With Manifold Class Variability Analysis. 3533-3544 - Changchong Sheng, Xinzhong Zhu, Huiying Xu, Matti Pietikäinen, Li Liu:
Adaptive Semantic-Spatio-Temporal Graph Convolutional Network for Lip Reading. 3545-3557 - Rentao Wan, Jinjia Zhou, Bowen Huang, Hui Zeng, Yibo Fan:
APMC: Adjacent Pixels Based Measurement Coding System for Compressively Sensed Images. 3558-3569 - Tiesong Zhao, Yuting Lin, Yiwen Xu, Weiling Chen, Zhou Wang:
Learning-Based Quality Assessment for Image Super-Resolution. 3570-3581 - Joseph P. Robinson, Zaid Khan, Yu Yin, Ming Shao, Yun Fu:
Families in Wild Multimedia: A Multimodal Database for Recognizing Kinship. 3582-3594 - Heinz Hofbauer, Florent Autrusseau, Andreas Uhl:
Low Quality and Recognition of Image Content. 3595-3610 - Shu Yang, Yaowei Wang, Ke Chen, Wei Zeng, Zesong Fei:
Attribute-Aware Feature Encoding for Object Recognition and Segmentation. 3611-3623 - Weining Wang, Tianwei Lin, Dongliang He, Fu Li, Shilei Wen, Liang Wang, Jing Liu:
Semi-Supervised Temporal Action Proposal Generation via Exploiting 2-D Proposal Map. 3624-3635 - Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen:
Style Normalization and Restitution for Domain Generalization and Adaptation. 3636-3651 - Ziwei Xu, Guangzhi Wang, Yongkang Wong, Mohan S. Kankanhalli:
Relation-Aware Compositional Zero-Shot Learning for Attribute-Object Pair Recognition. 3652-3664 - Lu Wang, Masoumeh Zareapoor, Jie Yang, Zhonglong Zheng:
Asymmetric Correlation Quantization Hashing for Cross-Modal Retrieval. 3665-3678 - Cong Chen, Shouyang Dong, Ye Tian, Kunlin Cao, Li Liu, Yuanhao Guo:
Temporal Self-Ensembling Teacher for Semi-Supervised Object Detection. 3679-3692 - Jialian Wu, Liangchen Song, Qian Zhang, Ming Yang, Junsong Yuan:
ForestDet: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation. 3693-3705 - Weisi Lin, Gheorghita Ghinea:
Progress and Opportunities in Modelling Just-Noticeable Difference (JND) for Multimedia. 3706-3721 - Yeyao Chen, Gangyi Jiang, Zhidi Jiang, Mei Yu, Yo-Sung Ho:
Deep Light Field Super-Resolution Using Frequency Domain Analysis and Semantic Prior. 3722-3737 - Haiyong Xu, Gangyi Jiang, Mei Yu, Zhongjie Zhu, Yongqiang Bai, Yang Song, Huifang Sun:
Tensor Product and Tensor-Singular Value Decomposition Based Multi-Exposure Fusion of Images. 3738-3753 - Yongxu Liu, Jinjian Wu, Aobo Li, Leida Li, Weisheng Dong, Guangming Shi, Weisi Lin:
Video Quality Assessment With Serial Dependence Modeling. 3754-3768 - Chaoyan Huang, Michael K. Ng, Tingting Wu, Tieyong Zeng:
Quaternion-Based Dictionary Learning and Saturation-Value Total Variation Regularization for Color Image Restoration. 3769-3781 - Zhongqi Wu, Chuanqing Zhuang, Jian Shi, Jianwei Guo, Jun Xiao, Xiaopeng Zhang, Dong-Ming Yan:
Single-Image Specular Highlight Removal via Real-World Dataset Construction. 3782-3793 - T. Janani, M. Brindha:
SEcure Similar Image Matching (SESIM): An Improved Privacy Preserving Image Retrieval Protocol over Encrypted Cloud Database. 3794-3806 - Yupeng Cheng, Qing Guo, Felix Juefei-Xu, Shang-Wei Lin, Wei Feng, Weisi Lin, Yang Liu:
Pasadena: Perceptually Aware and Stealthy Adversarial Denoise Attack. 3807-3822 - Hwanbok Mun, Gang-Joon Yoon, Jinjoo Song, Sang Min Yoon:
Texture Preserving Photo Style Transfer Network. 3823-3834 - Haoyuan Zhang, Lap-Pui Chau, Danwei Wang:
Soft Warping Based Unsupervised Domain Adaptation for Stereo Matching. 3835-3846 - Di Ma, Fan Zhang, David R. Bull:
BVI-DVC: A Training Database for Deep Video Compression. 3847-3858 - Yingxue Pang, Jianxin Lin, Tao Qin, Zhibo Chen:
Image-to-Image Translation: Methods and Applications. 3859-3881 - Xiaoqian Zhang, Zhen Tan, Huaijiang Sun, Zungang Wang, Mingwei Qin:
Orthogonal Low-Rank Projection Learning for Robust Image Feature Extraction. 3882-3895 - Pan Xie, Mengyi Zhao, Xiaohui Hu:
PiSLTRc: Position-Informed Sign Language Transformer With Content-Aware Convolution. 3908-3919 - Yongqiang Tang, Yuan Xie, Chenyang Zhang, Wensheng Zhang:
Constrained Tensor Representation Learning for Multi-View Semi-Supervised Subspace Clustering. 3920-3933 - Xiaoning Liu, Hui Li, Ce Zhu:
Joint Contrast Enhancement and Exposure Fusion for Real-World Image Dehazing. 3934-3946 - Bohong Yang, Wu Ran, Lin Wang, Hong Lu, Yi-Ping Phoebe Chen:
Multi-Classes and Motion Properties for Concurrent Visual SLAM in Dynamic Environments. 3947-3960 - Yujie Huang, Yuhao Liu, Ming-e Jing, Xiaoyang Zeng, Yibo Fan:
Tear the Image Into Strips for Style Transfer. 3978-3988 - Xueqi Ma, Weifeng Liu, Qi Tian, Yue Gao:
Learning Representation on Optimized High-Order Manifold for Visual Classification. 3989-4001 - Jialu Huang, Jing Liao, Zhifeng Tan, Sam Kwong:
Multi-Density Sketch-to-Image Translation Network. 4002-4015 - Haiwei Wu, Jiantao Zhou, Yuanman Li:
Deep Generative Model for Image Inpainting With Local Binary Pattern Learning and Spatial Attention. 4016-4027 - Xueying Wang, Yudong Guo, Zhongqi Yang, Juyong Zhang:
Prior-Guided Multi-View 3D Head Reconstruction. 4028-4040 - Ming Zeng, Yinglin Zheng, Jinpeng Lin, Xuan Cheng, Jing Liao, Zizhao Wu, Wenjin Deng:
Controllable Facial Caricaturization With Localized Deformation and Personalized Semantic Attentions. 4041-4053 - Yongyong Chen, Shuqin Wang, Xiaolin Xiao, Youfa Liu, Zhongyun Hua, Yicong Zhou:
Self-Paced Enhanced Low-Rank Tensor Kernelized Multi-View Subspace Clustering. 4054-4066 - Shuning Chang, Yanchao Li, Shengmei Shen, Jiashi Feng, Steven Zhiying Zhou:
Contrastive Attention for Video Anomaly Detection. 4067-4076 - Bing Li, Yuanlue Zhu, Yitong Wang, Chia-Wen Lin, Bernard Ghanem, Linlin Shen:
AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation. 4077-4091 - Zikang Yuan, Ken Cheng, Jinhui Tang, Xin Yang:
RGB-D DSO: Direct Sparse Odometry With RGB-D Cameras for Indoor Scenes. 4092-4101 - Bo Zhang, Tao Chen, Bin Wang, Ruoyao Li:
Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection. 4102-4112 - Xibin Song, Dingfu Zhou, Wei Li, Yuchao Dai, Liu Liu, Hongdong Li, Ruigang Yang, Liangjun Zhang:
WAFP-Net: Weighted Attention Fusion Based Progressive Residual Learning for Depth Map Super-Resolution. 4113-4127 - Yanxiang Chen, Pengcheng Zhao, Meibin Qi, Yang Zhao, Wei Jia, Ronggang Wang:
Audio Matters in Video Super-Resolution by Implicit Semantic Guidance. 4128-4142 - Yixin Mei, Li Li, Zhu Li, Fan Li:
Learning-Based Scalable Image Compression With Latent-Feature Reuse and Prediction. 4143-4157 - Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, Zhibo Chen, Shih-Fu Chang:
Beyond Triplet Loss: Meta Prototypical N-Tuple Loss for Person Re-identification. 4158-4169 - Dongjing Wang, Xin Zhang, Yao Wan, Dongjin Yu, Guandong Xu, Shuiguang Deng:
Modeling Sequential Listening Behaviors With Attentive Temporal Point Process for Next and Next New Music Recommendation. 4170-4182 - Chengxiang Yin, Jian Tang, Tongtong Yuan, Zhiyuan Xu, Yanzhi Wang:
Bridging the Gap Between Semantic Segmentation and Instance Segmentation. 4183-4196 - Fu-Zhao Ou, Yuan-Gen Wang, Jin Li, Guopu Zhu, Sam Kwong:
A Novel Rank Learning Based No-Reference Image Quality Assessment Method. 4197-4211 - Jing Liu, Ziwen Yang, Yuting Su, Xiaokang Yang:
TANet: Target Attention Network for Video Bit-Depth Enhancement. 4212-4223 - Xinyue Huo, Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Xin Chen, Hao Li, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian:
Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations. 4224-4235 - Zhaojian Yao, Luping Wang:
Boundary Information Progressive Guidance Network for Salient Object Detection. 4236-4249 - Pengyu Xie, Xin Xu, Zheng Wang, Toshihiko Yamasaki:
Sampling and Re-Weighting: Towards Diverse Frame Aware Unsupervised Video Person Re-Identification. 4250-4261 - Zhiwei Hao, Yong Luo, Zhi Wang, Han Hu, Jianping An:
CDFKD-MFS: Collaborative Data-Free Knowledge Distillation via Multi-Level Feature Sharing. 4262-4274 - Ling Lo, Hong-Xia Xie, Hong-Han Shuai, Wen-Huang Cheng:
Facial Chirality: From Visual Self-Reflection to Robust Facial Feature Learning. 4275-4284 - Jiaxing Chen, Wei-Shi Zheng, Qize Yang, Jingke Meng, Richang Hong, Qi Tian:
Deep Shape-Aware Person Re-Identification for Overcoming Moderate Clothing Changes. 4285-4300 - Nanfeng Jiang, Weiling Chen, Yuting Lin, Tiesong Zhao, Chia-Wen Lin:
Underwater Image Enhancement With Lightweight Cascaded Network. 4301-4313 - Jinxiang Liu, Yangheng Zhao, Siheng Chen, Ya Zhang:
A 3D Mesh-Based Lifting-and-Projection Network for Human Pose Transfer. 4314-4327 - Rumeng Yi, Yaping Huang:
TC-Net: Detecting Noisy Labels Via Transform Consistency. 4328-4341 - Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition. 4342-4355 - Jun Peng, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis. 4356-4366 - Selvarajah Thuseethan, Sutharshan Rajasegarar, John Yearwood:
Deep Continual Learning for Emerging Emotion Recognition. 4367-4380 - Xia Du, Chi-Man Pun:
Robust Audio Patch Attacks Using Physical Sample Simulation and Adversarial Patch Noise Generation. 4381-4393 - Lianbo Zhang, Shaoli Huang, Wei Liu:
Enhancing Mixture-of-Experts by Leveraging Attention for Fine-Grained Recognition. 4409-4421 - Minjie Ren, Xiangdong Huang, Wenhui Li, Dan Song, Weizhi Nie:
LR-GCN: Latent Relation-Aware Graph Convolutional Network for Conversational Emotion Recognition. 4422-4432 - Shengeng Tang, Dan Guo, Richang Hong, Meng Wang:
Graph-Based Multimodal Sequential Embedding for Sign Language Translation. 4433-4445 - Ashek Ahmmed, M. Manzur Murshed, Manoranjan Paul, David Taubman:
A Commonality Modeling Framework for Enhanced Video Coding Leveraging on the Cuboidal Partitioning Based Representation of Frames. 4446-4457 - Deming Wang, Guangliang Zhou, Yi Yan, Huiyi Chen, Qijun Chen:
GeoPose: Dense Reconstruction Guided 6D Object Pose Estimation With Geometric Consistency. 4394-4408 - Qianwen Cao, Heyan Huang:
Attention Guided Relation Detection Approach for Video Visual Relation Detection. 3896-3907 - Wei Hu, Jiahao Pang, Xianming Liu, Dong Tian, Chia-Wen Lin, Anthony Vetro:
Graph Signal Processing for Geometric Data and Beyond: Theory and Applications. 3961-3977 - Xiaoming Huang, Yu-Jin Zhang:
Fast Video Saliency Detection via Maximally Stable Region Motion and Object Repeatability. 4458-4470 - Weizhi Nie, Rihao Chang, Minjie Ren, Yuting Su, Anan Liu:
I-GCN: Incremental Graph Convolution Network for Conversation Emotion Detection. 4471-4481 - Zhengxu Yu, Yilun Zhao, Bin Hong, Zhongming Jin, Jianqiang Huang, Deng Cai, Xian-Sheng Hua:
Apparel-Invariant Feature Learning for Person Re-Identification. 4482-4492 - Lingling Gao, Yanli Ji, Kumie Gedamu, Xiaofeng Zhu, Xing Xu, Heng Tao Shen:
View-Invariant Human Action Recognition Via View Transformation Network (VTN). 4493-4503 - Li Li, Zhu Li, Shan Liu, Houqiang Li:
Motion Estimation and Coding Structure for Inter-Prediction of LiDAR Point Cloud Geometry. 4504-4513 - Xin Wei, Yingying Shi, Liang Zhou:
Haptic Signal Reconstruction for Cross-Modal Communications. 4514-4525 - Siyang Deng, Gang Xiang, Quanxue Gao, Wei Xia, Xinbo Gao:
Zero-Shot Learning Based on Quality-Verifying Adversarial Network. 4526-4537 - Jiansong Zhang, Kejiang Chen, Chuan Qin, Weiming Zhang, Nenghai Yu:
Distribution-Preserving-Based Automatic Data Augmentation for Deep Image Steganalysis. 4538-4550
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.