default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 19
Volume 19, Number 1, January 2023
- Xuan Shao, Ying Shen, Lin Zhang, Shengjie Zhao, Dandan Zhu, Yicong Zhou:
SLAM for Indoor Parking: A Comprehensive Benchmark Dataset and a Tightly Coupled Semantic Framework. 1:1-1:23 - Prasen Kumar Sharma, Ira Bisht, Arijit Sur:
Wavelength-based Attributed Deep Neural Network for Underwater Image Restoration. 2:1-2:23 - Jie Li, Ling Han, Chong Zhang, Qiyue Li, Zhi Liu:
Spherical Convolution Empowered Viewport Prediction in 360 Video Multicast with Limited FoV Feedback. 3:1-3:23 - Thi Ngoc Hanh Le, Chih-Kuo Yeh, Ying-Chi Lin, Tong-Yee Lee:
Animating Still Natural Images Using Warping. 4:1-4:24 - Lizhi Xiong, Xiao Han, Ching-Nung Yang, Zhihua Xia:
RDH-DES: Reversible Data Hiding over Distributed Encrypted-Image Servers Based on Secret Sharing. 5:1-5:19 - Peining Zhen, Shuqi Wang, Suming Zhang, Xiaotao Yan, Wei Wang, Zhigang Ji, Hai-Bao Chen:
Towards Accurate Oriented Object Detection in Aerial Images with Adaptive Multi-level Feature Fusion. 6:1-6:22 - Yue Song, Hao Tang, Nicu Sebe, Wei Wang:
Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling. 7:1-7:15 - Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang Wen Chen:
Boosting Scene Graph Generation with Visual Relation Saliency. 8:1-8:17 - Jingwen Chen, Jianjie Luo, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Boosting Vision-and-Language Navigation with Direction Guiding and Backtracing. 9:1-9:16 - Yunbo Rao, Ziqiang Yang, Shaoning Zeng, Qifeng Wang, Jiansu Pu:
Dual Projective Zero-Shot Learning Using Text Descriptions. 10:1-10:17 - Hang Yu, Chilam Cheang, Yanwei Fu, Xiangyang Xue:
Multi-view Shape Generation for a 3D Human-like Body. 11:1-11:22 - Weidong Chen, Guorong Li, Xinfeng Zhang, Shuhui Wang, Liang Li, Qingming Huang:
Weakly Supervised Text-based Actor-Action Video Segmentation by Clip-level Multi-instance Learning. 12:1-12:22 - Feihong Shen, Jun Liu:
Quantum Fourier Convolutional Network. 13:1-13:14 - Xiaotian Wu, Peng Yao:
Boolean-based Two-in-One Secret Image Sharing by Adaptive Pixel Grouping. 14:1-14:23 - Ashima Yadav, Dinesh Kumar Vishwakarma:
A Deep Multi-level Attentive Network for Multimodal Sentiment Analysis. 15:1-15:19 - Honghao Gao, Baobin Dai, Huaikou Miao, Xiaoxian Yang, Ramón J. Durán Barroso, Walayat Hussain:
A Novel GAPG Approach to Automatic Property Generation for Formal Verification: The GAN Perspective. 16:1-16:22 - Pengyi Zhang, Huanzhang Dou, Wenhu Zhang, Yuhan Zhao, Zequn Qin, Dongping Hu, Yi Fang, Xi Li:
A Large-Scale Synthetic Gait Dataset Towards in-the-Wild Simulation and Comparison Study. 17:1-17:23 - Wei Zhou, Zhiwu Xia, Peng Dou, Tao Su, Haifeng Hu:
Double Attention Based on Graph Attention Network for Image Multi-Label Classification. 18:1-18:23 - Xianlin Zhang, Mengling Shen, Xueming Li, Xiaojie Wang:
AABLSTM: A Novel Multi-task Based CNN-RNN Deep Model for Fashion Analysis. 19:1-19:18 - Deyin Liu, Lin Wu, Richang Hong, Zongyuan Ge, Jialie Shen, Farid Boussaïd, Mohammed Bennamoun:
Generative Metric Learning for Adversarially Robust Open-world Person Re-Identification. 20:1-20:19 - Shuo Wang, Huixia Ben, Yanbin Hao, Xiangnan He, Meng Wang:
Boosting Hyperspectral Image Classification with Dual Hierarchical Learning. 21:1-21:19 - Dayan Wu, Qi Dai, Bo Li, Weiping Wang:
Deep Uncoupled Discrete Hashing via Similarity Matrix Decomposition. 22:1-22:22 - Ming Cheung, Weiwei Sun, James She, Jiantao Zhou:
Social Network Analytic-Based Online Counterfeit Seller Detection using User Shared Images. 23:1-23:18 - Feihong Lu, Hang Chen, Kang Li, Qiliang Deng, Jian Zhao, Kaipeng Zhang, Hong Han:
Toward High-quality Face-Mask Occluded Restoration. 24:1-24:23 - Yajing Liu, Zhiwei Xiong, Ya Li, Yuning Lu, Xinmei Tian, Zheng-Jun Zha:
Category-Stitch Learning for Union Domain Generalization. 25:1-25:19
Volume 19, Number 1s, February 2023
- Claudio Ferrari, Federico Becattini, Leonardo Galteri, Alberto Del Bimbo:
(Compress and Restore)N: A Robust Defense Against Adversarial Attacks on Image Classification. 26:1-26:16 - Yaguang Song, Xiaoshan Yang, Changsheng Xu:
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation. 27:1-27:23 - Feng Xue, Tian Yang, Kang Liu, Zikun Hong, Mingwei Cao, Dan Guo, Richang Hong:
LCSNet: End-to-end Lipreading with Channel-aware Feature Selection. 28:1-28:21 - Zilong Fu, Hongtao Xie, Shancheng Fang, Yuxin Wang, Mengting Xing, Yongdong Zhang:
Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text Detection. 29:1-29:24 - João Baptista Cardia Neto, Claudio Ferrari, Aparecido Nilceu Marana, Stefano Berretti, Alberto Del Bimbo:
Learning Streamed Attention Network from Descriptor Images for Cross-Resolution 3D Face Recognition. 30:1-30:20 - Xin Huang:
On Teaching Mode of MTI Translation Workshop Based on IPT Corpus for Tibetan Areas of China. 31:1-31:16 - Liming Xu, Xianhua Zeng, Weisheng Li, Bochuan Zheng:
MFGAN: Multi-modal Feature-fusion for CT Metal Artifact Reduction Using GANs. 32:1-32:17 - Yuzhang Hu, Wenhan Yang, Jiaying Liu, Zongming Guo:
Deep Inter Prediction with Error-Corrected Auto-Regressive Network for Video Coding. 33:1-33:22 - Yue Li, Li Zhang, Kai Zhang:
iDAM: Iteratively Trained Deep In-loop Filter with Adaptive Model Selection. 34:1-34:22 - Rahul Kumar Jaiswal, Rajesh Kumar Dubey:
CAQoE: A Novel No-Reference Context-aware Speech Quality Prediction Metric. 35:1-35:23 - Tao Xiang, Honghong Zeng, Biwen Chen, Shangwei Guo:
BMIF: Privacy-preserving Blockchain-based Medical Image Fusion. 36:1-36:23 - Xiaoke Zhu, Changlong Li, Xiaopan Chen, Xinyu Zhang, Xiao-Yuan Jing:
Distance and Direction Based Deep Discriminant Metric Learning for Kinship Verification. 37:1-37:19 - Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang:
Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis. 38:1-38:18 - Lavinia De Divitiis, Federico Becattini, Claudio Baecchi, Alberto Del Bimbo:
Disentangling Features for Fashion Recommendation. 39:1-39:21 - Ka-Hou Chan, Sio Kei Im:
Using Four Hypothesis Probability Estimators for CABAC in Versatile Video Coding. 40:1-40:17 - Mengqi Yuan, Bing-Kun Bao, Zhiyi Tan, Changsheng Xu:
Adaptive Text Denoising Network for Image Caption Editing. 41:1-41:18 - Xiaoyu Zhang, Wei Gao, Ge Li, Qiuping Jiang, Runmin Cong:
Image Quality Assessment-driven Reinforcement Learning for Mixed Distorted Image Restoration. 42:1-42:23 - Chongyang Bai, Maksim Bolonkin, Viney Regunath, V. S. Subrahmanian:
DIPS: A Dyadic Impression Prediction System for Group Interaction Videos. 43:1-43:24 - Yuqing Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao:
Sequential Hierarchical Learning with Distribution Transformation for Image Super-Resolution. 44:1-44:21 - Haidong Wang, Xuan He, Zhiyong Li, Jin Yuan, Shutao Li:
JDAN: Joint Detection and Association Network for Real-Time Online Multi-Object Tracking. 45:1-45:17 - Mengyao Xiao, Xiaolong Li, Yao Zhao, Bin Ma, Guodong Guo:
A Novel Reversible Data Hiding Scheme Based on Pixel-Residual Histogram. 46:1-46:19 - Jiazhi Liu, Feng Liu:
Modified 2D-Ghost-Free Stereoscopic Display with Depth-of-Field Effects. 47:1-47:16 - Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning. 48:1-48:24 - Guanyu Zhu, Yong Zhou, Rui Yao, Hancheng Zhu, Jiaqi Zhao:
Cyclic Self-attention for Point Cloud Recognition. 49:1-49:19 - Dinghao Yang, Wei Gao, Ge Li, Hui Yuan, Junhui Hou, Sam Kwong:
Exploiting Manifold Feature Representation for Efficient Classification of 3D Point Clouds. 50:1-50:21
Volume 19, Number 2, March 2023
- Xiaohan Lan, Yitian Yuan, Xin Wang, Zhi Wang, Wenwu Zhu:
A Survey on Temporal Sentence Grounding in Videos. 51:1-51:33 - Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang:
Hierarchical and Progressive Image Matting. 52:1-52:23 - Fei Peng, Wenyan Jiang, Min Long:
A Low Distortion and Steganalysis-resistant Reversible Data Hiding for 2D Engineering Graphics. 53:1-53:20 - Sijie Mai, Songlong Xing, Jiaxuan He, Ying Zeng, Haifeng Hu:
Multimodal Graph for Unaligned Multimodal Sequence Analysis via Graph Convolution and Graph Pooling. 54:1-54:24 - Qi Zheng, Jianfeng Dong, Xiaoye Qu, Xun Yang, Yabing Wang, Pan Zhou, Baolong Liu, Xun Wang:
Progressive Localization Networks for Language-Based Moment Localization. 55:1-55:21 - Yue Zhang, Fanghui Zhang, Yi Jin, Yigang Cen, Viacheslav V. Voronin, Shaohua Wan:
Local Correlation Ensemble with GCN Based on Attention Features for Cross-domain Person Re-ID. 56:1-56:22 - Jacob Chakareski, Mahmudur Khan, Tanguy Ropitault, Steve Blandino:
Millimeter Wave and Free-space-optics for Future Dual-connectivity 6DOF Mobile Multi-user VR Streaming. 57:1-57:25 - Yun-Shao Lin, Yi-Ching Liu, Chi-Chun Lee:
An Interaction-process-guided Framework for Small-group Performance Prediction. 58:1-58:25 - Na Zheng, Xuemeng Song, Tianyu Su, Weifeng Liu, Yan Yan, Liqiang Nie:
Egocentric Early Action Prediction via Adversarial Knowledge Distillation. 59:1-59:21 - Li Wang, Ke Li, Jingjing Tang, Yuying Liang:
Image Super-Resolution via Lightweight Attention-Directed Feature Aggregation Network. 60:1-60:23 - Jiaying Lin, Xin Tan, Ke Xu, Lizhuang Ma, Rynson W. H. Lau:
Frequency-aware Camouflaged Object Detection. 61:1-61:16 - Shuang Liang, Anjie Zhu, Jiasheng Zhang, Jie Shao:
Hyper-node Relational Graph Attention Network for Multi-modal Knowledge Graph Completion. 62:1-62:21 - Yaya Shi, Haiyang Xu, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
Learning Video-Text Aligned Representations for Video Captioning. 63:1-63:21 - Yang Yang, Yingqiu Ding, Ming Cheng, Weiming Zhang:
No-reference Quality Assessment for Contrast-distorted Images Based on Gray and Color-gray-difference Space. 64:1-64:20 - Jia Wang, Jingcheng Ke, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng:
Referring Expression Comprehension Via Enhanced Cross-modal Graph Attention Networks. 65:1-65:21 - Dengyong Zhang, Pu Huang, Xiangling Ding, Feng Li, Wenjie Zhu, Yun Song, Gaobo Yang:
L2BEC2: Local Lightweight Bidirectional Encoding and Channel Attention Cascade for Video Frame Interpolation. 66:1-66:19 - Yushu Zhang, Qing Tan, Shuren Qi, Mingfu Xue:
PRNU-based Image Forgery Localization with Deep Multi-scale Fusion. 67:1-67:20 - Shanshan Dong, Tian-Zi Niu, Xin Luo, Wu Liu, Xinshun Xu:
Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning. 68:1-68:18 - Shunxin Xu, Ke Sun, Dong Liu, Zhiwei Xiong, Zheng-Jun Zha:
Synergy between Semantic Segmentation and Image Denoising via Alternate Boosting. 69:1-69:23 - Dan Song, Chu-Meng Zhang, Xiao-Qian Zhao, Teng Wang, Wei-Zhi Nie, Xuanya Li, An-An Liu:
Self-supervised Image-based 3D Model Retrieval. 70:1-70:18 - Stavros Nousias, Gerasimos Arvanitis, Aris S. Lalos, Konstantinos Moustakas:
Deep Saliency Mapping for 3D Meshes and Applications. 71:1-71:22 - Yun Liu, Xiaohua Yin, Zuliang Wan, Guanghui Yue, Zhi Zheng:
Toward A No-reference Omnidirectional Image Quality Evaluation by Using Multi-perceptual Features. 72:1-72:19 - Hua Wu, Xin Li, Gang Wang, Guang Cheng, Xiaoyan Hu:
Resolution Identification of Encrypted Video Streaming Based on HTTP/2 Features. 73:1-73:23 - Qipu Qin, Cheolkon Jung:
Quality Enhancement of Compressed 360-Degree Videos Using Viewport-based Deep Neural Networks. 74:1-74:19 - Wei Zhou, Zhiwu Xia, Peng Dou, Tao Su, Haifeng Hu:
Aligning Image Semantics and Label Concepts for Image Multi-Label Classification. 75:1-75:23
Volume 19, Number 3, May 2023
- Yi Zhang, Fang-Yi Chao, Wassim Hamidouche, Olivier Déforges:
PAV-SOD: A New Task towards Panoramic Audiovisual Saliency Detection. 101:1-101:26 - Chi Xie, Zikun Zhuang, Shengjie Zhao, Shuang Liang:
Temporal Dropout for Weakly Supervised Action Localization. 102:1-102:24 - Yangyang Guo, Liqiang Nie, Harry Cheng, Zhiyong Cheng, Mohan S. Kankanhalli, Alberto Del Bimbo:
On Modality Bias Recognition and Reduction. 103:1-103:22 - Kang Xu, Weixin Li, Xia Wang, Xiaoyan Hu, Ke Yan, Xiaojie Wang, Xuan Dong:
CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising. 104:1-104:22 - Wenxin Huang, Xuemei Jia, Xian Zhong, Xiao Wang, Kui Jiang, Zheng Wang:
Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search. 105:1-105:19 - Hongchuan Yu, Mengqing Huang, Jian-Jun Zhang:
Domain Adaptation Problem in Sketch Based Image Retrieval. 106:1-106:17 - Han Yan, Haijun Zhang, Jianyang Shi, Jianghong Ma, Xiaofei Xu:
Toward Intelligent Fashion Design: A Texture and Shape Disentangled Generative Adversarial Network. 107:1-107:23 - Peng Dou, Ying Zeng, Zhuoqun Wang, Haifeng Hu:
Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization. 108:1-108:19 - Lei Li, Zhiyuan Zhou, Suping Wu, Yongrong Cao:
Multi-scale Edge-guided Learning for 3D Reconstruction. 109:1-109:24 - Zhengxue Wang, Guangwei Gao, Juncheng Li, Hui Yan, Hao Zheng, Huimin Lu:
Lightweight Feature De-redundancy and Self-calibration Network for Efficient Image Super-resolution. 110:1-110:15 - Zhijie Huang, Jun Sun, Xiaopeng Guo:
FastCNN: Towards Fast and Accurate Spatiotemporal Network for HEVC Compressed Video Enhancement. 111:1-111:22 - Xiaohan Wang, Linchao Zhu, Fei Wu, Yi Yang:
A Differentiable Parallel Sampler for Efficient Video Classification. 112:1-112:18 - Junjie Li, Jin Yuan, Zhiyong Li:
TP-FER: An Effective Three-phase Noise-tolerant Recognizer for Facial Expression Recognition. 113:1-113:17 - Baojin Huang, Zhongyuan Wang, Guangcheng Wang, Zhen Han, Kui Jiang:
Local Eyebrow Feature Attention Network for Masked Face Recognition. 114:1-114:19 - Bincheng Yang, Gangshan Wu:
Efficient Single-image Super-resolution Using Dual path Connections with Multiple scale Learning. 115:1-115:21 - Wei Zhou, Yanke Hou, Dihu Chen, Haifeng Hu, Tao Su:
Attention-Augmented Memory Network for Image Multi-Label Classification. 116:1-116:24 - Shuaixiong Hui, Qiang Guo, Xiaoyu Geng, Caiming Zhang:
Multi-Guidance CNNs for Salient Object Detection. 117:1-117:19 - Kai Xing, Tao Li, Xuanhan Wang:
ProposalVLAD with Proposal-Intra Exploring for Temporal Action Proposal Generation. 118:1-118:18 - Hao Tang, Lei Ding, Songsong Wu, Bin Ren, Nicu Sebe, Paolo Rota:
Deep Unsupervised Key Frame Extraction for Efficient Video Classification. 119:1-119:17 - Ling Zhang, Chengjiang Long, Xiaolong Zhang, Chunxia Xiao:
Exploiting Residual and Illumination with GANs for Shadow Detection and Shadow Removal. 120:1-120:22 - Yushu Zhang, Nuo Chen, Shuren Qi, Mingfu Xue, Zhongyun Hua:
Detection of Recolored Image by Texture Features in Chrominance Components. 121:1-121:23 - Han Xue, Jun Ling, Anni Tang, Li Song, Rong Xie, Wenjun Zhang:
High-Fidelity Face Reenactment Via Identity-Matched Correspondence Learning. 122:1-122:23 - Haozhe Chen, Hang Zhou, Jie Zhang, Dongdong Chen, Weiming Zhang, Kejiang Chen, Gang Hua, Nenghai Yu:
Perceptual Hashing of Deep Convolutional Neural Networks for Model Copy Detection. 123:1-123:20 - Wei Duan, Yi Yu, Xulong Zhang, Suhua Tang, Wei Li, Keizo Oyama:
Melody Generation from Lyrics with Local Interpretability. 124:1-124:21 - Shiguang Liu, Huixin Wang:
Talking Face Generation via Facial Anatomy. 125:1-125:19
Volume 19, Number 4, July 2023
- Xuehu Yan, Longlong Li, Lei Sun, Jia Chen, Shudong Wang:
Fake and Dishonest Participant Immune Secret Image Sharing. 139:1-139:26 - Song Yang, Qiang Li, Wenhui Li, Xuanya Li, Ran Jin, Bo Lv, Rui Wang, Anan Liu:
Semantic Completion and Filtration for Image-Text Retrieval. 140:1-140:20 - Xuan Ma, Xiaoshan Yang, Changsheng Xu:
Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference. 141:1-141:17 - Shangxi Wu, Jitao Sang, Kaiyuan Xu, Jiaming Zhang, Jian Yu:
Attention, Please! Adversarial Defense via Activation Rectification and Preservation. 142:1-142:18 - Kan Wang, Changxing Ding, Jianxin Pang, Xiangmin Xu:
Context Sensing Attention Network for Video-based Person Re-identification. 143:1-143:20 - Wenjing Wang, Lilang Lin, Zejia Fan, Jiaying Liu:
Semi-supervised Learning for Mars Imagery Classification and Segmentation. 144:1-144:23 - Hui Liu, Shanshan Li, Jicheng Zhu, Kai Deng, Meng Liu, Liqiang Nie:
DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network. 145:1-145:17 - Xintian Wu, Huanyu Wang, Yiming Wu, Xi Li:
D3T-GAN: Data-Dependent Domain Transfer GANs for Image Generation with Limited Data. 146:1-146:20 - Dandan Zhu, Xuan Shao, Qiangqiang Zhou, Xiongkuo Min, Guangtao Zhai, Xiaokang Yang:
A Novel Lightweight Audio-visual Saliency Model for Videos. 147:1-147:22 - Amr Abdussalam, Zhongfu Ye, Ammar Hawbani, Majjed Al-Qatf, Rashid Khan:
NumCap: A Number-controlled Multi-caption Image Captioning Network. 148:1-148:24 - Hao Liu, Zhaoyu Yan, Bing Liu, Jiaqi Zhao, Yong Zhou, Abdulmotaleb El-Saddik:
Distilled Meta-learning for Multi-Class Incremental Learning. 149:1-149:16 - Jin Yuan, Shikai Chen, Yao Zhang, Zhongchao Shi, Xin Geng, Jianping Fan, Yong Rui:
Graph Attention Transformer Network for Multi-label Image Classification. 150:1-150:16 - Guojia Hou, Yuxuan Li, Huan Yang, Kunqian Li, Zhenkuan Pan:
UID2021: An Underwater Image Dataset for Evaluation of No-Reference Quality Assessment Metrics. 151:1-151:24
Volume 19, Number 5, September 2023
- Niklas Carlsson, Derek L. Eager:
Cross-User Similarities in Viewing Behavior for 360° Video and Caching Implications. 152:1-152:24 - Ziqiang Li, Pengfei Xia, Xue Rui, Bin Li:
Exploring the Effect of High-frequency Components in GANs Training. 153:1-153:22 - Haibing Yin, Hongkui Wang, Li Yu, Junhui Liang, Guangtao Zhai:
Feedforward and Feedback Modulations Based Foveated JND Estimation for Images. 154:1-154:23 - Taocun Yang, Yaping Huang, Yanlin Xie, Junbo Liu, Shengchun Wang:
MixOOD: Improving Out-of-distribution Detection with Enhanced Data Mixup. 155:1-155:18 - Hao Wei, Rui Chen:
A Multi-Level Consistency Network for High-Fidelity Virtual Try-On. 156:1-156:18 - Jiachang Hao, Haifeng Sun, Pengfei Ren, Yiming Zhong, Jingyu Wang, Qi Qi, Jianxin Liao:
Fine-Grained Text-to-Video Temporal Grounding from Coarse Boundary. 157:1-157:21 - Weixin Li, Tiantian Cao, Chang Liu, Xue Tian, Ya Li, Xiaojie Wang, Xuan Dong:
Dual-Lens HDR using Guided 3D Exposure CNN and Guided Denoising Transformer. 158:1-158:20 - Xin Yang, Hengrui Li, Xiaochuan Li, Tao Li:
HIFGAN: A High-Frequency Information-Based Generative Adversarial Network for Image Super-Resolution. 159:1-159:19 - Yang Li:
Detection of Moving Object Using Superpixel Fusion Network. 160:1-160:15 - Yingwei Pan, Yehao Li, Ting Yao, Tao Mei:
Bottom-up and Top-down Object Inference Networks for Image Captioning. 161:1-161:18 - Duoduo Feng, Xiangteng He, Yuxin Peng:
MKVSE: Multimodal Knowledge Enhanced Visual-semantic Embedding for Image-text Retrieval. 162:1-162:21 - Mengyi Zhao, Hao Tang, Pan Xie, Shuling Dai, Nicu Sebe, Wei Wang:
Bidirectional Transformer GAN for Long-term Human Motion Prediction. 163:1-163:19 - Jian Wang, Qiang Ling, Peiyan Li:
Robust Video Stabilization based on Motion Decomposition. 164:1-164:24
Volume 19, Number 2s, April 2023
- Summaira Jabeen, Xi Li, Amin Muhammad Shoib, Bourahla Omar, Songyuan Li, Abdul Jabbar:
A Review on Methods and Applications in Multimodal Deep Learning. 76:1-76:41 - Sophie C. C. Sun, Yongkang Zhao, Fang-Wei Fu, YaWei Ren:
Improved Random Grid-based Cheating Prevention Visual Cryptography Using Latin Square. 77:1-77:21 - Jiong Dong, Kaoru Ota, Mianxiong Dong:
Video Frame Interpolation: A Comprehensive Survey. 78:1-78:31 - Gaofeng Cao, Fei Zhou, Kanglin Liu, Anjie Wang, Leidong Fan:
A Decoupled Kernel Prediction Network Guided by Soft Mask for Single Image HDR Reconstruction. 79:1-79:23 - Yipeng Liu, Qi Yang, Yiling Xu, Le Yang:
Point Cloud Quality Assessment: Dataset Construction and Learning-based No-reference Metric. 80:1-80:26 - Cheng Xu, Zejun Chen, Jiajie Mai, Xuemiao Xu, Shengfeng He:
Pose- and Attribute-consistent Person Image Synthesis. 81:1-81:21 - Jae Hyun Park, Sanghoon Kim, Joo Chan Lee, Jong Hwan Ko:
Scalable Color Quantization for Task-centric Image Compression. 82:1-82:18 - Joan Manuel Marquès Puig, Helena Rifà-Pous, Samia Oukemeni:
From False-Free to Privacy-Oriented Communitarian Microblogging Social Networks. 83:1-83:23 - Yiming Tang, Yi Yu:
Query-Guided Prototype Learning with Decoder Alignment and Dynamic Fusion in Few-Shot Segmentation. 84:1-84:20 - Zhiming Liu, Kai Niu, Zhiqiang He:
ML-CookGAN: Multi-Label Generative Adversarial Network for Food Image Generation. 85:1-85:21 - Basheer Alwaely, Charith Abhayaratne:
GHOSM: Graph-based Hybrid Outline and Skeleton Modelling for Shape Recognition. 86:1-86:23 - Sankaraganesh Jonna, Moushumi Medhi, Rajiv Ranjan Sahay:
Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection. 87:1-87:26 - Xuewei Ding, Yingwei Pan, Yehao Li, Ting Yao, Dan Zeng, Tao Mei:
Boosting Relationship Detection in Images with Multi-Granular Self-Supervised Learning. 88:1-88:18 - Binfei Chu, Yiting Lin, Bineng Zhong, Zhenjun Tang, Xianxian Li, Jing Wang:
Robust Long-Term Tracking via Localizing Occluders. 89:1-89:15 - Huisi Wu, Zhaoze Wang, Zhuoying Li, Zhenkun Wen, Jing Qin:
Context Prior Guided Semantic Modeling for Biomedical Image Segmentation. 90:1-90:19 - Jun Wu, Tianliang Zhu, Jiahui Zhu, Tianyi Li, Chunzhi Wang:
A Optimized BERT for Multimodal Sentiment Analysis. 91:1-91:12 - Yongzong Xu, Zhijing Yang, Tianshui Chen, Kai Li, Chunmei Qing:
Progressive Transformer Machine for Natural Character Reenactment. 92:1-92:22 - Chong Hong Tan, KokSheik Wong, Vishnu Monn Baskaran, Kiki Adhinugraha, David Taniar:
Is it Violin or Viola? Classifying the Instruments' Music Pieces using Descriptive Statistics. 93:1-93:22 - Kedar Nath Singh, Om Prakash Singh, Amit Kumar Singh, Amrit Kumar Agrawal:
EiMOL: A Secure Medical Image Encryption Algorithm based on Optimization and the Lorenz System. 94:1-94:19 - Ziteng Qiao, Dianxi Shi, Xiaodong Yi, Yanyan Shi, Yuhui Zhang, Yangyang Liu:
UEFPN: Unified and Enhanced Feature Pyramid Networks for Small Object Detection. 95:1-95:21 - Linwei Zhu, Yun Zhang, Na Li, Gangyi Jiang, Sam Kwong:
Deep Learning-Based Intra Mode Derivation for Versatile Video Coding. 96:1-96:20 - Donghuo Zeng, Jianming Wu, Gen Hattori, Rong Xu, Yi Yu:
Learning Explicit and Implicit Dual Common Subspaces for Audio-visual Cross-modal Retrieval. 97:1-97:23 - Qiqi Gao, Jie Li, Tiejun Zhao, Yadong Wang:
Real-time Image Enhancement with Attention Aggregation. 98:1-98:19 - Yucheng Zhu, Xiongkuo Min, Dandan Zhu, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang, Ke Gu, Jiantao Zhou:
Toward Visual Behavior and Attention Understanding for Augmented 360 Degree Videos. 99:1-99:24 - Haiyang Mei, Letian Yu, Ke Xu, Yang Wang, Xin Yang, Xiaopeng Wei, Rynson W. H. Lau:
Mirror Segmentation via Semantic-aware Contextual Contrasted Feature Learning. 100:1-100:22
Volume 19, Number 3s, June 2023
- ZengRi Zeng, Baokang Zhao, Han-Chieh Chao, Ilsun You, Kuo-Hui Yeh, Weizhi Meng:
Towards Intelligent Attack Detection Using DNA Computing. 126:1-126:27 - Jinxia Wang, Rui Chen, Zhihan Lv:
DNA Computing-Based Multi-Source Data Storage Model in Digital Twins. 127:1-127:16 - Fawad Ahmed, Muneeb Ur Rehman, Jawad Ahmad, Muhammad Shahbaz Khan, Wadii Boulila, Gautam Srivastava, Jerry Chun-Wei Lin, William J. Buchanan:
A DNA Based Colour Image Encryption Scheme Using A Convolutional Autoencoder. 128:1-128:21 - Vignesh V. Menon, Hadi Amirpour, Mohammad Ghanbari, Christian Timmerer:
EMES: Efficient Multi-encoding Schemes for HEVC-based Adaptive Bitrate Streaming. 129:1-129:20 - Jiwei Zhang, Yi Yu, Suhua Tang, Jianming Wu, Wei Li:
Variational Autoencoder with CCA for Audio-Visual Cross-modal Retrieval. 130:1-130:21 - Thi Ngoc Hanh Le, Ya-Hsuan Chen, Tong-Yee Lee:
Structure-aware Video Style Transfer with Map Art. 131:1-131:25 - Sirui Zhao, Hongyu Jiang, Hanqing Tao, Rui Zha, Kun Zhang, Tong Xu, Enhong Chen:
PEDM: A Multi-task Learning Model for Persona-aware Emoji-embedded Dialogue Generation. 132:1-132:21 - Heyu Hung, Runmin Cong, Lianhe Yang, Ling Du, Cong Wang, Sam Kwong:
Feedback Chain Network for Hippocampus Segmentation. 133:1-133:18 - Xuanrong Yao, Xin Wang, Yue Liu, Wenwu Zhu:
Continual Recognition with Adaptive Memory Update. 134:1-134:15 - Jingyao Wang, Luntian Mou, Lei Ma, Tiejun Huang, Wen Gao:
AMSA: Adaptive Multimodal Learning for Sentiment Analysis. 135:1-135:21 - Shaoning Zeng, Yunbo Rao, Bob Zhang, Yong Xu:
Joint Augmented and Compressed Dictionaries for Robust Image Classification. 136:1-136:24 - Yuyang Wanyan, Xiaoshan Yang, Xuan Ma, Changsheng Xu:
Dual Scene Graph Convolutional Network for Motivation Prediction. 137:1-137:23 - Fei Lei, Zhongqi Cao, Yuning Yang, Yibo Ding, Cong Zhang:
Learning the User's Deeper Preferences for Multi-modal Recommendation Systems. 138:1-138:18
Volume 19, Number 5s, October 2023
- Pasi Fränti, Nancy Fazal:
Design Principles for Content Creation in Location-Based Games. 165:1-165:30 - Chenchi Zhang, Wenbo Ma, Jun Xiao, Hanwang Zhang, Jian Shao, Yueting Zhuang, Long Chen:
VL-NMS: Breaking Proposal Bottlenecks in Two-stage Visual-language Matching. 166:1-166:24 - Michal Mackowski, Piotr Brzoza, Mateusz Kawulok, Rafal Meisel, Dominik Spinczyk:
Multimodal Presentation of Interactive Audio-Tactile Graphics Supporting the Perception of Visual Information by Blind People. 167:1-167:22 - Xin Man, Jie Shao, Feiyu Chen, Mingxing Zhang, Heng Tao Shen:
TEVL: Trilinear Encoder for Video-language Representation Learning. 168:1-168:20 - Simone Ricci, Tiberio Uricchio, Alberto Del Bimbo:
Meta-learning Advisor Networks for Long-tail and Noisy Labels in Social Image Classification. 169:1-169:23 - Chen Li, Li Song, Rong Xie, Wenjun Zhang:
Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module. 170:1-170:18 - Tian-Zi Niu, Zhen-Duo Chen, Xin Luo, Peng-Fei Zhang, Zi Huang, Xin-Shun Xu:
Video Captioning by Learning from Global Sentence and Looking Ahead. 171:1-171:20 - Yang Wang, Bo Dong, Ke Xu, Haiyin Piao, Yufei Ding, Baocai Yin, Xin Yang:
A Geometrical Approach to Evaluate the Adversarial Robustness of Deep Neural Networks. 172:1-172:17 - Suncheng Xiang, Dahong Qian, Mengyuan Guan, Binjie Yan, Ting Liu, Yuzhuo Fu, Guanjie You:
Less Is More: Learning from Synthetic Data with Fine-Grained Attributes for Person Re-Identification. 173:1-173:20 - Matti Siekkinen, Teemu Kämäräinen:
Neural Network Assisted Depth Map Packing for Compression Using Standard Hardware Video Codecs. 174:1-174:20 - Bianca Jansen Van Rensburg, Pauline Puteaux, William Puech, Jean-Pierre Pedeboy:
3D Object Watermarking from Data Hiding in the Homomorphic Encrypted Domain. 175:1-175:20 - Hao Liu, Xiaoshan Yang, Changsheng Xu:
Counterfactual Scenario-relevant Knowledge-enriched Multi-modal Emotion Reasoning. 176:1-176:25 - Melika Ayoughi, Pascal Mettes, Paul Groth:
Self-contained Entity Discovery from Captioned Videos. 177:1-177:21
Volume 19, Number 6, November 2023
- Wu Liu, Hailin Shi, Yunchao Wei, Dan Zeng, Nicu Sebe, Jiebo Luo:
Introduction to the Special Issue on Trustworthy Multimedia Computing and Applications in Urban Scenes. 211:1-211:4 - Zhuming Wang, Yaowen Xu, Lifang Wu, Hu Han, Yukun Ma, Zun Li:
Improving Face Anti-spoofing via Advanced Multi-perspective Feature Learning. 212:1-212:18 - Xiaolong Liu, Yang Yu, Xiaolong Li, Yao Zhao, Guodong Guo:
TCSD: Triple Complementary Streams Detector for Comprehensive Deepfake Detection. 213:1-213:22 - Hao Li, Jinwei Wang, Neal Xiong, Yi Zhang, Athanasios V. Vasilakos, Xiangyang Luo:
A Siamese Inverted Residuals Network Image Steganalysis Scheme based on Deep Learning. 214:1-214:23 - Jie Nie, Lei Huang, Chengyu Zheng, Xiaowei Lv, Rui Wang:
Cross-scale Graph Interaction Network for Semantic Segmentation of Remote Sensing Images. 185:1-185:18 - Zheming Xu, Lili Wei, Congyan Lang, Songhe Feng, Tao Wang, Adrian G. Bors, Hongzhe Liu:
SSR-Net: A Spatial Structural Relation Network for Vehicle Re-identification. 216:1-216:22 - Xingyu Gao, Jinyang Xie, Zhenyu Chen, An-An Liu, Zhenan Sun, Lei Lyu:
Dilated Convolution-based Feature Refinement Network for Crowd Localization. 217:1-217:16 - Xiaohan Lan, Yitian Yuan, Xin Wang, Long Chen, Zhi Wang, Lin Ma, Wenwu Zhu:
A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach. 218:1-218:23 - Weigang Zhang, Zhaobo Qi, Shuhui Wang, Chi Su, Li Su, Qingming Huang:
Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition. 219:1-219:22 - Ruoyu Chen, Jingzhi Li, Hua Zhang, Changchong Sheng, Li Liu, Xiaochun Cao:
Sim2Word: Explaining Similarity with Representative Attribute Words via Counterfactual Explanations. 220:1-220:22
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.