default search action
Multimedia Systems, Volume 29
Volume 29, Number 1, February 2023
- Menghao Hu, Mingxuan Luo, Menghua Huang, Wenhua Meng, Baochen Xiong, Xiaoshan Yang, Jitao Sang:
Towards a multimodal human activity dataset for healthcare. 1-13 - Santosh Kumar Tripathy, Harsh Kostha, Rajeev Srivastava:
TS-MDA: two-stream multiscale deep architecture for crowd behavior prediction. 15-31 - Zijie Yang, Lingxi Xie, Wei Zhou, Xinyue Huo, Longhui Wei, Jian Lu, Qi Tian, Sheng Tang:
VoxSeP: semi-positive voxels assist self-supervised 3D medical segmentation. 33-48 - Hengyou Wang, Yanfei Song, Lianzhi Huo, Linlin Chen, Qiang He:
Multiscale object detection based on channel and data enhancement at construction sites. 49-58 - Weijia Liu, Jiuxin Cao, Yilin Zhu, Bo Liu, Xuelin Zhu:
Real-time anomaly detection on surveillance video with two-stream spatio-temporal generative model. 59-71 - R. Rashmi Adyapady, B. Annappa:
A comprehensive review of facial expression recognition techniques. 73-103 - Zhexin Zhang, Jiajun Ding, Jun Yu, Yiming Yuan, Jianping Fan:
Import vertical characteristic of rain streak for single image deraining. 105-115 - Kunhong Wu, Liang Li, Yahong Han:
Weighted progressive alignment for multi-source domain adaptation. 117-128 - Zhongyue Chen, Jiangqi Chen, Guangliu Ding, He Huang:
A lightweight CNN-based algorithm and implementation on embedded system for real-time face recognition. 129-138 - Zehao Lin, Jiahui She, Qiu Shen:
Real emotion seeker: recalibrating annotation for facial expression recognition. 139-151 - Letian Wang, Quan Zhou, Yuling Ma, Jie Guo, Xiushan Nie, Yilong Yin:
Deep regional detail-aware hashing. 153-166 - Shradha Dubey, Manish Dixit:
A comprehensive survey on human pose estimation approaches. 167-195 - Shengjie Liu, Ning He, Cheng Wang, Haigang Yu, Wenjing Han:
Lightweight human pose estimation algorithm based on polarized self-attention. 197-210 - Ye Li, Kangning Yin, Jie Liang, Zhuofu Tan, Xinzhong Wang, Guangqiang Yin, Zhiguo Wang:
A multitask joint framework for real-time person search. 211-222 - Weidong Zhu, Jun Sun, Simin Wang, Kaifeng Yang, Jifeng Shen, Xin Zhou:
Segmentation and recognition of filed sweet pepper based on improved self-attention convolutional neural networks. 223-234 - Pengyi Hao, Yali Li, Cong Bai:
Meta-relationship for course recommendation in MOOCs. 235-246 - Dicong Wang, Qinghua Hu, Kaijun Wu:
Dual-branch network with memory for video anomaly detection. 247-259 - Zhiling Cai, Ruijia Li, Hong Wu:
Learning unified anchor graph based on affinity relationships with strong consensus for multi-view spectral clustering. 261-273 - Lu Zhao, Liming Yuan, Kun Hao, Xianbin Wen:
Generalized attention-based deep multi-instance learning. 275-287 - Xiang Gao, Lijuan Xu, Fan Wang, Xiaopeng Hu:
Multi-branch aware module with channel shuffle pixel-wise attention for lightweight image super-resolution. 289-303 - Xikun Liang, Limin Tao, Bin Hu:
Image bit planes approximate reconstruction and encryption based on Gaussian function and multiple parameters chaos. 305-321 - Hanguang Xiao, Yuewei Li, Yu Xiu, Qingling Xia:
Development of outdoor swimmers detection system with small object detection method based on deep learning. 323-332 - Wen Guo, Dong Li, Bowen Liang, Bin Shan:
Multi-view region proposal network predictive learning for tracking. 333-346 - Hufei Wang, Kaiqiang Zhao, Dexin Zhao:
A triple fusion model for cross-modal deep hashing retrieval. 347-359 - Nesrine Tarhouni, Masmoudi Salma, Maha Charfeddine, Chokri Ben Amar:
Fake COVID-19 videos detector based on frames and audio watermarking. 361-375 - Wanjun Liu, Junkai Wang, Haicheng Qu, Lei Shen:
Hierarchical MVSNet with cost volume separation and fusion based on U-shape feature extraction. 377-387 - Qiming Yan, Yubao Sun, Shaojing Fan, Liling Zhao:
Polarity-aware attention network for image sentiment analysis. 389-399 - Srishti Yadav, Shahram Payandeh:
DATaR: Depth Augmented Target Redetection using Kernelized Correlation Filter. 401-420 - Sahar Dammak, Hazar Mliki, Emna Fendri:
Gender estimation based on deep learned and handcrafted features in an uncontrolled environment. 421-433 - Yang Yang, Yiwen Xiong, Yanqing Cao, Lanling Zeng, Yan Zhao, Yongzhao Zhan:
Fast bilateral filter with spatial subsampling. 435-446 - Honggui Li, Dimitri Galayko:
Correction to: Deep reconstruction of 1D ISOMAP representations. 449 - Muhammad Pervez Akhter, Jiangbin Zheng, Irfan Raza Naqvi, Mohammed Abdelmajeed, Tehseen Zia:
Correction to: Abusive language detection from social media comments using conventional machine learning and deep learning approaches. 451 - Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay:
Correction to: Attention based video captioning framework for Hindi. 453 - Hanyun Zhang, Dongliang Guo, Wei Liu, Junlan Nie, Shuo Li:
Correction to: An improved algorithm of video quality assessment by danmaku analysis. 455
Volume 29, Number 2, April 2023
- Zhenguang Liu, Roger Zimmermann, Li Cheng:
Special issue on human-centric intelligent multimedia understanding. 457-458 - Xiena Dong, Jun Yu, Jian Zhang:
Position constrained network for 3D human pose estimation. 459-468 - Xiaofeng Qu, Li Liu, Lei Zhu, Huaxiang Zhang:
Attribute-aware style adaptation for person re-identification. 469-485 - Aihua Zhou, Yujun Ma, Wanting Ji, Ming Zong, Pei Yang, Min Wu, Mingzhe Liu:
Multi-head attention-based two-stream EfficientNet for action recognition. 487-498 - Liqiang Peng, Qiang Li, Fei Wang:
Context-aware and ethics-first crowd mobility portraits over massive smart card data. 499-510 - Yulin Wu, Chang Liu, Lei Chen, Dong Zhao, Qinghe Zheng, Hongchao Zhou:
Perturbation consistency and mutual information regularization for semi-supervised semantic segmentation. 511-523 - Haipeng Chen, Yunjie Liu, Zenan Shi:
FPF-Net: feature propagation and fusion based on attention mechanism for pancreas segmentation. 525-538 - Fan Liu, Junfeng Wang, Delong Chen, Chunmei Shen, Feng Xu:
Asymmetric exponential loss function for crack segmentation. 539-552 - Tao Liu, Mingjun Li, Haibin Zheng, Zhaoyan Ming, Jinyin Chen:
Evil vs evil: using adversarial examples to against backdoor attack in federated learning. 553-568 - Chumeng Zhang, Yue Yang, Junbo Guo, Guoqing Jin, Dan Song, Anan Liu:
Improving text-image cross-modal retrieval with contrastive loss. 569-575 - An-An Liu, Xiaowen Wang, Ning Xu, Jing Liu, Yuting Su, Quan Zhang, Shenyuan Zhang, Yejun Tang, Junbo Guo, Guoqing Jin, Xuanya Li:
SMPC: boosting social media popularity prediction with caption. 577-586 - Xiao Li, Shexiang Ma, Liqing Shan, Xiao Li:
Multi-window Transformer parallel fusion feature pyramid network for pedestrian orientation detection. 587-603 - Yifan Jiao, Sisi You:
Rescue decision via Earthquake Disaster Knowledge Graph reasoning. 605-614 - Xiaoyan Tian, Ye Jin, Xianglong Tang:
Local-Global Transformer Neural Network for temporal action segmentation. 615-626 - Zupeng Ai, Chengwei Peng, Jun Jiang, Zekun Li, Bing Li:
Face swapping detection based on identity spatial constraints with weighted frequency division. 627-640 - Zhong Qu, Lili Wang:
Gating attention convolutional networks with dense connection for pixel-level crack detection. 641-652 - Yutong Shi, Xiujuan Wang, Kangfeng Zheng, Siwei Cao:
User authentication method based on keystroke dynamics and mouse dynamics using HDA. 653-668 - Mohammad Javad Parseh, Mohammad Rahmanimanesh, Parviz Keshavarzi, Zohreh Azimifar:
Semantic embedding: scene image classification using scene-specific objects. 669-691 - Ping Feng, Zhenjun Tang:
A survey of visual neural networks: current trends, challenges and opportunities. 693-724 - Lei Li, Fan Tang, Juan Cao, Xirong Li, Danding Wang:
Bias oriented unbiased data augmentation for cross-bias representation learning. 725-738 - Sree Ganesh T. N, Rishi Satish, Rajeswari Sridhar:
Learning effective embedding for automated COVID-19 prediction from chest X-ray images. 739-751 - Jianhui He, Chunlong Hu, Lijuan Wang:
Facial age estimation based on asymmetrical label distribution. 753-762 - Jin Che, Yuxia Zhang, Qi Yang, Yuting He:
Research on person re-identification based on posture guidance and feature alignment. 763-770 - Rudrika Kalsotra, Sakshi Arora:
Performance analysis of U-Net with hybrid loss for foreground detection. 771-786 - Gan Hu, Yanli Ji, Xingzhu Liang, Yuexing Han:
Layer-fusion for online mutual knowledge distillation. 787-796 - Xuyang Lu, Yang Gao:
Guide and interact: scene-graph based generation and control of video captions. 797-809 - Zhenhua Tang, Jiemei Yao, Qian Zhang, Yuanting Luo:
Multi-operator image retargeting with visual quality preservation of salient regions. 811-829 - Wenying Wen, Yunpeng Jian, Yuming Fang, Yushu Zhang, Baolin Qiu:
Authenticable medical image-sharing scheme based on embedded small shadow QR code and blockchain framework. 831-845 - Luis Rei, Dunja Mladenic, Mareike Dorozynski, Franz Rottensteiner, Thomas Schleider, Raphaël Troncy, Jorge Sebastián Lozano, Mar Gaitán Salvatella:
Multimodal metadata assignment for cultural heritage artifacts. 847-869 - Cheng-Jian Qiu, Yuqing Song, Zhe Liu, Jing Yin, Kai Han, Yi Liu:
CMFCUNet: cascaded multi-scale feature calibration UNet for pancreas segmentation. 871-886
Volume 29, Number 3, June 2023
- Shanqing Zhang, Yujie Chen, Yiheng Meng, Jianfeng Lu, Li Li, Rui Bai:
A multi-level feature weight fusion model for salient object detection. 887-895 - Sara Akan, Songül Varli:
Use of deep learning in soccer videos analysis: survey. 897-915 - Anjali Gautam:
Recent advancements of deep learning in detecting breast cancer: a survey. 917-943 - Linfeng Liu, Tong Chen, Haojie Liu, Shiliang Pu, Li Wang, Qiu Shen:
2C-Net: integrate image compression and classification via deep neural network. 945-959 - M. Kavitha:
MDP-HML: an efficient detection method for multiple human disease using retinal fundus images based on hybrid learning techniques. 961-979 - Shuying Zhang, Jing Zhang, Yizhou Wang, Li Zhuo:
Short video fingerprint extraction: from audio-visual fingerprint fusion to multi-index hashing. 981-1000 - Qingtian Zeng, Liangwei Niu, Shansong Wang, Weijian Ni:
SEViT: a large-scale and fine-grained plant disease classification model based on transformer and attention convolution. 1001-1010 - Yanxue Wang, Shansong Wang, Weijian Ni, Qingtian Zeng:
PAST-net: a swin transformer and path aggregation model for anthracnose instance segmentation. 1011-1023 - Deepak Dhillon, Rajlaxmi Chouhan:
Edge-preserving image denoising using noise-enhanced patch-based non-local means. 1025-1041 - Jingdan Li, Yi Wang, Dexin Zhao:
Layer-wise enhanced transformer with multi-modal fusion for image caption. 1043-1056 - Hao Sun, Xiaolin Qin, Xiaojing Liu:
Image-text matching using multi-subspace joint representation. 1057-1071 - Wenying Wen, Rongxin Tu, Yushu Zhang, Yuming Fang, Yong Yang:
A multi-level approach with visual information for encrypted H.265/HEVC videos. 1073-1087 - Heling Cao, Lei Li, Yonghe Chu, Miaolei Deng, Panpan Wang, Chenyang Zhao:
A coincidental correctness test case identification framework with fuzzy C-means clustering. 1089-1101 - Jasvinder Pal Singh, Uday Pratap Singh, Sanjeev Jain:
Model-based person identification in multi-gait scenario using hybrid classifier. 1103-1116 - Thae Song Kim, Su Hyon Kim:
An improved contrast enhancement for dark images with non-uniform illumination based on edge preservation. 1117-1130 - Fuming Sun, Tingting Zhao, Bing Zhu, Xu Jia, Fasheng Wang:
Deblurring transformer tracking with conditional cross-attention. 1131-1144 - Xingjian Gu, Yongjie Zhu, Shougang Ren, Xiangbo Shu:
BCMask: a finer leaf instance segmentation with bilayer convolution mask. 1145-1159 - Chenquan Gan, Xiaopeng Cao, Qingyi Zhu:
Microblog sentiment analysis via user representative relationship under multi-interaction hybrid neural networks. 1161-1172 - Neetu Singla, Sushama Nagpal, Jyotsna Singh:
A two-stage forgery detection and localization framework based on feature classification and similarity metric. 1173-1185 - Dong Xie, Bin Wu, Fulong Chen, Taochun Wang, Zebang Hu, Yibo Zhang:
A low-overhead compressed sensing-driven multi-party secret image sharing scheme. 1187-1202 - Anusha Chhabra, Dinesh Kumar Vishwakarma:
A literature survey on multimodal and multilingual automatic hate speech identification. 1203-1230 - Youyu Liu, Yi Li, Dezhang Xu, Qingyan Yang, Wanbao Tao:
Adaptive Kalman Filter with power transformation for online multi-object tracking. 1231-1244 - Min-Jen Tsai, Hung-Yu Wu, Di-Ting Lin:
Auto ROI & mask R-CNN model for QR code beautification (ARM-QR). 1245-1276 - Adithya Sineesh, Mahesh Raveendranatha Panicker:
Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks. 1277-1290 - Fangmei Chen, Yuying Wang, Sheng Xu, Fasheng Wang, Fuming Sun, Xu Jia:
Style transfer network for complex multi-stroke text. 1291-1300 - Xin Chao, Zhenjie Hou, Yujian Mo, Haiyong Shi, Wenjing Yao:
Structural feature representation and fusion of human spatial cooperative motion for action recognition. 1301-1314 - Joel Dickson, Arul Linsely, R. J. Alice Nineta:
An integrated 3D-sparse deep belief network with enriched seagull optimization algorithm for liver segmentation. 1315-1334 - Sicheng Zhang, Jin Liu, Bo Hu, Zhendong Mao:
GH-DDM: the generalized hybrid denoising diffusion model for medical image generation. 1335-1345 - Huanjie Tao, Minghao Lu, Zhenwu Hu, Jianfeng An:
A gated multi-hierarchical feature fusion network for recognizing steel plate surface defects. 1347-1360 - Aashania Antil, Chhavi Dhiman:
A two stream face anti-spoofing framework using multi-level deep features and ELBP features. 1361-1376 - N. Venugopal:
SCMACDnet: multilevel fusion-based deep twin capsule network for change detection. 1377-1389 - Jiajun Ding, Beili Liu, Jun Yu, Huanlei Guo, Ming Shen, Kenong Shen:
An efficient multi-path structure with staged connection and multi-scale mechanism for text-to-image synthesis. 1391-1403 - Wei Li, Xiwei Yang, Zhixin Li:
MLCB-Net: a multi-level class balancing network for domain adaptive semantic segmentation. 1405-1416 - Yuzhe He, Ning He, Haigang Yu, Ren Zhang, Kang Yan:
From macro to micro: rethinking multi-scale pedestrian detection. 1417-1429 - Ehsan Jafari, Ardeshir Dolati, Kamran Layeghi:
Object tracking using fuzzy-based improved graph, interesting patches and multi-label MRF optimization. 1431-1451 - ZhenFeng Zhang, ChuHua Huang, RenJing Huang, YaNan Li, YiFan Chen:
Illu-NASNet: unsupervised illumination estimation based on dense spatio-temporal smoothness. 1453-1462 - Shisong Huang, Danyang Li, Zhuhong Zhang, Yating Wu, Yumei Tang, Xing Chen, Yiqing Wu:
CSLSEP: an ensemble pruning algorithm based on clustering soft label and sorting for facial expression recognition. 1463-1479 - Pengqing Li, Hongjuan Zhang, Yansong Chen:
Structural local sparse and low-rank tracker using deep features. 1481-1498 - Lei Li, Tingting Liu, Chengyu Wang, Minghui Qiu, Cen Chen, Ming Gao, Aoying Zhou:
Resizing codebook of vector quantization without retraining. 1499-1512 - Seyma Derdiyok, Fatma Patlar Akbulut:
Biosignal based emotion-oriented video summarization. 1513-1526 - Deepika Sharma, Arvind Selwal:
A survey on face presentation attack detection mechanisms: hitherto and future perspectives. 1527-1577 - Leyuan Liu, Yunqi Gao, Jianchi Sun, Jingying Chen:
Single-image clothed 3D human reconstruction guided by a well-aligned parametric body model. 1579-1592 - Xin Shu, Jia Li, Liang Shi, Shucheng Huang:
RES-CapsNet: an improved capsule network for micro-expression recognition. 1593-1601 - Ercan Gürsoy, Yasin Kaya:
An overview of deep learning techniques for COVID-19 detection: methods, challenges, and future works. 1603-1627 - A. Mary Dayana, W. R. Sam Emmanuel, C. Harriet Linda:
Feature fusion and optimization integrated refined deep residual network for diabetic retinopathy severity classification using fundus image. 1629-1650 - Birkan Buyukarikan, Erkan Ülker:
Convolutional neural network-based apple images classification and image quality measurement by light colors using the color-balancing approach. 1651-1661 - Nawab Muhammad Faseeh Qureshi, Varun G. Menon, Ali Kashif Bashir, Shahid Mumtaz, Irfan Mehmood:
Role of deep learning models and analytics in industrial multimedia environment. 1663-1664 - Tiago do Carmo Nogueira, Cássio Dener Noronha Vinhal, Gélson da Cruz Júnior, Matheus Rudolfo Diedrich Ullmann, Thyago Carvalho Marques:
A reference-based model using deep learning for image captioning. 1665-1681 - Ahmed Barnawi, Prateek Chhikara, Rajkumar Tekchandani, Neeraj Kumar, Mehrez Boulares:
A CNN-based scheme for COVID-19 detection with emergency services provisions using an optimal path planning. 1683-1697 - Faria Nazir, Muhammad Nadeem Majeed, Mustansar Ali Ghazanfar, Muazzam Maqsood:
A computer-aided speech analytics approach for pronunciation feedback using deep feature clustering. 1699-1715 - Linbo Wang, Li Tan, Xianyong Fang, Yanwen Guo, Shaohua Wan:
Adaptively feature matching via joint transformational-spatial clustering. 1717-1727 - Loveleen Gaur, Ujwal Bhatia, N. Z. Jhanjhi, Ghulam Muhammad, Mehedi Masud:
Medical image-based detection of COVID-19 using Deep Convolution Neural Networks. 1729-1738 - Asma Kausar, Imran Razzak, Mohd Ibrahim Shapiai, Amin Beheshti:
3D shallow deep neural network for fast and precise segmentation of left atrium. 1739-1749 - Jimmy Ming-Tai Wu, Zhongcui Li, Norbert Herencsar, Bay Vo, Jerry Chun-Wei Lin:
A graph-based CNN-LSTM stock price prediction algorithm with leading indicators. 1751-1770 - Gengsheng Xie, Xianbin Wen, Liming Yuan, Jianchen Wang, Changlun Guo, Yansong Jia, Minghao Li:
Pose-guided feature region-based fusion network for occluded person re-identification. 1771-1783 - Sumit Pundir, Mohammad S. Obaidat, Mohammad Wazid, Ashok Kumar Das, Devesh Pratap Singh, Joel J. P. C. Rodrigues:
MADP-IIME: malware attack detection protocol in IoT-enabled industrial multimedia environment using machine learning approach. 1785-1797 - Akshi Kumar:
Leveraging crowd knowledge to curate documentation for agile software industry using deep learning and expert ranking. 1799-1813 - Ranran Lou, Zhihan Lv, Shuping Dang, Tianyun Su, Xinfang Li:
Application of machine learning in ocean data. 1815-1824 - Mohib Ullah Khan, Abdul Rehman Javed, Mansoor Ihsan, Usman Tariq:
A novel category detection of social media reviews in the restaurant industry. 1825-1838 - Celestine Iwendi, Gautam Srivastava, Suleman Khan, Praveen Kumar Reddy Maddikunta:
Cyberbullying detection solutions based on deep learning architectures. 1839-1852
Volume 29, Number 4, August 2023
- Saifullah Tumrani, Wazir Ali, Rajesh Kumar, Abdullah Aman Khan, Fayaz Ali Dharejo:
View-aware attribute-guided network for vehicle re-identification. 1853-1863 - Palash Ray, Asish Bera, Debasis Giri, Debotosh Bhattacharjee:
Style matching CAPTCHA: match neural transferred styles to thwart intelligent attacks. 1865-1895 - He Zhang, Lu Yin, Hanling Zhang:
A review of micro-expression spotting: methods and challenges. 1897-1915 - Carlos Vilchis, Carmina Pérez-Guerrero, Mauricio Mendez-Ruiz, Miguel González-Mendoza:
A survey on the pipeline evolution of facial capture and tracking for digital humans. 1917-1940 - Kai Hu, Junlan Jin, Chaowen Shen, Min Xia, Liguo Weng:
Attentional weighting strategy-based dynamic GCN for skeleton-based action recognition. 1941-1954 - Anqi Zheng, Shiqi Zheng, Cong Bai, Deng Chen:
Triple-level relationship enhanced transformer for image captioning. 1955-1966 - Gang Wang, Shucheng Huang, Zhe Tao:
Shallow multi-branch attention convolutional neural network for micro-expression recognition. 1967-1980 - Lei Yang, Yong Feng, Mingliang Zhou, Xiancai Xiong, Yongheng Wang, Baohua Qiang:
Multi-level network based on transformer encoder for fine-grained image-text matching. 1981-1994 - An-An Liu, Yuwei Zhang, Chenyu Zhang, Wenhui Li, Bo Lv, Lei Lei, Xuanya Li:
Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval. 1995-2007 - B. Bhaskar Reddy, M. Venkata Sudhakar, P. Rahul Reddy, P. Raghava Reddy:
Ensemble deep honey architecture for COVID-19 prediction using CT scan and chest X-ray images. 2009-2035 - Yizhong Yang, Ce Hou, Haixia Huang, Zhang Zhang, Guangjun Xie:
Cascaded deep residual learning network for single image dehazing. 2037-2048 - Elena Battini Sönmez, Sefer Memis, Berker Arslan, Okan Zafer Batur:
The segmented UEC Food-100 dataset with benchmark experiment on food detection. 2049-2057 - Furong Ma, Guiyu Xia, Qingshan Liu:
Human pose transfer via shape-aware partial flow prediction network. 2059-2072 - Xin Xu, Gang Lv, Yining Sun, Yuxia Hu, Fudong Nian:
Hierarchical cross-modal contextual attention network for visual grounding. 2073-2083 - Honghong Yang, Hongxi Liu, Yumei Zhang, Xiaojun Wu:
HSGNet: hierarchically stacked graph network with attention mechanism for 3D human pose estimation. 2085-2097 - Awais Ahmed, She Kun, Junaid Ahmed, Shaukat Hayat, Abdullah Aman Khan:
Multimodal image enhancement using convolutional sparse coding. 2099-2110 - Tarun Agrawal, Prakash Choudhary:
COVID-SegNet: encoder-decoder-based architecture for COVID-19 lesion segmentation in chest X-ray. 2111-2124 - Kangkang Wei, Weiqi Luo, Minglin Liu, Miaoxin Ye:
Residual guided coordinate attention for selection channel aware image steganalysis. 2125-2135 - Jian Shi, Geng Sun, Jinyu Zhang, Zhihui Wang, Haojie Li:
Face attribute recognition via end-to-end weakly supervised regional location. 2137-2152 - Mengting Liu, Xinrui Li, Yongge Liu, Yahong Han:
Weakly supervised anomaly detection with multi-level contextual modeling. 2153-2164 - Hafsa Ilyas, Ali Javed, Khalid Mahmood Malik, Aun Irtaza:
E-Cap Net: an efficient-capsule network for shallow and deepfakes forgery detection. 2165-2180 - Yingyuan Zhao, Zhiyi Tan, Bing-Kun Bao, Zhengzheng Tu:
Centralized sub-critic based hierarchical-structured reinforcement learning for temporal sentence grounding. 2181-2191 - Zepeng Li, Wenchuan Cheng, Jiawei Zhou, Zhengyi An, Bin Hu:
Deep learning model with multi-feature fusion and label association for suicide detection. 2193-2203 - Jing Sun, Rui Yan, Bing Zhang, Bing Zhu, Fuming Sun:
A cross-view geo-localization method guided by relation-aware global attention. 2205-2216 - Mei-Ting Su, Mei-Ling Chiang, Chia-Hsuan Tsai, Chi-Wei Lin, Rong-Xuan Liu, Yong-Ting Juang, Hsin-Hao Chen:
An acupoint health care system with real-time acupoint localization and visualization in augmented reality. 2217-2238 - Tobias Mühling, Isabelle Späth, Joy Backhaus, Nathalie Milke, Sebastian Oberdörfer, Alexander Meining, Marc Erich Latoschik, Sarah Koenig:
Virtual reality in medical emergencies training: benefits, perceived stress, and learning success. 2239-2252 - Shulin Cheng, Huimin Jiang, Wanyan Wang, Wei Jiang:
Research on multi-context aware recommendation methods based on tensor factorization. 2253-2262 - Yulin Deng, Liju Yin, Xiaoning Gao, Hui Zhou, Zhenzhou Wang, Guofeng Zou:
EA-EDNet: encapsulated attention encoder-decoder network for 3D reconstruction in low-light-level environment. 2263-2279 - Fangzheng Xu, Yu Bao, Bingye Li, Zhining Hou, Lekang Wang:
Entropy minimization and domain adversarial training guided by label distribution similarity for domain adaptation. 2281-2292 - Khouloud Salameh, Farah El Akoum, Joe Tekli:
Unsupervised knowledge representation of panoramic dental X-ray images using SVG image-and-object clustering. 2293-2322 - Dailiang Wei, Juanli Li, Bo Li, Xin Wang, Siyuan Chen, Xuewen Wang, Luyao Wang:
A fast recognition method for coal gangue image processing. 2323-2335 - Suchi Jain, Geeta Sikka, Renu Dhir:
An automatic cascaded approach for pancreas segmentation via an unsupervised localization using 3D CT volumes. 2337-2349 - Changshui Yang, Yan Liu, Qiang Liu, Riaz Ullah Khan, Bin Chen, Wenyong Wang:
Dual semantic-aligned clustering for cross-domain person re-identification. 2351-2362 - Bolin Wang, Yuanyuan Sun, Yonghe Chu, Changrong Min, Zhihao Yang, Hongfei Lin:
Local discriminative graph convolutional networks for text classification. 2363-2373 - Israr Ur Rehman, Muhammad Shehzad Hanif, Zulfiqar Ali, Zahoor Jan, Cobbinah Bernard Mawuli, Waqar Ali:
Empowering neural collaborative filtering with contextual features for multimedia recommendation. 2375-2388 - Zekun Yang, Yuta Nakashima, Haruo Takemura:
Multi-modal humor segment prediction in video. 2389-2398 - Gaoming Yang, Anxing Wei, Xianjin Fang, Ji Zhang:
FDS_2D: rethinking magnitude-phase features for DeepFake detection. 2399-2413 - Hong Lin, Xi Wang, Chun Liu, Dewei Peng:
HRCutBlur Augment: effectively enhancing data diversity for image super-resolution. 2415-2427 - Hongbo Xing, Guanqun Zhou, Shusen Yuan, Youjun Jiang, Pinyong Geng, Yewen Cao, Yujun Li, Lei Chen:
Micro-expression spotting network based on attention and one-dimensional convolutional sliding window. 2429-2437 - Hitesh D. Panchal, Hitesh B. Shah:
Multiple forgery detection in digital video based on inconsistency in video quality assessment attributes. 2439-2454
Volume 29, Number 5, October 2023
- Ajay Sharma, Bhavana P. Shrivastava, Aayushi Priya:
Multilevel progressive recursive dilated networks with correlation filter (MPRDNCF) for image super-resolution. 2455-2467 - Maosheng Zhong, Youde Chen, Hao Zhang, Hao Xiong, Zhixiang Wang:
Multimodal-enhanced hierarchical attention network for video captioning. 2469-2482 - Yongzhen Ke, Yin Wang, Kai Wang, Fan Qin, Jing Guo, Shuai Yang:
Image aesthetics assessment using composite features from transformer and CNN. 2483-2494 - Susmi Jacob, P. Vinod, Arjun Subramanian, Varun G. Menon:
Affect sensing from smartphones through touch and motion contexts. 2495-2509 - Yuqiang Li, Xinyi Shangguan, Chun Liu, Haochen Meng:
I2I translation model based on CondConv and spectral domain realness measurement: BCS-StarGAN. 2511-2526 - Chuan Liu, Ying-Ying Tan, Tian-Tian Xia, Jiajing Zhang, Ming Zhu:
Co-attention graph convolutional network for visual question answering. 2527-2543 - Zhenying Fang, Jianping Fan, Jun Yu:
LPR: learning point-level temporal action localization through re-training. 2545-2562 - Aiping Yang, Yan Liu, Simeng Cheng, Jiale Cao, Zhong Ji, Yanwei Pang:
Spatial attention-guided deformable fusion network for salient object detection. 2563-2573 - Xin Yang, Xiangchen Wang, Xiaohui Ye, Tao Li:
VMSG: a video caption network based on multimodal semantic grouping and semantic attention. 2575-2589 - Weihao Gao, Yongjun Zhang, Wei Long, Zhongwei Cui:
A deraining with detail-recovery network via context aggregation. 2591-2601 - Asha Rani, Pankaj Yadav, Yashaswi Verma:
Early-stage autism diagnosis using action videos and contrastive feature learning. 2603-2614 - Yunfei Zheng, Meng Sun, Xiaobing Wang, Tieyong Cao, Xiongwei Zhang, Lixing Xing, Zheng Fang:
Self-distillation object segmentation via pyramid knowledge representation and transfer. 2615-2631 - Jian-Wei Zhang, Yifan Sun, Wei Chen:
Pull and concentrate: improving unsupervised semantic segmentation adaptation with cross- and intra-domain consistencies. 2633-2650 - Longfeng Shen, Fenglan Qin, Hongying Zhu, Dengdi Sun, Hai Min:
EGARNet: adjacent residual lightweight super-resolution network based on extended group-enhanced convolution. 2651-2668 - Mahsa Soleimani, Ali Nazari, Mohsen Ebrahimi Moghaddam:
Deepfake detection of occluded images using a patch-based approach. 2669-2687 - Chaithanyadas K. V., G. R. Gnana King:
Computer-aided diagnosis for early detection and staging of human pancreatic tumors using an optimized 3D CNN on computed tomography. 2689-2703 - Xiuxia Cai, Pin Zhang, Shuaibin Du:
Imitation camouflage synthesis based on shallow neural network. 2705-2714 - Yan Li, Min Xia, Dongmei Jiang:
Cross-view adaptive graph attention network for dynamic facial expression recognition. 2715-2728 - Hongwei Zhao, Siquan Wu, Zhen Tian, Yidong Li, Yi Jin, Shengchun Wang:
Context-guided coarse-to-fine detection model for bird nest detection on high-speed railway catenary. 2729-2746 - Weiyi Wei, Jian Wang, Mengyu Xu, Futong Zhang:
Multimodal heterogeneous graph convolutional network for image recommendation. 2747-2760 - Jiachang Li, Haitao Zhang, Huadong Ma:
DRL-based transmission control for QoE guaranteed transmission efficiency optimization in tile-based panoramic video streaming. 2761-2777 - Si Chen, Bolun Xu, Miaohui Zhang, Yan Yan, Xia Du, Weiwei Zhuang, Yun Wu:
HC-GCN: hierarchical contrastive graph convolutional network for unsupervised domain adaptation on person re-identification. 2779-2790 - Zhangyu Liu, Zhi Li, Guomei Wang, Youliang Tian, Long Zheng:
Robust zero-watermarking algorithm for diffusion-weighted images based on multiscale feature fusion. 2791-2807 - Xianhua Duan, Chaoqiang Jin, Xin Shu:
HCPSNet: heterogeneous cross-pseudo-supervision network with confidence evaluation for semi-supervised medical image segmentation. 2809-2823 - Guangtao Wang, Jun Li, Zhijian Wu, Jianhua Xu, Jifeng Shen, Wankou Yang:
EfficientFace: an efficient deep network with feature enhancement for accurate face detection. 2825-2839 - Editorial note for few-shot learning for intelligent multimedia systems. 2841
- Xuewei Chao, Lixin Zhang:
Few-shot imbalanced classification based on data augmentation. 2843-2851 - Shan Liu, Yichao Tang, Ying Tian, Hansong Su:
Visual driving assistance system based on few-shot learning. 2853-2863 - Yue Yang, Zhuo Zhang, Wei Mao, Yang Li, Chengang Lv:
Radar target recognition based on few-shot learning. 2865-2875 - You Zhou, Changlin Chen, Shukun Ma:
Few-shot ship classification based on metric learning. 2877-2886 - Changlin Chen, Xuewei Chao:
Conversion of infrared ocean target images to visible images driven by energy information. 2887-2898 - Rajdeep Chatterjee, Ankita Chatterjee, SK Hafizul Islam, Muhammad Khurram Khan:
An object detection-based few-shot learning approach for multimedia quality assessment. 2899-2912 - Xiaolei Li:
Few-shot wind turbine blade damage early warning system based on sound signal fusion. 2913-2922 - Wei Ren, Li Zhou, Jie Chen:
Unsupervised single image dehazing with generative adversarial network. 2923-2933 - Abdelkader Tayeb Herouala, Benameur Ziani, Chaker Abdelaziz Kerrache, Abdou El Karim Tahari, Nasreddine Lagraa, Spyridon Mastorakis:
CaDaCa: a new caching strategy in NDN using data categorization. 2935-2950 - M. Poongodi, Mounir Hamdi, Huihui Wang:
Image and audio caps: automated captioning of background sounds and images using deep learning. 2951-2959 - Neha Sharma, Chinmay Chakraborty, Rajeev Kumar:
Optimized multimedia data through computationally intelligent algorithms. 2961-2977 - Jiandong Lv, Xingang Wang, Cuiling Shao:
TMIF: transformer-based multi-modal interactive fusion for automatic rumor detection. 2979-2989 - Wei Chen, Jing Nie:
A MADDPG-based multi-agent antagonistic algorithm for sea battlefield confrontation. 2991-3000 - Zhengjian Li, Jingyi He, Tianlei Ni, Jiaming Huo:
Numerical computation based few-shot learning for intelligent sea surface temperature prediction. 3001-3013 - Editorial note for trustworthy multimedia big data computing. 3015
- Zijie Song, Zhenzhen Hu, Richang Hong:
Efficient and self-adaptive rationale knowledge base for visual commonsense reasoning. 3017-3026 - Wenzhe Zhai, Qilei Li, Ying Zhou, Xuesong Li, Jinfeng Pan, Guofeng Zou, Mingliang Gao:
$\hbox {DA}^2$Net: a dual attention-aware network for robust crowd counting. 3027-3040 - Na Ta, Haipeng Chen, Yingda Lyu, Taosuo Wu:
BLE-Net: boundary learning and enhancement network for polyp segmentation. 3041-3054 - Dengyun Xu, Xuanjing Shen, Yongping Huang, Zenan Shi:
RB-Net: integrating region and boundary features for image manipulation localization. 3055-3067 - Chunxiao Fan, Zhenxing Wang, Jia Li, Shanshan Wang, Xiao Sun:
Robust facial expression recognition with global-local joint representation learning. 3069-3079 - Jing Ge, Qianxiang Wang, Guangyu Gao:
Hardest and semi-hard negative pairs mining for text-based person search with visual-textual attention. 3081-3093 - Yi Wang, Shixin Zheng, Xiao Sun, Dan Guo, Junjie Lang:
Micro-expression recognition with attention mechanism and region enhancement. 3095-3103 - Wenyi Hu, Xiao Wang, Zheng Wang, Xin Xu, Ruimin Hu:
Dual-focus: person search from Coarse-Grained Focus to Fine-Grained Focus. 3105-3114 - Haoming Chen, Runyang Feng, Sifan Wu, Hao Xu, Fengcheng Zhou, Zhenguang Liu:
2D Human pose estimation: a survey. 3115-3138 - Jian Wang, Xiaoyu Du, Yu Cheng, Yunlian Sun, Jinhui Tang:
SI-Net: spatial interaction network for deepfake detection. 3139-3150
Volume 29, Number 6, December 2023
- Chhavi Dixit, Shashank Mouli Satapathy:
A customizable framework for multimodal emotion recognition using ensemble of deep neural network models. 3151-3168 - Ce Zhang, Xiao Yao, Changfeng Shi, Min Gu:
Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning. 3169-3177 - Humaira Shafiq, Ghulam Gilanie, Muhammad Sajid, Muhammad Ahsan:
Dental radiology: a convolutional neural network-based approach to detect dental disorders from dental images in a real-time environment. 3179-3191 - Yuhan Huang, Jiacheng Lu, Nianzhe Chen, Hui Ding, Yuanyuan Shang:
A deep learning image inpainting method based on stationary wavelet transform. 3193-3207 - Chuanwang Wen, Shucheng Huang:
A LiDAR point cloud registration method combining linear feature extraction and TrICP algorithm. 3209-3221 - Baoying Zheng, Fang Liu, Mohan Zhang, Tongqing Zhou, Shenglan Cui, Yunfan Ye, Yeting Guo:
Image captioning for cultural artworks: a case study on ceramics. 3223-3243 - Huimin Qian, Wenyu Shen, Zhengqi Wang, Shuwei Xu:
Hotspot defect detection for photovoltaic modules under complex backgrounds. 3245-3258 - Liyan Xiong, Zhida Li, Xiaohui Huang, Yijuan Zeng, Peng Huang:
TFA-CNN: an efficient method for dealing with crowding and noise problems in crowd counting. 3259-3276 - Xiaohui Guan, Qiqi Shao, Yaguan Qian, Tengteng Yao, Bin Wang:
Adversarial training in logit space against tiny perturbations. 3277-3290 - Zekang Wang, Li Liu, Huaxiang Zhang, Dongmei Liu, Yu Song:
Generative adversarial text-to-image generation with style image constraint. 3291-3303 - Mamta Gehlot, Rakesh Kumar Saxena, Geeta Chhabra Gandhi:
"Tomato-Village": a dataset for end-to-end tomato disease detection in a real-world environment. 3305-3328 - Xin Wang, Ning He, Chen Hong, Fengxi Sun, Wenjing Han, Qi Wang:
YOLO-ERF: lightweight object detector for UAV aerial images. 3329-3339 - Yongwei Gai, Jinglei Liu:
Clustering by sparse orthogonal NMF and interpretable neural network. 3341-3356 - Mingju Shao, Guodong Wang:
Class-agnostic counting with feature augmentation and similarity comparison. 3357-3367 - Ugur Berk Sahin, Fatih Kamisli:
Image compression with learned lifting-based DWT and learned tree-based entropy models. 3369-3384 - Amal Bouatrous, Abdelkrim Meziane, Nadia Zenati, Chafiaâ Hamitouche:
A new adaptive VR-based exergame for hand rehabilitation after stroke. 3385-3402 - V. Praveena, L. R. Sujithra, S. Karthik, Muthu Subash Kavitha:
Bio-Inspired ensemble feature selection and deep auto-encoder approach for rapid diagnosis of breast cancer. 3403-3419 - Dayong Tian, Yiqin Cao, Yiwen Wei, Deyun Zhou:
Narrowing the variance of variational cross-encoder for cross-modal hashing. 3421-3430 - Xin Zhang, Xiaotian Cao, Jun Wang, Lei Wan:
G-UNeXt: a lightweight MLP-based network for reducing semantic gap in medical image segmentation. 3431-3446 - Jennil Thiyam, Sanasam Ranbir Singh, Prabin Kumar Bora:
Integrated document segmentation and region identification: textual, equation and graphical. 3447-3466 - Tuo Li, Yahong Han:
Improving transferable adversarial attack for vision transformers via global attention and local drop. 3467-3480 - Jakub Lokoc, Stelios Andreadis, Werner Bailer, Aaron Duane, Cathal Gurrin, Zhixin Ma, Nicola Messina, Thao-Nhu Nguyen, Ladislav Peska, Luca Rossetto, Loris Sauter, Konstantin Schall, Klaus Schoeffmann, Omar Shahbaz Khan, Florian Spiess, Lucia Vadicamo, Stefanos Vrochidis:
Interactive video retrieval in the age of effective joint embedding deep models: lessons from the 11th VBS. 3481-3504 - Shijie Jia, Yan Cui, Xiaoyan Su, Zongzheng Liang:
A social-aware video sharing solution using demand prediction of epidemic-based propagation in wireless networks. 3505-3520 - Quan-Lin Gu, Sai Yang, Tianxing Yu:
Lite general network and MagFace CNN for micro-expression spotting in long videos. 3521-3530 - Li Han, Jinhai He, Feng Dou, Huiwen Ma, Xinyang Xie, Wanwen Yang:
A viewpoint-guided prototype network for 3D shape classification. 3531-3547 - Zhiwei Ma, Guilin Yao:
Deep portrait matting via double-grained segmentation. 3549-3557 - Nan Xie, Zhaojie Liu, Zhengxu Li, Wei Pang, Beier Lu:
Student engagement detection in online environment using computer vision and multi-dimensional feature fusion. 3559-3577 - Fanqiang Kong, Jiahui Tang, Yunsong Li, Dan Li, Kedi Hu:
Dual-branch spectral-spatial feature extraction network for multispectral image compression. 3579-3597 - Jun Wu, Tianliang Zhu, Jiahui Zhu, Tianyi Li, Chunzhi Wang:
Hierarchical multiples self-attention mechanism for multi-modal analysis. 3599-3608 - Yizhong Yang, Tingting Xia, Dajin Li, Zhang Zhang, Guangjun Xie:
A multi-scale feature fusion spatial-channel attention model for background subtraction. 3609-3623 - Tao Hu, Xuyu Xiang, Jiaohua Qin, Yun Tan:
Audio-text retrieval based on contrastive learning and collaborative attention mechanism. 3625-3638 - Kaisi Yang, Lianyu Zhao, Chenglin Wang:
Workpiece tracking based on improved SiamFC++ and virtual dataset. 3639-3653 - Xinglin Pan, Mingxin Gan:
Multi-behavior recommendation based on intent learning. 3655-3668 - Bin Liu, Siyan Fang:
Multi-aggregation network based on non-separable lifting wavelet for single image deraining. 3669-3684 - Deeksha Gupta, Akashdeep Sharma:
A two-stage attention augmented fully convolutional network-based dynamic video summarization. 3685-3701 - Honglin Li, Qinghua Huang:
MAF-Net: multidimensional attention fusion network for multichannel speech separation. 3703-3720 - Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu:
Compression of face images using meta-heuristic algorithms based on curvelet transform with variable bit allocation. 3721-3744 - Haiyan Zhang, Quan Wang, Guorui Feng:
Artistic image adversarial attack via style perturbation. 3745-3755 - Xin Zheng, Xin He, Yimo Ren, Jinfa Wang, Junyang Yu:
Owner named entity recognition in website based on multidimensional text guidance and space alignment co-attention. 3757-3770 - Jiangpeng Zheng, Fan Shi, Meng Zhao, Chen Jia, Congcong Wang:
Learning intra-inter-modality complementary for brain tumor segmentation. 3771-3780 - Bowen Xin, Ning Xu, Yingchen Zhai, Tingting Zhang, Zimu Lu, Jing Liu, Weizhi Nie, Xuanya Li, An-An Liu:
A comprehensive survey on deep-learning-based visual captioning. 3781-3804 - Wei Xiong, Haoliang Liu, Siya Mi, Yu Zhang:
Asymmetric bi-encoder for image-text retrieval. 3805-3818 - Fengjun Xiao, Zhuxi Zhang, Ye Yao:
CTNet: hybrid architecture based on CNN and transformer for image inpainting detection. 3819-3832 - Emrah Dönmez, Serhat Kiliçarslan, Cemil Közkurt, Aykut Diker, Fahrettin Burak Demir, Abdullah Elen:
Identification of haploid and diploid maize seeds using hybrid transformer model. 3833-3845 - Na Ta, Haipeng Chen, Xianzhu Liu, Nuo Jin:
LET-Net: locally enhanced transformer network for medical image segmentation. 3847-3861 - Haoliang Zhou, Shucheng Huang, Yuqiao Xu:
Inceptr: micro-expression recognition integrating inception-CBAM and vision transformer. 3863-3876 - Noor Ahmed, Rozina, Ahmad Ali, Abdul Raziq:
Images denoising for COVID-19 chest X-ray based on multi-scale parallel convolutional neural network. 3877-3890 - Jiacheng Chang, Lanyong Zhang, Zhuang Shao:
View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer. 3891-3901 - Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu:
Variable bit allocation method based on meta-heuristic algorithms for facial image compression. 3903-3930 - Hüseyin Yasar, Murat Ceylan:
A novel study for automatic two-class COVID-19 diagnosis (between COVID-19 and Healthy, Pneumonia) on X-ray images using texture analysis and 2-D/3-D convolutional neural networks. 3931-3949 - Alison Reboud, Ismail Harrando, Pasquale Lisena, Raphaël Troncy:
Stories of love and violence: zero-shot interesting events' classification for unsupervised TV series summarization. 3951-3969 - Akash Tayal, Jivansha Gupta, Arun Solanki, Khyati Bisht, Anand Nayyar, Mehedi Masud:
Correction to: DL‑CNN‑based approach with image processing techniques for diagnosis of retinal diseases. 3971 - Hwei Teeng Chong, Chen Kim Lim, Ahmad Rafi, Kian Lam Tan, Mazlin Mokhtar:
Correction: Comprehensive systematic review on virtual reality for cultural heritage practices: coherent taxonomy and motivations. 3973
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.