default search action
18th ICCV Workshops 2021: Montreal, QC, Canada
- IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021, Montreal, QC, Canada, October 11-17, 2021. IEEE 2021, ISBN 978-1-6654-0191-3
Adversarial Robustness in the Real World (AROW)
- Artur Jordão, Hélio Pedrini:
On the Effect of Pruning on Adversarial Robustness. 1-11 - Greg Fields, Mohammad Samragh, Mojan Javaheripi, Farinaz Koushanfar, Tara Javidi:
Trojan Signatures in DNN Weights. 12-20 - Kanjar De, Marius Pedersen:
Impact of Colour on Robustness of Deep Neural Networks. 21-30 - Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon:
Evasion Attack STeganography: Turning Vulnerability Of Machine Learning To Adversarial Attacks Into A Real-world Application. 31-40 - Nathan Inkawhich, Kevin J. Liang, Jingyang Zhang, Huanrui Yang, Hai Li, Yiran Chen:
Can Targeted Adversarial Examples Transfer When the Source and Target Models Have No Label Space Overlap? 41-50 - Thomas Duboudin, Emmanuel Dellandréa, Corentin Abgrall, Gilles Hénaff, Liming Chen:
Encouraging Intra-Class Diversity Through a Reverse Contrastive Loss for Single-Source Domain Generalization. 51-60 - Guillaume Jeanneret, Juan C. Pérez, Pablo Arbeláez:
A Hierarchical Assessment of Adversarial Severity. 61-70 - Xiangyu Qu, Stanley H. Chan:
Detecting and Segmenting Adversarial Graphics Patterns from Images. 71-80 - Juan C. Pérez, Motasem Alfarra, Guillaume Jeanneret, Laura Rueda, Ali K. Thabet, Bernard Ghanem, Pablo Arbeláez:
Enhancing Adversarial Robustness via Test-time Transformation Ensembling. 81-91 - Abhiram Gnanasambandam, Alex M. Sherman, Stanley H. Chan:
Optical Adversarial Attack. 92-101 - Cheng Zhang, Pan Gao:
Countering Adversarial Examples: Combining Input Transformation and Noisy Training. 102-111 - Max Lennon, Nathan Drenkow, Philippe Burlina:
Patch Attack Invariance: How Sensitive are Patch Attacks to 3D Pose? 112-121 - Adith Boloor, Tong Wu, Patrick Naughton, Ayan Chakrabarti, Xuan Zhang, Yevgeniy Vorobeychik:
Can Optical Trojans Assist Adversarial Perturbations? 122-131 - Yuan Wu, Diana Inkpen, Ahmed El-Roby:
Towards Category and Domain Alignment: Category-Invariant Feature Enhancement for Adversarial Domain Adaptation. 132-141 - Yuzhen Ding, Nupur Thakur, Baoxin Li:
AdvFoolGen: Creating Persistent Troubles for Deep Classifiers. 142-151 - Chaitanya Devaguptapu, Devansh Agarwal, Gaurav Mittal, Pulkit Gopalani, Vineeth N. Balasubramanian:
On Adversarial Robustness: A Neural Architecture Search perspective. 152-161
Robust Subspace Learning and Applications in Computer Vision (RSLCV)
- Carl Olsson, Daniele Gerosa, Marcus Carlsson:
Relaxations for Non-Separable Cardinality/Rank Penalties. 162-171 - Zhengqin Xu, Huasong Xing, Shun Fang, Shiqian Wu, Shoulie Xie:
Double-Weighted Low-Rank Matrix Recovery Based on Rank Estimation. 172-180 - Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, Soon Ki Jung:
Background/Foreground Separation: Guided Attention based Adversarial Modeling (GAAM) versus Robust Subspace Learning Methods. 181-188 - HanQin Cai, Zehan Chao, Longxiu Huang, Deanna Needell:
Fast Robust Tensor Principal Component Analysis via Fiber CUR Decomposition *. 189-197 - Manish Sharma, Panos P. Markopoulos, Eli Saber, M. Salman Asif, Ashley Prater-Bennette:
Convolutional Auto-Encoder with Tensor-Train Factorization. 198-206 - Marcella Astrid, Muhammad Zaigham Zaheer, Seung-Ik Lee:
Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection. 207-214 - Islam I. Osman, Mohamed H. Abdelpakey, Mohamed S. Shehata:
TransBlast: Self-Supervised Learning Using Augmented Subspace with Transformer for Background/Foreground Separation. 215-224 - Jhony H. Giraldo, Sajid Javed, Naoufel Werghi, Thierry Bouwmans:
Graph CNN for Moving Object Detection in Complex Environments from Unseen Videos. 225-233
Distributed Smart Cameras (DSC)
- Nicola Garau, Giulia Martinelli, Piotr Bródka, Niccoló Bisagno, Nicola Conci:
PanopTOP: a framework for generating viewpoint-invariant human pose estimation datasets. 234-242 - Mateusz Majcher, Bogdan Kwolek:
Deep Quaternion Pose Proposals for 6D Object Pose Tracking. 243-251 - Yanting Zhang, Qingxiang Wang:
Pedestrian Tracking through Coordinated Mining of Multiple Moving Cameras. 252-261 - Abbas Omidi, Amirhossein Heydarian, Aida Mohammadshahi, Behnam Asghari Beirami, Farzan Haddadi:
An Embedded Deep Learning-based Package for Traffic Law Enforcement. 262-271 - Rita Pucci, Christian Micheloni, Niki Martinel:
Self-Attention Agreement Among Capsules. 272-280 - Asad Munir, Chengjin Lyu, Bart Goossens, Wilfried Philips, Christian Micheloni:
Resolution based Feature Distillation for Cross Resolution Person Re-Identification. 281-289 - Leonardo Taccari:
Domain-based semi-supervised learning: exploiting label invariance in unlabeled data from distributed cameras. 290-297 - Vaibhav Bansal, Gian Luca Foresti, Niki Martinel:
Where Did I See It? Object Instance Re-Identification with Attention. 298-306 - Alessandro Avi, Matteo Zuccatti, Matteo Nardello, Nicola Conci, Davide Brunelli:
Infrared dataset generation for people detection through superimposition of different camera sensors. 307-316
Neural Architectures: Past, Present and Future (NeurArch)
- Xiangxiang Chu, Bo Zhang, Qingyuan Li, Ruijun Xu, Xudong Li:
SCARLET-NAS: Bridging the Gap between Stability and Scalability in Weight-sharing Neural Architecture Search. 317-325 - Mahdi S. Hosseini, Jia Shu Zhang, Zhe Liu, Andre Fu, Jingxuan Su, Mathieu Tuli, Konstantinos N. Plataniotis:
CONet: Channel Optimization for Convolutional Neural Networks. 326-335 - Borui Jiang, Yadong Mu:
Russian Doll Network: Learning Nested Networks for Sample-Adaptive Dynamic Inference. 336-344 - Niv Vosco, Alon Shenkler, Mark Grobman:
Tiled Squeeze-and-Excite: Channel Attention With Local Spatial Context. 345-357 - Fan Jia, Wing Hong Wong, Tieyong Zeng:
DDUNet: Dense Dense U-Net with Applications in Image Denoising. 354-364 - Biluo Shen, Anqi Xiao, Jie Tian, Zhenhua Hu:
PP-NAS: Searching for Plug-and-Play Blocks on Convolutional Neural Network. 365-372 - Pengfei Hou, Ying Jin, Yukang Chen:
Single-DARTS: Towards Stable Architecture Search. 373-382 - Julio Zamora-Esquivel, Jesus Adan Cruz Vargas, Anthony D. Rhodes, Lama Nachman, Narayan Sundararajan:
Convolutional Filter Approximation Using Fractional Calculus. 383-392 - Michail Chatzianastasis, George Dasoulas, Georgios Siolas, Michalis Vazirgiannis:
Graph-based Neural Architecture Search with Operation Embeddings. 393-402 - Ionut Cosmin Duta, Mariana-Iuliana Georgescu, Radu Tudor Ionescu:
Contextual Convolutional Neural Networks. 403-412 - Zhuliang Yao, Yue Cao, Yutong Lin, Ze Liu, Zheng Zhang, Han Hu:
Leveraging Batch Normalization for Vision Transformers. 413-422
AI-Enabled Medical Image Analysis and COVID-19 Diagnosis (MIACOV19D)
- Francesco Rundo, Angelo Genovese, Roberto Leotta, Fabio Scotti, Vincenzo Piuri, Sebastiano Battiato:
Advanced 3D Deep Non-Local Embedded System for Self-Augmented X-Ray-based COVID-19 Assessment. 423-432 - Adrit Rao, Jongchan Park, Oliver O. Aalami:
The Value of Visual Attention for COVID-19 Classification in CT Scans. 433-438 - Weijun Tan, Jingfeng Liu:
A 3D CNN Network with BERT For Automatic COVID-19 Diagnosis From CT-Scan Images. 439-445 - Debora Gil, Sonia Baeza, Carles Sánchez, Guillermo Torres, Ignasi García-Olivé, Gloria Moragas, Jordi Deportós, Maite Salcedo, Antoni Rosell:
Intelligent Radiomic Analysis of Q-SPECT/CT images to optimize pulmonary embolism diagnosis in COVID-19 patients. 446-453 - Junlin Hou, Jilan Xu, Rui Feng, Yuejie Zhang, Fei Shan, Weiya Shi:
CMC-COV19D: Contrastive Mixup Classification for COVID-19 Diagnosis. 454-461 - Alyaa Amer, Xujiong Ye, Faraz Janan:
Residual Dilated U-net For The Segmentation Of COVID-19 Infection From CT Images. 462-470 - Guan-Lin Chen, Chih-Chung Hsu, Mei-Hsuan Wu:
Adaptive Distribution Learning with Statistical Hypothesis Testing for COVID-19 CT Scan Classification. 471-479 - George Ioannou, Tasos Papagiannis, Thanos Tagaris, Georgios Alexandridis, Andreas Stafylopatis:
Visual interpretability analysis of Deep CNNs using an Adaptive Threshold method on Diabetic Retinopathy images. 480-486 - Nguyen P. Nguyen, Youngjin Yoo, Andrei Chekkoury, Eva Eibenberger, Thomas J. Re, Jyotipriya Das, Abishek Balachandran, Yvonne W. Lui, Pina C. Sanelli, Thomas J. Schroeppel, Uttam Bodanapally, Savvas Nicolaou, Tommi A. White, Filiz Bunyak, Dorin Comaniciu, Eli Gibson:
Brain midline shift detection and quantification by a cascaded deep network pipeline on non-contrast computed tomography scans. 487-495 - Mohammad Nayeem Teli:
TeliNet: Classifying CT scan images for COVID-19 diagnosis. 496-502 - Talha Anwar:
COVID19 Diagnosis using AutoML from 3D CT scans. 503-507 - Shuang Liang, Weicun Zhang, Yu Gu:
A hybrid and fast deep learning framework for Covid-19 detection via 3D Chest CT Images. 508-512 - Lei Zhang, Yan Wen:
A transformer-based framework for automatic COVID19 diagnosis in chest CTs. 513-518 - Meghna P. Ayyar, Jenny Benois-Pineau, Akka Zemmari:
A Hierarchical Classification System for the Detection of Covid-19 from Chest X-Ray Images. 519-528 - Radu Miron, Cosmin Moisii, Sergiu Dinu, Mihaela Elena Breaban:
Evaluating volumetric and slice-based approaches for COVID-19 detection in chest CTs. 529-536 - Dimitrios Kollias, Anastasios Arsenos, Levon Soukissian, Stefanos D. Kollias:
MIA-COV19D: COVID-19 Detection through 3-D Chest CT Image Analysis. 537-544
Computational Challenges in Digital Pathology (CDPath)
- Philipp Gräbel, Martina Crysandt, Barbara Mara Klinkhammer, Peter Boor, Tim H. Brümmendorf, Dorit Merhof:
Guided Representation Learning for the Classification of Hematopoietic Cells. 545-551 - Adam J. Shephard, Simon Graham, Raja Muhammad Saad Bashir, Mostafa Jahanifar, Hanya Mahmood, Syed Ali Khurram, Nasir M. Rajpoot:
Simultaneous Nuclear Instance and Layer Segmentation in Oral Epithelial Dysplasia. 552-561 - Chetan L. Srinidhi, Anne L. Martel:
Improving Self-supervised Learning with Hardness-aware Dynamic Curriculum Learning: An Application to Digital Pathology. 562-571 - Sheyang Tang, Mahdi S. Hosseini, Lina Chen, Sonal Varma, Corwyn Rowsell, Savvas Damaskinos, Konstantinos N. Plataniotis, Zhou Wang:
Probeable DARTS with Application to Computational Pathology. 572-581 - Luisa Theelke, Frauke Wilm, Christian Marzahl, Christof A. Bertram, Robert Klopfleisch, Andreas Maier, Marc Aubreville, Katharina Breininger:
Iterative Cross-Scanner Registration for Whole Slide Images. 582-590 - Zhengfeng Lai, Chao Wang, Luca Cerny Oliveira, Brittany N. Dugger, Sen-Ching Samson Cheung, Chen-Nee Chuah:
Joint Semi-supervised and Active Learning for Segmentation of Gigapixel Pathology Images with Cost-Effective Labeling. 591-600 - Niccolò Marini, Manfredo Atzori, Sebastian Otálora, Stéphane Marchand-Maillet, Henning Müller:
H&E-adversarial network: a convolutional neural network to learn stain-invariant features through Hematoxylin & Eosin regression. 601-610 - Philippe Weitz, Yinxi Wang, Johan Hartman, Mattias Rantalainen:
An investigation of attention mechanisms in histopathology whole-slide-image analysis for regression objectives. 611-619 - Jessica Deuschel, Daniel Firmbach, Carol I. Geppert, Markus Eckstein, Arndt Hartmann, Volker Bruns, Petr Kuritcyn, Jakob Dexl, David Hartmann, Dominik Perrin, Thomas Wittenberg, Michaela Benz:
Multi-Prototype Few-shot Learning in Histopathology. 620-628 - Sivaramakrishnan Sankarapandian, Saul Kohn, Vaughn Spurrier, Sean Grullon, Rajath E. Soans, Kameswari D. Ayyagari, Ramachandra Vikas Chamarthi, Kiran Motaparthi, Jason B. Lee, Wonwoo Shon, Michael J. Bonham, Julianna D. Ianni:
A Pathology Deep Learning System Capable of Triage of Melanoma Specimens Utilizing Dermatopathologist Consensus as Ground Truth *. 629-638 - Joseph Boyd, Mykola Liashuha, Eric Deutsch, Nikos Paragios, Stergios Christodoulidis, Maria Vakalopoulou:
Self-Supervised Representation Learning using Visual Field Expansion on Digital Pathology. 639-647 - Robert Jewsbury, Abhir Bhalerao, Nasir M. Rajpoot:
A QuadTree Image Representation for Computational Pathology. 648-656 - Tomé Albuquerque, Ana Moreira, Jaime S. Cardoso:
Deep Ordinal Focus Assessment for Whole Slide Images. 657-663 - Muhammad Dawood, Kim Branson, Nasir M. Rajpoot, Fayyaz ul Amir Afsar Minhas:
ALBRT: Cellular Composition Prediction in Routine Histology Images. 664-673 - Mostafa Jahanifar, Neda Zamani Tajeddin, Navid Alemi Koohbanani, Nasir M. Rajpoot:
Robust Interactive Semantic Segmentation of Pathology Images with Minimal User Input. 674-683 - Simon Graham, Mostafa Jahanifar, Ayesha Azam, Mohammed Nimir, Yee-Wah Tsang, Katherine Dodd, Emily Hero, Harvir Sahota, Atisha Tank, Ksenija Benes, Noorul Wahab, Fayyaz A. Minhas, Shan-E-Ahmed Raza, Hesham Eldaly, Kishore Gopalakrishnan, David R. J. Snead, Nasir M. Rajpoot:
Lizard: A Large-Scale Dataset for Colonic Nuclear Instance Segmentation and Classification. 684-693 - Yuang Zhu, Zhao Chen, Yuxin Zheng, Qinghua Zhang, Xuan Wang:
Real-Time Cell Counting in Unlabeled Microscopy Images. 694-703
Learning To Understand Aerial Images (LUAI)
- Nicholas Kashani Motlagh, Aswathnarayan Radhakrishnan, Jim Davis, Roman Ilin:
A Framework for Semi-automatic Collection of Temporal Satellite Imagery for Analysis of Dynamic Regions. 704-712 - Huiming Sun, Yuewei Lin, Qin Zou, Shaoyue Song, Jianwu Fang, Hongkai Yu:
Convolutional Neural Networks Based Remote Sensing Scene Classification under Clear and Cloudy Environments. 713-720 - Stefan Wolf, Jonas Meier, Lars Sommer, Jürgen Beyerer:
Double Head Predictor based Few-Shot Object Detection for Aerial Imagery. 721-731 - Xiaochen Zheng, Benjamin Kellenberger, Rui Gong, Irena Hajnsek, Devis Tuia:
Self-Supervised Pretraining and Controlled Augmentation Improve Rare Wildlife Recognition in UAV Images. 732-741 - Weitao Chen, Zhibin Wang, Hao Li:
Get better 1 pixel PCK: ladder scales correspondence flow networks for remote sensing image matching in higher resolution. 742-751 - Nouman Ahmed, Sudipan Saha, Muhammad Shahzad, Muhammad Moazam Fraz, Xiao Xiang Zhu:
Progressive Unsupervised Deep Transfer Learning for Forest Mapping in Satellite Image. 752-761 - Gui-Song Xia, Jian Ding, Ming Qian, Nan Xue, Jiaming Han, Xiang Bai, Michael Ying Yang, Shengyang Li, Serge J. Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang, Qiang Zhou, Chao-hui Yu, Kaixuan Hu, Yingjia Bu, Wenming Tan, Zhe Yang, Wei Li, Shang Liu, Jiaxuan Zhao, Tianzhi Ma, Zi-han Gao, Lingqi Wang, Yi Zuo, Licheng Jiao, Chang Meng, Hao Wang, Jiahao Wang, Yiming Hui, Zhuojun Dong, Jie Zhang, Qianyue Bao, Zixiao Zhang, Fang Liu:
LUAI Challenge 2021 on Learning to Understand Aerial Images. 762-768
Low-Power Computer Vision (LPCV)
- Amin Banitalebi-Dehkordi:
Knowledge Distillation for Low-Power Object Detection: A Simple Technique and Its Extensions for Training Compact Models Using Unlabeled Data. 769-778 - Chien-Yao Wang, Hong-Yuan Mark Liao, I-Hau Yeh, Yung-Yu Chuang, Youn-Long Lin:
Exploring the power of lightweight YOLOv4. 779-788 - Chia-Hsiang Liu, Yu-Shin Han, Yuan-Yao Sung, Yi Lee, Hung-Yueh Chiang, Kai-Chiang Wu:
FOX-NAS: Fast, On-device and Explainable Neural Architecture Search. 789-797 - Ivan Lazarevich, Alexander Kozlov, Nikita Malinin:
Post-training deep neural network pruning via layer-wise calibration. 798-805
Multi-Modal Video Reasoning and Analyzing (MMVRA)
- Haoran Peng, He Huang, Li Xu, Tianjiao Li, Jun Liu, Hossein Rahmani, Qiuhong Ke, Zhicheng Guo, Cong Wu, Rongchang Li, Mang Ye, Jiahao Wang, Jiaxu Zhang, Yuanzhong Liu, Tao He, Fuwei Zhang, Xianbin Liu, Tao Lin:
The Multi-Modal Video Reasoning and Analyzing Competition. 806-813
Chalearn Face Anti-Spoofing (ChaLearn_FAS)
- Ajian Liu, Chenxu Zhao, Zitong Yu, Anyang Su, Xing Liu, Zijian Kong, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Guodong Guo:
3D High-Fidelity Mask Face Presentation Attack Detection Challenge. 814-823 - Xiang Xu, Yuanjun Xiong, Wei Xia:
On Improving Temporal Consistency for Online Face Liveness Detection System. 824-833 - Shen Chen, Taiping Yao, Ke-Yue Zhang, Yang Chen, Ke Sun, Shouhong Ding, Jilin Li, Feiyue Huang, Rongrong Ji:
A Dual-stream Framework for 3D Mask Face Presentation Attack Detection. 834-841 - Samuel Huang, Wen-Huang Cheng, Robert Cheng:
Single Patch Based 3D High-Fidelity Mask Face Anti-Spoofing. 842-845 - Oleg Grinchuk, Aleksandr Parkin, Evgenija Glazistova:
3D mask presentation attack detection via high resolution face parts. 846-853
When Graph Signal Processing Meets Computer Vision (GSP-CV)
- Maosen Li, Siheng Chen, Zihui Liu, Zijing Zhang, Lingxi Xie, Qi Tian, Ya Zhang:
Skeleton Graph Scattering Networks for 3D Skeleton-based Human Motion Prediction. 854-864 - Naina Dhingra, George Chogovadze, Andreas M. Kunz:
Border-SegGCN: Improving Semantic Segmentation by Refining the Border Outline using Graph Convolutional Network. 865-875 - Anindya Mondal, Shashant R, Jhony H. Giraldo, Thierry Bouwmans, Ananda S. Chowdhury:
Moving Object Detection for Event-based Vision using Graph Spectral Clustering. 876-884 - Jin Wang, Bo Jiang:
Zero-Shot Learning via Contrastive Learning on Dual Knowledge Graphs. 885-892 - Haolan Chen, Shitong Luo, Xiang Gao, Wei Hu:
Unsupervised Learning of Geometric Sampling Invariant Representations for 3D Point Clouds. 893-903 - Kevin Potter, Steven Sleder, Matthew Smith, Shehan Perera, Alper Yilmaz, John Tencer:
Parameterized Pseudo-Differential Operators for Graph Convolutional Neural Networks. 904-912
3D Object Detection From Images (3DODI)
- Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin:
FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection. 913-922 - Jonas Heylen, Mark De Wolf, Bruno Dawagne, Marc Proesmans, Luc Van Gool, Wim Abbeloos, Hazem Abdelkawy, Daniel Olmeda Reino:
MonoCInIS: Camera Independent Monocular 3D Object Detection using Instance Segmentation. 923-934 - Frederik Hagelskjær, Anders Glent Buch:
Bridging the Reality Gap for Pose Estimation Networks using Sensor-Based Domain Randomization. 935-944
Embedded and Real-World Computer Vision in Autonomous Driving (ERCVAD)
- Florentin Poucin, Andrea Kraus, Martin Simon:
Boosting Instance Segmentation with Synthetic Data: A study to overcome the limits of real world data sets. 945-953 - Julia Hornauer, Lazaros Nalpantidis, Vasileios Belagiannis:
Visual Domain Adaptation for Monocular Depth Estimation on Resource-Constrained Hardware. 954-962 - Aditya Rajagopal, Christos-Savvas Bouganis:
perf4sight: A toolflow to model CNN training performance on Edge GPUs. 963-971 - Sven Mantowsky, Falk Heuer, Syed Saqib Bukhari, Michael Keckeisen, Georg Schneider:
ProAI: An Efficient Embedded AI Hardware for Automotive Applications - a Benchmark Study. 972-978 - Matthias Reuse, Martin Simon, Bernhard Sick:
About the Ambiguity of Data Augmentation for 3D Object Detection in Autonomous Driving. 979-987 - Maria Lyssenko, Christoph Gladisch, Christian Heinzemann, Matthias Woehrle, Rudolph Triebel:
Instance Segmentation in CARLA: Methodology and Analysis for Pedestrian-oriented Synthetic Data Generation in Crowded Scenes. 988-996 - Falk Heuer, Sven Mantowsky, Syed Saqib Bukhari, Georg Schneider:
MultiTask-CenterNet (MCN): Efficient and Diverse Multitask Learning using an Anchor Free Approach. 997-1005 - Sujan Sai Gannamaneni, Sebastian Houben, Maram Akila:
Semantic Concept Testing in Autonomous Driving by Extraction of Object-Level Annotations from CARLA. 1006-1014 - Lukas Stäcker, Juncong Fei, Philipp Heidenreich, Frank Bonarens, Jason R. Rambach, Didier Stricker, Christoph Stiller:
Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization. 1015-1022 - Daniel Bogdoll, Jasmin Breitenstein, Florian Heidecker, Maarten Bieshaar, Bernhard Sick, Tim Fingscheidt, J. Marius Zöllner:
Description of Corner Cases in Automated Driving: Goals and Challenges. 1023-1028 - Deepthi Sreenivasaiah, Johannes S. Otterbach, Thomas Wollmann:
MEAL: Manifold Embedding-based Active Learning. 1029-1037
Visual Inductive Priors for Data-Efficient Deep Learning (VIPriors)
- Sylvestre-Alvise Rebuffi, Sébastien Ehrhardt, Kai Han, Andrea Vedaldi, Andrew Zisserman:
LSD-C: Linearly Separable Deep Clusters. 1038-1046 - António Farinhas, André F. T. Martins, Pedro M. Q. Aguiar:
Multimodal Continuous Visual Attention Mechanisms. 1047-1056 - Donghyun Kim, Kuniaki Saito, Samarth Mishra, Stan Sclaroff, Kate Saenko, Bryan A. Plummer:
Self-supervised Visual Attribute Learning for Fashion Compatibility. 1057-1066 - Sihan Liu, Yue Wang:
Few-shot Learning with Online Self-Distillation. 1067-1070 - Lorenzo Brigato, Björn Barz, Luca Iocchi, Joachim Denzler:
Tune It or Don't Use It: Benchmarking Data-Efficient Image Classification. 1071-1080 - Artem Moskalev, Ivan Sosnovik, Arnold W. M. Smeulders:
Relational Prior for Multi-Object Tracking. 1081-1085 - T. Anderson Keller, Max Welling:
Predictive Coding with Topographic Variational Autoencoders. 1086-1091 - Ivan Sosnovik, Artem Moskalev, Arnold W. M. Smeulders:
How to Transform Kernels for Scale-Convolutions. 1092-1097 - Vitaliy Kinakh, Olga Taran, Svyatoslav Voloshynovskiy:
ScatSimCLR: self-supervised contrastive learning with pretext task regularization for small-scale datasets. 1098-1106 - Matheus Gadelha, Rui Wang, Subhransu Maji:
Deep Manifold Prior. 1107-1116
Physics Based Vision Meets Deep Learning (PBDL)
- Yuxing Huang, Qiu Shen, Ying Fu, Shaodi You:
Weakly-supervised Semantic Segmentation in Cityscape via Hyperspectral Image. 1117-1126 - Ruth Wijma, Shaodi You, Yu Li:
Multi-Level Adaptive Separable Convolution for Large-Motion Video Frame Interpolation. 1127-1135 - Leron Julian, Aswin C. Sankaranarayanan:
Precise Forecasting of Sky Images Using Spatial Warping. 1136-1144 - Siyuan Li, Yue Luo, Ye Zhu, Xun Zhao, Yu Li, Ying Shan:
Enforcing Temporal Consistency in Video Depth Estimation. 1145-1154 - Chu Zhou, Minggui Teng, Jin Han, Chao Xu, Boxin Shi:
DeLiEve-Net: Deblurring Low-light Images with Light Streaks and Local Events. 1155-1164 - Takafumi Iwaguchi, Hiroshi Kawasaki:
Efficient light transport acquisition by coded illumination and robust photometric stereo by dual photography using deep neural network. 1165-1173 - Nobuhiko Wakai, Takayoshi Yamashita:
Deep Single Fisheye Image Camera Calibration for Over 180-degree Projection of Field of View. 1174-1183 - Yorimoto Kohei, Xian-Hua Han:
HyperMixNet: Hyperspectral Image Reconstruction with Deep Mixed Network from a Snapshot Measurement. 1184-1193 - Partha Das, Yang Liu, Sezer Karaoglu, Theo Gevers:
Generative Models for Multi-Illumination Color Constancy. 1194-1203
Catch UAVs That Want To Watch You: Detection and Tracking of Unmanned Aerial Vehicle in the Wild and Anti-UAV Challenge (AntiUAV)
- Bo Huang, Junjie Chen, Tingfa Xu, Ying Wang, Shenwang Jiang, Yuncheng Wang, Lei Wang, Jianan Li:
SiamSTA: Spatio-Temporal Attention based Siamese Tracker for Tracking UAVs. 1204-1212 - Jinjian Zhao, Xiaohan Zhang, Pengyu Zhang:
A Unified Approach for Tracking UAVs in Infrared. 1213-1222 - Brian K. S. Isaac-Medina, Matt Poyser, Daniel Organisciak, Chris G. Willcocks, Toby P. Breckon, Hubert P. H. Shum:
Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark. 1223-1232 - Kutalmis Gokalp Ince, Aybora Koksal, Arda Fazla, A. Aydin Alatan:
Semi-Automatic Annotation For Visual Object Tracking. 1233-1239 - Houzhang Fang, Xiaolin Wang, Zikai Liao, Yi Chang, Luxin Yan:
A Real-time Anti-distractor Infrared UAV Tracker with Channel Feature Refinement Module. 1240
Computer Vision in Plant Phenotyping and Agriculture (CVPPA)
- Ruohao Guo, Liao Qu, Dantong Niu, Zhenbo Li, Jun Yue:
LeafMask: Towards Greater Accuracy on Leaf Segmentation. 1249-1258 - Riccardo Gozzovelli, Benjamin Franchetti, Malik Bekmurat, Fiora Pirri:
Tip-burn stress detection of lettuce canopy grown in Plant Factories. 1259-1268 - Zhenghao Fei, Alex Olenskyj, Brian N. Bailey, Mason Earles:
Enlisting 3D Crop Models and GANs for More Data Efficient and Generalizable Fruit Detection. 1269-1277 - Chengxin Liu, Kewei Wang, Hao Lu, Zhiguo Cao:
Dynamic Color Transform for Wheat Head Detection. 1278-1283 - Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling O'Connor, Noel E. O'Connor, Kevin McGuinness:
Semi-supervised dry herbage mass estimation using automatic data and synthetic images. 1284-1293 - Birgit Möller, Berit Schreck, Stefan Posch:
Analysis of Arabidopsis Root Images - Studies on CNNs and Skeleton-Based Root Topology. 1294-1302 - Geoffroy Couasnet, Mouad Zine El Abidine, François Laurens, Helin Dutagaci, David Rousseau:
Machine learning meets distinctness in variety testing. 1303-1311 - Masoomeh Aslahishahri, Kevin G. Stanley, Hema Sudhakar Duddu, Steve Shirtliffe, Sally Vail, Kirstin Bett, Curtis Pozniak, Ian Stavness:
From RGB to NIR: Predicting of near infrared reflectance from visible spectrum aerial images of crops. 1312-1322 - Alexander Gillert, Bo Peters, Uwe Freiherr von Lukas, Jürgen Kreyling:
Identification and Measurement of Individual Roots in Minirhizotron Images of Dense Root Systems. 1323-1331 - Sandesh Bhagat, Manesh Kokare, Vineet Haswani, Praful Hambarde, Ravi Kamble:
WheatNet-Lite: A Novel Light Weight Network for Wheat Head Detection. 1332-1341 - Keyhan Najafian, Alireza Ghanbari, Ian Stavness, Lingling Jin, Gholam Hassan Shirdel, Farhad Maleki:
A Semi-self-supervised Learning Approach for Wheat Head Detection using Extremely Small Number of Labeled Samples. 1342-1351 - Abby Stylianou, Robert Pless, Nadia Shakoor, Todd C. Mockler:
Classification and Visualization of Genotype × Phenotype Interactions in Biomass Sorghum. 1352-1361 - Sakib Mostafa, Debajyoti Mondal, Michael A. Beck, Christopher P. Bidinosti, Christopher J. Henry, Ian Stavness:
Visualizing Feature Maps for Model Selection in Convolutional Neural Networks. 1362-1371 - Ole-Christian Galbo Engstrøm, Erik Schou Dreier, Kim Steenstrup Pedersen:
Predicting Protein Content in Grain Using Hyperspectral Deep Learning. 1372-1380 - Takeshi Masuda:
Leaf Area Estimation by Semantic Segmentation of Point Cloud of Tomato Plants. 1381-1389 - Changye Yang, Sriram Baireddy, Enyu Cai, Melba M. Crawford, Edward J. Delp:
Field-Based Plot Extraction Using UAV RGB Images. 1390-1398 - Sai Vidyaranya Nuthalapati, Anirudh Tunga:
Multi-Domain Few-Shot Learning and Dataset for Agricultural Applications. 1399-1408 - David S. LeBauer, Maxwell Burnette, Noah Fahlgren, Rob Kooper, Kenton McHenry, Abby Stylianou:
What Does TERRA-REF's High Resolution, Multi Sensor Plant Sensing Public Domain Data Offer the Computer Vision Community? 1409-1415
Differentiable 3D Vision and Graphics (Diff3D)
- Lokender Tiwari, Brojeshwar Bhowmick:
DeepDraper: Fast and Accurate 3D Garment Draping over a 3D Human Body. 1416-1426 - Issam H. Laradji, Pau Rodríguez, David Vázquez, Derek Nowrouzezahrai:
SSR: Semi-supervised Soft Rasterizer for single-view 2D to 3D Reconstruction. 1427-1436
Face Bio-Metrics Under COVID?Masked Face Recognition (MFR)
- Jiankang Deng, Jia Guo, Xiang An, Zheng Zhu, Stefanos Zafeiriou:
Masked Face Recognition Challenge: The InsightFace Track Report. 1437-1444 - Xiang An, Xuhan Zhu, Yuan Gao, Yang Xiao, Yongle Zhao, Ziyong Feng, Lan Wu, Bin Qin, Ming Zhang, Debing Zhang, Ying Fu:
Partial FC: Training 10 Million Identities on a Single Machine. 1445-1449 - Weiqiu Wang, Zhicheng Zhao, Hongyuan Zhang, Zhaohui Wang, Fei Su:
MaskOut: A Data Augmentation Method for Masked Face Recognition. 1450-1455 - Kai Wang, Shuo Wang, Jianfei Yang, Xiaobo Wang, Baigui Sun, Hao Li, Yang You:
Mask Aware Network for Masked Face Recognition in the Wild. 1456-1461 - Hanjie Qian, Panpan Zhang, Sijie Ji, Shuxin Cao, Yuecong Xu:
Improving Representation Consistency with Pairwise Loss for Masked Face Recognition. 1462-1467 - Wei-Yi Chang, Ming-Ying Tsai, Shih-Chieh Lo:
ResSaNet: A Hybrid Backbone of Residual Block and Self-Attention Module for Masked Face Recognition. 1468-1476 - Boxiao Liu, Shenghan Zhang, Guanglu Song, Haihang You, Yu Liu:
Rectifying the Data Bias in Knowledge Distillation. 1477-1486 - Baojin Huang, Zhongyuan Wang, Guangcheng Wang, Kui Jiang, Zheng He, Hua Zou, Qin Zou:
Masked Face Recognition Datasets and Validation. 1487-1491 - Tao Feng, Liangpeng Xu, Hangjie Yuan, Yongfei Zhao, Mingqian Tang, Mang Wang:
Towards Mask-robust Face Recognition. 1492-1496 - Delong Qi, Kangli Hu, Weijun Tan, Qi Yao, Jingfeng Liu:
Balanced Masked and Standard Face Recognition. 1497-1502 - Haoran Jiang, Dan Zeng:
Explainable Face Recognition based on Accurate Facial Compositions. 1503-1512 - Feng Yu, He Li, Sige Bian, Yongming Tang:
An Efficient Network Design for Face Video Super-resolution. 1513-1520 - Xing Lan, Qinghao Hu, Jian Cheng:
Revisting Quantization Error in Face Alignment. 1521-1530 - Jun Yu, Xinlong Hao, Zeyu Cui, Peng He, Tongliang Liu:
Boosting Fairness for Masked Face Recognition. 1531-1540
Interactive Labeling and Data Augmentation for Vision (ILDAV)
- Xinyue Wei, Weichao Qiu, Yi Zhang, Zihao Xiao, Alan L. Yuille:
Nuisance-Label Supervision: Robustness Improvement by Free Labels. 1541-1550 - Yuying Hao, Yi Liu, Zewu Wu, Lin Han, Yizhou Chen, Guowei Chen, Lutao Chu, Shiyu Tang, Zhiliang Yu, Zeyu Chen, Baohua Lai:
EdgeFlow: Achieving Practical Interactive Segmentation with Edge-Guided Flow. 1551-1560 - Rowel Atienza:
Data Augmentation for Scene Text Recognition. 1561-1570 - Vishal Vinod, K. Ram Prabhakar, R. Venkatesh Babu, Anirban Chakraborty:
Multi-Domain Conditional Image Translation: Translating Driving Datasets from Clear-Weather to Adverse Conditions. 1571-1582 - Pranav Acharya, Daniel Lohn, Vivian Ross, Maya Ha, Alexander Rich, Ehsan Sayyad, Tobias Höllerer:
Using Synthetic Data Generation to Probe Multi-View Stereo Networks. 1583-1591 - Shuhao Qiu, Chuang Zhu, Wenli Zhou:
Meta Self-Learning for Multi-Source Domain Adaptation: A Benchmark. 1592-1601 - Soonchan Park, Jinah Park:
Localizing Human Keypoints beyond the Bounding Box. 1602-1611 - Feng Chen, Michael P. Pound, Andrew P. French:
Learning to Localise and Count with Incomplete Dot-annotations. 1612-1620 - Angira Sharma, Naeemullah Khan, Muhammad Mubashar, Ganesh Sundaramoorthi, Philip H. S. Torr:
Class-Agnostic Segmentation Loss and Its Application to Salient Object Detection and Segmentation. 1621-1630 - Javad Zolfaghari Bengar, Joost van de Weijer, Bartlomiej Twardowski, Bogdan Raducanu:
Reducing Label Effort: Self-Supervised meets Active Learning. 1631-1639 - Matteo Pennisi, Simone Palazzo, Concetto Spampinato:
Self-improving classification performance through GAN distillation. 1640-1648 - Mickael Cormier, Fabian Röpke, Thomas Golda, Jürgen Beyerer:
Interactive Labeling for Human Pose Estimation in Surveillance Videos. 1649-1658 - Svetlana Illarionova, Sergey Nesteruk, Dmitrii Shadrin, Vladimir Ignatiev, Mariia Pukalchik, Ivan V. Oseledets:
Object-Based Augmentation for Building Semantic Segmentation: Ventura and Santa Rosa Case Study. 1659-1668 - Marten Franke, Vaishnavi Gopinath, Chaitra Reddy, Danijela Ristic-Durrant, Kai Michels:
Bounding Box Dataset Augmentation for Long-range Object Distance Estimation. 1669-1677 - Robby Neven, Davy Neven, Bert De Brabandere, Marc Proesmans, Toon Goedemé:
Weakly-Supervised Semantic Segmentation by Learning Label Uncertainty. 1678-1686 - Gyungin Shin, Weidi Xie, Samuel Albanie:
All you need are a few pixels: semantic segmentation with PixelPick. 1687-1697 - Moab Arar, Ariel Shamir, Amit Bermano:
InAugment: Improving Classifiers via Internal Augmentation. 1698-1707
Assistive Computer Vision and Robotics (ACVR)
- Xixuan Julie Liu, Yi Fang:
Virtual Touch: Computer Vision Augmented Touch-Free Scene Exploration for the Blind or Visually Impaired. 1708-1717 - Daohan Lu, Yi Fang:
Audi-Exchange: AI-Guided Hand-based Actions to Assist Human-Human Interactions for the Blind and the Visually Impaired. 1718-1726 - Semih Orhan, Yalin Bastanlar:
Efficient Search in a Panoramic Image Database for Long-term Visual Localization. 1727-1734 - Dario Allegra, Mattia Litrico, Maria Ausilia Napoli Spatafora, Filippo Stanco, Giovanni Maria Farinella:
Exploiting Egocentric Vision on Shopping Cart for Out-Of-Stock Detection in Retail Environments. 1735-1740 - Ilya G. Ovodov:
Optical Braille Recognition Using Object Detection Neural Network. 1741-1748 - Yu Rong, Takaaki Shiratori, Hanbyul Joo:
FrankMocap: A Monocular 3D Whole-Body Pose Estimation System via Regression and Integration. 1749-1759 - Jiaming Zhang, Kailun Yang, Angela Constantinescu, Kunyu Peng, Karin Müller, Rainer Stiefelhagen:
Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World. 1760-1770 - Piotr Wozniak, Bogdan Kwolek:
Deep Embeddings-based Place Recognition Robust to Motion Blur. 1771-1779 - Huayao Liu, Ruiping Liu, Kailun Yang, Jiaming Zhang, Kunyu Peng, Rainer Stiefelhagen:
HIDA: Towards Holistic Indoor Understanding for the Visually Impaired via Semantic Instance Segmentation with a Wearable Solid-State LiDAR Sensor. 1780-1790 - Szilárd Molnár, Benjamin Kelényi, Levente Tamás:
ToFNest: Efficient normal estimation for time-of-flight depth cameras. 1791-1798 - Antonio Buemi, Arcangelo Bruna, Sylvain Petinot, Nicolas Roux:
ORB-SLAM with Near-infrared images and Optical Flow data. 1799-1804
Advances in Image Manipulation (AIM)
- Guy Ohayon, Theo Adrai, Gregory Vaksman, Michael Elad, Peyman Milanfar:
High Perceptual Quality Image Denoising with a Posterior Sampling CGAN. 1805-1813 - Xuguang Lai, Xiuxiu Bai, Yongqiang Hao:
Unsupervised Generative Adversarial Networks with Cross-model Weight Transfer Mechanism for Image-to-image Translation. 1814-1822 - Xuanchi Ren, Tao Yang, Yuwang Wang, Wenjun Zeng:
Rethinking Content and Style: Exploring Bias for Unsupervised Disentanglement. 1823-1832 - Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, Radu Timofte:
SwinIR: Image Restoration Using Swin Transformer. 1833-1844 - Mohammad Saeed Rad, Thomas Yu, Behzad Bozorgtabar, Jean-Philippe Thiran:
Test-Time Adaptation for Super-Resolution: You Only Need to Overfit on a Few More Images. 1845-1854 - Angela Castillo, María Escobar, Juan C. Pérez, Andrés Romero, Radu Timofte, Luc Van Gool, Pablo Arbeláez:
Generalized Real-World Super-Resolution through Adversarial Robustness. 1855-1865 - Bahjat Kawar, Gregory Vaksman, Michael Elad:
Stochastic Image Denoising by Sampling from the Posterior Distribution. 1866-1875 - Jianfeng He, Bei Xiao, Xuchao Zhang, Shuo Lei, Shuhui Wang, Chang-Tien Lu:
Reducing Noise Pixels and Metric Bias in Semantic Inpainting on Segmentation Map. 1876-1885 - Quanlong Zheng, Xiaotian Qiao, Ying Cao, Shi Guo, Lei Zhang, Rynson W. H. Lau:
Distilling Reflection Dynamics for Single-Image Reflection Removal. 1886-1894 - Wenbin Zou, Mingchao Jiang, Yunchen Zhang, Liang Chen, Zhiyong Lu, Yi Wu:
SDWNet: A Straight Dilated Network with Wavelet Transformation for image Deblurring. 1895-1904 - Xintao Wang, Liangbin Xie, Chao Dong, Ying Shan:
Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data. 1905-1914 - Qiudan Wang:
Manipulating Image Style Transformation via Latent-Space SVM. 1915-1923 - Andrés Romero, Luc Van Gool, Radu Timofte:
SMILE: Semantically-guided Multi-attribute Image and Layout Editing. 1924-1933 - Alex Andonian, Taesung Park, Bryan Russell, Phillip Isola, Jun-Yan Zhu, Richard Zhang:
Contrastive Feature Loss for Image Prediction. 1934-1943 - Fushuo Huo, Bingheng Li, Xuegui Zhu:
Efficient Wavelet Boost Learning-Based Multi-stage Progressive Refinement Network for Underwater Image Enhancement. 1944-1952 - Mengmeng Zhu, Guanqun Hou, Xinjia Chen, Jiaxing Xie, Haixian Lu, Jun Che:
Saliency-Guided Transformer Network combined with Local Embedding for No-Reference Image Quality Assessment. 1953-1962 - Victor-Andrei Ivan, Ionut Mistreanu, Andrei Leica, Sung-Jun Yoon, Manri Cheon, Junwoo Lee, Jinsoo Oh:
Improving Key Human Features for Pose Transfer. 1963-1972 - Jiajun Huang, Xueyu Wang, Bo Du, Pei Du, Chang Xu:
DeepFake MNIST+: A DeepFake Facial Animation Dataset. 1973-1982 - Kwangjin Yoon:
Simple and Efficient Unpaired Real-world Super-Resolution using Image Statistics. 1983-1990 - Ruiqi Zhao, Tianyi Wu, Guodong Guo:
Sparse to Dense Motion Transfer for Face Image Animation. 1991-2000 - Dilara Gokay, Enis Simsar, Efehan Atici, Alper Ahmetoglu, Atif Emre Yüksel, Pinar Yanardag:
Graph2Pix: A Graph-Based Image to Image Translation Framework. 2001-2010 - Arpit Pipara, Urvi Oza, Srimanta Mandal:
Underwater Image Color Correction Using Ensemble Colorization Network. 2011-2020 - Kim C. Ng, Jinglin Shen, Chiu Man Ho:
A System for Fusing Color and Near-Infrared Images in Radiance Domain. 2021-2030
Structural and Compositional Learning on 3D Data (StruCo3D)
- Pinak Paliwal, Vikas Paliwal:
3D Scene Angles using UL Decomposition of Planar Homography. 2031-2038 - Rinon Gal, Amit Bermano, Hao Zhang, Daniel Cohen-Or:
MRGAN: Multi-Rooted 3D Shape Representation Learning with Unsupervised Part Disentanglement. 2039-2048 - Siddharth Katageri, Shashidhar Veerappa Kudari, Akshaykumar Gunari, Ramesh Ashok Tabib, Uma Mudenagudi:
ABD-Net: Attention Based Decomposition Network for 3D Point Cloud Decomposition. 2049-2057
Simulation Technology for Embodied AI (SEAI)
- Jiafei Duan, Samson Yu Bai Jian, Cheston Tan:
SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments. 2058-2063
Deep Learning for Geometric Computing (DLGC)
- Hichem Sahbi:
Learning Laplacians in Chebyshev Graph Convolutional Networks. 2064-2075 - Andrea Alfieri, Yancong Lin, Jan C. van Gemert:
Investigating transformers in the decomposition of polygonal shapes as point collections. 2076-2085 - Sharjeel Ali, Oliver van Kaick:
Evaluation of Latent Space Learning with Procedurally-Generated Datasets of Shapes. 2086-2094 - Shyam A. Tailor, René de Jong, Tiago Azevedo, Matthew Mattina, Partha Maji:
Towards Efficient Point Cloud Graph Neural Networks Through Architectural Simplification. 2095-2104 - Nam Hoang Nguyen:
U-Net based skeletonization and bag of tricks. 2105-2109 - Shun Yao, Fei Yang, Yongmei Cheng, Mikhail G. Mozerov:
3D Shapes Local Geometry Codes Learning with SDF. 2110-2117 - Soonyong Song, Heechul Bae, Junhee Park:
DISCO - U-Net based Autoencoder Architecture with Dual Input Streams for Skeleton Image Drawing. 2128-2135 - Xiaojun Tang, Rui Zheng, Yinghao Wang:
Distance and Edge Transform for Skeleton Extraction. 2136-2141 - Sabari Nathan, Priya Kansal:
SkeletonNetV2: A Dense Channel Attention Blocks for Skeleton Extraction. 2142-2149
Understanding Social Behavior in Dyadic and Small Group Interactions (DYAD)
- Neelu Madan, Arya Farkhondeh, Kamal Nasrollahi, Sergio Escalera, Thomas B. Moeslund:
Temporal Cues from Socially Unacceptable Trajectories for Anomaly Detection. 2150-2158 - Sovan Biswas, Juergen Gall:
Multiple Instance Triplet Loss for Weakly Supervised Multi-Label Action Localisation of Interacting Persons. 2159-2167 - Claudia Greco, Carmela Buono, Pau Buch-Cardona, Gennaro Cordasco, Sergio Escalera, Anna Esposito, Anaïs Fernández, Daria Kyslitska, Maria Stylianou Korsnes, Cristina Palmero, Jofre Tenorio-Laranga, Anna Torp Johansen, María Inés Torres:
Emotional Features of Interactions with Empathic Agents. 2168-2176 - David Curto, Albert Clapés, Javier Selva, Sorina Smeureanu, Júlio C. S. Jacques Júnior, David Gallardo-Pujol, Georgina Guilera, David Leiva, Thomas B. Moeslund, Sergio Escalera, Cristina Palmero:
Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic Interactions. 2177-2188
Deep Multi-Task Learning in Computer Vision (DeepMTL)
- Anjan Dutta, Massimiliano Mancini, Zeynep Akata:
Concurrent Discrimination and Alignment for Self-Supervised Feature Learning. 2189-2198 - Andrea Ferreri, Silvia Bucci, Tatiana Tommasi:
Multi-Modal RGB-D Scene Recognition Across Domains. 2199-2208 - Guy Oren, Lior Wolf:
In Defense of the Learning Without Forgetting for Task Incremental Learning. 2209-2218 - Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gérard G. Medioni:
MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention. 2219-2229 - Hong-Yu Zhou, Chixiang Lu, Sibei Yang, Yizhou Yu:
ConvNets vs. Transformers: Whose Visual Representations are More Transferable? 2230-2238 - NareshKumar Gurulingan, Elahe Arani, Bahram Zonooz:
UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks. 2239-2248 - Usman Sajid, Xiangyu Chen, Hasan Sajid, Taejoon Kim, Guanghui Wang:
Audio-Visual Transformer Based Crowd Counting. 2249-2259
Human Trajectory and Pose Dynamics Forecasting in the Wild (SoMoF)
- Chenxi Wang, Yunfeng Wang, Zixuan Huang, Zhiwen Chen:
Simple Baseline for Single Human Motion Forecasting. 2260-2265 - Daiheng Gao, Bang Zhang, Qi Wang, Xindi Zhang, Pan Pan, Yinghui Xu:
SCAT: Stride Consistency with Auto-regressive regressor and Transformer for hand pose estimation. 2266-2275 - Ángel Martínez-González, Michael Villamizar, Jean-Marc Odobez:
Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers. 2276-2284 - Yusheng Peng, Gaofeng Zhang, Xiangyu Li, Liping Zheng:
STIRNet: A Spatial-temporal Interaction-aware Recursive Network for Human Trajectory Prediction. 2285-2293 - Behnam Parsaeifard, Saeed Saadatnejad, Yuejiang Liu, Taylor Mordan, Alexandre Alahi:
Learning Decoupled Representations for Human Pose Forecasting. 2294-2303 - Ankur Singh, Upendra Suddamalla:
Multi-Input Fusion for Practical Pedestrian Intention Prediction. 2304-2311
Video Retrieval Methods and Their Limits (ViRaL)
- Damianos Galanopoulos, Vasileios Mezaris:
Hard-Negatives or Non-Negatives? A Hard-Negative Selection Strategy for Cross-Modal Retrieval Using the Improved Marginal Ranking Loss. 2312-2316 - Aozhu Chen, Fan Hu, Zihan Wang, Fangming Zhou, Xirong Li:
What Matters for Ad-hoc Video Search? A Large-scale Evaluation on TRECVID. 2317-2322 - Wenhao Yang, Yinan Song, Zhicheng Zhao, Fei Su:
Instance Search via Fusing Hierarchical Multi-level Retrieval and Human-object Interaction Detection. 2323-2327
Large-Scale Fine-Grained Food AnalysIs (LFFAI)
- Jeremy Klotz, Vijay Rengarajan, Aswin C. Sankaranarayanan:
Fine-Grain Prediction of Strawberry Freshness using Subsurface Scattering. 2328-2336 - Jiangpeng He, Fengqing Zhu:
Online Continual Learning For Visual Food Classification. 2337-2346
More Exploration, Less Exploitation (MELEX)
- Duhyeon Bang, Hyunjung Shim:
MGGAN: Solving Mode Collapse Using Manifold-Guided Training. 2347-2356 - Max Ehrlich, Larry Davis, Ser-Nam Lim, Abhinav Shrivastava:
Analyzing and Mitigating JPEG Compression Defects in Deep Learning. 2357-2367 - Saneem A. Chemmengath, Soumava Paul, Samarth Bharadwaj, Suranjana Samanta, Karthik Sankaranarayanan:
Addressing Target Shift in Zero-shot Learning using Grouped Adversarial Learning. 2368-2377
Remote Physiological Signal Sensing (RePSS)
- Chengyang Hu, Ke-Yue Zhang, Taiping Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma:
An End-to-end Efficient Framework for Remote Physiological Signal Sensing. 2378-2384 - Xuenan Liu, Xuezhi Yang, Ziyan Meng, Ye Wang, Jie Zhang, Alexander Wong:
MANet: a Motion-Driven Attention Network for Detecting the Pulse from a Facial Video with Drastic Motions. 2385-2390 - Jingda Du, Si-Qi Liu, Bochao Zhang, Pong C. Yuen:
Weakly Supervised rPPG Estimation for Respiratory Rate Estimation. 2391-2397 - Yuhang Dong, Gongping Yang, Yilong Yin:
Time Lab's approach to the Challenge on Computer Vision for Remote Physiological Measurement. 2398-2403 - Xiaobai Li, Haomiao Sun, Zhaodong Sun, Hu Han, Antitza Dantcheva, Shiguang Shan, Guoying Zhao:
The 2nd Challenge on Remote Physiological Signal Sensing (RePSS). 2404-2413
Sketching for Human Expressivity (SHE)
- Gianluca Berardi, Samuele Salti, Luigi Di Stefano:
SketchyDepth: from Scene Sketches to RGB-D Images. 2414-2423 - Leo Sampaio Ferraz Ribeiro, Tu Bui, John P. Collomosse, Moacir Ponti:
Scene Designer: a Unified Model for Scene Search and Synthesis from Sketch. 2424-2433 - Josh Holinaty, Alec Jacobson, Fanny Chevalier:
Supporting Reference Imagery for Digital Drawing. 2434-2442 - Shaozu Yuan, Aijun Dai, Zhiling Yan, Zehua Guo, Ruixue Liu, Meng Chen:
SketchBird: Learning to Generate Bird Sketches from Text. 2443-2452
Traditional Computer Vision in the Age of Deep Learning (TradiCV)
- Boran Han, Jeremy Vila:
A Robust End-to-end Method for Parametric Curve Tracing via Soft Cosine-similarity-based Objective Function. 2453-2463 - Yiming Zhao, Xiao Zhang, Xinming Huang:
A Technical Survey and Evaluation of Traditional Point Cloud Clustering Methods for LiDAR Panoptic Segmentation. 2464-2473 - Matthew Bailey, Adrian Hilton, Jean-Yves Guillemaut:
Finite Aperture Stereo: 3D Reconstruction of Macro-Scale Scenes. 2474-2484 - Zhiqi Kang, Radu Horaud, Mostafa Sadeghi:
Robust Face Frontalization For Visual Speech Recognition*. 2485-2495 - Viktor Seib, Dietrich Paulus:
Object Detection in Cluttered Environments with Sparse Keypoint Selection. 2496-2505 - Ufuk Efe, Kutalmis Gokalp Ince, A. Aydin Alatan:
Effect of Parameter Optimization on Classical and Learning-based Image Matching Methods. 2506-2513 - Skylar Sutherland, Bernhard Egger, Josh Tenenbaum:
Building 3D Morphable Models from a Single Scan. 1-11 - Vikash Kumar, Sarthak Srivastava, Rohit Lal, Anirban Chakraborty:
CAFT: Class Aware Frequency Transform for Reducing Domain Gap. 2525-2534 - Vedant Shah, Anmol Agarwal, Tanmay Tulsidas Verlekar, Raghavendra Singh:
Adapting Deep Neural Networks for Pedestrian-Detection to Low-Light Conditions without Re-training. 2535-2541 - Taras Rumezhak, Oles Dobosevych, Rostyslav Hryniv, Vladyslav Selotkin, Volodymyr Karpiv, Mykola Maksymenko:
Towards realistic symmetry-based completion of previously unseen point clouds. 2542-2550 - Carlo Colombo, Marco Fanfani:
A closed form solution for viewing graph construction in uncalibrated vision. 2551-2558 - Jason Rebello, Chunshang Li, Steven L. Waslander:
DC-VINS: Dynamic Camera Visual Inertial Navigation System with Online Calibration. 2559-2568 - Xiao Hu, François Lauze, Kim Steenstrup Pedersen, Jean Mélou:
Absolute and Relative Pose Estimation in Refractive Multi View. 2569-2578
Computer Vision in Human-Robot Collaborative Factories of the Future (CVinHRC)
- Manolis I. A. Lourakis, Maria Pateraki:
Markerless Visual Tracking of a Container Crane Spreader. 2579-2586 - Panagiotis Mouzenidis, Antonios Louros, Dimitrios Konstantinidis, Kosmas Dimitropoulos, Petros Daras, Theofilos Mastos:
Multi-modal Variational Faster R-CNN for Improved Visual Object Detection in Manufacturing. 2587-2594 - Muhammad Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Marcella Astrid, Seung-Ik Lee:
An Anomaly Detection System via Moving Surveillance Robots with Human Collaboration. 2595-2601 - Dávid Rozenberszki, Gábor Sörös, Szilvia Szeier, András Lorincz:
3D Semantic Label Transfer in Human-Robot Collaboration. 2602 - N. E. Anatoliotakis, P. Koustoumpardis, Konstantinos Moustakas:
Cloth mechanical parameter estimation and simulation for optimized robotic manipulation. 2612-2620 - Lee Aing, Wen-Nung Lie, Jui-Chiu Chiang, Guo-Shiang Lin:
InstancePose: Fast 6DoF Pose Estimation for Multiple Objects from a Single RGB Image. 2621-2630
Crossmodal Social Animation (XSAnim)
- Xiaopeng Lu, Zhen Fan, Yansen Wang, Jean Oh, Carolyn P. Rosé:
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling. 2631-2639 - Shyam Krishna, Vijay Vignesh P, Dinesh Babu J.:
SignPose: Sign Language Animation Through 3D Pose Lifting. 2640-2649
Video Scene Parsing in the Wild (VSPW)
- Abhinav Sagar, Rajkumar Soundrapandiyan:
Semantic Segmentation With Multi Scale Spatial Attention For Self Driving Cars. 2650-2656 - Hao Wang, Hasan Mohamed, Zuowen Wang, Bodo Rueckauer, Shih-Chii Liu:
LiteEdge: Lightweight Semantic Edge Detection Network. 2657-2666 - Fangrui Zhu, Yi Zhu, Li Zhang, Chongruo Wu, Yanwei Fu, Mu Li:
A Unified Efficient Pyramid Transformer for Semantic Segmentation. 2667-2677
Visual Object Tracking (VOT)
- Fei Xie, Wankou Yang, Kaihua Zhang, Bo Liu, Guangting Wang, Wangmeng Zuo:
Learning Spatio-Appearance Memory Network for High-Performance Visual Tracking. 2678-2687 - Fei Xie, Chunyu Wang, Guangting Wang, Wankou Yang, Wenjun Zeng:
Learning Tracking Representations via Dual-Branch Fully Transformer Networks. 2688-2697 - Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni:
Is First Person Vision Challenging for Object Tracking? 2698-2710 - Matej Kristan, Jirí Matas, Ales Leonardis, Michael Felsberg, Roman P. Pflugfelder, Joni-Kristian Kämäräinen, Hyung Jin Chang, Martin Danelljan, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Jani Käpylä, Gustav Häger, Song Yan, Jinyu Yang, Zhongqun Zhang, Gustavo Fernández, Mohamed H. Abdelpakey, Goutam Bhat, Llukman Cerkezi, Hakan Cevikalp, Shengyong Chen, Xin Chen, Miao Cheng, Ziyi Cheng, Yu-Chen Chiu, Ozgun Cirakman, Yutao Cui, Kenan Dai, Mohana Murali Dasari, Qili Deng, Xingping Dong, Daniel K. Du, Matteo Dunnhofer, Zhenhua Feng, Zhiyong Feng, Zhihong Fu, Shiming Ge, Rama Krishna Gorthi, Yuzhang Gu, Bilge Günsel, Qing Guo, Filiz Gurkan, Wencheng Han, Yanyan Huang, Felix Järemo Lawin, Shang-Jhih Jhang, Rongrong Ji, Cheng Jiang, Yingjie Jiang, Felix Juefei-Xu, J. Yin, Xiao Ke, Fahad Shahbaz Khan, Byeong Hak Kim, Josef Kittler, Xiangyuan Lan, Jun Ha Lee, Bastian Leibe, Hui Li, Jianhua Li, Xianxian Li, Yuezhou Li, Bo Liu, Chang Liu, Jingen Liu, Li Liu, Qingjie Liu, Huchuan Lu, Wei Lu, Jonathon Luiten, Jie Ma, Ziang Ma, Niki Martinel, Christoph Mayer, Alireza Memarmoghadam, Christian Micheloni, Yuzhen Niu, Danda Pani Paudel, Houwen Peng, Shoumeng Qiu, Aravindh Rajiv, Muhammad Rana, Andreas Robinson, Hasan Saribas, Ling Shao, Mohamed S. Shehata, Furao Shen, Jianbing Shen, Kristian Simonato, Xiaoning Song, Zhangyong Tang, Radu Timofte, Philip H. S. Torr, Chi-Yi Tsai, Bedirhan Uzun, Luc Van Gool, Paul Voigtlaender, Dong Wang, Guangting Wang, Liangliang Wang, Lijun Wang, Limin Wang, Linyuan Wang, Yong Wang, Yunhong Wang, Chenyan Wu, Gangshan Wu, Xiaojun Wu, Fei Xie, Tianyang Xu, Xiang Xu, Wanli Xue, Bin Yan, Wankou Yang, Xiaoyun Yang, Yu Ye, Jun Yin, Chengwei Zhang, Chunhui Zhang, Haitao Zhang, Kaihua Zhang, Kangkai Zhang, Xiaohan Zhang, Xiaolin Zhang, Xinyu Zhang, Zhibin Zhang, Shao-Chuan Zhao, Ming Zhen, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu:
The Ninth Visual Object Tracking VOT2021 Challenge Results. 2711-2738
Vision for Vitals (V4V)
- Brian L. Hill, Xin Liu, Daniel McDuff:
Beat-to-Beat Cardiac Pulse Rate Measurement From Video. 2739-2742 - John Gideon, Simon Stent:
Estimating Heart Rate from Unlabelled Video. 2743-2749 - Yassine Ouzar, Djamaleddine Djeldjli, Frédéric Bousefsaf, Choubeila Maaoui:
LCOMS Lab's approach to the Vision For Vitals (V4V) Challenge. 2750-2754 - Benjamin Kossack, Eric L. Wisotzky, Anna Hilsmann, Peter Eisert:
Automatic region-based heart rate measurement using remote photoplethysmography. 2755-2759 - Ambareesh Revanur, Zhihua Li, Umur A. Ciftci, Lijun Yin, László A. Jeni:
The First Vision For Vitals (V4V) Challenge for Non-Contact Video-Based Physiological Estimation. 2760-2767
Vision Meets Drones: A Challenge (VisDrone)
- Leon Amadeus Varga, Andreas Zell:
Tackling the Background Bias in Sparse Object Detection via Cropped Windows. 2768-2777 - Xingkui Zhu, Shuchang Lyu, Xu Wang, Qi Zhao:
TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios. 2778-2788 - Chengzhen Duan, Zhiwei Wei, Chi Zhang, Siying Qu, Hongpeng Wang:
Coarse-grained Density Map Guided Object Detection in Aerial Images. 2789-2798 - Zixiao Zhang, Xiaoqiang Lu, Guojin Cao, Yuting Yang, Licheng Jiao, Fang Liu:
ViT-YOLO: Transformer-Based YOLO for Object Detection. 2799-2808 - Yunhao Du, Junfeng Wan, Yanyun Zhao, Binyu Zhang, Zhihang Tong, Junhao Dong:
GIAOTracker: A comprehensive framework for MCMOT with global information and optimizing strategies in VisDrone 2021. 2809-2819 - Junfeng Wan, Binyu Zhang, Yanyun Zhao, Yunhao Du, Zhihang Tong:
VistrongerDet: Stronger Visual Information for Object Detection in VisDrone Images. 2820-2829 - Zhihao Liu, Zhijian He, Lujia Wang, Wenguan Wang, Yixuan Yuan, Dingwen Zhang, Jinglin Zhang, Pengfei Zhu, Luc Van Gool, Junwei Han, Steven C. H. Hoi, Qinghua Hu, Ming Liu, Junwen Pan, Baoqun Yin, Binyu Zhang, Chengxin Liu, Ding Ding, Dingkang Liang, Guanchen Ding, Hao Lu, Hui Lin, Jingyuan Chen, Jiong Li, Liang Liu, Lin Zhou, Min Shi, Qianqian Yang, Qing He, Sifan Peng, Wei Xu, Wenwei Han, Xiang Bai, Xiwu Chen, Yabin Wang, Yinfeng Xia, Yiran Tao, Zhenzhong Chen, Zhiguo Cao:
VisDrone-CC2021: The Vision Meets Drone Crowd Counting Challenge Results. 2830-2838 - Guanlin Chen, Wenguan Wang, Zhijian He, Lujia Wang, Yixuan Yuan, Dingwen Zhang, Jinglin Zhang, Pengfei Zhu, Luc Van Gool, Junwei Han, Steven Chu-Hong Hoi, Qinghua Hu, Ming Liu, Andrea Sciarrone, Chao Sun, Chiara Garibotto, Duong Nguyen-Ngoc Tran, Fabio Lavagetto, Halar Haleem, Hakki Motorcu, Hasan F. Ates, Huy-Hung Nguyen, Hyung-Joon Jeon, Igor Bisio, Jae Wook Jeon, Jiahao Li, Long Hoang Pham, Moongu Jeon, Qianyu Feng, Shengwen Li, Tai Huu-Phuong Tran, Xiao Pan, Young-Min Song, Yuehan Yao, Yunhao Du, Zhenyu Xu, Zhipeng Luo:
VisDrone-MOT2021: The Vision Meets Drone Multiple Object Tracking Challenge Results. 2839-2846 - Yaru Cao, Zhijian He, Lujia Wang, Wenguan Wang, Yixuan Yuan, Dingwen Zhang, Jinglin Zhang, Pengfei Zhu, Luc Van Gool, Junwei Han, Steven C. H. Hoi, Qinghua Hu, Ming Liu, Chong Cheng, Fanfan Liu, Guojin Cao, Guozhen Li, Hongkai Wang, Jianye He, Junfeng Wan, Qi Wan, Qi Zhao, Shuchang Lyu, Wenzhe Zhao, Xiaoqiang Lu, Xingkui Zhu, Yingjie Liu, Yixuan Lv, Yujing Ma, Yuting Yang, Zhe Wang, Zhenyu Xu, Zhipeng Luo, Zhimin Zhang, Zhiguang Zhang, Zihao Li, Zixiao Zhang:
VisDrone-DET2021: The Vision Meets Drone Object detection Challenge Results. 2847-2854
Autonomous Vehicle Vision (AVVision)
- Haotian Zhang, Haorui Ji, Aotian Zheng, Jenq-Neng Hwang, Ren-Hung Hwang:
Monocular 3D Localization of Vehicles in Road Scenes. 2855-2864 - Romain Guesdon, Carlos Fernando Crispim Junior, Laure Tougne:
DriPE: A Dataset for Human Pose Estimation in Real-World Driving Settings. 2865-2874 - Thomas Roddick, Benjamin Biggs, Daniel Olmeda Reino, Roberto Cipolla:
On the Road to Large-Scale 3D Monocular Scene Reconstruction using Deep Implicit Functions. 2875-2884 - Pranjay Shyam, Kuk-Jin Yoon, Kyung-Soo Kim:
Weakly Supervised Approach for Joint Object and Lane Marking Detection. 2885-2895 - Shreya Ghosh, Abhinav Dhall, Garima Sharma, Sarthak Gupta, Nicu Sebe:
Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset. 2896-2905 - Valentina Musat, Ivan Fursa, Paul Newman, Fabio Cuzzolin, Andrew Bradley:
Multi-weather city: Adverse weather stacking for autonomous driving. 2906-2915 - Annika Meyer, Philipp Skudlik, Jan-Hendrik Pauls, Christoph Stiller:
YOLinO: Generic Single Shot Polyline Detection in Real Time. 2916-2925 - Anshul Paigwar, David Sierra González, Özgür Erkent, Christian Laugier:
Frustum-PointPillars: A Multi-Stage Approach for 3D Object Detection using RGB Camera and LiDAR. 2926-2933 - Alice Plebe, Julian F. P. Kooij, Gastone Pietro Rosati Papini, Mauro Da Lio:
Occupancy Grid Mapping with Cognitive Plausibility for Autonomous Driving Applications. 2934-2941 - Jordan B. Chipka, Shuqing Zeng, Thanura R. Elvitigala, Priyantha Mudalige:
A Computer Vision-Based Attention Generator using DQN. 2942-2950 - Jiongchao Jin, Arezou Fatemi, Wallace M. P. Lira, Fenggen Yu, Biao Leng, Rui Ma, Ali Mahdavi-Amiri, Hao (Richard) Zhang:
RaidaR: A Rich Annotated Image Dataset of Rainy Street Scenes. 2951-2961 - Qi Xu, Yinan Ma, Jing Wu, Chengnian Long, Xiaolin Huang:
CDAda: A Curriculum Domain Adaptation for Nighttime Semantic Segmentation. 2962-2971 - Cinjon Resnick, Or Litany, Amlan Kar, Karsten Kreis, James Lucas, Kyunghyun Cho, Sanja Fidler:
Causal BERT: Improving object detection by searching for challenging groups. 2972-2981 - Hughes Perreault, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Maguelonne Héritier:
CenterPoly: real-time instance segmentation using bounding polygons. 2982-2991 - Meytal Rapoport-Lavie, Dan Raviv:
It's All Around You: Range-Guided Cylindrical Network for 3D Object Detection. 2992-3001 - Xiaofeng Ding, Chaomin Shen, Zhengping Che, Tieyong Zeng, Yaxin Peng:
SCARF: A Semantic Constrained Attention Refinement Network for Semantic Segmentation. 3002-3011 - Shivam Gautam, Gregory P. Meyer, Carlos Vallespi-Gonzalez, Brian C. Becker:
DVTracker: Real-Time Multi-Sensor Association and Tracking for Self-Driving Vehicles. 3012-3021 - Prarthana Bhattacharyya, Chengjie Huang, Krzysztof Czarnecki:
SA-Det3D: Self-Attention Based Context-Aware 3D Object Detection. 3022-3031 - Tiago Cortinhal, Fatih Kurnaz, Eren Erdal Aksoy:
Semantics-aware Multi-modal Domain Translation: From LiDAR Point Clouds to Panoramic Color Images. 3032-3041 - Divya Kothandaraman, Rohan Chandra, Dinesh Manocha:
SS-SFDA : Self-Supervised Source-Free Domain Adaptation for Road Segmentation in Hazardous Environments. 3042-3052 - Michael Meyer, Georg Kuschk, Sven Tomforde:
Graph Convolutional Networks for 3D Object Detection on Radar Data. 3053-3062 - Anuj Tambwekar, Kshitij Agrawal, Anay Majee, Anbumani Subramanian:
Few-Shot Batch Incremental Road Object Detection via Detector Fusion. 3063-3070 - Aman Kishore, Tae Eun Choe, Junghyun Kwon, Minwoo Park, Pengfei Hao, Akshita Mittel:
Synthetic Data Generation using Imitation Training. 3071-3079 - Christopher J. Holder, Muhammad Shafique:
Efficient Uncertainty Estimation in Semantic Segmentation via Distillation. 3080-3087 - Rui Fan, Nemanja Djuric, Fisher Yu, Rowan McAllister, Ioannis Pitas:
Autonomous Vehicle Vision 2021: ICCV Workshop Summary. 3088-3095 - Tina Chen, Renran Tian, Zhengming Ding:
Visual Reasoning using Graph Convolutional Networks for Predicting Pedestrian Crossing Intention. 3096-3102 - Yiqiang Chen, Feng Liu, Ke Pei:
Cross-modal Matching CNN for Autonomous Driving Sensor Data Monitoring. 3103-3112 - Zejie Wang, Zhen Zhao, Zhao Jin, Zhengping Che, Jian Tang, Chaomin Shen, Yaxin Peng:
Multi-Stage Fusion for Multi-Class 3D Lidar Detection. 3113-3121
Closing the Loop Between Vision and Language (CLVL)
- Taichi Nishimura, Kojiro Sakoda, Atsushi Hashimoto, Yoshitaka Ushiku, Natsuko Tanaka, Fumihito Ono, Hirotaka Kameko, Shinsuke Mori:
Egocentric Biochemical Video-and-Language Dataset. 3122-3126 - Xiaopeng Lu, Lynnette Hui Xian Ng, Jared Fernandez, Hao Zhu:
CIGLI: Conditional Image Generation from Language & Image. 3127-3131 - Yuanen Zhou, Yong Zhang, Zhenzhen Hu, Meng Wang:
Semi-Autoregressive Transformer for Image Captioning. 3132-3136 - Zixu Wang, Yishu Miao, Lucia Specia:
Latent Variable Models for Visual Question Answering. 3137-3141 - Alison Reboud, Raphaël Troncy:
What You Say Is Not What You Do: Studying Visio-Linguistic Models for TV Series Summarization. 3142-3146 - Yusuke Hirota, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Ittetsu Taniguchi, Takao Onoye:
Visual Question Answering with Textual Representations for Images. 3147-3150 - Jenhao Hsiao, Yikang Li, Chiuman Ho:
Language-guided Multi-Modal Fusion for Video Action Recognition. 3151-3155
AI for Creative Video Editing and Understanding (CVEU)
- Daniel Neimark, Omri Bar, Maya Zohar, Dotan Asselmann:
Video Transformer Network. 3156-3165 - Humam Alwassel, Silvio Giancola, Bernard Ghanem:
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks. 3166-3176 - Andrew Brown, Vicky Kalogeiton, Andrew Zisserman:
Face, Body, Voice: Video Person-Clustering with Multiple Modalities. 3177-3187 - Haofei Kuang, Yi Zhu, Zhi Zhang, Xinyu Li, Joseph Tighe, Sören Schwertfeger, Cyrill Stachniss, Mu Li:
Video Contrastive Learning with Global Context. 3188 - Bhagyashree Gaikwad, Ankita Sontakke, Manasi Patwardhan, Niranjan Pedanekar, Shirish Karande:
Plots to Previews: Towards Automatic Movie Preview Retrieval using Publicly Available Meta-data. 3198-3207 - Yuzhong Huang, Xue Bai, Oliver Wang, Fabian Caba, Aseem Agarwala:
Learning Where to Cut from Edited Videos. 3208-3216 - Mattia Soldan, Mengmeng Xu, Sally Sisi Qu, Jesper Tegnér, Bernard Ghanem:
VLG-Net: Video-Language Graph Matching Network for Video Grounding. 3217-3227
Computer Vision for Automated Medical Diagnosis (CVAMD)
- Sharif Amit Kamran, Khondker Fariha Hossain, Alireza Tavakkoli, Stewart Lee Zuckerbrod, Salah A. Baker:
VTGAN: Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers. 3228-3238 - Yuan Gao, Lok Hin Lee, Richard Droste, Rachel Craik, Sridevi Beriwal, Aris T. Papageorghiou, J. Alison Noble:
A Dual Adversarial Calibration Framework for Automatic Fetal Brain Biometry. 3239-3247 - Uddeshya Upadhyay, Viswanath P. Sudarshan, Suyash P. Awate:
Uncertainty-aware GAN with Adaptive Loss for Robust MRI Image Enhancement. 3248-3257 - Ashia Lewis, Evanjelin Mahmoodi, Yuyue Zhou, Megan Coffee, Elena Sizikova:
Improving Tuberculosis (TB) Prediction using Synthetically Generated Computed Tomography (CT) Images. 3258-3266 - Reza Azad, Afshin Bozorgpour, Maryam Asadi-Aghbolaghi, Dorit Merhof, Sergio Escalera:
Deep Frequency Re-calibration U-Net for Medical Image Segmentation. 3267-3276 - Mahbaneh Eshaghzadeh Torbati, Dana L. Tudorascu, Davneet S. Minhas, Pauline Maillard, Charles DeCarli, Seong Jae Hwang:
Multi-scanner Harmonization of Paired Neuroimaging Data via Structure Preserving Embedding Learning. 3277-3286 - Gregory Holste, Savannah C. Partridge, Habib Rahbar, Debosmita Biswas, Christoph I. Lee, Adam M. Alessio:
End-to-End Learning of Fused Image and Non-Image Features for Improved Breast Cancer Classification from MRI. 3287-3296 - Zhou Zheng, Masahiro Oda, Kensaku Mori:
Graph Cuts Loss to Boost Model Accuracy and Generalizability for Medical Image Segmentation. 3297-3306 - Thanh T. Tran, Hieu H. Pham, Thang V. Nguyen, Tung T. Le, Hieu T. Nguyen, Ha Q. Nguyen:
Learning to Automatically Diagnose Multiple Diseases in Pediatric Chest Radiographs Using Deep Convolutional Neural Networks. 3307-3316 - Behzad Bozorgtabar, Guillaume Vray, Dwarikanath Mahapatra, Jean-Philippe Thiran:
SOoD: Self-Supervised Out-of-Distribution Detection Under Domain Shift for Multi-Class Colorectal Cancer Tissue Types. 3317-3326 - Masoud Monajatipoor, Mozhdeh Rouhsedaghat, Liunian Harold Li, Aichi Chien, C.-C. Jay Kuo, Fabien Scalzo, Kai-Wei Chang:
BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis. 3327-3336 - Dwarikanath Mahapatra, Behzad Bozorgtabar, Zongyuan Ge:
Medical Image Classification Using Generalized Zero Shot Learning. 3337-3346 - Miguel Nehmad Alche, Daniel G. Acevedo, Marta Mejail:
EfficientARL: improving skin cancer diagnoses by combining lightweight attention on EfficientNet. 3347-3353 - Rina Bao, Noor M. Al-Shakarji, Filiz Bunyak, Kannappan Palaniappan:
DMNet: Dual-Stream Marker Guided Deep Network for Dense Cell Segmentation and Lineage Tracking. 3354-3363 - Yochai Blau, Daniel Freedman, Valentin Dashinsky, Roman Goldenberg, Ehud Rivlin:
Unsupervised 3D Shape Coverage Estimation with Applications to Colonoscopy. 3364-3374 - Sherry Chao, David Belanger:
Generalizing Few-Shot Classification of Whole-Genome Doubling Across Cancer Types. 3375-3385 - Supriti Mulay, Keerthi Ram, Balamurali Murugesan, Mohanasankar Sivaprakasam:
Style Transfer based Coronary Artery Segmentation in X-ray Angiogram. 3386-3394 - Zhuotun Zhu, Yongyi Lu, Wei Shen, Elliot K. Fishman, Alan L. Yuille:
Segmentation for Classification of Screening Pancreatic Neuroendocrine Tumors. 3395-3401 - Esha Pahwa, Dwij Mehta, Sanjeet Kapadia, Devansh Jain, Achleshwar Luthra:
MedSkip: Medical Report Generation Using Skip Connections and Integrated Attention. 3402-3408 - Adrit Rao, Jongchan Park, Sanghyun Woo, Joon-Young Lee, Oliver O. Aalami:
Studying the Effects of Self-Attention for Medical Image Analysis. 3409-3418
Egocentric Perception, Interaction and Computing (EPIC)
- Deepak E. Gopinath, Guy Rosman, Simon Stent, Katsuya Terahata, Luke Fletcher, Brenna Argall, John J. Leonard:
MAAD: A Model and Dataset for "Attended Awareness" in Driving. 3419-3429 - Nada Osman, Guglielmo Camporese, Pasquale Coscia, Lamberto Ballan:
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos. 3430-3438 - Yangming Wen, Krishna Kumar Singh, Markham Anderson, Wei-Pang Jan, Yong Jae Lee:
Seeing the Unseen: Predicting the First-Person Camera Wearer's Location and Pose in Third-Person Scenes. 3439-3448 - Xiaowei Chen, Guoliang Fan:
Egocentric Indoor Localization from Room Layouts and Image Outer Corners. 3449-3458 - Wolfgang Fuhl, Johannes Schneider, Enkelejda Kasneci:
1000 Pupil Segmentations in a Second using Haar Like Features and Statistical Learning. 3459-3469
Real-World Computer Vision From Inputs With Limited Quality (RLQ)
- Lichuan Xiang, Royson Lee, Mohamed S. Abdelfattah, Nicholas D. Lane, Hongkai Wen:
Temporal Kernel Consistency for Blind Video Super-Resolution. 3470-3479 - Biplob Debnath, Giuseppe Coviello, Yi Yang, Srimat Chakradhar:
UAC: An Uncertainty-Aware Face Clustering Algorithm. 3480-3488 - Xinyu Jia, Chuang Zhu, Minzhen Li, Wenqi Tang, Wenli Zhou:
LLVIP: A Visible-infrared Paired Dataset for Low-light Vision. 3489-3497 - Markus D. Solbach, John K. Tsotsos:
Blocks World Revisited: The Effect of Self-Occlusion on Classification by Convolutional Neural Networks. 3498-3507 - Taewon Kang:
Multiple GAN Inversion for Exemplar-based Image-to-Image Translation. 3508-3515 - Jun Yu, Xinlong Hao, Peng He:
Single-stage Face Detection under Extremely Low-light Conditions. 3516-3525
Affective Behavior Analysis In-the-Wild (ABAW)
- Bonaventure F. P. Dossou, Yeno K. S. Gbenou:
FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition. 3526-3531 - Wei Zhang, Zunhu Guo, Keyu Chen, Lincheng Li, Zhimeng Zhang, Yu Ding, Runze Wu, Tangjie Lv, Changjie Fan:
Prior Aided Streaming Network for Multi-task Affective Analysis. 3532-3542 - Geesung Oh, Euiseok Jeong, Sejoon Lim:
Causal affect prediction model using a past facial image sequence. 3543-3549 - Didan Deng, Liang Wu, Bertram E. Shi:
Iterative Distillation for Better Uncertainty Estimates in Multitask Emotion Recognition. 3550-3559 - Su Zhang, Yi Ding, Ziquan Wei, Cuntai Guan:
Continuous Emotion Recognition with Audio-visual Leader-follower Attentive Fusion. 3560-3567 - Aboli Marathe, Rahee Walambe, Ketan Kotecha:
Evaluating the Performance of Ensemble Methods and Voting Strategies for Dense 2D Pedestrian Detection in the Wild. 3568-3577 - Darshan Gera, S. Balasubramanian:
Noisy Annotations Robust Consensual Collaborative Affect Expression Recognition. 3578-3585 - Phan Tran Dac Thinh, Hoang Manh Hung, Hyung-Jeong Yang, Soo-Hyung Kim, Guee-Sang Lee:
Emotion Recognition With Sequential Multi-task Learning Technique. 3586-3589 - Yue Jin, Tianqing Zheng, Chao Gao, Guoqiang Xu:
MTMSN: Multi-Task and Multi-Modal Sequence Network for Facial Action Unit and Expression Recognition. 3590-3595 - Lingfeng Wang, Shisen Wang, Jin Qi, Kenji Suzuki:
A Multi-task Mean Teacher for Semi-supervised Facial Affective Behavior Analysis. 3596-3601 - Yibo Huang, Hongqian Wen, Linbo Qing, Rulong Jin, Leiming Xiao:
Emotion Recognition Based on Body and Context Fusion in the Wild. 3602-3610 - Linbo Qing, Lindong Li, Shengyu Xu, Yibo Huang, Mei Liu, Rulong Jin, Bo Liu, Tong Niu, Hongqian Wen, Yuchen Wang, Xue Jiang, Yonghong Peng:
Public Life in Public Space (PLPS): A multi-task, multi-group video dataset for public life research. 3611-3620 - Kevin Delgado, Juan Manuel Origgi, Tania Hasanpoor, Hao Yu, Danielle Allessio, Ivon Arroyo, William Lee, Margrit Betke, Beverly P. Woolf, Sarah Adel Bargal:
Student Engagement Dataset. 3621-3629 - Manh-Tu Vu, Marie Beurton-Aimar, Serge Marchand:
Multitask Multi-database Emotion Recognition. 3630-3637 - Panagiotis Antoniadis, Ioannis Pikoulis, Panagiotis Paraskevas Filntisis, Petros Maragos:
An audiovisual and contextual approach for categorical and continuous emotion recognition in-the-wild. 3638-3644 - Dimitrios Kollias, Stefanos Zafeiriou:
Analysing Affective Behavior in the second ABAW2 Competition. 3645-3653
Computer Vision in the Ocean (OceanVision)
- Yuchun Pu, Zhenghui Feng, Zhonglei Wang, Zhenyu Yang, Jianping Li:
Anomaly Detection for In situ Marine Plankton Images. 3654-3664 - Joseph L. Walker, Eric C. Orenstein:
Improving Rare-Class Recognition of Marine Plankton with Hard Negative Mining. 3665-3675 - Wenqi Ma, Tao Chen, Zhengwen Zhang, Zhenyu Yang, Chao Dong, Jianping Qiao, Jianping Li:
Super-resolution for in situ Plankton Images. 3676-3685 - Qimin Chen, Oscar Beijbom, Stephen Chan, Jessica Bouwmeester, David J. Kriegman:
A New Deep Learning Engine for CoralNet. 3686-3695 - Maxime Ferrera, Aurélien Arnaubec, Klemen Istenic, Nuno Gracias, Touria Bajjouk:
Hyperspectral 3D Mapping of Underwater Environments. 3696-3705 - Petter Risholm, Peter Ørnulf Ivarsen, Karl-Henrik Haugholt, Ahmed Mohammed:
Underwater marker-based pose-estimation with associated uncertainty. 3706-3714 - Peder Georg Olofsson Zwilgmeyer, Mauhing Yip, Andreas Langeland Teigen, Rudolf Mester, Annette Stahl:
The VAROS Synthetic Underwater Data Set: Towards realistic multi-sensor underwater data with ground truth. 3715-3723 - David Nakath, Mengkun She, Yifan Song, Kevin Köser:
In-Situ Joint Light and Medium Estimation for Underwater Color Restoration. 3724-3733 - Deepak Singh, Matias Valdenegro-Toro:
The Marine Debris Dataset for Forward-Looking Sonar Semantic Segmentation. 3734-3742
Eye Tracking for AR/VR: Sensors and Applications (OpenEDS)
- Yasser Abdelaziz Dahou Djilali, Kevin McGuinness, Noel E. O'Connor:
Simple baselines can fool 360° saliency metrics. 3743-3749
Responsible Pattern Recognition and Machine Intelligence (RPRMI)
- Pawel Drozdowski, Christian Rathgeb, Christoph Busch:
The Watchlist Imbalance Effect in Biometric Face Identification: Comparing Theoretical Estimates and Empiric Measurements. 3750-3758 - Sebastian Palacio, Adriano Lucieri, Mohsin Munir, Sheraz Ahmed, Jörn Hees, Andreas Dengel:
XAI Handbook: Towards a Unified Framework for Explainable AI. 3759-3768 - Sowmen Das, Selim S. Seferbekov, Arup Datta, Md. Saiful Islam, Md. Ruhul Amin:
Towards Solving the DeepFake Problem : An Analysis on Improving DeepFake Detection using Dynamic Face Augmentation. 3769-3778 - Puspita Majumdar, Surbhi Mittal, Richa Singh, Mayank Vatsa:
Unravelling the Effect of Image Distortions for Biased Prediction of Pre-trained Face Recognition Models. 3779-3788 - Luke Guerdan, Alex Raymond, Hatice Gunes:
Toward Affective XAI: Facial Affect Analysis for Understanding Explainable Human-AI Interactions. 3789-3798 - Carlo Alberto Barbano, Enzo Tartaglione, Marco Grangetto:
Bridging the gap between debiasing and privacy for deep learning. 3799-3808
Airborne Object Tracking (AOTW)
- Mohamed Adel Musallam, Miguel Ortiz del Castillo, Kassem Al Ismaeil, Marcos Damian Perez, Djamila Aouada:
Leveraging Temporal Information for 3D Trajectory Estimation of Space Objects. 3809-3815 - Daniel Steininger, Verena Widhalm, Julia Simon, Andreas Kriegler, Christoph Sulzbacher:
The Aircraft Context Dataset: Understanding and Optimizing Data Variability in Aerial Domains. 3816-3825
Occluded Video Instance Segmentation (OVIS)
- Shane Gilroy, Martin Glavin, Edward Jones, Darragh Mullins:
Pedestrian Occlusion Level Classification using Keypoint Detection and 2D Body Surface Area Estimation. 3826-3832 - Khalid J. Almalki, Baek-Young Choi, Yu Chen, Sejun Song:
Characterizing Scattered Occlusions for Effective Dense-Mode Crowd Counting. 3833-3842 - Heechul Bae, Soonyong Song, Junhee Park:
Occluded Video Instance Segmentation with Set Prediction Approach. 3843-3846 - Zhuang Li, Leilei Cao, Hongbin Wang:
Limited Sampling Reference Frame for MaskTrack R-CNN. 3847-3850 - Ali Athar, Sabarinath Mahadevan, Aljosa Osep, Laura Leal-Taixé, Bastian Leibe:
A Single-Stage, Bottom-up Approach for Occluded VIS using Spatio-temporal Embeddings. 3851-3855 - Wenbo Li, Xuesheng Li, Qiwei Xu, Chen Li:
From VIS To OVIS: A Technical Report To Promote The Development Of The Field. 3856-3860
Analysis of Aerial Motion Imagery (WAAMI)
- Christian Lusardi, Abu Md Niamul Taufique, Andreas E. Savakis:
Robust Multi-Object Tracking Using Re-Identification Features and Graph Convolutional Networks. 3861-3870 - Lars Sommer, Wolfgang Krüger, Michael Teutsch:
Appearance and Motion Based Persistent Multiple Object Tracking in Wide Area Motion Imagery. 3871-3881 - Brendan Alvey, Derek T. Anderson, Andrew R. Buck, Matthew Deardorff, Grant J. Scott, James M. Keller:
Simulated Photorealistic Deep Learning Framework and Workflows to Accelerate Computer Vision and Unmanned Aerial Vehicle Research. 3882-3891 - Yuxiang Zhao, Khurram Shafique, Zeeshan Rasheed, Maoxu Li:
JanusNet: Detection of Moving Objects from UAV Platforms. 3892-3901 - Matthew Plaudis, Muhammad Azam, Derek Jacoby, Marc-Antoine Drouin, Yvonne Coady:
An Algorithmic Approach to Quantifying GPS Trajectory Error. 3902-3909 - Tristan Brodeur, Hadi Aliakbarpour, Steve Suddarth:
Point Cloud Object Segmentation Using Multi Elevation-Layer 2D Bounding-Boxes. 3910-3918 - Deniz Kavzak Ufuktepe, Jaired Collins, Ekincan Ufuktepe, Joshua Fraser, Timothy Krock, Kannappan Palaniappan:
Learning-Based Shadow Detection in Aerial Imagery Using Automatic Training Supervision from 3D Point Clouds. 3919-3928 - Md. Shahid, Sumohana S. Channappayya:
Aerial Cross-platform Path Planning Dataset. 3929-3938
Multi-Agent Interaction and Relational Reasoning (MAIR2)
- Hongyu Chen, Ruifang Liu, Bo Peng:
Cross-modal Relational Reasoning Network for Visual Question Answering. 3939-3948 - Divya Kothandaraman, Rohan Chandra, Dinesh Manocha:
BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving Environments. 3949-3958
Learning for Computational Imaging (LCI)
- Chengxi Li, Xiangyu Qu, Abhiram Gnanasambandam, Omar A. Elgendy, Jiaju Ma, Stanley H. Chan:
Photon-Limited Object Detection using Non-local Feature Matching and Knowledge Distillation. 3959-3970 - Elizabeth K. Cole, Frank Ong, Shreyas S. Vasanawala, John M. Pauly:
Fast Unsupervised MRI Reconstruction Without Fully-Sampled Ground Truth Data Using Generative Adversarial Networks. 3971-3980 - Gabriel Eilertsen, Saghi Hajisharif, Param Hanji, Apostolia Tsirikoglou, Rafal K. Mantiuk, Jonas Unger:
How to cheat with metrics in single-image HDR reconstruction. 3981-3990 - Kanghyun Ryu, Cagan Alkan, Chanyeol Choi, Ikbeom Jang, Shreyas Vasanawala:
K-space refinement in deep learning MR reconstruction via regularizing scan specific SPIRiT-based self consistency. 3991-4000 - Canberk Ekmekci, Müjdat Çetin:
What Does Your Computational Imaging Algorithm Not Know?: A Plug-and-Play Model Quantifying Model Uncertainty. 4001-4010 - Mingyang Xie, Jiaming Liu, Yu Sun, Weijie Gan, Brendt Wohlberg, Ulugbek S. Kamilov:
Joint Reconstruction and Calibration Using Regularization by Denoising with Application to Computed Tomography. 4011-4020 - Robiulhossain Mdrafi, Ali Cafer Gürbüz:
Compressed Classification from Learned Measurements. 4021-4030 - Weijie Gan, Yuyang Hu, Cihat Eldeniz, Jiaming Liu, Yasheng Chen, Hongyu An, Ulugbek S. Kamilov:
SS-JIRCS: Self-Supervised Joint Image Reconstruction and Coil Sensitivity Calibration in Parallel MRI without Ground Truth. 4031-4039 - Vishwanath Saragadam, Akshat Dave, Ashok Veeraraghavan, Richard G. Baraniuk:
Thermal Image Processing via Physics-Inspired Deep Networks. 4040-4048 - Youssef S. G. Nashed, Frédéric Poitevin, Harshit Gupta, Geoffrey Woollard, Michael Kagan, Chun Hong Yoon, Daniel Ratner:
CryoPoseNet: End-to-End Simultaneous Learning of Single-particle Orientation and 3D Map Reconstruction from Cryo-electron Microscopy Data. 4049-4059
Human-Centric Trustworthy Computer Vision: From Research to Applications (HTCV)
- Zhenzhu Zheng, Christopher Rasmussen, Xi Peng:
Student-Teacher Oneness: A Storage-efficient approach that improves facial expression recognition. 4060-4069 - Xudong Liu, Ruizhe Wang, Hao Peng, Minglei Yin, Chih-Fan Chen, Xin Li:
Sparse Feature Representation Learning for Deep Face Gender Transfer. 4070-4080 - Hirokatsu Kataoka, Asato Matsumoto, Ryosuke Yamada, Yutaka Satoh, Eisuke Yamagata, Nakamasa Inoue:
Formula-driven Supervised Learning with Recursive Tiling Patterns. 4081-4088 - Xiang Li, Yasushi Makihara, Chi Xu, Yasushi Yagi:
End-to-end Model-based Gait Recognition using Synchronized Multi-view Pose Constraint. 4089-4098 - Zhuming Wang, Yaowen Xu, Lifang Wu, Hu Han, Yukun Ma, Guozhang Ma:
Multi-Perspective Features Learning for Face Anti-Spoofing. 4099-4105 - Matthew Gwilliam, Srinidhi Hegde, Lade Tinubu, Alex Hanson:
Rethinking Common Assumptions to Mitigate Racial Bias in Face Recognition Datasets. 4106-4115 - Puspita Majumdar, Richa Singh, Mayank Vatsa:
Attention Aware Debiasing for Unbiased Model Prediction. 4116-4124 - Xingyang Ni, Heikki Huttunen, Esa Rahtu:
On the Importance of Encrypting Deep Features. 4125-4132 - Shenqi Lai, Zhenhua Chai, Xiaolin Wei:
Transformer Meets Part Model: Adaptive Part Division for Person Re-Identification. 4133-4140 - Sam Sattarzadeh, Mahesh Sudhakar, Konstantinos N. Plataniotis:
SVEA: A Small-scale Benchmark for Validating the Usability of Post-hoc Explainable AI Solutions in Image and Signal Recognition. 4141-4150 - Debaditya Shome, Tejaswini Kar:
FedAffect: Few-shot federated learning for facial expression recognition. 4151-4158
Topology, Algebra, and Geometry in Computer Vision (TAG-CV)
- Chuan-Shen Hu, Austin Lawson, Yu-Min Chung, Kaitlin Keegan:
Two-parameter Persistence for Images via Distance Transform. 4159-4167 - Xiaofeng Ma, Michael Kirby, Chris Peterson:
The Flag Manifold as a Tool for Analyzing and Comparing Sets of Data Sets. 4168-4177 - Stephen Y. Zhang:
A unified framework for non-negative matrix and tensor factorisations with a smoothed Wasserstein loss. 4178-4186 - Amit Efraim, Joseph M. Francos:
Dual Transformation and Manifold Distances Voting for Outlier Rejection in Point Cloud Registration. 4187-4195 - Yuval Haitman, Joseph M. Francos, Louis L. Scharf:
Grassmannian Dimensionality Reduction for Optimized Universal Manifold Embedding Representation of 3D Point Clouds. 4196-4204 - Henry Kvinge, Brett A. Jefferson, Cliff A. Joslyn, Emilie Purvine:
Sheaves as a Framework for Understanding and Interpreting Model Fit. 4205-4213 - Yuliang Cai, Sumit Mohan, Adithya Niranjan, Nilesh Jain, Alex Cloninger, Srinjoy Das:
A Manifold Learning based Video Prediction approach for Deep Motion Transfer. 4214-4221 - Mark Blumstein, Henry Kvinge:
Multi-Dimensional Scaling on Groups. 4222-4227
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.