default search action
CVPR 2023: Vancouver, BC, Canada - Workshops
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Workshops, Vancouver, BC, Canada, June 17-24, 2023. IEEE 2023, ISBN 979-8-3503-0249-3
- Ruggero Ragonesi, Pietro Morerio, Vittorio Murino:
Learning unbiased classifiers from biased data with meta-learning. 1-9 - Teng-Yok Lee, Yusuke Nagai, Akira Minezawa:
Memory-efficient and GPU-oriented visual anomaly detection with incremental dimension reduction. 1-9 - Juewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Ke Xian, Zhiguo Cao:
Selective Bokeh Effect Transformation. 1-9 - Bilal Porgali, Vítor Albiero, Jordan Ryda, Cristian Canton-Ferrer, Caner Hazirbas:
The Casual Conversations v2 Dataset : A diverse, large benchmark for measuring fairness and robustness in audio/vision/speech models. 10-17 - Hannah Kirkland, Sanjeev J. Koppal:
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera. 18-27 - Akshay Agarwal, Nalini K. Ratha, Richa Singh, Mayank Vatsa:
Robustness Against Gradient based Attacks through Cost Effective Network Fine-Tuning. 28-37 - Linzhi Huang, Mei Wang, Jiahao Liang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Jian Zhao:
Gradient Attention Balance Network: Mitigating Face Recognition Racial Bias via Gradient Attention. 38-47 - Aman Shrivastava, Yanjun Qi, Vicente Ordonez:
Estimating and Maximizing Mutual Information for Knowledge Distillation. 48-57 - Shreyank N. Gowda:
Synthetic Sample Selection for Generalized Zero-Shot Learning. 58-67 - Yuhao Chen, Hayden Gunraj, E. Zhixuan Zeng, Robbie Meyer, Maximilian Gilles, Alexander Wong:
MMRNet: Improving Reliability for Multimodal Object Detection and Segmentation for Bin Picking via Multimodal Redundancy. 68-77 - Yang Zheng, Oles Andrienko, Yonglei Zhao, Minwoo Park, Trung Pham:
DPPD: Deformable Polar Polygon Object Detection. 78-87 - Oliver Zendel, Johannes Huemer, Markus Murschitz, Gustavo Fernández Domínguez, Amadeus Lobe:
Joint Camera and LiDAR Risk Analysis. 88-97 - Adriano Cardace, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano:
Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation. 98-109 - Apoorv Singh:
Training Strategies for Vision Transformers for Object Detection. 110-118 - Yunxiao Shi, Hong Cai, Amin Ansari, Fatih Porikli:
EGA-Depth: Efficient Guided Attention for Self-Supervised Multi-Camera Depth Estimation. 119-129 - Vickram Rajendran, Chuck Tang, Frits van Paasschen:
Improving Rare Classes on nuScenes LiDAR segmentation Through Targeted Domain Adaptation. 130-139 - Håkon Hukkelås, Frank Lindseth:
Does Image Anonymization Impact Computer Vision Training? 140-150 - Ce Zhang, Chengjie Zhang, Yiluan Guo, Lingji Chen, Michael Happold:
MotionTrack: End-to-End Transformer-based Multi-Object Tracking with LiDAR-Camera Fusion. 151-160 - Tae Eun Choe, Jane Wu, Xiaolin Lin, Karen Kwon, Minwoo Park:
HazardNet: Road Debris Detection by Augmentation of Synthetic Models. 161-171 - Xuanyao Chen, Tianyuan Zhang, Yue Wang, Yilun Wang, Hang Zhao:
FUTR3D: A Unified Sensor Fusion Framework for 3D Detection. 172-181 - Felix Fent, Philipp Bauerschmidt, Markus Lienkamp:
RadarGNN: Transformation Invariant Graph Neural Network for Radar-based Perception. 182-191 - Ruphan Swaminathan, Pradyot V. N. Korupolu:
MobileDeRainGAN: An Efficient Semi-Supervised Approach to Single Image Rain Removal for Task-Driven Applications. 192-201 - Haotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang, Song Han:
TorchSparse++: Efficient Point Cloud Engine. 202-209 - Tommaso Nesti, Santhosh Boddana, Burhaneddin Yaman:
Ultra-Sonic Sensor based Object Detection for Autonomous Vehicles. 210-218 - Andreas Bär, Daniel Kusuma, Tim Fingscheidt:
Improvements to Image Reconstruction-Based Performance Prediction for Semantic Segmentation in Highly Automated Driving. 219-229 - Sheng-Cheng Lee, Victor Lu, Chieh-Chih Wang, Wen-Chieh Lin:
LiDAR-Based Localization on Highways Using Raw Data and Pole-Like Object Features. 230-237 - Matías Molina:
Zero-shot Classification at Different Levels of Granularity. 238-244 - Octavio Arriaga, Sebastian Palacio, Matias Valdenegro-Toro:
Difficulty Estimation with Action Scores for Computer Vision Tasks. 245-253 - Juan Luis Gonzalez Bello, Jaeho Moon, Munchurl Kim:
Detail-Preserving Self-Supervised Monocular Depth with Self-Supervised Structural Sharpening. 254-264 - Emmanuel Martinez, Roman Jacome, Alejandra Hernandez-Rojas, Henry Arguello:
LD-GAN: Low-Dimensional Generative Adversarial Network for Spectral Image Generation with Variance Regularization. 265-275 - David Laines, Miguel González-Mendoza, Gilberto Ochoa-Ruiz, Gissella Bejarano:
Isolated Sign Language Recognition based on Tree Structure Skeleton Images. 276-284 - Rafael Martinez Garcia Peña, Mansoor Ali Teevno, Gilberto Ochoa-Ruiz, Sharib Ali:
SUPRA: Superpixel Guided Loss for Improved Multi-modal Segmentation in Endoscopy. 285-294 - Daniel Flores-Araiza, Francisco Javier Lopez-Tiro, Jonathan El Beze, Jacques Hubert, Miguel González-Mendoza, Gilberto Ochoa-Ruiz, Christian Daul:
Deep Prototypical-Parts Ease Morphological Kidney Stone Identification and are Competitively Robust to Photometric Perturbations. 295-304 - Yoshio Rubio, Marco A. Contreras-Cruz:
Wildlife Image Generation from Scene Graphs. 305-314 - Juan C. Pérez, Motasem Alfarra, Ali K. Thabet, Pablo Arbeláez, Bernard Ghanem:
Towards Characterizing the Semantic Robustness of Face Recognition. 315-325 - Willams de Lima Costa, Estefania Talavera Martínez, Lucas Silva Figueiredo, Veronica Teichrieb:
High-level context representation for emotion recognition in images. 326-334 - Kshitij Nikhal, Nkiruka Uzuegbunam, Bridget Kennedy, Benjamin S. Riggan:
Mitigating Catastrophic Interference using Unsupervised Multi-Part Attention for RGB-IR Face Recognition. 335-344 - Alicja Kwasniewska, Anastacia MacAllister, Rey Nicolas, Javier Garzás:
Multi-sensor Ensemble-guided Attention Network for Aerial Vehicle Perception Beyond Visible Spectrum. 345-353 - Abel A. Reyes, Sidike Paheding, A. Rajaneesh, K. S. Sajinkumar, Thomas Oommen:
C-PLES: Contextual Progressive Layer Expansion with Self-attention for Multi-class Landslide Segmentation on Mars using Multimodal Satellite Imagery. 354-364 - Wassim A. El Ahmar, Yahya Massoud, Dhanvin Kolhatkar, Hamzah Alghamdi, Mohammad Al Ja'afreh, Robert Laganière, Riad I. Hammoud:
Enhanced Thermal-RGB Fusion for Robust Object Detection. 365-374 - Rhythm Vohra, Femina Senjaliya, Melissa Cote, Amanda Dash, Alexandra Branzan Albu, Julek Chawarski, Steve Pearce, Kaan Ersahin:
Detecting Underwater Discrete Scatterers in Echograms with Deep Learning-Based Semantic Segmentation. 375-384 - Eleni Kamenou, Jesús Martínez del Rincón, Paul Miller, Patricia Devlin-Hill:
A Meta-learning Approach for Domain Generalisation across Visual Modalities in Vehicle Re-identification. 385-393 - Noreen Anwar, Philippe Duplessis-Guindon, Guillaume-Alexandre Bilodeau, Wassim Bouachir:
VisiTherS: Visible-thermal infrared stereo disparity estimation of human silhouette. 394-402 - Yue Cao, Junchi Bin, Jozsef Hamari, Erik Blasch, Zheng Liu:
Multimodal Object Detection by Channel Switching and Spatial Attention. 403-411 - Spencer Low, Oliver Nina, Angel Domingo Sappa, Erik Blasch, Nathan Inkawhich:
Multi-modal Aerial View Object Classification Challenge Results - PBVS 2023. 412-421 - Meryem Mine Gündogan, Tolga Aksoy, Alptekin Temizel, Ugur Halici:
IR Reasoner: Real-time Infrared Object Detection by Visual Reasoning. 422-430 - Jincheng Zhang, Andrew R. Willis, Kevin M. Brink:
Photometric Correction for Infrared Sensors. 431-439 - Jasmine Bayrooti, Noah D. Goodman, Alex Tamkin:
Multispectral Contrastive Learning with Viewmaker Networks. 440-448 - Berkcan Ustun, Ahmet Kagan Kaya, Ezgi Cakir Ayerden, Fazil Altinel:
Spectral Transfer Guided Active Domain Adaptation For Thermal Imagery. 449-458 - Fabian Erlenbusch, Constanze Merkt, Bernardo de Oliveira, Alexander Gatter, Friedhelm Schwenker, Ulrich Klauck, Michael Teutsch:
Thermal Infrared Single Image Dehazing and Blind Image Quality Assessment. 459-469 - Rafael E. Rivadeneira, Angel Domingo Sappa, Boris Xavier Vintimilla, Chenyang Wang, Junjun Jiang, Xianming Liu, Zhiwei Zhong, Dai Bin, Li Ruodi, Shengye Li:
Thermal Image Super-Resolution Challenge Results - PBVS 2023. 470-478 - Feng Cai, Keyu Wu, Haipeng Wang, Feng Wang:
A Three-Stage Framework with Reliable Sample Pool for Long-Tailed Classification. 479-486 - Aniruddh Sikdar, Sumanth Udupa, Prajwal Gurunath, Suresh Sundaram:
DeepMAO: Deep Multi-scale Aware Overcomplete Network for Building Segmentation in Satellite Imagery. 487-496 - Ahmed Zgaren, Wassim Bouachir, Nizar Bouguila, Riad I. Hammoud:
MoundCount: A detection-based approach for automatic counting of planting microsites on UAV images. 497-506 - Aditya Kasliwal, Pratinav Seth, Sriya Rallabandi, Sanchit Singhal:
CoReFusion: Contrastive Regularized Fusion for Guided Thermal Super-Resolution. 507-514 - Spencer Low, Oliver Nina, Angel Domingo Sappa, Erik Blasch, Nathan Inkawhich:
Multi-modal Aerial View Image Challenge: Translation from Synthetic Aperture Radar to Electro-Optical Domain Results - PBVS 2023. 515-523 - Brian K. S. Isaac-Medina, Seyma Yucer, Neelanjan Bhowmik, Toby P. Breckon:
Seeing Through the Data: A Statistical Evaluation of Prohibited Item Detection Benchmark Datasets for X-ray Security Screening. 524-533 - Raghunath Sai Puttagunta, Zhu Li, Shuvra S. Bhattacharyya, George York:
Appearance Label Balanced Triplet Loss for Multi-modal Aerial View Object Classification. 534-542 - Ainkaran Santhirasekaram, Mathias Winkler, Andrea G. Rockall, Ben Glocker:
Topology Preserving Compositionality for Robust Medical Image Segmentation. 543-552 - Yi Tang Chen, Sebastian Kurtek:
Shape and Intensity Analysis of Glioblastoma Multiforme Tumors. 553-560 - Ainkaran Santhirasekaram, Avinash Kori, Mathias Winkler, Andrea G. Rockall, Francesca Toni, Ben Glocker:
Robust Hierarchical Symbolic Explanations in Hyperbolic Space for Image Classification. 561-570 - Kalyan Varma Nadimpalli, Amit Chattopadhyay, Bastian Rieck:
Euler Characteristic Transform Based Topological Loss for Reconstructing 3D Images from Single 2D Slices. 571-579 - Andac Demir, Elie Massaad, Bulent Kiziltan:
Topology-Aware Focal Loss for 3D Image Segmentation. 580-589 - Huma Jamil, Yajing Liu, Turgay Caglar, Christina M. Cole, Nathaniel Blanchard, Christopher Peterson, Michael Kirby:
Hamming Similarity and Graph Laplacians for Class Partitioning and Adversarial Image Detection. 590-599 - Audun Myers, Henry Kvinge, Tegan Emerson:
TopFusion: Using Topological Feature Space for Fusion and Imputation in Multi-Modal Data. 600-609 - Francisco Acosta, Sophia Sanborn, Khanh Dao Duc, Manu S. Madhav, Nina Miolane:
Quantifying Extrinsic Curvature in Neural Manifolds. 610-619 - Davis Brown, Henry Kvinge:
Making Corgis Important for Honeycomb Classification: Adversarial Attacks on Concept-based Explainability Tools. 620-627 - Bohan Zeng, Xuhui Liu, Sicheng Gao, Boyu Liu, Hong Li, Jianzhuang Liu, Baochang Zhang:
Face Animation with an Attribute-Guided Diffusion Model. 628-637 - Shaobo Lin, Kun Wang, Xingyu Zeng, Rui Zhao:
Explore the Power of Synthetic Data on Few-shot Object Detection. 638-647 - Noa Alkobi, Tamar Rott Shaham, Tomer Michaeli:
Internal Diverse Image Completion. 648-658 - Hazrat Ali, Christer Grönlund, Zubair Shah:
Leveraging GANs for data scarcity of COVID-19: Beyond the hype. 659-667 - Kaiwen Cui, Rongliang Wu, Fangneng Zhan, Shijian Lu:
Face Transformer: Towards High Fidelity and Accurate Face Swapping. 668-677 - René Haas, Stella Graßhof, Sami S. Brandt:
Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion. 678-687 - Edgar Schönfeld, Julio Borges, Vadim Sushko, Bernt Schiele, Anna Khoreva:
Discovering Class-Specific GAN Controls for Semantic Image Synthesis. 688-697 - Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière:
One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models. 698-708 - Shiyao Xu, Lingzhi Li, Li Shen, Zhouhui Lian:
DeSRF: Deformable Stylized Radiance Field. 709-718 - Heng Yu, Zoltan A. Milacski, László A. Jeni:
Unsupervised Style-based Explicit 3D Face Reconstruction from Single Image. 719-729 - Nitish Shukla, Sudipta Banerjee:
Generating Adversarial Attacks in the Latent Space. 730-739 - Kangmin Bae, Hyung-Il Kim, Yongjin Kwon, Jinyoung Moon:
Unsupervised Bidirectional Style Transfer Network using Local Feature Transform Module. 740-749 - Samy Chali, Inna Kucher, Marc Duranton, Jacques-Olivier Klein:
Improving Normalizing Flows with the Approximate Mass for Out-of-Distribution Detection. 750-758 - Tripti Shukla, Paridhi Maheshwari, Rajhans Singh, Ankita Shukla, Kuldeep Kulkarni, Pavan K. Turaga:
Scene Graph Driven Text-Prompt Generation for Image Inpainting. 759-768 - Jordan Shipard, Arnold Wiliem, Kien Nguyen Thanh, Wei Xiang, Clinton Fookes:
Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion. 769-778 - Mohammadreza Mofayezi, Yasamin Medghalchi:
Benchmarking Robustness to Text-Guided Corruptions. 779-786 - Edgardo Solano-Carrillo, Ángel Bueno Rodríguez, Borja Carrillo-Perez, Yannik Steiniger, Jannis Stoppe:
Look ATME: The Discriminator Mean Entropy Needs Attention. 787-796 - Mark Hamazaspyan, Shant Navasardyan:
Diffusion-Enhanced PatchMatch: A Framework for Arbitrary Style Transfer with Diffusion Models. 797-805 - Jan Niklas Kolf, Tim Rieber, Jurek Elliesen, Fadi Boutros, Arjan Kuijper, Naser Damer:
Identity-driven Three-Player Generative Adversarial Network for Synthetic-based Face Recognition. 806-816 - Mohamed Amine Marnissi, Abir Fathallah:
GAN-based Vision Transformer for High-Quality Thermal Image Enhancement. 817-825 - Yutong Zhou, Nobutaka Shimada:
Vision + Language Applications: A Survey. 826-842 - Arpit Bansal, Hong-Min Chu, Avi Schwarzschild, Soumyadip Sengupta, Micah Goldblum, Jonas Geiping, Tom Goldstein:
Universal Guidance for Diffusion Models. 843-852 - Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min:
Exploring Compositional Visual Generation with Latent Classifier Guidance. 853-862 - Matyás Bohácek, Hany Farid:
A Geometric and Photometric Exploration of GAN and Diffusion Synthesized Faces. 874-883 - Shivansh Mundra, Gonzalo J. Aniano Porcile, Smit Marvaniya, James R. Verbus, Hany Farid:
Exposing GAN-Generated Profile Photos from Compact Embeddings. 884-892 - Shan Jia, Mingzhen Huang, Zhou Zhou, Yan Ju, Jialing Cai, Siwei Lyu:
AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics. 893-903 - Chengzhe Sun, Shan Jia, Shuwei Hou, Siwei Lyu:
AI-Synthesized Voice Detection Using Neural Vocoder Artifacts. 904-912 - Kar Balan, Shruti Agarwal, Simon Jenni, Andy Parsons, Andrew Gilbert, John P. Collomosse:
EKILA: Synthetic Media Provenance and Attribution for Generative Art. 913-922 - Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu:
Harnessing the Power of Text-image Contrastive Models for Automatic Detection of Online Misinformation. 923-932 - Tu Bui, Shruti Agarwal, Ning Yu, John P. Collomosse:
RoSteALS: Robust Steganography using Autoencoder Latent Space. 933-942 - Davide Cozzolino, Alessandro Pianese, Matthias Nießner, Luisa Verdoliva:
Audio-Visual Person-of-Interest DeepFake Detection. 943-952 - Jun Wang, Omran Alamayreh, Benedetta Tondi, Mauro Barni:
Open Set Classification of GAN-based Image Manipulations via a ViT-based Hybrid Architecture. 953-962 - Ziyue Xiang, Amit Kumar Singh Yadav, Paolo Bestagini, Stefano Tubaro, Edward J. Delp:
MTN: Forensic Analysis of MP4 Video Files Using Graph Neural Networks. 963-972 - Riccardo Corvi, Davide Cozzolino, Giovanni Poggi, Koki Nagano, Luisa Verdoliva:
Intriguing properties of synthetic images: from generative adversarial networks to diffusion models. 973-982 - Danial Samadi Vahdati, Tai D. Nguyen, Matthew C. Stamm:
Defending Low-Bandwidth Talking Head Videoconferencing Systems From Real-Time Puppeteering Attacks. 983-992 - Muhammad Anas Raza, Khalid Mahmood Malik:
Multimodaltrace: Deepfake Detection using Audiovisual Representation Learning. 993-1000 - Songlin Yang, Wei Wang, Chenye Xu, Ziwen He, Bo Peng, Jing Dong:
Exposing Fine-Grained Adversarial Vulnerability of Face Anti-Spoofing Models. 1001-1010 - Yufei Zhang, Rui Zhao, Ziyi Zhao, Naveen Ramakrishnan, Manoj Aggarwal, Gerard Medioni, Qiang Ji:
Robust Partial Fingerprint Recognition. 1011-1020 - Pedro C. Neto, Ana Filipa Sequeira, Jaime S. Cardoso, Philipp Terhörst:
PIC-Score: Probabilistic Interpretable Comparison Score for Optimal Matching Confidence in Single- and Multi-Biometric Face Recognition. 1021-1029 - Chi Xu, Yasushi Makihara, Xiang Li, Yasushi Yagi:
Gait Recognition from Fisheye Images. 1030-1040 - Haiyu Wu, Vítor Albiero, K. S. Krishnapriya, Michael C. King, Kevin W. Bowyer:
Face Recognition Accuracy Across Demographics: Shining a Light Into the Problem. 1041-1050 - Daniel DeAlcala, Aythami Morales, Ruben Tolosana, Alejandro Acien, Julian Fiérrez, Santiago Hernandez, Miguel A. Ferrer, Moisés Díaz:
BeCAPTCHA-Type: Biometric Keystroke Data Generation for Improved Bot Detection. 1051-1060 - Meiling Fang, Marco Huber, Naser Damer:
SynthASpoof: Developing Face Presentation Attack Detection Based on Privacy-friendly Synthetic Data. 1061-1070 - Sandipan Banerjee, Ajjen Joshi, Jay Turcot:
The Universal Face Encoder: Learning Disentangled Representations Across Different Attributes. 1071-1080 - Chih-Jung Chang, Yaw-Chern Lee, Shih-Hsuan Yao, Min-Hung Chen, Chien-Yi Wang, Shang-Hong Lai, Trista Pei-Chun Chen:
A Closer Look at Geometric Temporal Dynamics for Face Anti-Spoofing. 1081-1091 - Chongyi Li, Chunle Guo, Shangchen Zhou, Qiming Ai, Ruicheng Feng, Chen Change Loy:
FlexiCurve: Flexible Piecewise Curves Estimation for Photo Retouching. 1092-1101 - Qixin Yan, Chunle Guo, Jixin Zhao, Yuekun Dai, Chen Change Loy, Chongyi Li:
BeautyREC: Robust, Efficient, and Component-Specific Makeup Transfer. 1102-1110 - Iman Abbasnejad, Fabio Zambetta, Flora D. Salim, Timothy Wiley, Jeffrey Chan, Russell Gallagher, Ehsan Abbasnejad:
SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translation. 1111-1120 - Wei Jiang, Hyomin Choi, Fabien Racapé:
Adaptive Human-Centric Video Compression for Humans and Machines. 1121-1129 - Ali Hojjat, Janek Haberer, Olaf Landsiedel:
ProgDTD: Progressive Learned Image Compression with Double-Tail-Drop Training. 1130-1139 - Peter Buckel, Timo Oksanen, Thomas Dietmueller:
RB-Dust - A Reference-based Dataset for Vision-based Dust Removal. 1140-1149 - Han Yao Choong, Suryansh Kumar, Luc Van Gool:
Quantum Annealing for Single Image Super-Resolution. 1150-1159 - Yinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang:
Unlimited-Size Diffusion Restoration. 1160-1167 - Ruohao Wang, Xiaohui Liu, Zhilu Zhang, Xiaohe Wu, Chun-Mei Feng, Lei Zhang, Wangmeng Zuo:
Benchmark Dataset and Effective Inter-Frame Alignment for Real-World Video Super-Resolution. 1168-1177 - Masud An Nur Islam Fahim, Jani Boutellier:
SS-TTA: Test-Time Adaption for Self-Supervised Denoising Methods. 1178-1187 - Aakash Rajpal, Noshaba Cheema, Klaus Illgner-Fehns, Philipp Slusallek, Sunil Prasad Jaiswal:
High-Resolution Synthetic RGB-D Datasets for Monocular Depth Estimation. 1188-1198 - Mehran Jeelani, Sadbhawna, Noshaba Cheema, Klaus Illgner-Fehns, Philipp Slusallek, Sunil Prasad Jaiswal:
Expanding Synthetic Real-World Degradations for Blind Video Super Resolution. 1199-1208 - Guisik Kim, Jinhee Park, Junseok Kwon:
Deep Dehazing Powered by Image Processing Network. 1209-1218 - Yuanzhi Zhu, Kai Zhang, Jingyun Liang, Jiezhang Cao, Bihan Wen, Radu Timofte, Luc Van Gool:
Denoising Diffusion Models for Plug-and-Play Image Restoration. 1219-1229 - Hassan Imani, Md Baharul Islam, Lai-Kuan Wong:
Saliency-aware Stereoscopic Video Retargeting. 1230-1239 - Samira Pouyanfar, Sunando Sengupta, Mahmoud Mohammadi, Ebey Abraham, Brett Bloomquist, Lukas Dauterman, Anjali Parikh, Steve Lim, Eric Sommerlade:
FRR-Net: A Real-Time Blind Face Restoration and Relighting Network. 1240-1250 - Shruti S. Phutke, Ashutosh Kulkarni, Santosh Kumar Vipparthi, Subrahmanyam Murala:
Blind Image Inpainting via Omni-dimensional Gated Attention and Wavelet Queries. 1251-1260 - Andrei Dumitriu, Florin Tatui, Florin Miron, Radu Tudor Ionescu, Radu Timofte:
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results. 1261-1271 - Sean Man, Guy Ohayon, Theo Adrai, Michael Elad:
High-Perceptual Quality JPEG Decoding via Posterior Sampling. 1272-1282 - Chengxing Xie, Xiaoming Zhang, Linze Li, Haiteng Meng, Tianlin Zhang, Tianrui Li, Xiaole Zhao:
Large Kernel Distillation Network for Efficient Single Image Super-Resolution. 1283-1292 - Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang:
OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution. 1293-1301 - Kai Zhao, Kun Yuan, Ming Sun, Xing Wen:
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment. 1302-1310 - Shuhao Cui, Junshi Huang, Shuman Tian, Mingyuan Fan, Jiaqi Zhang, Li Zhu, Xiaoming Wei, Xiaolin Wei:
Pyramid Ensemble Structure for High Resolution Image Shadow Removal. 1311-1319 - Yingqian Wang, Longguang Wang, Zhengyu Liang, Jungang Yang, Radu Timofte, Yulan Guo, Kai Jin, Zeqiang Wei, Angulia Yang, Sha Guo, Mingzhi Gao, Xiuzhuang Zhou, Vinh Van Duong, Thuc Nguyen Huu, Jonghoon Yim, Byeungwoo Jeon, Yutong Liu, Zhen Cheng, Zeyu Xiao, Ruikang Xu, Zhiwei Xiong, Gaosheng Liu, Manchang Jin, Huanjing Yue, Jingyu Yang, Chen Gao, Shuo Zhang, Song Chang, Youfang Lin, Wentao Chao, Xuechun Wang, Guanghui Wang, Fuqing Duan, Wang Xia, Yan Wang, Peiqi Xia, Shunzhou Wang, Yao Lu, Ruixuan Cong, Hao Sheng, Da Yang, Rongshan Chen, Sizhe Wang, Zhenglong Cui, Yilei Chen, Yongjie Lu, Dongjun Cai, Ping An, Ahmed Salem, Hatem Ibrahem, Bilel Yagoub, Hyun Soo Kang, Zekai Zeng, Heng Wu:
NTIRE 2023 Challenge on Light Field Image Super-Resolution: Dataset, Methods and Results. 1320-1335 - Ahmed Salem, Hatem Ibrahem, Hyun Soo Kang:
Learning Epipolar-Spatial Relationship for Light Field Image Super-Resolution. 1336-1345 - Longguang Wang, Yulan Guo, Yingqian Wang, Juncheng Li, Shuhang Gu, Radu Timofte, Ming Cheng, Haoyu Ma, Qiufang Ma, Xiaopeng Sun, Shijie Zhao, Xuhan Sheng, Yukang Ding, Ming Sun, Xing Wen, Dafeng Zhang, Jia Li, Fan Wang, Zheng Xie, Zongyao He, Zidian Qiu, Zilin Pan, Zhihao Zhan, Xingyuan Xian, Zhi Jin, Yuanbo Zhou, Wei Deng, Ruofeng Nie, Jiajun Zhang, Qinquan Gao, Tong Tong, Kexin Zhang, Junpei Zhang, Rui Peng, Yanbiao Ma, Licheng Jiao, Haoran Bai, Lingshun Kong, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Pu Cao, Tianrui Huang, Lu Yang, Qing Song, Bingxin Chen, Chunhua He, Meiyun Chen, Zijie Guo, Shaojuan Luo, Chengzhi Cao, Kunyu Wang, Fanrui Zhang, Qiang Zhang, Nancy Mehta, Subrahmanyam Murala, Akshay Dudhane, Yujin Wang, Lingen Li, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He, Junyang Chen, Hao Li, Yukai Shi, Zhijing Yang, Wenbin Zou, Yunchen Zhang, Mingchao Jiang, Zhongxin Yu, Ming Tan, Hongxia Gao, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Jingxiang Chen, Bo Yang, XiSheryl Zhang, Chenghua Li, Weijun Yuan, Zhan Li, Ruting Deng, Jintao Zeng, Pulkit Mahajan, Sahaj Mistry, Shreyas Chatterjee, Vinit Jakhetiya, Badri N. Subudhi, Sunil Prasad Jaiswal, Zhao Zhang, Huan Zheng, Suiyi Zhao, Yangcheng Gao, Yanyan Wei, Bo Wang, Gen Li, Aijin Li, Lei Sun, Ke Chen, Congling Tang, Yunzhe Li, Jun Chen, Yuan-Chun Chiang, Yi-Chung Chen, Zhi-Kai Huang, Hao-Hsiang Yang, I-Hsiang Chen, Sy-Yen Kuo, Yiheng Wang, Gang Zhu, Xingyi Yang, Songhua Liu, Yongcheng Jing, Xingyu Hu, Jianwen Song, Changming Sun, Arcot Sowmya, Seung Ho Park, Xiaoyan Lei, Jingchao Wang, Chenbo Zhai, Yufei Zhang, Weifeng Cao, Wenlong Zhang:
NTIRE 2023 Challenge on Stereo Image Super-Resolution: Methods and Results. 1346-1372 - Kai Jin, Angulia Yang, Zeqiang Wei, Sha Guo, Mingzhi Gao, Xiuzhuang Zhou:
DistgEPIT: Enhanced Disparity Learning for Light Field Image Super-Resolution. 1373-1383 - Pierluigi Zama Ramirez, Fabio Tosi, Luigi Di Stefano, Radu Timofte, Alex Costanzino, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Jun Shi, Dafeng Zhang, Yong A, Yixiang Jin, Dingzhe Li, Chao Li, Zhiwen Liu, Qi Zhang, Yixing Wang, Shi Yin:
NTIRE 2023 Challenge on HR Depth from Images of Specular and Transparent Surfaces. 1384-1395 - Wenbin Zou, Hongxia Gao, Liang Chen, Yunchen Zhang, Mingchao Jiang, Zhongxin Yu, Ming Tan:
Cross-View Hierarchy Network for Stereo Image Super-Resolution. 1396-1405 - Yangyi Liu, Huan Liu, Liangyan Li, Zijun Wu, Jun Chen:
A Data-Centric Solution to NonHomogeneous Dehazing via Vision Transformer. 1406-1415 - Yuanbo Zhou, Yuyang Xue, Wei Deng, Ruofeng Nie, Jiajun Zhang, Jiaqi Pu, Qinquan Gao, Junlin Lan, Tong Tong:
Stereo Cross Global Learnable Attention Module for Stereo Image Super-Resolution. 1416-1425 - Zidian Qiu, Zongyao He, Zhihao Zhan, Zilin Pan, Xingyuan Xian, Zhi Jin:
SC-NAFSSR: Perceptual-Oriented Stereo Image Super-Resolution Using Stereo Consistency Guided NAFSSR. 1426-1435 - Hua-En Chang, Chia-Hsuan Hsieh, Hao-Hsiang Yang, I-Hsiang Chen, Yi-Chung Chen, Yu-Chiang Frank Wang, Zhi-Kai Huang, Wei-Ting Chen, Sy-Yen Kuo:
TSRFormer: Transformer Based Two-stage Refinement for Single Image Shadow Removal. 1436-1446 - Hao-Hsiang Yang, I-Hsiang Chen, Chia-Hsuan Hsieh, Hua-En Chang, Yuan-Chun Chiang, Yi-Chung Chen, Zhi-Kai Huang, Wei-Ting Chen, Sy-Yen Kuo:
Semantic Guidance Learning for High-Resolution Non-homogeneous Dehazing. 1447-1455 - Simone Zini, Claudio Rota, Marco Buzzelli, Simone Bianco, Raimondo Schettini:
Back to the future: a night photography rendering ISP without deep learning. 1465-1473 - Yixuan Gao, Yuqin Cao, Tengchuan Kou, Wei Sun, Yunlong Dong, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai:
VDPVE: VQA Dataset for Perceptual Video Enhancement. 1474-1483 - Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He:
A Simple Transformer-style Network for Lightweight Image Super-resolution. 1484-1494 - Marcos V. Conde, Eduard Zamfir, Radu Timofte, Daniel Motilla, Cen Liu, Zexin Zhang, Yunbo Peng, Yue Lin, Jiaming Guo, Xueyi Zou, Yuyi Chen, Yi Liu, Jia Hao, Youliang Yan, Yuanfan Zhang, Gen Li, Lei Sun, Lingshun Kong, Haoran Bai, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Mingxi Li, Yuhang Zhang, Xianjun Fan, Yankai Sheng, Long Sun, Zibin Liu, Weiran Gou, Shaoqing Li, Ziyao Yi, Yan Xiang, Dehui Kong, Ke Xu, Ganzorig Gankhuyag, Kihwan Yoon, Jin Zhang, Gaocheng Yu, Feng Zhang, Hongbin Wang, Zhou Zhou, Jiahao Chao, Hongfan Gao, Jiali Gong, Zhengfeng Yang, Zhenbing Zeng, Chengpeng Chen, Zichao Guo, Anjin Park, Yuqing Liu, Qi Jia, Hongyuan Yu, Xuanwu Yin, Dongyang Zhang, Ting Fu, Zhengxue Cheng, Shiai Zhu, Dajiang Zhou, Weichen Yu, Lin Ge, Jiahua Dong, Yajun Zou, Zhuoyuan Wu, Binnan Han, Xiaolin Zhang, Heng Zhang, Ben Shao, Shaolong Zheng, Daheng Yin, Baijun Chen, Mengyang Liu, Marian-Sergiu Nistor, Yi-Chung Chen, Zhi-Kai Huang, Yuan-Chun Chiang, Wei-Ting Chen, Hao-Hsiang Yang, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Tu Vo, Qingsen Yan, Yun Zhu, Jinqiu Su, Yanning Zhang, Cheng Zhang, Jiaying Luo, Youngsun Cho, Nakyung Lee, Kunlong Zuo:
Efficient Deep Models for Real-Time 4K Image Super-Resolution. NTIRE 2023 Benchmark and Report. 1495-1521 - Eduard Zamfir, Marcos V. Conde, Radu Timofte:
Towards Real-Time 4K Image Super-Resolution. 1522-1532 - Mirko Agarla, Luigi Celona, Claudio Rota, Raimondo Schettini:
Quality assessment of enhanced videos guided by aesthetics and technical quality attributes. 1533-1541 - Zhihao Yang, Wenyi Lian, Siyuan Lai:
BokehOrNot: Transforming Bokeh Effect with Image Transformer and Lens Metadata Embedding. 1542-1550 - Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Kai Zhao, Heng Cong, Hang Shi, Zhiliang Ma, Mirko Agarla, Zhiwei Huang, Hongye Liu, Ironhead Chuang, Haotian Fan, Shiqi Zhou, Yu Lai, Wenqi Wang, Haoning Wu, Chunzheng Zhu, Shiling Zhao, Hanene Brachemi Meftah, Tengfei Shi, Azadeh Mansouri:
NTIRE 2023 Quality Assessment of Video Enhancement Challenge. 1551-1569 - Xiaoyang Kang, Xianhui Lin, Kai Zhang, Zheng Hui, Wangmeng Xiang, Jun-Yan He, Xiaoming Li, Peiran Ren, Xuansong Xie, Radu Timofte, Yixin Yang, Jinshan Pan, Zhong Zheng, Peng Qiyan, Jiangxin Zhang, Jinhui Dong, Jinjing Tan, Chi-Chen Lin, Lin Qipei Li, Qirong Liang, Ruipeng Gang, Xiaofeng Liu, Shuang Feng, Shuai Liu, Hao Wang, Chaoyu Feng, Furui Bai, Yuqian Zhang, Guangqi Shao, Xiaotao Wang, Lei Lei, Siqi Chen, Yu Zhang, Hanning Xu, Zheyuan Liu, Zhao Zhang, Yan Luo, Zhichao Zuo:
NTIRE 2023 Video Colorization Challenge. 1570-1581 - Jiaming Guo, Xueyi Zou, Yuyi Chen, Yi Liu, Jia Hao, Jianzhuang Liu, Youliang Yan:
AsConvSR: Fast and Lightweight Super-Resolution Network with Assembled Convolutions. 1582-1592 - Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He:
Mixer-based Local Residual Network for Lightweight Image Super-resolution. 1593-1602 - Xiangyu Kong, Fan Wang, Dafeng Zhang, Jinlong Wu, Zikun Liu:
NAFBET: Bokeh Effect Transformation with Parameter Analysis Block based on NAFNet. 1603-1612 - Ding-Jiun Huang, Yu-Ting Kao, Tieh-Hung Chuang, Ya-Chun Tsai, Jing-Kai Lou, Shuen-Huei Guan:
SB-VQA: A Stack-Based Video Quality Assessment Framework for Video Enhancement. 1613-1622 - Bahri Batuhan Bilecen, Mustafa Ayazoglu:
Bicubic++: Slim, Slimmer, Slimmest Designing an Industry-Grade Super-Resolution Network. 1623-1332 - Tim Seizinger, Marcos V. Conde, Manuel Kolmet, Tom E. Bishop, Radu Timofte:
Efficient Multi-Lens Bokeh Effect Rendering and Transformation. 1633-1642 - Marcos V. Conde, Manuel Kolmet, Tim Seizinger, Tom E. Bishop, Radu Timofte, Xiangyu Kong, Dafeng Zhang, Jinlong Wu, Fan Wang, Juewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Zhiguo Cao, Ke Xian, Chaowei Liu, Zigeng Chen, Xingyi Yang, Songhua Liu, Yongcheng Jing, Michael Bi Mi, Xinchao Wang, Zhihao Yang, Wenyi Lian, Siyuan Lai, Haichuan Zhang, Trung Hoang, Amirsaeed Yazdani, Vishal Monga, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Yuxuan Zhao, Baoliang Chen, Yiqing Xu, JiXiangNiu:
Lens-to-Lens Bokeh Effect Transformation. NTIRE 2023 Challenge Report. 1643-1659 - Yanyu Mao, Nihao Zhang, Qian Wang, Bendu Bai, Wanying Bai, Haonan Fang, Peng Liu, Mingyue Li, Shengbo Yan:
Multi-level Dispersion Residual Network for Efficient Image Super-Resolution. 1660-1669 - Trung Hoang, Haichuan Zhang, Amirsaeed Yazdani, Vishal Monga:
TransER: Hybrid Model and Ensemble-based Sequential Learning for Non-homogenous Dehazing. 1670-1679 - Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön:
Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models. 1680-1691 - Lei Yu, Xinpeng Li, Youwei Li, Ting Jiang, Qi Wu, Haoqiang Fan, Shuaicheng Liu:
DIPNet: Efficiency Distillation and Iterative Pruning for Image Super-Resolution. 1692-1701 - Ming Cheng, Haoyu Ma, Qiufang Ma, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Shijie Zhao, Junlin Li, Li Zhang:
Hybrid Transformer and CNN Attention Network for Stereo Image Super-resolution. 1702-1711 - Weijian Deng, Hongjie Yuan, Lunhui Deng, Zengtong Lu:
Reparameterized Residual Feature Network For Lightweight Image Super-Resolution. 1712-1721 - Jinjing Li, Qirong Liang, Qipei Li, Ruipeng Gang, Ji Fang, Chi-Chen Lin, Shuang Feng, Xiaofeng Liu:
RTTLC: Video Colorization with Restored Transformer and Test-time Local Converter. 1722-1730 - Mingdeng Cao, Chong Mou, Fanghua Yu, Xintao Wang, Yinqiang Zheng, Jian Zhang, Chao Dong, Gen Li, Ying Shan, Radu Timofte, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Bin Chen, Haoyu Ma, Ming Cheng, Shijie Zhao, Wanwan Cui, Tianyu Xu, Chunyang Li, Long Bao, Heng Sun, Huaibo Huang, Xiaoqiang Zhou, Yuang Ai, Ran He, Renlong Wu, Yi Yang, Zhilu Zhang, Shuohao Zhang, Junyi Li, Yunjin Chen, Dongwei Ren, Wangmeng Zuo, Qian Wang, Hao-Hsiang Yang, Yi-Chung Chen, Zhi-Kai Huang, Wei-Ting Chen, Yuan-Chun Chiang, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Zebin Zhang, Jiaqi Zhang, Yuhui Wang, Shuhao Cui, Junshi Huang, Li Zhu, Shuman Tian, Wei Yu, Bingchun Luo:
NTIRE 2023 Challenge on 360° Omnidirectional Image and Video Super-Resolution: Datasets, Methods and Results. 1731-1745 - Ganzorig Gankhuyag, Kihwan Yoon, Jinman Park, Haeng Seon Son, Kyoungwon Min:
Lightweight Real-Time Image Super-Resolution Network for 4K Images. 1746-1755 - Qiang Zhu, Pengfei Li, Qianhui Li:
Attention Retractable Frequency Fusion Transformer for Image Super Resolution. 1756-1763 - Ke Chen, Liangyan Li, Huan Liu, Yunzhe Li, Congling Tang, Jun Chen:
SwinFSR: Stereo Image Super-Resolution using SwinIR and Frequency Domain Knowledge. 1764-1774 - Yawei Li, Kai Zhang, Jingyun Liang, Jiezhang Cao, Ce Liu, Rui Gong, Yulun Zhang, Hao Tang, Yun Liu, Denis Demandolx, Rakesh Ranjan, Radu Timofte, Luc Van Gool:
LSDIR: A Large Scale Dataset for Image Restoration. 1775-1787 - Florin-Alexandru Vasluianu, Tim Seizinger, Radu Timofte, Shuhao Cui, Junshi Huang, Shuman Tian, Mingyuan Fan, Jiaqi Zhang, Li Zhu, Xiaoming Wei, Xiaolin Wei, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Xiaoyi Dong, Xi Sheryl Zhang, Chenghua Li, Cong Leng, Woon-Ha Yeo, Wang-Taek Oh, Yeoreum Lee, Han-Cheol Ryu, Jinting Luo, Chengzhi Jiang, Mingyan Han, Qi Wu, Wenjie Lin, Lei Yu, Xinpeng Li, Ting Jiang, Haoqiang Fan, Shuaicheng Liu, Shuning Xu, Binbin Song, Xiangyu Chen, Shile Zhang, Jiantao Zhou, Zhao Zhang, Suiyi Zhao, Huan Zheng, Yangcheng Gao, Yanyan Wei, Bo Wang, Jiahuan Ren, Yan Luo, Yuki Kondo, Riku Miyata, Fuma Yasue, Taito Naruki, Norimichi Ukita, Hua-En Chang, Hao-Hsiang Yang, Yi-Chung Chen, Yuan-Chun Chiang, Zhi-Kai Huang, Wei-Ting Chen, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Xianwei Li, Huiyuan Fu, Chunlin Liu, Huadong Ma, Binglan Fu, Huiming He, Mengjia Wang, Wenxuan She, Yu Liu, Sabari Nathan, Priya Kansal, Zhongjian Zhang, Huabin Yang, Yan Wang, Yanru Zhang, Shruti S. Phutke, Ashutosh Kulkarni, Md Raqib Khan, Subrahmanyam Murala, Santosh Kumar Vipparthi, Heng Ye, Zixi Liu, Xingyi Yang, Songhua Liu, Yinwei Wu, Yongcheng Jing, Qianhao Yu, Naishan Zheng, Jie Huang, Yuhang Long, Mingde Yao, Feng Zhao, Bowen Zhao, Nan Ye, Ning Shen, Yanpeng Cao, Tong Xiong, Weiran Xia, Dingwen Li, Shuchen Xia:
NTIRE 2023 Image Shadow Removal Challenge Report. 1788-1807 - Codruta O. Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, Radu Timofte, Han Zhou, Wei Dong, Yangyi Liu, Jun Chen, Huan Liu, Liangyan Li, Zijun Wu, Yubo Dong, Yuyan Li, Tian Qiu, Yu He, Yonghong Lu, Yinwei Wu, Zhenxiang Jiang, Songhua Liu, Xingyi Yang, Yongcheng Jing, Bilel Benjdira, Anas M. Ali, Anis Koubaa, Hao-Hsiang Yang, I-Hsiang Chen, Wei-Ting Chen, Zhi-Kai Huang, Yi-Chung Chen, Chia-Hsuan Hsieh, Hua-En Chang, Yuan-Chun Chiang, Sy-Yen Kuo, Yu Guo, Yuan Gao, Ryan Wen Liu, Yuxu Lu, Jingxiang Qu, Shengfeng He, Wenqi Ren, Trung Hoang, Haichuan Zhang, Amirsaeed Yazdani, Vishal Monga, Lehan Yang, Alex Jiahao Wu, Tiancheng Mai, Xiaofeng Cong, Xuemeng Yin, Xuefei Yin, Hazim Emad, Ahmed Abdallah, Yahya Yasser, Dalia Elshahat, Esraa Elbaz, Zhan Li, Wenqing Kuang, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Zhao Zhang, Yanyan Wei, Junhu Wang, Suiyi Zhao, Huan Zheng, Jin Guo, Yangfan Sun, Tianli Liu, Dejun Hao, Kui Jiang, Anjali Sarvaiya, Kalpesh Prajapati, Ratnadeep Patra, Pragnesh Barik, Chaitanya Rathod, Kishor P. Upla, Kiran B. Raja, Raghavendra Ramachandra, Christoph Busch:
NTIRE 2023 HR NonHomogeneous Dehazing Challenge Report. 1808-1825 - Florin-Alexandru Vasluianu, Tim Seizinger, Radu Timofte:
WSRD: A Novel Benchmark for High Resolution Image Shadow Removal. 1826-1835 - Yu Zhang, Siqi Chen, Mingdao Wang, Xianlin Zhang, Chuang Zhu, Yue Zhang, Xueming Li:
Temporal Consistent Automatic Video Colorization via Semantic Correspondence. 1836-1845 - Wei Wu, Shuming Hu, Pengxiang Xiao, Sibin Deng, Yilin Li, Ying Chen, Kai Li:
Video Quality Assessment Based on Swin Transformer with Spatio-Temporal Feature Fusion and Data Augmentation. 1846-1854 - Bilel Benjdira, Anas M. Ali, Anis Koubaa:
Streamlined Global and Local Features Combinator (SGLC) for High Resolution Image Dehazing. 1855-1864 - Yulun Zhang, Kai Zhang, Zheng Chen, Yawei Li, Radu Timofte, Junpei Zhang, Kexin Zhang, Rui Peng, Yanbiao Ma, Licheng Jia, Huaibo Huang, Xiaoqiang Zhou, Yuang Ai, Ran He, Yajun Qiu, Qiang Zhu, Pengfei Li, Qianhui Li, Shuyuan Zhu, Dafeng Zhang, Jia Li, Fan Wang, Chunmiao Li, TaeHyung Kim, Jungkeong Kil, Eon Kim, Yeonseung Yu, Beomyeol Lee, Subin Lee, Seokjae Lim, Somi Chae, Heungjun Choi, Zhi-Kai Huang, YiChung Chen, Yuan-Chun Chiang, Hao-Hsiang Yang, Wei-Ting Chen, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Ui-Jin Choi, Marcos V. Conde, Sunder Ali Khowaja, Jiseok Yoon, Ik Hyun Lee, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He, Zhao Zhang, Baiang Li, Huan Zheng, Suiyi Zhao, Yangcheng Gao, Yanyan Wei, Jiahuan Ren, Jiayu Wei, Yanfeng Li, Jia Sun, Zhanyi Cheng, Zhiyuan Li, Xu Yao, Xinyi Wang, Danxu Li, Xuan Cui, Jun Cao, Cheng Li, Jianbin Zheng, Anjali Sarvaiya, Kalpesh Prajapati, Ratnadeep Patra, Pragnesh Barik, Chaitanya Rathod, Kishor P. Upla, Kiran B. Raja, Raghavendra Ramachandra, Christoph Busch:
NTIRE 2023 Challenge on Image Super-Resolution (×4): Methods and Results. 1865-1884 - Yu Guo, Yuan Gao, Ryan Wen Liu, Yuxu Lu, Jingxiang Qu, Shengfeng He, Wenqi Ren:
SCANet: Self-Paced Semi-Curricular Attention Network for Non-Homogeneous Image Dehazing. 1885-1894 - Han Zhou, Wei Dong, Yangyi Liu, Jun Chen:
Breaking Through the Haze: An Advanced Non-Homogeneous Dehazing Method based on Fast Fourier Convolution and ConvNeXt. 1895-1904 - Yawei Li, Yulun Zhang, Radu Timofte, Luc Van Gool, Zhijun Tu, Kunpeng Du, Hailing Wang, Hanting Chen, Wei Li, Xiaofei Wang, Jie Hu, Yunhe Wang, Xiangyu Kong, Jinlong Wu, Dafeng Zhang, Jianxing Zhang, Shuai Liu, Furui Bai, Chaoyu Feng, Hao Wang, Yuqian Zhang, Guangqi Shao, Xiaotao Wang, Lei Lei, Rongjian Xu, Zhilu Zhang, Yunjin Chen, Dongwei Ren, Wangmeng Zuo, Qi Wu, Mingyan Han, Shen Cheng, Haipeng Li, Ting Jiang, Chengzhi Jiang, Xinpeng Li, Jinting Luo, Wenjie Lin, Lei Yu, Haoqiang Fan, Shuaicheng Liu, Aditya Arora, Syed Waqas Zamir, Javier Vazquez-Corral, Konstantinos G. Derpanis, Michael S. Brown, Hao Li, Zhihao Zhao, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Bo Yang, Jingxiang Chen, Chenghua Li, Xi Zhang, Zhao Zhang, Jiahuan Ren, Zhicheng Ji, Kang Miao, Suiyi Zhao, Huan Zheng, Yanyan Wei, Kangliang Liu, Xiangcheng Du, Sijie Liu, Yingbin Zheng, Xingjiao Wu, Cheng Jin, Rajeev Irny, Sriharsha Koundinya, Vighnesh Kamath, Gaurav Khandelwal, Sunder Ali Khowaja, Jiseok Yoon, Ik Hyun Lee, Shijie Chen, Chengqiang Zhao, Huabin Yang, Zhongjian Zhang, Junjia Huang, Yanru Zhang:
NTIRE 2023 Challenge on Image Denoising: Methods and Results. 1905-1921 - Yawei Li, Yulun Zhang, Radu Timofte, Luc Van Gool, Lei Yu, Youwei Li, Xinpeng Li, Ting Jiang, Qi Wu, Mingyan Han, Wenjie Lin, Chengzhi Jiang, Jinting Luo, Haoqiang Fan, Shuaicheng Liu, Yucong Wang, Minjie Cai, Mingxi Li, Yuhang Zhang, Xianjun Fan, Yankai Sheng, Yanyu Mao, Nihao Zhang, Qian Wang, Mingjun Zheng, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhongbao Yang, Yan Wang, Erlin Pan, Qixuan Cai, Xinan Dai, Magauiya Zhussip, Nikolay Kalyazin, Dmitry Vyal, Xueyi Zou, Youliang Yan, Heaseo Chung, Jin Zhang, Gaocheng Yu, Feng Zhang, Hongbin Wang, Bohao Liao, Zhibo Du, Yu-Liang Wu, Gege Shi, Long Peng, Yang Wang, Yang Cao, Zhengjun Zha, Zhi-Kai Huang, Yi-Chung Chen, Yuan-Chun Chiang, Hao-Hsiang Yang, Wei-Ting Chen, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Xin Liu, Jiahao Pan, Hongyuan Yu, Weichen Yu, Lin Ge, Jiahua Dong, Yajun Zou, Zhuoyuan Wu, Binnan Han, Xiaolin Zhang, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Weijian Deng, Hongjie Yuan, Zengtong Lu, Mingyu Ouyang, Wenzhuo Ma, Nian Liu, Hanyou Zheng, Yuantong Zhang, Junxi Zhang, Zhenzhong Chen, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He, Yurui Zhu, Xi Wang, Xueyang Fu, Zheng-Jun Zha, Daheng Yin, Mengyang Liu, Baijun Chen, Ao Li, Lei Luo, Kangjun Jin, Ce Zhu, Xiaoming Zhang, Chengxing Xie, Linze Li, Haiteng Meng, Tianlin Zhang, Tianrui Li, Xiaole Zhao, Zhao Zhang, Baiang Li, Huan Zheng, Suiyi Zhao, Yangcheng Gao, Jiahuan Ren, Kang Hu, Jingpeng Shi, Zhijian Wu, Dingjiang Huang, Jinchen Zhu, Hui Li, Qianru Xv, Tianle Liu, Gang Wu, Junpeng Jiang, Xianming Liu, Junjun Jiang, Mingjian Zhang, Shizhuang Weng, Jing Hu, Chengxu Wu, Qinrui Fan, Chengming Feng, Ziwei Luo, Shu Hu, Siwei Lyu, Xi Wu, Xin Wang:
NTIRE 2023 Challenge on Efficient Super-Resolution: Methods and Results. 1922-1960 - Chen Gao, Youfang Lin, Song Chang, Shuo Zhang:
Spatial-Angular Multi-Scale Mechanism for Light Field Spatial Super-Resolution. 1961-1970 - Yucong Wang, Minjie Cai:
A Single Residual Network with ESA Modules and Distillation. 1971-1981 - Alina Shutova, Egor I. Ershov, Georgy Perevozchikov, Ivan Ermakov, Nikola Banic, Radu Timofte, Richard Collins, Maria Efimova, Arseniy P. Terekhin, Simone Zini, Claudio Rota, Marco Buzzelli, Simone Bianco, Raimondo Schettini, Chunxia Lei, Tingniao Wang, Song Wang, Shuai Liu, Chaoyu Feng, Guangqi Shao, Hao Wang, Xiaotao Wang, Lei Lei, Lu Xu, Chao Zhang, Yasi Wang, Jin Guo, Yangfan Sun, Tianli Liu, Hao Dejun, Furkan Kinli, Baris Özcan, Furkan Kiraç, Hyerin Chung, Nakyung Lee, Sungkeun Kwak, Marcos V. Conde, Tim Seizinger, Florin-Alexandru Vasluianu, Omar Elezabi, Chia-Hsuan Hsieh, Wei-Ting Chen, Hao-Hsiang Yang, Zhi-Kai Huang, Hua-En Chang, I-Hsiang Chen, Yi-Chung Chen, Yuan-Chun Chiang:
NTIRE 2023 Challenge on Night Photography Rendering. 1982-1993 - Aashish Bhandari, Siddhant Bikram Shah, Surendrabikram Thapa, Usman Naseem, Mehwish Nasim:
CrisisHateMM: Multimodal Analysis of Directed and Undirected Hate Speech in Text-Embedded Images from Russia-Ukraine Conflict. 1994-2003 - Phanideep Gampa, Akash Anil Valsangkar, Shailesh Choubey, Pooja A:
Prioritised Moderation for Online Advertising. 2004-2012 - Ngoc Long Nguyen, Jérémy Anger, Axel Davy, Pablo Arias, Gabriele Facciolo:
L1BSR: Exploiting Detector Overlap for Self-Supervised Single-Image Super-Resolution of Sentinel-2 L1B Imagery. 2013-2023 - Mainak Singha, Ankit Jha, Bhupendra Solanki, Shirsha Bose, Biplab Banerjee:
APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP. 2024-2034 - Roger Marí, Gabriele Facciolo, Thibaud Ehret:
Multi-Date Earth Observation NeRF: The Detail Is in the Shadows. 2035-2045 - Akhil Meethal, Eric Granger, Marco Pedersoli:
Cascaded Zoom-in Detector for High Resolution Aerial Images. 2046-2055 - Jamy Lafenetre, Ngoc Long Nguyen, Gabriele Facciolo, Thomas Eboli:
Handheld Burst Super-Resolution Meets Multi-Exposure Satellite Imagery. 2056-2064 - Thomas M. Mercier, Tasmiat Rahman, Amin Sabet:
Solar Irradiance Anticipative Transformer. 2065-2074 - Valerio Marsocci, Nicolas Gonthier, Anatol Garioud, Simone Scardapane, Clément Mallet:
GeoMultiTaskNet: remote sensing unsupervised domain adaptation using geographical coordinates. 2075-2085 - Patrick Ebel, Vivien Sainte Fare Garnot, Michael Schmitt, Jan Dirk Wegner, Xiao Xiang Zhu:
UnCRtainTS: Uncertainty Quantification for Cloud Removal in Optical Satellite Time Series. 2086-2096 - Mohamed Ali Chebbi, Ewelina Rupnik, Marc Pierrot-Deseilligny, Paul Lopes:
DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching. 2097-2105 - Jamila Mifdal, Marc Tomás-Cruz, Alessandro Sebastianelli, Bartomeu Coll, Joan Duran:
Deep unfolding for hyper sharpening using a high-frequency injection module. 2106-2115 - Georgios Voulgaris, Andy Philippides, Jonathan Dolley, Jeremy Reffin, Fiona Marshall, Novi Quadrianto:
Seasonal Domain Shift in the Global South: Dataset and Deep Features Analysis. 2116-2124 - Valerie Pasquarella, Christopher F. Brown, Wanda Czerwinski, William Rucklidge:
Comprehensive quality assessment of optical satellite imagery using weakly supervised video learning. 2125-2135 - Jonathan Prexl, Michael Schmitt:
Multi-Modal Multi-Objective Contrastive Learning for Sentinel-1/2 Imagery. 2136-2144 - Joëlle Hanna, Michael Mommert, Damian Borth:
Sparse Multimodal Vision Transformer for Weakly Supervised Semantic Segmentation. 2145-2154 - Jonathan Giezendanner, Rohit Mukherjee, Matthew Purri, Mitchell Thomas, Max Mauerman, A. K. M. Saiful Islam, Beth Tellman:
Inferring the past: a combined CNN-LSTM deep learning framework to fuse satellites for historical inundation mapping. 2155-2165 - Linus Scheibenreif, Michael Mommert, Damian Borth:
Masked Vision Transformers for Hyperspectral Image Classification. 2166-2176 - Jiachen Li, Marianna Ohanyan, Vidit Goel, Shant Navasardyan, Yunchao Wei, Humphrey Shi:
VideoMatt: A Simple Baseline for Accessible Real-Time Video Matting. 2177-2186 - Guillaume Berger, Manik Dhingra, Antoine Mercier, Yashesh Savani, Sunny Panchal, Fatih Porikli:
QuickSRNet: Plain Single-Image Super-Resolution Architecture for Faster Inference on Mobile Platforms. 2187-2196 - Ruifeng Yuan, Yuhao Cheng, Yiqiang Yan, Haiyan Liu:
Real-time Segmenting Human Portrait at Anywhere. 2197-2203 - Penghao Jiang, Ke Xin, Chunxi Li, Yinsi Zhou:
High-efficiency Device-Cloud Collaborative Transformer Model. 2204-2210 - Mustafa Munir, William Avery, Radu Marculescu:
MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications. 2211-2219 - Risheek Garrepalli, Jisoo Jeong, Rajeswaran C. Ravindran, Jamie Menjay Lin, Fatih Porikli:
DIFT: Dynamic Iterative Field Transforms for Memory Efficient Optical Flow. 2220-2229 - Dongning Ma, Pengfei Zhao, Xun Jiao:
PerfHD: Efficient ViT Architecture Performance Ranking using Hyperdimensional Computing. 2230-2237 - Wentao Zhu, Yufang Huang, Xiufeng Xie, Wenxian Liu, Jincan Deng, Debing Zhang, Zhangyang Wang, Ji Liu:
AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection. 2238-2247 - Yong Guo, Yaofo Chen, Yin Zheng, Qi Chen, Peilin Zhao, Junzhou Huang, Jian Chen, Mingkui Tan:
Pareto-aware Neural Architecture Generation for Diverse Computational Budgets. 2248-2258 - Ryosuke Yamada, Risa Shinoda, Hirokatsu Kataoka:
Exploring the Potential of Neural Dataset Search. 2259-2266 - Lujun Li, Anggeng Li:
A2-Aug: Adaptive Automated Data Augmentation. 2267-2274 - Lotte Hendrickx, Arne Symons, Wiebe Van Ranst, Marian Verhelst, Toon Goedemé:
Hardware-aware NAS by Genetic Optimisation with a Design Space Exploration Simulator. 2275-2283 - Andrew Hryniowski, Alexander Wong:
Systematic Architectural Design of Scale Transformed Attention Condenser DNNs via Multi-Scale Class Representational Response Similarity Analysis. 2284-2292 - Alexander Wong, Yifan Wu, Saad Abbasi, Saeejith Nair, Yuhao Chen, Mohammad Javad Shafiee:
Fast GraspNeXt: A Fast Self-Attention Neural Network Architecture for Multi-task Learning in Computer Vision Tasks for Robotic Grasping on the Edge. 2293-2297 - Soumalya Nandi, Sravanti Addepalli, Harsh Rangwani, R. Venkatesh Babu:
Certified Adversarial Robustness Within Multiple Perturbation Bounds. 2298-2305 - Yuwei Chen, Shiyong Chu:
Adversarial Defense in Aerial Detection. 2306-2313 - Zhengbao He, Tao Li, Sizhe Chen, Xiaolin Huang:
Investigating Catastrophic Overfitting in Fast Adversarial Training: A Self-fitting Perspective. 2314-2321 - Jianbo Chen, Xinwei Liu, Siyuan Liang, Xiaojun Jia, Yuan Xun:
Universal Watermark Vaccine: Universal Adversarial Perturbations for Watermark Protection. 2322-2329 - Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Sahand Ghorbanpour, Vineet Gundecha, Antonio Guillen, Ricardo Luna Gutierrez, Avisek Naug:
Robustness with Query-efficient Adversarial Attack using Reinforcement Learning. 2330-2337 - Hasan Abed Al Kader Hammoud, Adel Bibi, Philip H. S. Torr, Bernard Ghanem:
Don't FREAK Out: A Frequency-Inspired Approach to Detecting Backdoor Poisoned Samples in DNNs. 2338-2345 - Ren Wang, Yuxuan Li, Sijia Liu:
Exploring Diversified Adversarial Robustness in Neural Networks via Robust Mode Connectivity. 2346-2352 - Charles Godfrey, Henry Kvinge, Elise Bishoff, Myles Mckay, Davis Brown, Tim Doster, Eleanor Byler:
How many dimensions are required to find an adversarial example? 2353-2360 - Paul Gavrikov, Janis Keuper, Margret Keuper:
An Extended Study of Human-like Behavior under Adversarial Training. 2361-2368 - Zixiang Zhao, Jiang-She Zhang, Haowen Bai, Yicheng Wang, Yukun Cui, Lilun Deng, Kai Sun, Chunxia Zhang, Junmin Liu, Shuang Xu:
Deep Convolutional Sparse Coding Networks for Interpretable Image Fusion. 2369-2377 - Timothy Redgrave, Colton Crum:
Generating Adversarial Samples in Mini-Batches May Be Detrimental To Adversarial Robustness. 2378-2384 - Haomin Zhuang, Yihua Zhang, Sijia Liu:
A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion. 2385-2392 - Hengyue Liang, Buyun Liang, Ju Sun, Ying Cui, Tim Mitchell:
Implications of Solution Patterns on Adversarial Robustness. 2393-2400 - Mert Kilickaya, Joaquin Vanschoren:
Are Labels Needed for Incremental Instance Learning? 2401-2409 - James Seale Smith, Junjiao Tian, Shaunak Halbe, Yen-Chang Hsu, Zsolt Kira:
A Closer Look at Rehearsal-Free Continual Learning. 2410-2420 - Abdelrahman Mohamed, Rushali Grandhe, K. J. Joseph, Salman H. Khan, Fahad Shahbaz Khan:
D3Former: Debiased Dual Distilled Transformer for Incremental Learning. 2421-2430 - Md Yousuf Harun, Jhair Gallardo, Tyler L. Hayes, Christopher Kanan:
How Efficient Are Today's Continual Learning Algorithms? 2431-2436 - Joachim Houyon, Anthony Cioppa, Yasir Ghunaim, Motasem Alfarra, Anaïs Halin, Maxim Henry, Bernard Ghanem, Marc Van Droogenbroeck:
Online Distillation with Continual Learning for Cyclic Domain Shifts. 2437-2446 - Elena Camuffo, Simone Milani:
Continual Learning for LiDAR Semantic Segmentation: Class-Incremental and Coarse-to-Fine strategies on Sparse Data. 2447-2456 - Prasanna B, Sunandini Sanyal, R. Venkatesh Babu:
Continual Domain Adaptation through Pruning-aided Domain-specific Weight Modulation. 2457-2463 - Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, Abhinav Valada:
CoVIO: Online Continual Learning for Visual-Inertial Odometry. 2464-2473 - Lama Alssum, Juan León Alcázar, Merey Ramazanova, Chen Zhao, Bernard Ghanem:
Just a Glimpse: Rethinking Temporal Information for Video Continual Learning. 2474-2483 - Xiaofan Yu, Yunhui Guo, Sicun Gao, Tajana Rosing:
SCALE: Online Self-Supervised Lifelong Learning without Prior Knowledge. 2484-2495 - Amir Nazemi, Zeyad Moustafa, Paul W. Fieguth:
CLVOS23: A Long Video Object Segmentation Dataset for Continual Learning. 2496-2505 - Chenshen Wu, Joost van de Weijer:
Density Map Distillation for Incremental Object Counting. 2506-2515 - Aristotelis Chrysakis, Marie-Francine Moens:
Simulating Task-Free Continual Learning Streams From Existing Datasets. 2516-2524 - Shikhar Srivastava, Mohammad Yaqub, Karthik Nandakumar:
Lifelong Learning of Task-Parameter Relationships for Knowledge Transfer. 2525-2534 - Chengxing Xie, Qian Ning, Weisheng Dong, Guangming Shi:
TFRGAN: Leveraging Text Information for Blind Face Restoration with Extreme Degradation. 2535-2545 - Luigi Riz, Andrea Caraffa, Matteo Bortolon, Mohamed Lamine Mekhalfi, Davide Boscaini, André Moura, José Antunes, André Dias, Hugo Silva, Andreas Leonidou, Christos Constantinides, Christos Keleshis, Dante Abate, Fabio Poiesi:
The MONET dataset: Multimodal drone thermal dataset recorded in rural scenarios. 2546-2554 - Yuren Cong, Jinhui Yi, Bodo Rosenhahn, Michael Ying Yang:
SSGVS: Semantic Scene Graph-to-Video Synthesis. 2555-2565 - Wenru Zheng, Ryota Yoshihashi, Rei Kawakami, Ikuro Sato, Asako Kanezaki:
Multi Event Localization by Audio-Visual Fusion with Omnidirectional Camera and Microphone Array. 2566-2574 - Zihui Xue, Radu Marculescu:
Dynamic Multimodal Fusion. 2575-2584 - Jae-Myung Kim, A. Sophia Koepke, Cordelia Schmid, Zeynep Akata:
Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval. 2585-2595 - Ying Wang, Jonas Pfeiffer, Nicolas Carion, Yann LeCun, Aishwarya Kamath:
Adapting Grounded Visual Question Answering Models to Low Resource Languages. 2596-2605 - Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham A. Thomas, Armin Mustafa:
SEM-POS: Grammatically and Semantically Correct Video Captioning. 2606-2616 - Yiming Ma, Victor Sanchez, Soodeh Nikan, Devesh Upadhyay, Bhushan Atote, Tanaya Guha:
Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-Attention. 2617-2625 - Jun Zhu, Jiandong Jin, Zihan Yang, Xiaohao Wu, Xiao Wang:
Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition. 2626-2629 - Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring:
Causalainer: Causal Explainer for Automatic Video Summarization. 2630-2636 - Avinash Madasu, Vasudev Lal:
Is Multimodal Vision Supervision Beneficial to Language? 2637-2642 - Giacomo Camposampiero, Loïc Houmard, Benjamin Estermann, Joël Mathys, Roger Wattenhofer:
Abstract Visual Reasoning Enabled by Language. 2643-2647 - Ekta Sood, Fabian Kögel, Philipp Müller, Dominike Thomas, Mihai Bâce, Andreas Bulling:
Multimodal Integration of Human-Like Attention in Visual Question Answering. 2648-2658 - Shiwei Jin, Ji Dai, Truong Nguyen:
Kappa Angle Regression with Ocular Counter-Rolling Awareness for Gaze Estimation. 2659-2668 - Hengfei Wang, Jun O. Oh, Hyung Jin Chang, Jin Hee Na, Minwoo Tae, Zhongqun Zhang, Sang-Il Choi:
GazeCaps: Gaze Estimation with Self-Attention-Routed Capsules. 2669-2677 - Nora Horanyi, Linfang Zheng, Eunji Chong, Ales Leonardis, Hyung Jin Chang:
Where are they looking in the 3D space? 2678-2687 - Haldun Balim, Seonwook Park, Xi Wang, Xucong Zhang, Otmar Hilliges:
EFE: End-to-end Frame-to-Gaze Estimation. 2688-2697 - Moritz Ibing, Gregor Kobsik, Leif Kobbelt:
Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically Structured Sequences. 2698-2707 - Reza Asad, Manolis Savva:
3DSSR: 3D Subscene Retrieval. 2708-2716 - Chengzhi Wu, Junwei Zheng, Julius Pfrommer, Jürgen Beyerer:
Attention-based Part Assembly for 3D Volumetric Shape Modeling. 2717-2726 - Kseniya Cherenkova, Elona Dupont, Anis Kacem, Ilya Arzhannikov, Gleb Gusev, Djamila Aouada:
SepicNet: Sharp Edges Recovery by Parametric Inference of Curves in 3D Shapes. 2727-2735 - Ramesh Ashok Tabib, Nitishkumar Upasi, Tejas Anvekar, Dikshit Hegde, Uma Mudenagudi:
IPD-Net: SO(3) Invariant Primitive Decompositional Network for 3D Point Clouds. 2736-2744 - Federico Cunico, Federico Girella, Andrea Avogaro, Marco Emporio, Andrea Giachetti, Marco Cristani:
OO-dMVMT: A Deep Multi-view Multi-task Classification Framework for Real-time 3D Hand Gesture Classification and Segmentation. 2745-2754 - Gyeongsik Moon, Hongsuk Choi, Sanghyuk Chun, Jiyoung Lee, Sangdoo Yun:
Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild. 2755-2764 - Chandra Kambhamettu:
3DSAINT Representation for 3D Point Clouds. 2765-2774 - Qiulin Chen, Jan P. Allebach:
Face Image Lighting Enhancement Using a 3D Model. 2775-2784 - Martin Sundermeyer, Tomás Hodan, Yann Labbé, Gu Wang, Eric Brachmann, Bertram Drost, Carsten Rother, Jirí Matas:
BOP Challenge 2022 on Detection, Segmentation and Pose Estimation of Specific Rigid Objects. 2785-2794 - Xinhan Di, Xiaokun Dai, Xinkang Zhang, Xinrong Chen:
Dual Attention Poser: Dual Path Body Tracking Based on Attention. 2795-2804 - Kaiwen Zheng, Jie Huang, Hu Yu, Feng Zhao:
Efficient Multi-exposure Image Fusion via Filter-dominated Fusion and Gradient-driven Unsupervised Learning. 2805-2814 - Kaiwen Zheng, Jie Huang, Man Zhou, Feng Zhao:
Asymmetric Color Transfer with Consistent Modality Learning. 2815-2823 - Dafeng Zhang, Jia Ouyang, Guanqun Liu, Xiaobing Wang, Xiangyu Kong, Zhezhu Jin:
FF-Former: Swin Fourier Transformer for Nighttime Flare Removal. 2824-2832 - Zhihao Fan, Xun Wu, Fanqing Meng, Yaqi Wu, Feng Zhang:
OTST: A Two-Phase Framework for Joint Denoising and Remosaicing in RGBW CFA. 2833-2842 - Soonyong Song, Heechul Bae:
Hard-negative Sampling with Cascaded Fine-Tuning Network to Boost Flare Removal Performance in the Nighttime Images. 2843-2852 - Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, Jinwei Gu, Shuai Liu, Hao Wang, Chaoyu Feng, Luyang Wang, Guangqi Shao, Chenguang Zhang, Xiaotao Wang, Lei Lei, Dafeng Zhang, Xiangyu Kong, Guanqun Liu, Mengmeng Bai, Jia Ouyang, Xiaobing Wang, Jiahui Yuan, Xinpeng Li, Chengzhi Jiang, Ting Jiang, Wenjie Lin, Qi Wu, Mingyan Han, Jinting Luo, Lei Yu, Haoqiang Fan, Shuaicheng Liu, Bo Yan, Zhuang Li, Yadong Li, Hongbin Wang, Soonyong Song, Minghan Fu, Rayyan Azam Khan, Fang-Xiang Wu, Zhao Zhang, Suiyi Zhao, Huan Zheng, Yangcheng Gao, Yanyan Wei, Jiahuan Ren, Bo Wang, Yan Luo, Shuaibo Gao, Wenhui Wu, Sicong Kang, Nikhil Akalwadi, Ankit Raichur, Vinod Patil, Allabakash Ghodesawar, Swaroop Adrashyappanamath, Amogh Joshi, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi, Sicheng Li, Ruoxi Zhu, Jiazheng Lian, Shusong Xu, Zihao Liu, Sabari Nathan, Priya Kansal:
MIPI 2023 Challenge on Nighttime Flare Removal: Methods and Results. 2853-2863 - Qingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu, Yangke Huang, Kang Zhang, Meiya Chen, Yu Wang, Yongchao Li, Hao Jiang, Amrit Kumar Muduli, Vikash Kumar, Kunal Swami, Pankaj Kumar Bajpai, Yunchao Ma, Jiajun Xiao, Zhi Ling:
MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results. 2864-2870 - Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Hongyuan Yu, Yuqing Liu, Weichen Yu, Lin Ge, Xiaolin Zhang, Qi Jia, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Qi Wu, Wenjie Lin, Ting Jiang, Chengzhi Jiang, Mingyan Han, Xinpeng Li, Jinting Luo, Lei Yu, Haoqiang Fan, Shuaicheng Liu, Kunyu Wang, Chengzhi Cao, Yuanshen Guan, Jiyuan Xia, Ruikang Xu, Mingde Yao, Zhiwei Xiong:
MIPI 2023 Challenge on RGBW Fusion: Methods and Results. 2871-2877 - Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Yuqing Liu, Hongyuan Yu, Weichen Yu, Zhen Dong, Binnan Han, Qi Jia, Xuanwu Yin, Kunlong Zuo, Yaqi Wu, Zhihao Fan, Fanqing Meng, Xun Wu, Jiawei Zhang, Feng Zhang, Mingyan Han, Jinting Luo, Qi Wu, Ting Jiang, Chengzhi Jiang, Wenjie Lin, Xinpeng Li, Lei Yu, Haoqiang Fan, Shuaicheng Liu:
MIPI 2023 Challenge on RGBW Remosaic: Methods and Results. 2878-2885 - Mohammad Baradaran, Robert Bergevin:
Multi-Task Learning based Video Anomaly Detection with Attention. 2886-2896 - Alessandro Flaborea, Bardh Prenkaj, Bharti Munjal, Marco Aurelio Sterpa, Dario Aragona, Luca Podo, Fabio Galasso:
Are we certain it's anomalous? 2897-2907 - Lars Heckler, Rebecca König, Paul Bergmann:
Exploring the Importance of Pretrained Feature Extractors for Unsupervised Anomaly Detection and Localization. 2917-2926 - Li-Ling Chiu, Shang-Hong Lai:
Self-Supervised Normalizing Flows for Image Anomaly Detection and Localization. 2927-2936 - Matej Grcic, Josip Saric, Sinisa Segvic:
On Advantages of Mask-level Recognition for Outlier-aware Segmentation. 2937-2947 - Mark S. Graham, Walter H. L. Pinaya, Petru-Daniel Tudosiu, Parashkev Nachev, Sébastien Ourselin, M. Jorge Cardoso:
Denoising diffusion models for out-of-distribution detection. 2948-2957 - Ziyi Yang, Iman Soltani Bozchalooi, Eric Darve:
Anomaly Detection with Domain Adaptation. 2958-2967 - Eliahu Horwitz, Yedid Hoshen:
Back to the Feature: Classical 3D Features are (Almost) All You Need for 3D Anomaly Detection. 2968-2977 - Niamh Belton, Misgina Tsighe Hagos, Aonghus Lawlor, Kathleen M. Curran:
FewSOME: One-Class Few Shot Anomaly Detection with Siamese Networks. 2978-2987 - Álvaro González-Jiménez, Simone Lionetti, Marc Pouly, Alexander A. Navarini:
SANO: Score-based Diffusion Model for Anomaly Localization in Dermatology. 2988-2994 - Yona Falinie A. Gaus, Neelanjan Bhowmik, Brian K. S. Isaac-Medina, Hubert P. H. Shum, Amir Atapour-Abarghouei, Toby P. Breckon:
Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance Imagery. 2995-3005 - Ruian He, Shili Zhou, Ri Cheng, Yuqi Sun, Weimin Tan, Bo Yan:
Motion Matters: Difference-based Multi-scale Learning for Infrared UAV Detection. 3006-3015 - Yanyi Lyu, Zhunga Liu, Huandong Li, Dongxiu Guo, Yimin Fu:
A Real-time and Lightweight Method for Tiny Airborne Object Detection. 3016-3025 - Yifan Li, Dian Yuan, Meng Sun, Hongyu Wang, Xiaotao Liu, Jing Liu:
A Global-Local Tracking Framework Driven by Both Motion and Appearance for Infrared Anti-UAV. 3026-3035 - Qianjin Yu, Yinchao Ma, Jianfeng He, Dawei Yang, Tianzhu Zhang:
A Unified Transformer-based Tracker for Anti-UAV Tracking. 3036-3046 - Zongheng Tang, Yulu Gao, Zizheng Xun, Fengguang Peng, Yifan Sun, Si Liu, Bo Li:
Strong Detector with Simple Tracker. 3047-3053 - Xin Yang, Gang Wang, Weiming Hu, Jin Gao, Shubo Lin, Liang Li, Kai Gao, Yizheng Wang:
Video Tiny-Object Detection Guided by the Spatial-Temporal Motion Information. 3054-3063 - Jaime Spencer, C. Stella Qian, Michaela Trescakova, Chris Russell, Simon Hadfield, Erich W. Graf, Wendy J. Adams, Andrew J. Schofield, James H. Elder, Richard Bowden, Ali Anwar, Hao Chen, Xiaozhi Chen, Kai Cheng, Yuchao Dai, Huynh Thai Hoa, Sadat Hossain, Jianmian Huang, Mohan Jing, Bo Li, Chao Li, Baojun Li, Zhiwen Liu, Stefano Mattoccia, Siegfried Mercelis, Myungwoo Nam, Matteo Poggi, Xiaohua Qi, Jiahui Ren, Yang Tang, Fabio Tosi, Linh Trinh, S. M. Nadim Uddin, Khan Muhammad Umair, Kaixuan Wang, Yufei Wang, Yixing Wang, Mochu Xiang, Guangkai Xu, Wei Yin, Jun Yu, Qi Zhang, Chaoqiang Zhao:
The Second Monocular Depth Estimation Challenge. 3064-3076 - Blake VanBerlo, Brian Li, Alexander Wong, Jesse Hoey, Robert Arntfield:
Exploring the Utility of Self-Supervised Pretraining Strategies for the Detection of Absent Lung Sliding in M-Mode Lung Ultrasound. 3077-3086 - Abder-Rahman Ali, Anthony E. Samir, Peng Guo:
Self-Supervised Learning for Accurate Liver View Classification in Ultrasound Images with Minimal Labeled Data. 3087-3093 - Nick Luiken, Matteo Ravasi:
A deep learning-based approach to increase efficiency in the acquisition of ultrasonic non-destructive testing datasets. 3094-3102 - Daniel E. Shea, Sourabh Kulhare, Rachel Millin, Zohreh Laverriere, Courosh Mehanian, Charles B. Delahunt, Dipayan Banik, Xinliang Zheng, Meihua Zhu, Ye Ji, Travis Ostbye, Martha-Marie S. Mehanian, Atinuke Uwajeh, Adeseye M. Akinsete, Fen Wang, Matthew P. Horning:
Deep Learning Video Classification of Lung Ultrasound Features Associated with Pneumonia. 3103-3112 - Ayush Somani, Pragyan Banerjee, Krishna Agarwal, Manu Rastogi, Dilip K. Prasad, Anowarul Habib:
Image Inpainting with Hypergraphs for Resolution Improvement in Scanning Acoustic Microscopy. 3113-3122 - Shuning Chang, Pichao Wang, Fan Wang, Jiashi Feng, Mike Zheng Shou:
DOAD: Decoupled One Stage Action Detection Network. 3123-3232 - Saif Iftekar Sayed, Reza Ghoddoosian, Bhaskar Trivedi, Vassilis Athitsos:
A New Dataset and Approach for Timestamp Supervised Action Segmentation Using Human Object Interaction. 3133-3142 - Hacene Terbouche, Maryan Morel, Mariano Rodriguez, Alice Othmani:
Multi-Annotation Attention Model for Video Summarization. 3143-3152 - Volodymyr Fedynyak, Yaroslav Romanus, Oles Dobosevych, Igor Babin, Roman Riazantsev:
Global Motion Understanding in Large-Scale Video Object Segmentation. 3153-3162 - Kaer Huang, Kanokphan Lertniphonphan, Feng Chen, Jian Li, Zhepeng Wang:
Multi-Object Tracking by Self-supervised Learning Appearance Model. 3163-3169 - Daniel Stadler, Jürgen Beyerer:
An Improved Association Pipeline for Multi-Person Tracking. 3170-3179 - Tomoya Takahashi, Shingo Yashima, Kohta Ishikawa, Ikuro Sato, Rio Yokota:
Pixel-level Contrastive Learning of Driving Videos with Optical Flow. 3180-3187 - Kaicheng Yu, Tang Tao, Hongwei Xie, Zhiwei Lin, Tingting Liang, Bing Wang, Peng Chen, Dayang Hao, Yongtao Wang, Xiaodan Liang:
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection. 3188-3198 - Marvin Klemp, Kevin Rösch, Royden Wagner, Jannik Quehl, Martin Lauer:
LDFA: Latent Diffusion Face Anonymization for Self-driving Applications. 3199-3205 - Shubham Kedia, Yu Zhou, Sambhu H. Karumanchi:
Integrated Perception and Planning for Autonomous Vehicle Navigation: An Optimization-based Approach. 3206-3215 - Mengmeng Wang, Teli Ma, Xingxing Zuo, Jiajun Lv, Yong Liu:
Correlation Pyramid Network for 3D Single Object Tracking. 3216-3225 - Rizhao Fan, Matteo Poggi, Stefano Mattoccia:
Contrastive Learning for Depth Prediction. 3226-3237 - Yao Rong, Xiangyu Wei, Tianwei Lin, Yueyu Wang, Enkelejda Kasneci:
DynStatF: An Efficient Feature Fusion Strategy for LiDAR 3D Object Detection. 3238-3247 - Alexander Naumann, Felix Hertlein, Daniel Grimm, Maximilian Zipfl, Steffen Thoma, Achim Rettinger, Lavdim Halilaj, Juergen Luettin, Stefan Schmid, Holger Caesar:
Lanelet2 for nuScenes: Enabling Spatial Semantic Relationships and Diverse Map-based Anchor Paths. 3248-3257 - Haiyu Wu, Grace Bezold, Manuel Günther, Terrance E. Boult, Michael C. King, Kevin W. Bowyer:
Consistency and Accuracy of CelebA Attribute Values. 3258-3266 - Timo Kaiser, Christoph Reinders, Bodo Rosenhahn:
Compensation Learning in Semantic Segmentation. 3267-3278 - Yuhao Chen, Shen Zhang, Renjie Song:
Scoring Your Prediction on Unseen Data. 3279-3288 - Weiyu Feng, Seth Z. Zhao, Chuanyu Pan, Adam Chang, Yichen Chen, Zekun Wang, Allen Y. Yang:
Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Applications. 3289-3298 - Shuyu Miao, Lin Zheng, Jingjing Liu, Hong Jin:
K-means Clustering Based Feature Consistency Alignment for Label-free Model Evaluation. 3299-3307 - Jihun Yoon, Min-Kook Choi:
Exploring Video Frame Redundancies for Efficient Data Sampling and Annotation in Instance Segmentation. 3308-3317 - Aboli Marathe, Deva Ramanan, Rahee Walambe, Ketan Kotecha:
WEDGE: A multi-weather autonomous driving dataset built from generative vision-language models. 3318-3327 - Sania Zahan, Syed Zulqarnain Gilani, Ghulam Mubashar Hassan, Ajmal Mian:
Human Gesture and Gait Analysis for Autism Detection. 3328-3337 - Muhammad Haseeb Aslam, Muhammad Osama Zeeshan, Marco Pedersoli, Alessandro L. Koerich, Simon Bacon, Eric Granger:
Privileged Knowledge Distillation for Dimensional Emotion Recognition in the Wild. 3338-3347 - Yao Hu, Xinyu Du, Shengbing Jiang:
Online LiDAR-to-Vehicle Alignment Using Lane Markings and Traffic Signs. 3348-3357 - Sriram Krishna, Basavaraja Shanthappa Vandrotti:
DeepSmooth: Efficient and Smooth Depth Completion. 3358-3367 - Gaowen Liu, Yuzhang Shang, Yuguang Yao, Ramana Kompella:
Network Specialization via Feature-level Knowledge Distillation. 3368-3375 - Hatem Ibrahem, Ahmed Salem, Hyun Soo Kang:
ST-RoomNet: Learning Room Layout Estimation From Single Image Through Unsupervised Spatial Transformations. 3376-3384 - Hidetomo Sakaino:
PanopticVis: Integrated Panoptic Segmentation for Visibility Estimation at Twilight and Night. 3385-3398 - Junhyeong Bak, In Kyu Park:
Light Field Synthesis from a Monocular Image using Variable LDI. 3399-3407 - Zeyu Xiao, Ruisheng Gao, Yutong Liu, Yueyi Zhang, Zhiwei Xiong:
Toward Real-World Light Field Super-Resolution. 3408-3418 - Xueting Yang, Junli Deng, Rongshan Chen, Ruixuan Cong, Wei Ke, Hao Sheng:
Disentangling Local and Global Information for Light Field Depth Estimation. 3419-3427 - Nguyen P. Nguyen, Ramakrishna Surya, Prasad Calyam, Kannappan Palaniappan, Matthew R. Maschmann, Filiz Bunyak:
CNT-NeRF: Carbon Nanotube Forest Depth Layer Decomposition in SEM Imagery using Generative Adversarial Networks. 3428-3437 - Tun Wang, Rongshan Chen, Ruixuan Cong, Da Yang, Zhenglong Cui, Fangping Li, Hao Sheng:
EPI-Guided Cost Construction Network for Light Field Disparity Estimation. 3438-3446 - Joshitha Ravishankar, Sally Khaidem, Mansi Sharma:
A Data-Driven Approach based on Dynamic Mode Decomposition for Efficient Encoding of Dynamic Light Fields. 3447-3453 - Yiming Li, Ruixuan Cong, Sizhe Wang, Mingyuan Zhao, Yang Zhang, Fangping Li, Hao Sheng:
Multi-view Semantic Information Guidance for Light Field Image Segmentation. 3454-3462 - Lin Zhong, Bangcheng Zong, Qiming Wang, Junle Yu, Wenhui Zhou:
Implicit Epipolar Geometric Function based Light Field Continuous Angular Representation. 3463-3472 - Hao Sheng, Yebin Liu, Jingyi Yu, Gaochang Wu, Wei Xiong, Ruixuan Cong, Rongshan Chen, Longzhao Guo, Yanlin Xie, Shuo Zhang, Song Chang, Youfang Lin, Wentao Chao, Xuechun Wang, Guanghui Wang, Fuqing Duan, Tun Wang, Da Yang, Zhenglong Cui, Sizhe Wang, Mingyuan Zhao, Qiong Wang, Qianyu Chen, Zhengyu Liang, Yingqian Wang, Jungang Yang, Xueting Yang, Junli Deng:
LFNAT 2023 Challenge on Light Field Depth Estimation: Methods and Results. 3473-3485 - Hernan Carrillo, Michaël Clément, Aurélie Bugeau, Edgar Simo-Serra:
Diffusart: Enhancing Line Art Colorization with Conditional Diffusion Models. 3486-3490 - Liyuan Ma, Tingwei Gao, Haibin Shen, Kejie Huang:
FreqHPT: Frequency-aware attention and flow fusion for Human Pose Transfer. 3491-3496 - Ryotaro Shimizu, Takuma Nakamura, Masayuki Goto:
Fashion-Specific Ambiguous Expression Interpretation with Partial Visual-Semantic Embedding. 3497-3502 - Vinay Kumar Verma, Dween Rabius Sanny, Shreyas Sunil Kulkarni, Prateek Sircar, Abhishek Singh, Deepak Gupta:
SkiLL: Skipping Color and Label Landscape: Self Supervised Design Representations for Products in E-commerce. 3503-3507 - Masanari Kimura, Takuma Nakamura, Yuki Saito:
SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts. 3508-3513 - Mingyu Wang, Ata Mahjoubfar, Anupama Joshi:
FashionVQA: A Domain-Specific Visual Question Answering System. 3514-3519 - Rohan Sarkar, Achal Dave, Gerard Medioni, Benjamin Biggs:
Shape of You: Precise 3D shape estimations for diverse body types. 3520-3524 - Shidong Cao, Wenhao Chai, Shengyu Hao, Gaoang Wang:
Image Reference-guided Fashion Design with Structure-aware Transfer by Diffusion Models. 3525-3529 - Zhi-Song Liu, Li-Wen Wang, Wan-Chi Siu, Vicky Kalogeiton:
Name your style: text-guided artistic style transfer. 3530-3534 - Hao Tian, Yu Cao, P. Y. Mok:
DETR-based Layered Clothing Segmentation and Fine-Grained Attribute Recognition. 3535-3539 - Nikolaos Zioulis, James F. O'Brien:
KBody: Balanced monocular whole-body estimation. 3540-3545 - Surgan Jandial, Shripad V. Deshmukh, Abhinav Java, Simra Shahid, Balaji Krishnamurthy:
Gatha: Relational Loss for enhancing text-based style transfer. 3546-3551 - Mizuki Tabata, Kana Kurata, Junichiro Tamamatsu:
Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional Inputs. 3552-3561 - Jinming Su, Ruihong Yin, Shuaibin Zhang, Junfeng Luo:
Motion-state Alignment for Video Semantic Segmentation. 3571-3580 - Jinming Su, Ruihong Yin, Xingyue Chen, Junfeng Luo:
Perceive, Excavate and Purify: A Novel Object Mining Framework for Instance Segmentation. 3581-3590 - Hidetomo Sakaino:
PanopticRoad: Integrated Panoptic Road Segmentation Under Adversarial Conditions. 3591-3603 - Xi Ye, Guillaume-Alexandre Bilodeau:
A unified model for continuous conditional video prediction. 3604-3613 - Muhammad Rameez Ur Rahman, Luca Scofano, Edoardo De Matteis, Alessandro Flaborea, Alessio Sampieri, Fabio Galasso:
Best Practices for 2-Body Pose Forecasting. 3614-3624 - Haotian Xue, Antonio Torralba, Joshua B. Tenenbaum, Daniel Yamins, Yunzhu Li, Hsiao-Yu Tung:
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes. 3625-3635 - Francesco Ragusa, Giovanni Maria Farinella, Antonino Furnari:
StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation. 3636-3645 - Vladan Filipovic, Dimitrije Stefanovic, Nina Pajevic, Zeljana Grbovic, Nemanja Djuric, Marko Panic:
Bush Detection for Vision-based UGV Guidance in Blueberry Orchards: Data Set and Methods. 3646-3655 - Yuyu Guo, Yancheng Bai, Daiqi Shi, Yan Cai, Wei Bian:
DPOSE: Online Keypoint-CAM Guided Inference for Driver Pose Estimation with GMM-based Balanced Sampling. 3656-3665 - Je-Seok Ham, Dae Hoe Kim, NamKyo Jung, Jinyoung Moon:
CIPF: Crossing Intention Prediction Network based on Feature Fusion Modules for Improving Pedestrian Safety. 3666-3675 - Khoa Vo, Trong-Thang Pham, Kashu Yamazaki, Minh Q. Tran, Ngan Le:
DNA: Deformable Neural Articulations Network for Template-free Dynamic 3D Human Reconstruction from Monocular RGB-D Video. 3676-3685 - Chul Gwon, Steven C. Howell:
ODSmoothGrad: Generating Saliency Maps for Object Detectors. 3686-3690 - Romain Xu-Darme, Georges Quénot, Zakaria Chihani, Marie-Christine Rousset:
Sanity checks for patch visualisation in prototype-based image classification. 3691-3696 - Sebastian Bordt, Uddeshya Upadhyay, Zeynep Akata, Ulrike von Luxburg:
The Manifold Hypothesis for Gradient-Based Explanations. 3697-3702 - Sadaf Gulshad, Teng Long, Nanne van Noord:
Hierarchical Explanations for Video Action Recognition. 3703-3708 - Anna Arias-Duart, Ettore Mariotti, Dario Garcia-Gasulla, Jose Maria Alonso-Moral:
A Confusion Matrix for Evaluating Feature Attribution Methods. 3709-3714 - Lenka Tetková, Lars Kai Hansen:
Robustness of Visual Explanations to Common Data Augmentation Methods. 3715-3720 - Nicolas M. Müller, Jochen Jacobs, Jennifer Williams, Konstantin Böttinger:
Localized Shortcut Removal. 3721-3725 - Piotr Komorowski, Hubert Baniecki, Przemyslaw Biecek:
Towards Evaluating Explanations of Vision Transformers for Medical Imaging. 3726-3732 - Syed Nouman Hasany, Caroline Petitjean, Fabrice Mériaudeau:
Seg-XRes-CAM: Explaining Spatially Local Regions in Image Segmentation. 3733-3738 - Jonas Theiner, Nils Nommensen, Jim Rhotert, Matthias Springstein, Eric Müller-Budack, Ralph Ewerth:
Analyzing Results of Depth Estimation Models with Monocular Criteria. 3739-3743 - Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi:
Text2Concept: Concept Activation Vectors Directly from Text. 3744-3749 - Pushkar Shukla, Sushil Bharati, Matthew A. Turk:
CAVLI - Using image associations to produce local concept-based explanations. 3750-3755 - Angelos Nalmpantis, Apostolos Panagiotopoulos, John Gkountouras, Konstantinos Papakostas, Wilker Aziz:
Vision DiffMask: Faithful Interpretation of Vision Transformers with Differentiable Patch Masking. 3756-3763 - Akash Guna R. T., Raul Benitez, O. K. Sikha:
Ante-Hoc Generation of Task-Agnostic Interpretation Maps. 3764-3769 - Laura O'Mahony, Vincent Andrearczyk, Henning Müller, Mara Graziani:
Disentangling Neuron Representations with Concept Vectors. 3770-3775 - Katelyn Morrison, Ankita Mehra, Adam Perer:
Shared Interest...Sometimes: Understanding the Alignment between Human Perception, Vision Architectures, and Saliency Map Techniques. 3776-3781 - Pedro Madeira, André V. Carreiro, Alex Gaudio, Luís Rosado, Filipe Soares, Asim Smailagic:
ZEBRA: Explaining rare cases through outlying interpretable concepts. 3782-3788 - Alexander Koenig, Maximilian Schambach, Johannes S. Otterbach:
Uncovering the Inner Workings of STEGO for Safe Unsupervised Semantic Segmentation. 3789-3798 - Cristiano Patrício, João C. Neves, Luís F. Teixeira:
Coherent Concept-based Explanations in Medical Image and Its Application to Skin Lesion Diagnosis. 3799-3808 - Sungtae An, Nataraj Jammalamadaka, Eunji Chong:
Maximum Entropy Information Bottleneck for Uncertainty-aware Stochastic Embedding. 3809-3818 - Frederik Pahde, Galip Ümit Yolcu, Alexander Binder, Wojciech Samek, Sebastian Lapuschkin:
Optimizing Explanations by Network Canonization and Hyperparameter Search. 3819-3828 - Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin:
Revealing Hidden Context Bias in Segmentation and Object Detection through Concept-specific Explanations. 3829-3839 - Sujan Sai Gannamaneni, Arwin Sadaghiani, Rohil Prakash Rao, Michael Mock, Maram Akila:
Investigating CLIP Performance for Meta-data Generation in AD Datasets. 3840-3850 - Andreas Bär, Jonas Uhrig, Jeethesh Pai Umesh, Marius Cordts, Tim Fingscheidt:
A Novel Benchmark for Refinement of Noisy Localization Labels in Autolabeled Datasets for Object Detection. 3851-3860 - Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Sahand Ghorbanpour, Vineet Gundecha, Antonio Guillen, Ricardo Luna Gutierrez, Avisek Naug:
RL-CAM: Visual Explanations for Convolutional Networks using Reinforcement Learning. 3861-3869 - Jingxing Zhou, Jürgen Beyerer:
Category Differences Matter: A Broad Analysis of Inter-Category Error in Semantic Segmentation. 3870-3880 - Galadrielle Humblot-Renaux, Sergio Escalera, Thomas B. Moeslund:
Beyond AUROC & co. for evaluating out-of-distribution detection performance. 3881-3890 - Mert Keser, Gesina Schwalbe, Azarm Nowzad, Alois Knoll:
Interpretable Model-Agnostic Plausibility Verification for 2D Object Detectors Using Domain-Invariant Concept Bottleneck Models. 3891-3900 - Gyubeom Im, Keunjoo Park, Junseok Kim, Bongki Son, Seungchul Shin, Haechang Lee:
Live Demonstration: PINK: Polarity-based Anti-flicker for Event Cameras. 3901-3902 - Sami Barchid, José Mennesson, Chaabane Djeraba:
Exploring Joint Embedding Architectures and Data Augmentations for Self-Supervised Representation Learning in Event-Based Vision. 3903-3912 - Alexander Kugele, Thomas Pfeil, Michael Pfeiffer, Elisabetta Chicca:
How Many Events Make an Object? Improving Single-frame Object Detection on the 1 Mpx Dataset. 3913-3922 - Ionut Schiopu, Radu Ciprian Bilcu:
Entropy Coding-based Lossless Compression of Asynchronous Event Sequences. 3923-3930 - Yusuke Sekikawa, Jun Nagata:
Live Demonstration: Tangentially Elongated Gaussian Belief Propagation for Event-based Incremental Optical Flow Estimation. 3931-3932 - Burak Ercan, Onur Eker, Aykut Erdem, Erkut Erdem:
EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction. 3943-3952 - Thomas Dalgaty, Thomas Mesquida, Damien Joubert, Amos Sironi, Pascal Vivet, Christoph Posch:
HUGNet: Hemi-Spherical Update Graph Neural Network applied to low-latency event-based optical flow. 3953-3962 - Germain Haessig, Damien Joubert, Justin Haque, Moritz B. Milde, Tobi Delbruck, Viktor Gruev:
PDAVIS: Bio-inspired Polarization Event Camera. 3963-3972 - Tobi Delbruck, Zuowen Wang, Haiyang Mei, Germain Haessig, Damien Joubert, Justin Haque, Yingkai Chen, Moritz B. Milde, Viktor Gruev:
Live Demo: E2P-Events to Polarization Reconstruction from PDAVIS Events. 3973-3975 - William Chamorro, Joan Solà, Juan Andrade-Cetto:
Event-IMU fusion strategies for faster-than-IMU estimation throughput. 3976-3983 - Sami Arja, Alexandre Marcireau, Richard L. Balthazor, Matthew G. McHarg, Saeed Afshar, Gregory Cohen:
Density Invariant Contrast Maximization for Neuromorphic Earth Observations. 3984-3994 - Laurie Bose, Piotr Dudek, Stephen J. Carey, Jianing Chen:
Live Demonstration: SCAMP-7. 3995-3996 - Antony W. N'Dri, Thomas Barbier, Céline Teulière, Jochen Triesch:
Predictive Coding Light: learning compact visual codes by combining excitatory and inhibitory spike timing-dependent plasticity*. 3997-4006 - Wieland Morgenstern, Niklas Gard, Simon Baumann, Anna Hilsmann, Peter Eisert:
X-maps: Direct Depth Lookup for Event-based Structured Light Systems. 4007-4015 - Kenneth Chaney, Fernando Cladera Ojeda, Ziyun Wang, Anthony Bisulco, M. Ani Hsieh, Christopher M. Korpela, Vijay Kumar, Camillo J. Taylor, Kostas Daniilidis:
M3ED: Multi-Robot, Multi-Sensor, Multi-Environment Event Dataset. 4016-4023 - Gaurvi Goyal, Franco Di Pietro, Nicoló Carissimi, Arren Glover, Chiara Bartolozzi:
MoveEnet: Online High-Frequency Human Pose Estimation with an Event Camera. 4024-4033 - Ryan Page:
Live Demonstration: Integrating Event Based Hand Tracking Into TouchFree Interactions. 4034-4035 - Marco Monforte, Luna Gava, Massimiliano Iacono, Arren Glover, Chiara Bartolozzi:
Fast Trajectory End-Point Prediction with Event Cameras for Reactive Robot Control. 4036-4044 - Rui Graça, Brian McReynolds, Tobi Delbruck:
Shining light on the DVS pixel: A tutorial and discussion about biasing and optimization. 4045-4053 - Ryogo Niwa, Tatsuki Fushimi, Kenta Yamamoto, Yoichi Ochiai:
Live Demonstration: Event-based Visual Microphone. 4054-4055 - Marcin Kowalczyk, Tomasz Kryjak:
Interpolation-Based Event Visual Data Filtering Algorithms. 4056-4064 - Chiara Boretti, Philippe Bich, Fabio Pareschi, Luciano Prono, Riccardo Rovatti, Gianluca Setti:
PEDRo: an Event-based Dataset for Person Detection in Robotics. 4065-4070 - Stefano Chiavazza, Svea Marie Meyer, Yulia Sandamirskaya:
Low-latency monocular depth estimation using event timing on neuromorphic hardware. 4071-4080 - Arjun Roy, Manish Nagaraj, Chamika Mihiranga Liyanagedera, Kaushik Roy:
Live Demonstration: Real-time Event-based Speed Detection using Spiking Neural Networks. 4081-4082 - Sanket Kachole, Yusra Alkendi, Fariborz Baghaei Naeini, Dimitrios Makris, Yahya H. Zweiri:
Asynchronous Events-based Panoptic Segmentation using Graph Mixer Neural Network. 4083-4092 - Amélie Gruel, Lucía Trillo Carreras, Marina Bueno García, Ewa Kupczyk, Jean Martinet:
Frugal event data: how small is too small? A human performance assessment with shrinking data. 4093-4100 - Hugo Bulzomi, Marcel Schweiker, Amélie Gruel, Jean Martinet:
End-to-end Neuromorphic Lip Reading. 4101-4108 - Lorenzo Berlincioni, Luca Cultrera, Chiara Albisani, Lisa Cresti, Andrea Leonardo, Sara Picchioni, Federico Becattini, Alberto Del Bimbo:
Neuromorphic Event-based Facial Expression Recognition. 4109-4119 - Takuya Nakabayashi, Kunihiro Hasegawa, Masakazu Matsugu, Hideo Saito:
Event-based Blur Kernel Estimation For Blind Motion Deblurring. 4120-4128 - Yannick Schnider, Stanislaw Wozniak, Mathias Gehrig, Jules Lecomte, Axel von Arnim, Luca Benini, Davide Scaramuzza, Angeliki Pantazi:
Neuromorphic Optical Flow and Real-time Implementation with Event Cameras. 4129-4138 - Steven Abreu, Muhammed Gouda, Alessio Lugnan, Peter Bienstman:
Flow cytometry with event-based vision and spiking neuromorphic hardware. 4139-4147 - Adarsh Kumar Kosta, Marco Paul E. Apolinario, Kaushik Roy:
Live Demonstration: ANN vs SNN vs Hybrid Architectures for Event-based Real-time Gesture Recognition and Optical Flow Estimation. 4148-4149 - Pablo Rodrigo Gantier Cadena, Yeqiang Qian, Chunxiang Wang, Ming Yang:
Sparse-E2VID: A Sparse Convolutional Model for Event-Based Video Reconstruction Trained with Real Event Noise. 4150-4158 - Rajhans Singh, Ankita Shukla, Pavan K. Turaga:
Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments. 4159-4168 - Lokender Tiwari, Brojeshwar Bhowmick, Sanjana Sinha:
GenSim: Unsupervised Generic Garment Simulator. 4169-4178 - Tejas Anvekar, Dena Bazazian:
GPr-Net: Geometric Prototypical Network for Point Cloud Few-Shot Learning. 4179-4188 - Dan Zhang, Fangfang Zhou, Yuwen Jiang, Zhengming Fu:
MM-BSN: Self-Supervised Image Denoising for Real-World with Multi-Mask based on Blind-Spot Network. 4189-4198 - Yufeng Li, Jiyang Lu, Hongming Chen, Xianhao Wu, Xiang Chen:
Dilated Convolutional Transformer for High-Quality Image Deraining. 4199-4207 - Sunhyeok Lee, Donggon Jang, Dae-Shik Kim:
Temporally Averaged Regression for Semi-Supervised Low-Light Image Enhancement. 4208-4217 - Zhentao Fan, Xianhao Wu, Xiang Chen, Yufeng Li:
Learning to See in Nighttime Driving Scenes with Inter-frequency Priors. 4218-4225 - Mustafa Ozcan, Hamza Ergezer, Mustafa Ayazoglu:
FLIGHT Mode On: A Feather-Light Network for Low-Light Image Enhancement. 4226-4235 - Weiyun Jiang, Vivek Boominathan, Ashok Veeraraghavan:
NeRT: Implicit Neural Representations for Unsupervised Atmospheric Turbulence Mitigation. 4236-4243 - Najib Ishaq, Nathan Hotaling, Nicholas Schaub:
Theia: Bleed-Through Estimation with Convolutional Neural Networks. 4244-4252 - Natalia Khanzhina, Maxim Kashirin, Andrey Filchenkov:
New Bayesian Focal Loss Targeting Aleatoric Uncertainty Estimate: Pollen Image Recognition. 4253-4262 - Björn Möller, Jan Pirklbauer, Marvin Klingner, Peer Kasten, Markus Etzkorn, Tim J. Seifert, Uta Schlickum, Tim Fingscheidt:
A Super-Resolution Training Paradigm Based on Low-Resolution Data Only to Surpass the Technical Limits of STEM and STM Microscopy. 4263-4272 - Minghao Chen, Mukesh Bangalore Renuka, Lu Mi, Jeff Lichtman, Nir Shavit, Yaron Meirovitch:
Learning to Correct Sloppy Annotations in Electron Microscopy Volumes. 4273-4284 - Maciej Sypetkowski, Morteza Rezanejad, Saber Saberian, Oren Kraus, John Urbanik, James Taylor, Ben Mabey, Mason Victors, Jason Yosinski, Alborz Rezazadeh Sereshkeh, Imran S. Haque, Berton Earnshaw:
RxRx1: A Dataset for Evaluating Experimental Batch Correction Methods. 4285-4294 - Sota Kato, Kazuhiro Hotta:
One-shot and Partially-Supervised Cell Image Segmentation Using Small Visual Prompt. 4295-4304 - Tristan Lazard, Marvin Lerousseau, Etienne Decencière, Thomas Walter:
Giga-SSL: Self-Supervised Learning for Gigapixel Images. 4305-4314 - Yue Han, Yang Lei, Viktor Shkolnikov, Daisy Xin, Alicia Auduong, Steven Barcelo, Jan P. Allebach, Edward J. Delp:
An Ensemble Method with Edge Awareness for Abnormally Shaped Nuclei Segmentation. 4315-4325 - Wolfgang M. Pernice, Michael Doron, Alex Quach, Aditya Pratapa, Sultan Kenjeyev, Nicholas De Veaux, Michio Hirano, Juan C. Caicedo:
Out of Distribution Generalization via Interventional Style Transfer in Single-Cell Microscopy. 4326-4335 - Vedrana Andersen Dahl, Anders Bjorholm Dahl:
Fast Local Thickness. 4336-4344 - Lingrui Zhang, Shuheng Zhang, Guoyang Xie, Jiaqi Liu, Hua Yan, Jinbao Wang, Feng Zheng, Yaochu Jin:
What Makes a Good Data Augmentation for Few-Shot Unsupervised Image Anomaly Detection? 4345-4354 - Guangyu Ren, Michalis Lazarou, Jing Yuan, Tania Stathaki:
Towards Automated Polyp Segmentation Using Weakly- and Semi-Supervised Learning and Deformable Transformers. 4355-4364 - JunKyu Jang, Eugene Hwang, Sung-Hyuk Park:
N-pad : Neighboring Pixel-based Industrial Anomaly Detection. 4365-4374 - Xian Yeow Lee, Lasitha Vidyaratne, Mahbubul Alam, Ahmed K. Farahat, Dipanjan Ghosh, Maria Teresa Gonzalez Diaz, Chetan Gupta:
XDNet: A Few-Shot Meta-Learning Approach for Cross-Domain Visual Inspection. 4375-4384 - Yizhou Jin, Yu Lu, Gang Zhou, Qingjie Liu, Yunhong Wang:
Glass Wool Defect Detection Using an Improved YOLOv5. 4385-4394 - Weizhi Liu, Chang Liu, Qiang Liu, Dahai Yu:
Assigned MURA Defect Generation Based on Diffusion Model. 4395-4402 - Alexander Naumann, Felix Hertlein, Laura Dörr, Kai Furmans:
Parcel3D: Shape Reconstruction from Single RGB Images for Applications in Transportation Logistics. 4403-4413 - Liang Xu, Han Zou, Takayuki Okatani:
How Do Label Errors Affect Thin Crack Detection by DNNs. 4414-4423 - Juraj Fulir, Lovro Bosnar, Hans Hagen, Petra Gospodnetic:
Synthetic Data for Defect Segmentation on Complex Metal Surfaces. 4424-4434 - Chengkan Lv, Zhengtao Zhang, Fei Shen, Feng Zhang:
Unsupervised Automatic Defect Inspection based on Image Matching and Local One-class Classification. 4435-4444 - Jing Wei, Fei Shen, Chengkan Lv, Zhengtao Zhang, Feng Zhang, Huabin Yang:
Diversified and Multi-Class Controllable Industrial Defect Synthesis for Data Augmentation and Transfer. 4445-4453 - Xiaomeng Zhu, Talha Bilal, Pär Mårtensson, Lars Hanson, Mårten Björkman, Atsuto Maki:
Towards Sim-to-Real Industrial Parts Classification with Synthetic Dataset. 4454-4463 - Faranak Shamsafar, Sunil Prasad Jaiswal, Benjamin Kelkel, Kireeti Bodduna, Klaus Illgner-Fehns:
Leveraging Multi-view Data for Improved Detection Performance: An Industrial Use Case. 4464-4471 - I-Sheng Fang, Hsiao-Chieh Wen, Chia-Lun Hsu, Po-Chung Jen, Ping-Yang Chen, Yong-Sheng Chen:
ES3Net: Accurate and Efficient Edge-based Self-Supervised Stereo Matching Network. 4472-4481 - Jef Plochaet, Toon Goedemé:
Hardware-Aware Pruning for FPGA Deep Learning Accelerators. 4482-4490 - Ethan Goan, Clinton Fookes:
Uncertainty in Real-Time Semantic Segmentation on Embedded Systems. 4491-4501 - Vivek Parmar, Sandeep Kaur Kingra, Syed Shakib Sarwar, Ziyun Li, Barbara De Salvo, Manan Suri:
Fully-Binarized Distance Computation based On-device Few-Shot Learning for XR applications. 4502-4508 - Moritz Ibing, Isaak Lim, Leif Kobbelt:
Localized Latent Updates for Fine-Tuning Vision-Language Models. 4509-4518 - James Stewart, Umberto Michieli, Mete Ozay:
Data-Free Model Pruning at Initialization via Expanders. 4519-4524 - Shuming Liu, Mengmeng Xu, Chen Zhao, Xu Zhao, Bernard Ghanem:
ETAD: Training Action Detection End to End on a Laptop. 4525-4534 - Elahe Rahimian, Golara Javadi, Frederick Tung, Gabriel Leivas Oliveira:
DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning. 4535-4543 - Robin Hesse, Simone Schaub-Meyer, Stefan Roth:
Content-Adaptive Downsampling in Convolutional Neural Networks. 4544-4553 - Zeqi Zhu, Arash Pourtaherian, Luc Waeijen, Egor Bondarev, Orlando Moreira:
STAR: Sparse Thresholded Activation under partial-Regularization for Activation Sparsity Exploration. 4554-4563 - Martin Ferianc, Miguel Rodrigues:
MIMMO: Multi-Input Massive Multi-Output Neural Network. 4564-4569 - Purbayan Kar, Vishal M. Chudasama, Naoyuki Onoe, Pankaj Wasnik:
Revisiting Class Imbalance for End-to-end Semi-Supervised Object Detection. 4570-4579 - Saurabh Kumar Jain, Sukhendu Das:
MARRS: Modern Backbones Assisted Co-training for Rapid and Robust Semi-Supervised Domain Adaptation. 4580-4589 - Manogna Sreenivas, Soma Biswas:
Similar Class Style Augmentation for Efficient Cross-Domain Few-Shot Learning. 4590-4598 - Daniel Bolya, Judy Hoffman:
Token Merging for Fast Stable Diffusion. 4599-4603 - Zhangheng Li, Yu Gong, Zhenyu Zhang, Xingyun Xue, Tianlong Chen, Yi Liang, Bo Yuan, Zhangyang Wang:
Accelerable Lottery Tickets with the Mixed-Precision Quantization. 4604-4612 - Tomer Ronen, Omer Levy, Avram Golbert:
Vision Transformers with Mixed-Resolution Tokenization. 4613-4622 - Chuanyue Shen, Letian Zhang, Zhangsihao Yang, Masood Mortazavi, Xiyun Song, Liang Peng, Heather Yu:
Envisioning a Next Generation Extended Reality Conferencing System with Efficient Photorealistic Human Rendering. 4623-4632 - Kartheek Kumar Reddy Nareddy, Mani Madhoolika Bulusu, Praveen Kumar Pokala, Chandra Sekhar Seelamantula:
Quantized Proximal Averaging Networks for Compressed Image Recovery. 4633-4643 - Hichem Sahbi:
Phase-field Models for Lightweight Graph Convolutional Networks. 4644-4650 - Yu-Hui Chen, Raman Sarokin, Juhyun Lee, Jiuqiang Tang, Chuo-Ling Chang, Andrei Kulik, Matthias Grundmann:
Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations. 4651-4655 - Darshan C. Ganji, Saad Ashfaq, Ehsan Saboori, Sudhakar Sah, Saptarshi Mitra, MohammadHossein AskariHemmat, Alexander Hoffman, Ahmed Hassanien, Mathieu Léonardon:
DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables. 4656-4664 - Phuoc-Hoan Charles Le, Xinlin Li:
BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models. 4665-4674 - Roland Gao:
Rethinking Dilated Convolution for Real-time Semantic Segmentation. 4675-4684 - Souvik Kundu, Yuke Zhang, Dake Chen, Peter A. Beerel:
Making Models Shallow Again: Jointly Learning to Reduce Non-Linearity and Depth for Latency-Efficient Private Inference. 4685-4689 - Haolin Jia, Qifei Wang, Omer Tov, Yang Zhao, Fei Deng, Lu Wang, Chuo-Ling Chang, Tingbo Hou, Matthias Grundmann:
BlazeStyleGAN: A Real-Time On-Device StyleGAN. 4690-4694 - Mayukh Bhattacharyya, Soumitri Chattopadhyay, Sayan Nag:
DeCAtt: Efficient Vision Transformers with Decorrelated Attention Heads. 4695-4699 - Yeonju Ro, Cong Xu, Agnieszka Ciborowska, Suparna Bhattacharya, Frankie Li, Martin Foltin:
Dataset Efficient Training with Model Ensembling. 4700-4704 - Rawwad Alhejaili, Motaz Alfarraj, Hamzah Luqman, Ali Al-Shaikhi:
Recursions Are All You Need: Towards Efficient Deep Unfolding Networks. 4705-4714 - Samir Khaki, Weihan Luo:
CFDP: Common Frequency Domain Pruning. 4715-4724 - Gyudo Park, Soohyeok Kang, Wencan Cheng, Jong Hwan Ko:
Dynamic Inference Acceleration of 3D Point Cloud Deep Neural Networks Using Point Density and Entropy. 4725-4729 - Marina Neseem, Ahmed Agiza, Sherief Reda:
AdaMTL: Adaptive Input-dependent Inference for Efficient Multi-Task Learning. 4730-4739 - Ryu Tadokoro, Ryosuke Yamada, Hirokatsu Kataoka:
Pre-training Auto-generated Volumetric Shapes for 3D Medical Image Segmentation. 4740-4745 - Kai Li, Curtis Wigington, Chris Tensmeyer, Vlad I. Morariu, Handong Zhao, Varun Manjunatha, Nikolaos Barmpalios, Yun Fu:
Improving Cross-Domain Detection with Self-Supervised Learning. 4746-4755 - Giorgos Kordopatis-Zilos, Giorgos Tolias, Christos Tzelepis, Ioannis Kompatsiaris, Ioannis Patras, Symeon Papadopoulos:
Self-Supervised Video Similarity Learning. 4756-4766 - Ashish Sinha, Jonghyun Choi:
MEnsA: Mix-up Ensemble Average for Unsupervised Multi Target Domain Adaptation on 3D Point Clouds. 4767-4777 - Wentao Zhu, Jingya Liu, Yufang Huang:
HNSSL: Hard Negative-Based Self-Supervised Learning. 4778-4787 - Jose Sosa, David C. Hogg:
Self-supervised 3D Human Pose Estimation from a Single Image. 4788-4797 - Qinwei Xu, Ruipeng Zhang, Yiyan Wu, Ya Zhang, Ning Liu, Yanfeng Wang:
SimDE: A Simple Domain Expansion Approach for Single-source Domain Generalization. 4798-4808 - Robin Schön, Katja Ludwig, Rainer Lienhart:
Impact of Pseudo Depth on Open World Object Segmentation with Minimal User Guidance. 4809-4819 - Shaobo Lin, Kun Wang, Xingyu Zeng, Rui Zhao:
An Effective Crop-Paste Pipeline for Few-shot Object Detection. 4820-4828 - Indu Panigrahi, Ryan Manzuk, Adam Maloof, Ruth Fong:
Improving Data-Efficient Fossil Segmentation via Model Editing. 4829-4838 - Robert-Jan Bruintjes, Tomasz Motyka, Jan van Gemert:
What Affects Learned Equivariance in Deep Image Recognition Models? 4839-4847 - Gyungin Shin, Samuel Albanie, Weidi Xie:
Zero-shot Unsupervised Transfer Instance Segmentation. 4848-4858 - Keval Doshi, Yasin Yilmaz:
Zero-Shot Action Recognition with Transformer-based Video Semantic Embedding. 4859-4868 - Tianyu Li, Subhankar Roy, Huayi Zhou, Hongtao Lu, Stéphane Lathuilière:
Contrast, Stylize and Adapt: Unsupervised Contrastive Learning Framework for Domain Adaptive Semantic Segmentation. 4869-4879 - Merey Ramazanova, Victor Escorcia, Fabian Caba Heilbron, Chen Zhao, Bernard Ghanem:
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos. 4880-4890 - Haixu Long, Xiaolin Zhang, Yanbin Liu, Zongtai Luo, Jianbo Liu:
Mutual Exclusive Modulator for Long-Tailed Recognition. 4891-4900 - Taekyung Kim, Debasmit Das, Seokeon Choi, Minki Jeong, Seunghan Yang, Sungrack Yun, Changick Kim:
Neural Transformation Network to Generate Diverse Views for Contrastive Learning. 4901-4911 - Xiaofei Huang, Lingfei Luan, Elaheh Hatamimajoumerd, Michael Wan, Pooria Daneshvar Kakhaki, Rita Obeid, Sarah Ostadabbas:
Posture-based Infant Action Recognition in the Wild with Very Limited Data. 4912-4921 - Elena Belén Bueno-Benito, Biel Tura Vecino, Mariella Dimiccoli:
Leveraging triplet loss for unsupervised action segmentation. 4922-4930 - Fadoua Khmaissia, Hichem Frigui:
Improving Automatic Target Recognition in Low Data Regime using Semi-Supervised Learning and Generative Data Augmentation. 4931-4939 - Andrew Lu, Xudong Lin, Yulei Niu, Shih-Fu Chang:
In Defense of Structural Symbolic Representation for Video Event-Relation Prediction. 4940-4950 - Hung-Ting Su, Yulei Niu, Xudong Lin, Winston H. Hsu, Shih-Fu Chang:
Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering. 4951-4960 - Gyungin Shin, Weidi Xie, Samuel Albanie:
NamedMask: Distilling Segmenters from Complementary Foundation Models. 4961-4970 - Deepan Chakravarthi Padmanabhan, Shruthi Gowda, Elahe Arani, Bahram Zonooz:
LSFSL: Leveraging Shape Information in Few-shot Learning. 4971-4980 - Farzad Nozarian, Shashank Agarwal, Farzaneh Rezaeianaran, Danish Shahzad, Atanas Poibrenski, Christian Müller, Philipp Slusallek:
Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection. 4981-4990 - Kohei Shiba, Yusuke Mukuta, Tatsuya Harada:
Zero-shot Object Classification with Large-scale Knowledge Graph. 4991-4998 - Dani Manjah, Davide Cacciarelli, Mohamed Benkedadra, Baptiste Standaert, Gauthier Rotsart De Hertaing, Benoît Macq, Stéphane Galland, Christophe De Vleeschouwer:
Stream-Based Active Distillation for Scalable Model Deployment. 4999-5007 - Chinmaya Devaraj, Cornelia Fermüller, Yiannis Aloimonos:
Incorporating Visual Grounding In GCN For Zero-shot Learning Of Human Object Interaction Actions. 5008-5017 - Dengsheng Chen, Vince Junkai Tan, Zhilin Lu, Enhua Wu, Jie Hu:
OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework. 5018-5026 - Huancheng Chen, Haris Vikalo:
Federated Learning in Non-IID Settings Aided by Differentially Private Synthetic Data. 5027-5036 - Ruisi Cai, Xiaohan Chen, Shiwei Liu, Jayanth Srinivasa, Myungjin Lee, Ramana Kompella, Zhangyang Wang:
Many-Task Federated Learning: A New Problem Setting and A Simple Baseline. 5037-5045 - Pretom Roy Ovi, Emon Dey, Nirmalya Roy, Aryya Gangopadhyay:
Mixed Quantization Enabled Federated Learning to Tackle Gradient Inversion Attacks. 5046-5054 - Donald Shenaj, Marco Toldo, Alberto Rigon, Pietro Zanuttigh:
Asynchronous Federated Continual Learning. 5055-5063 - Tuo Zhang, Lei Gao, Sunwoo Lee, Mi Zhang, Salman Avestimehr:
TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training. 5064-5073 - Hassan Mkhallati, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck:
SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries. 5074-5085 - Jan Held, Anthony Cioppa, Silvio Giancola, Abdullah Hamdi, Bernard Ghanem, Marc Van Droogenbroeck:
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views. 5086-5097 - Silvio Giancola, Anthony Cioppa, Julia Georgieva, Johsan Billingham, Andreas Serner, Kerry Peek, Bernard Ghanem, Marc Van Droogenbroeck:
Towards Active Learning for Action Spotting in Association Football Videos. 5098-5108 - Tobias Baumgartner, Stefanie Klatt:
Monocular 3D Human Pose Estimation for Sports Broadcasts using Partial Sports Field Registration. 5109-5118 - William J. McNally, Jacob Lambeth, Dustin Brekke:
Combining Physics and Deep Learning Models to Simulate the Flight of a Golf Ball. 5119-5128 - Yang Liu, Luiz G. Hafemann:
A Scale-Invariant Trajectory Simplification Method for Efficient Data Collection in Videos. 5129-5138 - Yu-Hsi Chen, Chien-Yao Wang, Cheng-Yun Yang, Hung-Shuo Chang, Youn-Long Lin, Yung-Yu Chuang, Hong-Yuan Mark Liao:
NeighborTrack: Single Object Tracking by Bipartite Matching with Neighbor Tracklets and Its Applications to Sports. 5139-5148 - Hendrik Hachmann, Bodo Rosenhahn:
Human Spine Motion Capture using Perforated Kinesiology Tape. 5149-5157 - Naga Venkata Sai Raviteja Chappa, Pha A. Nguyen, Alexander H. Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu:
SPARTAN: Self-supervised Spatiotemporal Transformers Approach to Group Activity Recognition. 5158-5168 - Michael Deyzel, Rensu P. Theart:
One-shot skeleton-based action recognition on strength and conditioning exercises. 5169-5178 - Katja Ludwig, Julian Lorenz, Robin Schön, Rainer Lienhart:
All Keypoints You Need: Detecting Arbitrary Keypoints on the Body of Triple, High, and Long Jump Athletes. 5179-5187 - Matteo Dunnhofer, Luca Sordi, Christian Micheloni:
Visualizing Skiers' Trajectories in Monocular Videos. 5188-5198 - Magnus Ibh, Stella Grasshof, Dan Witzner Hansen, Pascal Madeleine:
TemPose: a new skeleton-based transformer model designed for fine-grained motion recognition in badminton. 5199-5208 - Yash Pandya, Kaustav Nandy, Shivam Agarwal:
Homography based Player Identification in Live Sports. 5209-5218 - Christian Keilstrup Ingwersen, Christian Mikkelstrup, Janus Nørtoft Jensen, Morten Rieger Hannemose, Anders Bjorholm Dahl:
SportsPose - A Dynamic 3D sports pose dataset. 5219-5228 - Farzaneh Askari, Ruixi Jiang, Zhiwei Li, Jiatong Niu, Yuyan Shi, James J. Clark:
Self-Supervised Video Interaction Classification using Image Representation of Skeleton Data. 5229-5238 - Hsiang-Wei Huang, Cheng-Yen Yang, Zhongyu Jiang, Pyong-Kun Kim, Kyoungoh Lee, Kwangju Kim, Samartha Ramkumar, Chaitanya Mullapudi, In-Su Jang, Chung-I Huang, Jenq-Neng Hwang:
Enhancing Multi-Camera People Tracking with Anchor-Guided Clustering and Spatio-Temporal Consistency ID Re-Assignment. 5239-5249 - Liangqi Yuan, Yunsheng Ma, Lu Su, Ziran Wang:
Peer-to-Peer Federated Continual Learning for Naturalistic Driving Action Recognition. 5250-5259 - Wenjie Yang, Zhenyu Xie, Yaoming Wang, Yang Zhang, Xiao Ma, Bing Hao:
Integrating Appearance and Spatial-Temporal Information for Multi-Camera People Tracking. 5260-5269 - Rong-Chang Li, Cong Wu, Linze Li, Zhongwei Shen, Tianyang Xu, Xiaojun Wu, Xi Li, Jiwen Lu, Josef Kittler:
Action Probability Calibration for Efficient Naturalistic Driving Action Localization. 5270-5277 - Yichen Cai, Aoran Jiao:
DACNet: A Deep Automated Checkout Network with Selective Deblurring. 5278-5286 - Yunsheng Ma, Liangqi Yuan, Amr Abdelraouf, Kyungtae Han, Rohit Gupta, Zihao Li, Ziran Wang:
M2DAR: Multi-View Multi-Scale Driver Action Recognition with Vision Transformer. 5287-5294 - Pirazh Khorramshahi, Vineet Shenoy, Rama Chellappa:
Robust and Scalable Vehicle Re-Identification via Self-Supervision. 5295-5304 - Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Erkhembayar Ganbold, Jun-Wei Hsieh, Ming-Ching Chang, Ping-Yang Chen, Byambaa Dorj, Hamad Al Jassmi, Ganzorig Batnasan, Fady Alnajjar, Mohammed Abduljabbar, Fang-Pang Lin:
FishEye8K: A Benchmark and Dataset for Fisheye Camera Object Detection. 5305-5313 - Hamam Mokayed, Amirhossein Nayebiastaneh, Kanjar De, Stergios Sozos, Olle Hagner, Björn Backe:
Nordic Vehicle Dataset (NVD): Performance of vehicle detectors using newly captured NVD from UAV in different snowy weather conditions. 5314-5322 - Carlos Gómez Huélamo, Marcos V. Conde, Rafael Barea, Luis Miguel Bergasa:
Improving Multi-Agent Motion Prediction with Heuristic Goals and Motion Refinement. 5323-5332 - Long Hoang Pham, Duong Nguyen-Ngoc Tran, Huy-Hung Nguyen, Hyung-Joon Jeon, Tai Huu-Phuong Tran, Hyung-Min Jeon, Jae Wook Jeon:
Improving Deep Learning-based Automatic Checkout System Using Image Enhancement Techniques. 5333-5340 - Duong Nguyen-Ngoc Tran, Long Hoang Pham, Hyung-Joon Jeon, Huy-Hung Nguyen, Hyung-Min Jeon, Tai Huu-Phuong Tran, Jae Wook Jeon:
Robust Automatic Motorcycle Helmet Violation Detection for an Intelligent Transportation System. 5341-5349 - Armstrong Aboah, Bin Wang, Ulas Bagci, Yaw Adu-Gyamfi:
Real-time Multi-Class Helmet Violation Detection Using Few-Shot Data Sampling Technique and YOLOv8. 5350-5358 - Armstrong Aboah, Ulas Bagci, Abdul Rashid Mussah, Neema Jakisa Owor, Yaw Adu-Gyamfi:
DeepSegmenter: Temporal Action Localization for Detecting Anomalies in Untrimmed Naturalistic Driving Videos. 5359-5365 - Chun-Ming Tsai, Jun-Wei Hsieh, Ming-Ching Chang, Guan-Lin He, Ping-Yang Chen, Wei-Tsung Chang, Yi-Kuan Hsieh:
Video Analytics for Detecting Motorcyclist Helmet Rule Violations. 5366-5374 - Wei Zhou, Yinlong Qian, Zequn Jie, Lin Ma:
Multi View Action Recognition for Distracted Driver Behavior Localization. 5375-5380 - Viet Hung Duong, Quang Huy Tran, Huu Si Phuc Nguyen, Duc Quyen Nguyen, Tien Cuong Nguyen:
Helmet Rule Violation Detection for Motorcyclists using a Custom Tracking Framework and Advanced Object Detection Techniques. 5381-5390 - Ziqiang Shi, Zhongling Liu, Liu Liu, Rujie Liu, Takuma Yamamoto, Xiaoyu Mi, Daisuke Uchida:
CheckSORT: Refined Synthetic Data Combination and Optimized SORT for Automatic Retail Checkout. 5391-5398 - Yuntae Jeon, Dai Quoc Tran, Minsoo Park, Seunghee Park:
Leveraging Future Trajectory Prediction for Multi-Camera People Tracking. 5399-5408 - Bach Hoang Ngo, Dat Thanh Nguyen, Nhat-Tuong Do-Tran, Phuc Pham Huy Thien, Minh-Hung An, Tuan-Ngoc Nguyen, Loi Nguyen Hoang, Vinh Dinh Nguyen, Quang-Vinh Dinh:
Comprehensive Visual Features and Pseudo Labeling for Robust Natural Language-based Vehicle Retrieval. 5409-5418 - Dong Xie, Linhu Liu, Shengjun Zhang, Jiang Tian:
A Unified Multi-modal Structure for Retrieving Tracked Vehicles through Natural Language Descriptions. 5419-5427 - Huy Duong Le, Minh Quan Vu, Manh Tung Tran, Nguyen Van Phuc:
Triplet Temporal-based Video Recognition with Multiview for Temporal Action Localization. 5428-5434 - Xiaodong Dong, Ruijie Zhao, Hao Sun, Dong Wu, Jin Wang, Xuyang Zhou, Jiang Liu, Shun Cui, Zhongjiang He:
Multi-Attention Transformer for Naturalistic Driving Action Recognition. 5435-5441 - Andreas Specker, Jürgen Beyerer:
ReidTrack: Reid-only Multi-target Multi-camera Tracking. 5442-5452 - Erkut Akdag, Zeqi Zhu, Egor Bondarev, Peter H. N. de With:
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition. 5453-5462 - Jeongho Kim, Wooksu Shin, Hancheol Park, Jong-Won Baek:
Addressing the Occlusion Problem in Multi-Camera People Tracking with Human Pose Estimation. 5463-5469 - Shun Cui, Tiantian Zhang, Hao Sun, Xuyang Zhou, Wenqing Yu, Aigong Zhen, Qihang Wu, Zhongjiang He:
An Effective Motorcycle Helmet Object Detection Framework for Intelligent Traffic Safety. 5470-5476 - Bor-Shiun Wang, Ping-Yang Chen, Yi-Kuan Hsieh, Jun-Wei Hsieh, Ming-Ching Chang, JiaXin He, Shin-You Teng, HaoYuan Yue, Yu-Chee Tseng:
PRB-FPN+: Video Analytics for Enforcing Motorcycle Helmet Laws. 5477-5485 - Zeliang Ma, Delong Liu, Zhe Cui, Yanyun Zhao:
AdaptCD: An Adaptive Target Region-based Commodity Detection System. 5486-5495 - Quang Qui-Vinh Nguyen, Huy Dinh-Anh Le, Truc Thi-Thanh Chau, Duc-Tuan Luu, Nhat Minh Chung, Synh Viet-Uyen Ha:
Multi-camera People Tracking With Mixture of Realistic and Synthetic Knowledge. 5496-5506 - Anudeep Dhonde, Prabhudev Guntur, Vinitha Palani:
Adaptive RoI with pretrained models for Automated Retail Checkout. 5507-5510 - Huy Dinh-Anh Le, Quang Qui-Vinh Nguyen, Duc Trung Luu, Truc Thi-Thanh Chau, Nhat Minh Chung, Synh Viet-Uyen Ha:
Tracked-Vehicle Retrieval by Natural Language Descriptions with Multi-Contextual Adaptive Knowledge. 5511-5519 - Zongyi Li, Runsheng Wang, He Li, Bohao Wei, Yuxuan Shi, Hefei Ling, Jiazhong Chen, Boyuan Liu, Zhongyang Li, Hanqing Zheng:
Hierarchical Clustering and Refinement for Generalized Multi-Camera Person Tracking. 5520-5529 - Arpita Vats, David C. Anastasiu:
Enhancing Retail Checkout through Video Inpainting, YOLOv8 Detection, and DeepSort Tracking. 5530-5537 - Milind Naphade, Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Yue Yao, Liang Zheng, Mohammed Shaiqur Rahman, Meenakshi S. Arya, Anuj Sharma, Qi Feng, Vitaly Ablavsky, Stan Sclaroff, Pranamesh Chakraborty, Sanjita Prajapati, Alice Li, Shangru Li, Krishna Kunadharaju, Shenxin Jiang, Rama Chellappa:
The 7th AI City Challenge. 5538-5548 - Weiling Chen, Keng Teck Ma, Zi Jian Yew, Minhoe Hur, David Aik-Aun Khoo:
TEVAD: Improved video anomaly detection with captions. 5549-5559 - Arushi Rai, Adriana Kovashka:
Improving language-supervised object detection with linguistic structure analysis. 5560-5570 - Muah Seol, Jonghee Kim, Jinyoung Moon:
BMRN: Boundary Matching and Refinement Network for Temporal Moment Localization with Natural Language. 5571-5579 - Shamanthak Hegde, Soumya Jahagirdar, Shankar Gangisetty:
Making the V in Text-VQA Matter. 5580-5588 - Charani Alampalle, Shamanthak Hegde, Soumya Jahagirdar, Shankar Gangisetty:
Weakly Supervised Visual Question Answer Generation. 5589-5597 - Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró:
Visual Semantic Relatedness Dataset for Image Captioning. 5598-5606 - Maria Parelli, Alexandros Delitzas, Nikolas Hars, Georgios Vlassis, Sotiris Anagnostidis, Gregor Bachmann, Thomas Hofmann:
CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes. 5607-5612 - Jonghee Kim, Youngwan Lee, Jinyoung Moon:
T2V2T: Text-to-Video-to-Text Fusion for Text-to-Video Retrieval. 5613-5618 - Tejas Srinivasan, Xiang Ren, Jesse Thomason:
Curriculum Learning for Data-Efficient Vision-Language Alignment. 5619-5624 - Laia Tarrés, Gerard I. Gállego, Amanda Cardoso Duarte, Jordi Torres, Xavier Giró-i-Nieto:
Sign Language Translation from Instructional Videos. 5625-5635 - Meghna Kapoor, Suvam Patra, Badri Narayan Subudhi, Vinit Jakhetiya, Ankur Bansal:
Underwater Moving Object Detection using an End-to-End Encoder-Decoder Architecture and GraphSage with Aggregator and Refactoring. 5636-5645 - Deblina Bhattacharjee, Sabine Süsstrunk, Mathieu Salzmann:
Dense Multitask Learning to Reconfigure Comics. 5646-5655 - Maryam Daniali, Edward Kim:
Perception Over Time: Temporal Dynamics for Robust Image Understanding. 5656-5665 - Zoya Shafique, Haiyan Wang, Yingli Tian:
Nonverbal Communication Cue Recognition: A Pathway to More Accessible Communication. 5666-5674 - Sudha Velusamy, Rakesh Radarapu, Anandavardhan Hegde, Narayan Kothari:
A Light-Weight Human Eye Fixation Solution for Smartphone Applications. 5675-5680 - Bokyeung Lee, Hyunuk Shin, Bonhwa Ku, Hanseok Ko:
Frame Level Emotion Guided Dynamic Facial Expression Recognition with Emotion Grouping. 5681-5691 - Dexter Neo, Tsuhan Chen, Stefan Winkler:
Large-Scale Facial Expression Recognition Using Dual-Domain Affect Fusion for Noisy Labels. 5692-5700 - Fanglei Xue, Yifan Sun, Yi Yang:
Exploring Expression-related Self-supervised Learning and Spatial Reserve Pooling for Affective Behaviour Analysis. 5701-5708 - Sanghwa Hong, Jin-Woo Jeong:
Dynamic Noise Injection for Facial Expression Recognition In-the-Wild. 5709-5715 - Andrey V. Savchenko:
EmotiEffNets for Facial Processing in Video-based Valence-Arousal Prediction, Expression Classification and Action Unit Detection. 5716-5724 - Ziyang Zhang, Liuwei An, Zishun Cui, Ao Xu, Tengteng Dong, Yueqi Jiang, Jingyi Shi, Xin Liu, Xiao Sun, Meng Wang:
ABAW5 Challenge: A Facial Affect Recognition Approach Utilizing Transformer Encoder and Audiovisual Fusion. 5725-5734 - Ximan Li, Weihong Deng, Shan Li, Yong Li:
Compound Expression Recognition In-the-wild with AU-assisted Meta Multi-task Learning. 5735-5744 - Panagiotis Paraskevas Filntisis, George Retsinas, Foivos Paraperas Papantoniou, Athanasios Katsamanis, Anastasios Roussos, Petros Maragos:
SPECTRE: Visual Speech-Informed Perceptual 3D Facial Expression Reconstruction from Videos. 5745-5755 - Weiwei Zhou, Jiada Lu, Zhaolong Xiong, Weifeng Wang:
Leveraging TCN and Transformer for effective visual-audio fusion in continuous emotion recognition. 5756-5763 - Su Zhang, Ziyuan Zhao, Cuntai Guan:
Multimodal Continuous Emotion Recognition: A Technical Report for ABAW5. 5764-5769 - Vu Ngoc Tu, Van Thong Huynh, Trong Nghia Nguyen, Soo-Hyung Kim:
Ensemble Spatial and Temporal Vision Transformer for Action Units Detection. 5770-5776 - Feng Qiu, Bowen Ma, Wei Zhang, Yu Ding:
Multi-modal Emotion Reaction Intensity Estimation with Temporal Augmentation. 5777-5784 - Jun Yu, Renda Li, Zhongpeng Cai, Gongpeng Zhao, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qiang Ling, Lei Wang, Cong Wang, Luyu Qiu, Wei Zheng:
Local Region Perception and Relationship Learning Combined with Feature Fusion for Facial Action Unit Detection. 5785-5792 - Wei Zhang, Bowen Ma, Feng Qiu, Yu Ding:
Multi-modal Facial Affective Analysis based on Masked Autoencoder. 5793-5802 - Jun Yu, Zhongpeng Cai, Renda Li, Gongpeng Zhao, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qiang Ling, Lei Wang, Cong Wang, Luyu Qiu, Wei Zheng:
Exploring Large-scale Unlabeled Faces to Enhance Facial Expression Recognition. 5803-5810 - Jun Yu, Jichao Zhu, Wangyuan Zhu, Zhongpeng Cai, Guochen Xie, Renda Li, Gongpeng Zhao, Qiang Ling, Lei Wang, Cong Wang, Luyu Qiu, Wei Zheng:
A Dual Branch Network for Emotional Reaction Intensity Estimation. 5811-5818 - Ankith Jain Rakesh Kumar, Bir Bhanu:
Relational Edge-Node Graph Attention Network for Classification of Micro-Expressions. 5819-5828 - Joao Palotti, Gagan Narula, Lekan Raheem, Herbert Bay:
Analysis of Emotion Annotation Strength Improves Generalization in Speech Emotion Recognition Models. 5829-5837 - Jia Li, Yin Chen, Xuesong Zhang, Jiantao Nie, Ziqiang Li, Yangchen Yu, Yan Zhang, Richang Hong, Meng Wang:
Multimodal Feature Extraction and Fusion for Emotional Reaction Intensity Estimation and Expression Classification in Videos with Transformers. 5838-5844 - Aboli Marathe, Sanjana Prabhu:
t-RAIN: Robust generalization under weather-aliasing label shift attacks. 5845-5854 - Yuanyuan Deng, Xiaolong Liu, Liyu Meng, Wenqiang Jiang, Youqiang Dong, Chuanhe Liu:
Multi-modal Information Fusion for Action Unit Detection in the Wild. 5855-5862 - Xiaolong Liu, Lei Sun, Wenqiang Jiang, Fengyuan Zhang, Yuanyuan Deng, Zhaopei Huang, Liyu Meng, Yuchen Liu, Chuanhe Liu:
EVAEF: Ensemble Valence-Arousal Estimation Framework in the Wild. 5863-5871 - Chuanhe Liu, Xinjie Zhang, Xiaolong Liu, Tenggan Zhang, Liyu Meng, Yuchen Liu, Yuanyuan Deng, Wenqiang Jiang:
Facial Expression Recognition Based on Multi-modal Features for Videos in the Wild. 5872-5879 - Song Tong, Jingyi Duan, Xuefeng Liang, Takatsune Kumada, Kaiping Peng, Ryoichi Nakashima:
Inferring Affective Experience from the Big Picture Metaphor: A Two-dimensional Visual Breadth Model. 5880-5888 - Dimitrios Kollias, Panagiotis Tzirakis, Alice Baird, Alan Cowen, Stefanos Zafeiriou:
ABAW: Valence-Arousal Estimation, Expression Recognition, Action Unit Detection & Emotional Reaction Intensity Estimation Challenges. 5889-5898 - Zihan Wang, Siyang Song, Cheng Luo, Yuzhi Zhou, Shiling Wu, Weicheng Xie, Linlin Shen:
Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection. 5899-5907 - Sridhar Sola, Darshan Gera:
Unmasking Your Expression: Expression-Conditioned GAN for Masked Face Inpainting. 5908-5916 - Onur Cezmi Mutlu, Mohammadmahdi Honarmand, Saimourya Surabhi, Dennis P. Wall:
TempT: Temporal consistency for Test-time adaptation. 5917-5923 - Bowen Ma, Wei Zhang, Feng Qiu, Yu Ding:
A Unified Approach to Facial Affect Analysis: the MAE-Face Visual Representation. 5924-5933 - Yini Fang, Liang Wu, Frederic Jumelle, Bertram E. Shi:
Integrating Holistic and Local Information to Estimate Emotional Reaction Intensity. 5934-5939 - Jonathan F. Carter, João Jorge, Bindia Venugopal, Oliver Gibson, Lionel Tarassenko:
Deep Learning-Enabled Sleep Staging From Vital Signs and Activity Measured Using a Near-Infrared Video Camera. 5940-5949 - Uldis Rubins, Aleksejs Miscuks, Yousef K. Qawqzeh, Zbignevs Marcinkevics, Andris Grabovskis:
Photoplethysmography imaging algorithm for real-time monitoring of skin perfusion maps. 5950-5956 - Lieke Dorine van Putten, Kate Emily Bamford:
Improving Systolic Blood Pressure Prediction from Remote Photoplethysmography Using a Stacked Ensemble Regressor. 5957-5964 - Fulan Li, Surendrabikram Thapa, Shreyas Bhat, Abhijit Sarkar, A. Lynn Abbott:
A Temporal Encoder-Decoder Approach to Extracting Blood Volume Pulse Signal Morphology from Face Videos. 5965-5974 - Yogesh Deshpande, Surendrabikram Thapa, Abhijit Sarkar, A. Lynn Abbott:
Camera-based Recovery of Cardiovascular Signals from Unconstrained Face Videos using an Attention Network. 5975-5984 - Nathan Vance, Jeremy Speth, Benjamin Sporrer, Patrick J. Flynn:
Promoting Generalization in Cross-Dataset Remote Photoplethysmography. 5985-5993 - Lu Niu, Jeremy Speth, Nathan Vance, Benjamin Sporrer, Adam Czajka, Patrick J. Flynn:
Full-Body Cardiovascular Sensing with Remote Photoplethysmography. 5994-6004 - Zimeng Liu, Bin Huang, Chun-Liang Lin, Chieh-Liang Wu, Changchen Zhao, Wen-Cheng Chao, Yu-Cheng Wu, Yadan Zheng, Zhiru Wang:
Contactless Respiratory Rate Monitoring For ICU Patients Based On Unsupervised Learning. 6005-6014 - Jun Seong Lee, Gyutae Hwang, Moonwook Ryu, Sang Jun Lee:
LSTC-rPPG: Long Short-Term Convolutional Network for Remote Photoplethysmography. 6015-6023 - Iskander Zhalbekov, Leonid Beynenson, Alexey Trushkov, Ivan Bulychev, Wenshuai Yin:
Frequency Tracker for Unsupervised Heart Rate Estimation. 6024-6033 - Seunghyun Kim, Kunyoung Lee, Eui Chul Lee:
Multi-View Body Image-Based Prediction of Body Mass Index and Various Body Part Sizes. 6034-6041 - Natalia Kowalczyk, Jacek Ruminski:
Respiratory Rate Estimation Based on Detected Mask Area in Thermal Images. 6042-6051 - Huaijing Shu, Lirong Ren, Liping Pan, Dongmin Huang, Hongzhou Lu, Wenjin Wang:
Single Image based Infant Body Height and Weight Estimation. 6052-6059 - Haowen Wang, Weijun Huang, Jia Huang, Guowei Wang, Hongzhou Lu, Wenjin Wang:
Camera based Eye State Estimation for ICU Patients: A Pilot Clinical Study. 6060-6067 - Chu Chu Qiu, Jing Wei Chin, Kwan Long Wong, Tsz Tai Chan, Yudong He, Richard Hau Yue So:
Remote mass facial temperature screening in varying ambient temperatures and distances. 6068-6076 - Shutao Chen, Sui Kei Ho, Jing Wei Chin, Kin Ho Luo, Tsz Tai Chan, Richard Hau Yue So, Kwan Long Wong:
Deep learning-based image enhancement for robust remote photoplethysmography in various illumination scenarios. 6077-6085 - Ismoil Odinaev, Jing Wei Chin, Kin Ho Luo, Zhang Ke, Richard Hau Yue So, Kwan Long Wong:
Optimizing Camera Exposure Control Settings for Remote Vital Sign Measurements in Low-Light Environments. 6086-6093 - Patrik Hansen, Marianela García Lozano, Farzad Kamrani, Joel Brynielsson:
Real-Time Estimation of Heart Rate in Situations Characterized by Dynamic Illumination using Remote Photoplethysmography. 6094-6103 - Fuxiang Huang, Lei Zhang:
Language Guided Local Infiltration for Interactive Image Retrieval. 6104-6113 - Menelaos Kanakis, Simon Maurer, Matteo Spallanzani, Ajad Chhatkuli, Luc Van Gool:
ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization. 6114-6123 - Han Zou, Liang Xu, Takayuki Okatani:
Geometry Enhanced Reference-based Image Super-resolution. 6124-6133 - Christiano Couto Gava, Vishal Mukunda, Tewodros Habtegebrial, Federico Raue, Sebastian Palacio, Andreas Dengel:
SphereGlue: Learning Keypoint Matching on High Resolution Spherical Images. 6134-6144 - JongMin Lee, Eunhyeok Park, Sungjoo Yoo:
Multi-scale Local Implicit Keypoint Descriptor for Keypoint Matching. 6145-6154 - Giovanni Barbarani, Mohamad Mostafa, Hajali Bayramov, Gabriele Trivigno, Gabriele Moreno Berton, Carlo Masone, Barbara Caputo:
Are Local Features All You Need for Cross-Domain Visual Place Recognition? 6155-6165 - Chia-Hui Wang, Yu-Chee Tseng, Ting-Hui Chiang, Yan-Ann Chen:
Learning Multi-scale Representations with Single-stream Network for Video Retrieval. 6166-6176 - Jiahao Chang, Jiahuan Yu, Tianzhu Zhang:
Structured Epipolar Matcher for Local Feature Matching. 6177-6186 - Amogh Tiwari, Pranav Manu, Nakul Rathore, Astitva Srivastava, Avinash Sharma:
ConVol-E: Continuous Volumetric Embeddings for Human-Centric Dense Correspondence Estimation. 6187-6195 - Alex Stoken, Kenton Fisher:
Find My Astronaut Photo: Automated Localization and Georectification of Astronaut Photography. 6196-6205 - Alexander Avery, Andreas E. Savakis:
DeepRM: Deep Recurrent Matching for 6D Pose Refinement. 6206-6214 - Nikolaos Zioulis, James F. O'Brien:
KBody: Towards general, robust, and aligned monocular whole-body estimation. 6215-6225 - Gee-Sern Hsu, Yu-Hong Lin, Chin-Cheng Chang:
Pretrained Pixel-Aligned Reference Network for 3D Human Reconstruction. 6226-6234 - Xiaoqi Wang, Yaojun Wang, Jingbo Zhao, Jing Niu:
ECA-ConvNeXt: A Rice Leaf Disease Identification Model Based on ConvNeXt. 6235-6243 - Lukas Meyer, Andreas Gilson, Oliver Scholz, Marc Stamminger:
CherryPicker: Semantic Skeletonization and Topological Reconstruction of Cherry Trees. 6244-6253 - Farah Saeed, Jin Sun, Peggy Ozias-Akins, Ye Juliet Chu, Changying Charlie Li:
PeanutNeRF: 3D Radiance Field for Peanuts. 6254-6263 - George Retsinas, Niki Efthymiou, Petros Maragos:
Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding. 6264-6271 - Shivam K. Panda, Yongkyu Lee, Mohammad Khalid Jawed:
Agronav: Autonomous Navigation Framework for Agricultural Robots and Vehicles using Semantic Segmentation and Semantic Line Detection. 6272-6281 - Dafni Anagnostopoulou, George Retsinas, Niki Efthymiou, Panayiotis Paraskevas Filntisis, Petros Maragos:
A Realistic Synthetic Mushroom Scenes Dataset. 6282-6289 - Diogo Nunes Gonçalves, José Marcato Jr., Pedro Zamboni, Hemerson Pistori, Jonathan Li, Keiller Nogueira, Wesley Nunes Gonçalves:
MTLSegFormer: Multi-task Learning with Transformers for Semantic Segmentation in Precision Agriculture. 6290-6298 - Raiyan Rahman, Christopher Indris, Tianxiao Zhang, Kaidong Li, Brian McCornack, Daniel Flippo, Ajay Sharda, Guanghui Wang:
On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild. 6299-6306 - Jiachen Li, Ali Hassani, Steven Walton, Humphrey Shi:
ConvMLP: Hierarchical Convolutional MLPs for Vision. 6307-6316 - Nanxuan Zhao, Jianbo Jiao, Weidi Xie, Dahua Lin:
Cali-NCE: Boosting Cross-modal Video Representation Learning with Calibrated Alignment. 6317-6327 - Yifeng Shi, Feng Lv, Xinliang Wang, Chunlong Xia, Shaojie Li, Shujie Yang, Teng Xi, Gang Zhang:
Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation. 6328-6335 - Ajian Liu, Zichang Tan, Yanyan Liang, Jun Wan:
Attack-Agnostic Deep Face Anti-Spoofing. 6336-6345 - Zitong Yu, Ajian Liu, Chenxu Zhao, Kevin H. M. Cheng, Xu Cheng, Guoying Zhao:
Flexible-Modal Face Anti-Spoofing: A Benchmark. 6346-6351 - Yongluo Liu, Yaowen Xu, Zhaofan Zou, Zhuming Wang, Bowen Zhang, Lifang Wu, Zhizhi Guo, Zhixiang He:
Adversarial Domain Generalization for Surveillance Face Anti-Spoofing. 6352-6360 - Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Zhen Lei:
Surveillance Face Presentation Attack Detection Challenge. 6361-6371 - Keyao Wang, Mouxiao Huang, Guosheng Zhang, Haixiao Yue, Gang Zhang, Yu Qiao:
Dynamic Feature Queue for Surveillance Face Anti-spoofing via Progressive Training. 6372-6379 - Dong Wang, Jia Guo, Qiqi Shao, Haochi He, Zhian Chen, Chuanbao Xiao, Ajian Liu, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Jun Wan, Jiankang Deng:
Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results. 6380-6391 - Yoanna Martínez-Díaz, Heydi Méndez-Vázquez, Luis S. Luevano, Miguel González-Mendoza:
Exploring the Effectiveness of Lightweight Architectures for Face Anti-Spoofing. 6392-6402 - Dingheng Zeng, Liang Gao, Hao Fang, Guohui Xiang, Yue Feng, Quan Lu:
Bandpass Filter Based Dual-stream Network for Face Anti-spoofing. 6403-6410 - Jingrui Yu, Tobias Scheck, Roman Seidel, Yukti Adya, Dipankar Nandi, Gangolf Hirtz:
Human Pose Estimation in Monocular Omnidirectional Top-View Images. 6411-6420 - Jingrui Yu, Ana Cecilia Pérez Grassi, Gangolf Hirtz:
Applications of Deep Learning for Top-View Omnidirectional Imaging: A Survey. 6421-6433 - Hao Shi, Yu Li, Kailun Yang, Jiaming Zhang, Kunyu Peng, Alina Roitberg, Yaozu Ye, Huajian Ni, Kaiwei Wang, Rainer Stiefelhagen:
FishDreamer: Towards Fisheye Semantic Completion via Unified Image Outpainting and Segmentation. 6434-6444 - Bruno Berenguel-Baeta, Antoine N. André, Guillaume Caron, Jesus Bermudez-Cameo, Josechu J. Guerrero:
Visual Gyroscope: Combination of Deep Learning Features and Direct Alignment for Panoramic Stabilization. 6445-6448 - Hengzhi Zhang, Hong Yi, Haijing Jia, Wei Wang, Makoto Odamaki:
PanoPoint: Self-Supervised Feature Points Detection and Description for 360° Panorama. 6449-6458 - Negar Nejatishahidin, Will Hutchcroft, Manjunath Narayana, Ivaylo Boyadzhiev, Yuguang Li, Naji Khosravan, Jana Kosecká, Sing Bing Kang:
Graph-CoVis: GNN-based Multi-view Panorama Global Pose Estimation. 6459-6468 - Jheng-Wei Su, Chi-Han Peng, Peter Wonka, Hung-Kuo Chu:
GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network. 6469-6478 - Louis Gallagher, Ganesh Sistu, Jonathan Horgan, John B. McDonald:
A System for Dense Monocular Mapping with a Fisheye Camera. 6479-6487 - Siddharth Ravi, Pau Climent-Pérez, Théo Morales, Carlo Huesca-Spairani, Kooshan Hashemifard, Francisco Flórez-Revuelta:
ODIN: An OmniDirectional INdoor dataset capturing Activities of Daily Living from multiple synchronized modalities. 6488-6497 - Marcela Mera-Trujillo, Shivang Patel, Yu Gu, Gianfranco Doretto:
Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images. 6498-6507 - Liyuan Zhu, Yuru Jia, Shengyu Huang, Nicholas Meyer, Andreas Wieser, Konrad Schindler, Jordan Aaron:
DeFlow: Self-supervised 3D Motion Estimation of Debris Flow. 6508-6517 - Ewelina Rupnik, Marc Pierrot-Deseilligny:
Pointless Global Bundle Adjustment With Relative Motions Hessians. 6518-6526 - Teng Wu, Bruno Vallet, Marc Pierrot-Deseilligny:
PSMNet-FusionX3: LiDAR-Guided Deep Learning Stereo Dense Matching On Aerial Images. 6527-6536 - Abhisek Maiti, Sander Oude Elberink, George Vosselman:
TransFusion: Multi-modal Fusion Network for Semantic Segmentation. 6537-6547 - Olaf Wysocki, Yan Xia, Magdalena Wysocki, Eleonora Grilli, Ludwig Hoegner, Daniel Cremers, Uwe Stilla:
Scan2LoD3: Reconstructing semantic 3D building models at LoD3 using ray casting and Bayesian networks. 6548-6558 - Weihang Ran, Wei Yuan, Ryosuke Shibasaki:
Few-Shot Depth Completion Using Denoising Diffusion Probabilistic Model. 6559-6567 - Maryam Jameela, Gunho Sohn, Sunghwan Yoo:
Fusion-SUNet: Spatial Layout Consistency for 3D Semantic Segmentation. 6568-6576 - Sunghwan Yoo, Yeonjeong Jeong, Maryam Jameela, Gunho Sohn:
Human Vision Based 3D Point Cloud Semantic Segmentation of Large-Scale Outdoor Scenes. 6577-6586 - Tianshu Kuai, Akash Karthikeyan, Yash Kant, Ashkan Mirzaei, Igor Gilitschenski:
CAMM: Building Category-Agnostic and Animatable 3D Models from Monocular Videos. 6587-6597 - Erik C. M. Johnson, Marc Habermann, Soshi Shimada, Vladislav Golyanik, Christian Theobalt:
Unbiased 4D: Monocular 4D Reconstruction with a Neural Deformation Model. 6598-6607 - Abed Malti:
Robust Monocular 3D Human Motion with Lasso-Based Differential Kinematics. 6608-6618 - Haidong Zhu, Zhaoheng Zheng, Wanrong Zheng, Ram Nevatia:
CAT-NeRF: Constancy-Aware Tx2Former for Dynamic Body Modeling. 6619-6628
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.