default search action
WACV 2024: Waikoloa, HI, USA
- IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2024, Waikoloa, HI, USA, January 3-8, 2024. IEEE 2024, ISBN 979-8-3503-1892-0
- Piyush Arora, Pratik Mazumder:
Hybrid Sample Synthesis-based Debiasing of Classifier in Limited Data Setting. i-ix - Yining Ding, Andrew M. Wallace, Sen Wang:
Estimating Fog Parameters from an Image Sequence using Non-linear Optimisation. i-ix - Alon Shoshan, Ori Linial, Nadav Bhonker, Elad Hirsch, Lior Zamir, Igor Kviatkovsky, Gérard G. Medioni:
Asymmetric Image Retrieval with Cross Model Compatible Ensembles. 1-11 - Sai Aparna Aketi, Kaushik Roy:
Cross-feature Contrastive Loss for Decentralized Deep Learning on Heterogeneous Data. 12-21 - Suhas Srinath, Shankhanil Mitra, Shika Rao, Rajiv Soundararajan:
Learning Generalizable Perceptual Representations for Data-Efficient No-Reference Image Quality Assessment. 22-31 - Jayateja Kalla, Soma Biswas:
Robust Feature Learning and Global Variance-Driven Classifier Alignment for Long-Tail Class Incremental Learning. 32-41 - Amirhossein Dadashzadeh, Shuchao Duan, Alan L. Whone, Majid Mirmehdi:
PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment. 42-52 - Pierpaolo Morì, Lukas Frickenstein, Shambhavi Balamuthu Sampath, Moritz Thoma, Nael Fasfous, Manoj Rohit Vemparala, Alexander Frickenstein, Christian Unger, Walter Stechele, Daniel Mueller-Gritschneder, Claudio Passerone:
Wino Vidi Vici: Conquering Numerical Instability of 8-bit Winograd Convolution for Accurate Inference Acceleration on Edge. 53-62 - Sahil Singla, Atoosa Malemir Chegini, Mazda Moayeri, Soheil Feizi:
Data-Centric Debugging: mitigating model failures via targeted image retrieval. 63-74 - Jinfeng Wang, Sifan Song, Jionglong Su, S. Kevin Zhou:
Distortion-Disentangled Contrastive Learning. 75-85 - Xuwei Xu, Sen Wang, Yudong Chen, Yanping Zheng, Zhewei Wei, Jiajun Liu:
GTP-ViT: Efficient Vision Transformers via Graph-based Token Propagation. 86-95 - Khanh-Binh Nguyen:
SequenceMatch Revisiting the design of weak-strong augmentations for Semi-supervised learning. 96-105 - Saurabh Kumar Jain, Sukhendu Das:
Stochastic Binary Network for Universal Domain Adaptation. 106-115 - Mathilde Caron, Neil Houlsby, Cordelia Schmid:
Location-Aware Self-Supervised Transformers for Semantic Segmentation. 116-126 - Kilian Batzner, Lars Heckler, Rebecca König:
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies. 127-137 - Seungwook Kim, Juhong Min, Minsu Cho:
Efficient Semantic Matching with Hypercolumn Correlation. 138-147 - Jie Zhang, Masanori Suganuma, Takayuki Okatani:
Contextual Affinity Distillation for Image Anomaly Detection. 148-157 - Hojin Kim, Seunghun Lee, Hyeon Kang, Sunghoon Im:
Offline-to-Online Knowledge Distillation for Video Instance Segmentation. 158-167 - Yanda Li, Zilong Huang, Gang Yu, Ling Chen, Yunchao Wei, Jianbo Jiao:
Disentangled Pre-training for Image Matting. 168-177 - Ziqiang Shi, Rujie Liu:
Conditional Velocity Score Estimation for Image Restoration. 178-187 - Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo:
ARNIQA: Learning Distortion Manifold for Image Quality Assessment. 188-197 - Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko:
Hard Sample-aware Consistency for Low-resolution Facial Expression Recognition. 198-207 - Maxim Shugaev, Ilya Semenov, Kyle Ashley, Michael Klaczynski, Naresh Cuntoor, Mun Wai Lee, Nathan Jacobs:
ArcGeo: Localizing Limited Field-of-View Images using Cross-view Matching. 208-217 - Hiran Sarkar, Vishal M. Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth N. Balasubramanian:
Open-Set Object Detection By Aligning Known Class Representations. 218-227 - Inkyu Shin, Dahun Kim, Qihang Yu, Jun Xie, Hong-Seok Kim, Bradley Green, In So Kweon, Kuk-Jin Yoon, Liang-Chieh Chen:
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation. 228-238 - Volodymyr Fedynyak, Yaroslav Romanus, Bohdan Hlovatskyi, Bohdan Sydor, Oles Dobosevych, Igor Babin, Roman Riazantsev:
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation. 239-248 - Md Awsafur Rahman, Shaikh Anowarul Fattah:
Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation. 249-258 - Vladan Stojnic, Zakaria Laskar, Giorgos Tolias:
Training Ensembles with Inliers and Outliers for Semi-supervised Active Learning. 259-268 - Samuel Black, Richard Souvenir:
Multi-view Classification Using Hybrid Fusion and Mutual Distillation. 269-279 - Jiayang Ao, Qiuhong Ke, Krista A. Ehinger:
Amodal Intra-class Instance Segmentation: Synthetic Datasets and Benchmark. 280-289 - Balamurali Murugesan, Rukhshanda Hussain, Rajarshi Bhattacharya, Ismail Ben Ayed, Jose Dolz:
Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation. 290-301 - Jingwen Sun, Jing Wu, Ze Ji, Yu-Kun Lai:
RSMPNet: Relationship Guided Semantic Map Prediction. 302-311 - Rajeev Yasarla, Renliang Weng, Wongun Choi, Vishal M. Patel, Amir Sadeghian:
3SD: Self-Supervised Saliency Detection With No Labels. 312-321 - Zenglin Shi, Ying Sun, Mengmi Zhang:
Training-free Object Counting with Prompts. 322-330 - Souradeep Chakraborty, Shujon Naha, Muhammet Bastan, Amit Kumar K. C, Dimitris Samaras:
Unsupervised and semi-supervised co-salient object detection via segmentation frequency statistics. 331-341 - Zheng Xiong, Liangyu Chai, Wenxi Liu, Yongtuo Liu, Sucheng Ren, Shengfeng He:
Glance to Count: Learning to Rank with Anchors for Weakly-supervised Crowd Counting. 342-351 - Yahia Dalbah, Jean Lahoud, Hisham Cholakkal:
TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation. 352-361 - Jinwoo Hwang, Philipp Benz, Pete Kim:
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention. 362-371 - Zhifeng Teng, Jiaming Zhang, Kailun Yang, Kunyu Peng, Hao Shi, Simon Reiß, Ke Cao, Rainer Stiefelhagen:
360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View. 372-381 - Yasser Abdelaziz Dahou Djilali, Kevin McGuinness, Noel E. O'Connor:
Learning Saliency From Fixations. 382-392 - Qilei Li, Shaogang Gong:
Mitigate Domain Shift by Primary-Auxiliary Objectives Association for Generalizing Person ReID. 393-402 - Md. Motiur Rahman, Shiva Shokouhmand, Smriti Bhatt, Miad Faezipour:
MIST: Medical Image Segmentation Transformer with Convolutional Attention Mixing (CAM) Decoder. 403-412 - Cheolhyun Mun, Sanghuk Lee, Youngjung Uh, Junsuk Choe, Hyeran Byun:
Small Objects Matters in Weakly-supervised Semantic Segmentation. 413-422 - Qizhen Lan, Qing Tian:
Gradient-Guided Knowledge Distillation for Object Detectors. 423-432 - Beoungwoo Kang, Seunghun Moon, Yubin Cho, Hyunwoo Yu, Suk-Ju Kang:
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation. 433-442 - Cheng-Hsiu Chen, Jheng-Wei Su, Min-Chun Hu, Chih-Yuan Yao, Hung-Kuo Chu:
Panelformer: Sewing Pattern Reconstruction from 2D Garment Images. 443-452 - Ruxue Wen, Hangjie Yuan, Dong Ni, Wenbo Xiao, Yaoyao Wu:
From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation. 453-463 - Tariq Berrada, Camille Couprie, Karteek Alahari, Jakob Verbeek:
Guided Distillation for Semi-Supervised Instance Segmentation. 464-472 - Gwanghan Lee, Saebyeol Shin, Taeyoung Na, Simon S. Woo:
Real-Time User-guided Adaptive Colorization with Vision Transformer. 473-482 - Bin Duan, Hao Tang, Changchang Sun, Ye Zhu, Yan Yan:
Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation. 483-492 - Yessine Khanfir, Marwa Dhiaf, Emna Ghodhbani, Ahmed Cheikh Rouhou, Yousri Kessentini:
Graph Neural Networks for End-to-End Information Extraction from Handwritten Documents. 493-501 - Lei Li:
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting. 502-511 - Xiaobo Yang, Xiaojin Gong:
Foundation Model Assisted Weakly Supervised Semantic Segmentation. 512-521 - Ahmed Ben Saad, Gabriele Facciolo, Axel Davy:
On the Importance of Large Objects in CNN Based Object Detection Algorithms. 522-531 - Yeti Ziya Gürbüz, Ogul Can, A. Aydin Alatan:
Deep Metric Learning with Chance Constraints. 532-542 - Tajamul Ashraf, Fuzayil Bin Afzal Mir, Iqra Altaf Gillani:
TransFed: A way to epitomize Focal Modulation using Transformer-based Federated Learning. 543-552 - Yangzheng Wu, Michael A. Greenspan:
Learning Better Keypoints for Multi-Object 6DoF Pose Estimation. 553-563 - Praful Mathur, Shashi Kumar Parwani, Mrinmoy Sen, Roopa Sheshadri, Aman Sharma:
Object Aware Contrastive Prior for Interactive Image Segmentation. 564-573 - Teodora Popordanoska, Aleksei Tiulpin, Matthew B. Blaschko:
Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection. 574-583 - Jianlong Yuan, Minh Hieu Phan, Liyang Liu, Yifan Liu:
FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation. 584-594 - Han Qiu, Gongjie Zhang, Jiaxing Huang, Peng Gao, Zhang Wei, Shijian Lu:
Efficient MAE towards Large-Scale Vision Transformers. 595-604 - Saad Himmi, Vincent Parret, Ajad Chhatkuli, Luc Van Gool:
MS-EVS: Multispectral event-based vision for deep learning based face detection. 605-614 - Hyuna Cho, Injun Choi, Suha Kwak, Won Hwa Kim:
Interactive Network Perturbation between Teacher and Students for Semi-Supervised Semantic Segmentation. 615-624 - Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp:
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning. 625-634 - Ji-Ye Jeon, Xuan Truong Nguyen, Soojung Ryu, Hyuk-Jae Lee:
USDN: A Unified Sample-wise Dynamic Network with Mixed-Precision and Early-Exit. 635-643 - Qian Xie, Ta Ying Cheng, Jia-Xing Zhong, Kaichen Zhou, Andrew Markham, Niki Trigoni:
Beyond Fusion: Modality Hallucination-based Multispectral Fusion for Pedestrian Detection. 644-653 - Fangchen Yu, Yina Xie, Lei Wu, Yafei Wen, Guozhi Wang, Shuai Ren, Xiaoxin Chen, Jianfeng Mao, Wenye Li:
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction. 654-663 - Hasib Zunair, A. Ben Hamza:
Learning to Recognize Occluded and Small Objects with Partial Inputs. 664-673 - Razieh Kaviani Baghbaderani, Yuanxin Li, Shuangquan Wang, Hairong Qi:
Temporally-Consistent Video Semantic Segmentation with Bidirectional Occlusion-guided Feature Propagation. 674-684 - Nikhil Reddy, Mahsa Baktashmotlagh, Chetan Arora:
Domain-Aware Knowledge Distillation for Continual Model Generalization. 685-696 - Kamalakar Vijay Thakare, Debi Prosad Dogra, Heeseung Choi, Haksub Kim, Ig-Jae Kim:
Let's Observe Them Over Time: An Improved Pedestrian Attribute Recognition Approach. 697-706 - Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya:
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance. 707-717 - Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-Sun Seo, Yu Cao:
Patch-based Selection and Refinement for Early Object Detection. 718-727 - Cagri Gungor, Adriana Kovashka:
Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth. 728-737 - Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala:
C2AIR: Consolidated Compact Aerial Image Haze Removal. 738-747 - K. N. Ajay Shastry, K. Ravi Sri Teja, Aditya Nigam, Chetan Arora:
Favoring One Among Equals - Not a Good Idea: Many-to-one Matching for Robust Transformer based Pedestrian Detection. 748-757 - Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou:
Improving Vision-and-Language Reasoning via Spatial Relations Modeling. 758-767 - Chau Pham, Truong Vu, Khoi Nguyen:
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing. 768-777 - Barsegh Atanyan, Levon Khachatryan, Shant Navasardyan, Yunchao Wei, Humphrey Shi:
Continuous Adaptation for Interactive Segmentation Using Teacher-Student Architecture. 778-788 - Qiyang Wan, Ruiping Wang, Xilin Chen:
Interpretable Object Recognition by Semantic Prototype Analysis. 789-798 - Gregor Köhler, Tassilo Wald, Constantin Ulrich, David Zimmerer, Paul F. Jaeger, Jörg K. H. Franke, Simon Kohl, Fabian Isensee, Klaus H. Maier-Hein:
RecycleNet: Latent Feature Recycling Leads to Iterative Decision Refinement. 799-807 - Junehyoung Kwon, Eunju Lee, Yunsung Cho, YoungBin Kim:
Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation. 808-817 - Connor Anderson, Matthew Gwilliam, Evelyn Gaskin, Ryan Farrell:
Elusive Images: Beyond Coarse Analysis for Fine-Grained Recognition. 818-828 - Xiaoyu Dong, Naoto Yokoya:
Understanding Dark Scenes by Contrasting Multi-Modal Observations. 829-839 - Abdullah Rashwan, Jiageng Zhang, Ali Taalimi, Fan Yang, Xingyi Zhou, Chaochao Yan, Liang-Chieh Chen, Yeqing Li:
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation. 840-850 - Fangwen Wu, Jingxuan He, Yufei Yin, Yanbin Hao, Gang Huang, Lechao Cheng:
Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation. 851-860 - Zizheng Yan, Yushuang Wu, Yipeng Qin, Xiaoguang Han, Shuguang Cui, Guanbin Li:
Universal Semi-supervised Model Adaptation via Collaborative Consistency Training. 861-871 - Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol:
STEP - Towards Structured Scene-Text Spotting. 872-881 - Zhuoming Liu, Xuefeng Hu, Ram Nevatia:
Efficient Feature Distillation for Zero-shot Annotation Object Detection. 882-891 - Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis:
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis. 892-902 - Taotao Jing, Lichen Wang, Naji Khosravan, Zhiqiang Wan, Zachary Bessinger, Zhengming Ding, Sing Bing Kang:
iBARLE: imBalance-Aware Room Layout Estimation. 903-913 - Shuo Wang, Jing Li, Zibo Zhao, Dongze Lian, Binbin Huang, Xiaomei Wang, Zhengxin Li, Shenghua Gao:
TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding. 914-923 - Peter Naylor, Diego Di Carlo, Arianna Traviglia, Makoto Yamada, Marco Fiorucci:
Implicit neural representation for change detection. 924-934 - Hei Law, Jia Deng:
Label-Free Synthetic Pretraining of Object Detectors. 935-945 - Ximeng Sun, Rameswar Panda, Chun-Fu Richard Chen, Naigang Wang, Bowen Pan, Aude Oliva, Rogério Feris, Kate Saenko:
Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths. 946-956 - Maximilian Bernhard, Roberto Amoroso, Yannic Kindermann, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp, Matthias Schubert:
What's Outside the Intersection? Fine-grained Error Analysis for Semantic Segmentation Beyond IoU. 957-966 - Hao Chen, Yonghan Dong, Zheming Lu, Yunlong Yu, Jungong Han:
Pixel Matching Network for Cross-Domain Few-Shot Segmentation. 967-976 - Joonhyun Jeong, Beomyoung Kim, Joonsang Yu, Youngjoon Yoo:
EResFD: Rediscovery of the Effectiveness of Standard Convolution for Lightweight Face Detection. 977-987 - Mir Rayat Imtiaz Hossain, Leonid Sigal, James J. Little:
Framework-agnostic Semantically-aware Global Reasoning for Segmentation. 988-998 - Arvi Jonnarth, Yushan Zhang, Michael Felsberg:
High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation. 999-1008 - Harsh Maheshwari, Yen-Cheng Liu, Zsolt Kira:
Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation. 1009-1019 - Jialiang Zhu, Danqing Huang, Chunyu Wang, Mingxi Cheng, Ji Li, Han Hu, Xin Geng, Baining Guo:
Unsupervised Graphic Layout Grouping with Transformers. 1020-1029 - Vuong D. Nguyen, Khadija Khaldi, Dung Nguyen, Pranav Mantini, Shishir Shah:
Contrastive Viewpoint-aware Shape Learning for Long-term Person Re-Identification. 1030-1038 - Xuan Yang, Liangzhe Yuan, Kimberly Wilber, Astuti Sharma, Xiuye Gu, Siyuan Qiao, Stephanie Debats, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Liang-Chieh Chen:
PolyMaX: General Dense Prediction with Mask Transformer. 1039-1050 - Liyang Liu, Zihan Wang, Minh Hieu Phan, Bowen Zhang, Jinchao Ge, Yifan Liu:
BPKD: Boundary Privileged Knowledge Distillation For Semantic Segmentation. 1051-1061 - Changkun Ye, Russell Tsuchida, Lars Petersson, Nick Barnes:
Label Shift Estimation for Class-Imbalance Problem: A Bayesian Approach. 1062-1071 - Aditay Tripathi, Anand Mishra, Anirban Chakraborty:
Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch. 1072-1081 - Yiting Li, Adam David Goodge, Fayao Liu, Chuan-Sheng Foo:
PromptAD: Zero-shot Anomaly Detection using Text Prompts. 1082-1091 - Xiaosong Wang, Ziyue Xu, Dong Yang, Leo K. Tam, Holger Roth, Daguang Xu:
Learning Quality Labels for Robust Image Classification. 1092-1101 - Haitian He, Sarah M. Erfani, Mingming Gong, Qiuhong Ke:
Learning Transferable Representations for Image Anomaly Localization Using Dense Pretraining. 1102-1111 - Xiangyong Lu, Masanori Suganuma, Takayuki Okatani:
SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers. 1112-1122 - Andrei-Timotei Ardelean, Tim Weyrich:
High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis. 1123-1133 - Jongwoo Park, Kumara Kahatapitiya, Donghyun Kim, Shivchander Sudalairaj, Quanfu Fan, Michael S. Ryoo:
Grafting Vision Transformers. 1134-1143 - Tao Liu, Chenshu Chen, Xi Yang, Wenming Tan:
Rethinking Knowledge Distillation with Raw Features for Semantic Segmentation. 1144-1153 - Soumya Roy, Vinay Kumar Verma, Deepak Gupta:
Efficient Expansion and Gradient Based Task Inference for Replay Free Incremental Learning. 1154-1164 - Hiroto Honda, Yusuke Uchida:
CLRerNet: Improving Confidence of Lane Detection with LaneIoU. 1165-1174 - Yuqi Hou, Zhongqun Zhang, Nora Horanyi, Jaewon Moon, Yihua Cheng, Hyung Jin Chang:
Multi-Modal Gaze Following in Conversational Scenarios. 1175-1184 - Sithu Aung, Haesol Park, Hyungjoo Jung, Junghyun Cho:
Enhancing Multi-view Pedestrian Detection Through Generalized 3D Feature Pulling. 1185-1194 - Zacharias Anastasakis, Dimitrios Mallis, Markos Diomataris, George Alexandridis, Stefanos Kollias, Vassilis Pitsikalis:
Self-Supervised Learning for Visual Relationship Detection through Masked Bounding Box Reconstruction. 1195-1204 - Sandra Kara, Hejer Ammar, Florian Chabot, Quoc-Cuong Pham:
The Background Also Matters: Background-Aware Motion-Guided Objects Discovery. 1205-1214 - Seonhoon Lee, Jong-Hwan Kim:
Semi-Supervised Scene Change Detection by Distillation from Feature-metric Alignment. 1215-1224 - Siddharth Srivastava, Gaurav Sharma:
OmniVec: Learning robust representations with cross modal sharing. 1225-1237 - Dong Yuan, Frédéric Maire, Feras Dayoub:
Cross-Attention Between Satellite and Ground Views for Enhanced Fine-Grained Robot Geo-Localization. 1238-1245 - Haoyang Fang, Boran Han, Shuai Zhang, Su Zhou, Cuixiong Hu, Wen-Ming Ye:
Data Augmentation for Object Detection via Controllable Diffusion Models. 1246-1255 - Haoye Dong, Tiange Xiang, Sravan Chittupalli, Jun Liu, Dong Huang:
Physical-space Multi-body Mesh Detection Achieved by Local Alignment and Global Dense Learning. 1256-1265 - Atif Belal, Akhil Meethal, Francisco Perdigon Romero, Marco Pedersoli, Eric Granger:
Multi-Source Domain Adaptation for Object Detection with Prototype-based Mean Teacher. 1266-1275 - Reza Azad, Leon Niggemeier, Michael Hüttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof:
Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation. 1276-1286 - Amirhossein Kazerouni, Reza Azad, Alireza Hosseini, Dorit Merhof, Ulas Bagci:
INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings. 1287-1296 - Oriol Barbany, Xiaofan Lin, Muhammet Bastan, Arnab Dhua:
ProcSim: Proxy-based Confidence for Robust Similarity Learning. 1297-1306 - Yujie Zang, Yaochen Li, Yuan Gao, Yimou Guo, Wenneng Tang, Yanxue Li, Meklit Atlaw:
Refine and Redistribute: Multi-Domain Fusion and Dynamic Label Assignment for Unbiased Scene Graph Generation. 1307-1316 - Mykhailo Shvets, Dongxu Zhao, Marc Niethammer, Roni Sengupta, Alexander C. Berg:
Joint Depth Prediction and Semantic Segmentation with Multi-View SAM. 1317-1327 - Bicheng Xu, Renjie Liao, Leonid Sigal:
Self-Supervised Relation Alignment for Scene Graph Generation. 1328-1338 - Shan Zhang, Yao Ni, Jinhao Du, Yanxia Liu, Piotr Koniusz:
Semantic Transfer from Head to Tail: Enlarging Tail Margin for Long-Tailed Visual Recognition. 1339-1349 - Savinay Nagendra, Daniel Kifer:
PatchRefineNet: Improving Binary Segmentation by Incorporating Signals from Optimal Patch-wise Binarization. 1350-1361 - Fatih Ilhan, Ka-Ho Chow, Sihao Hu, Tiansheng Huang, Selim F. Tekin, Wenqi Wei, Yanzhao Wu, Myungjin Lee, Ramana Kompella, Hugo Latapie, Gaowen Liu, Ling Liu:
Adaptive Deep Neural Network Inference Optimization with EENet. 1362-1371 - Minchul Kim, Shangqian Gao, Yen-Chang Hsu, Yilin Shen, Hongxia Jin:
Token Fusion: Bridging the Gap between Token Pruning and Token Merging. 1372-1381 - Donghyeon Lee, Eunho Lee, Youngbae Hwang:
Pruning from Scratch via Shared Pruning Module and Nuclear norm-based Regularization. 1382-1391 - Monika Wysoczanska, Michaël Ramamonjisoa, Tomasz Trzcinski, Oriane Siméoni:
CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free. 1392-1402 - Junyoung Park, Jin Kim, Hyeongjun Kwon, Ilhoon Yoon, Kwanghoon Sohn:
Layer-wise Auto-Weighting for Non-Stationary Test-Time Adaptation. 1403-1412 - Qasim M. K. Siddiqui, Sebastian Starke, Peter Steinbach:
Uncertainty Estimation in Instance Segmentation with Star-convex Shapes. 1413-1422 - Abbas Khan, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El-Saddik, Giulia De Masi, Fakhri Karray:
CamoFocus: Enhancing Camouflage Object Detection with Split-Feature Focal Modulation and Context Refinement. 1423-1432 - Heitor Rapela Medeiros, Fidel A. Guerrero-Peña, Masih Aminbeidokhti, Thomas Dubail, Eric Granger, Marco Pedersoli:
HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information. 1433-1442 - Md Raqib Khan, Priyanka Mishra, Nancy Mehta, Shruti S. Phutke, Santosh Kumar Vipparthi, Sukumar Nandi, Subrahmanyam Murala:
Spectroformer: Multi-Domain Query Cascaded Transformer Network For Underwater Image Enhancement. 1443-1452 - Luca Barsellotti, Roberto Amoroso, Lorenzo Baraldi, Rita Cucchiara:
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval. 1453-1462 - Rajeev Yasarla, Jeya Maria Jose Valanarasu, Vishwanath S, Vishal M. Patel:
Self-Supervised Denoising Transformer with Gaussian Process. 1463-1473 - Chihiro Noguchi, Shun Fukuda, Masao Yamanaka:
Scene Text Image Super-resolution based on Text-conditional Diffusion Models. 1474-1484 - Royson Lee, Rui Li, Stylianos I. Venieris, Timothy M. Hospedales, Ferenc Huszár, Nicholas D. Lane:
Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation. 1485-1494 - Aditya Chandrasekar, Manogna Sreenivas, Soma Biswas:
PhISH-Net: Physics Inspired System for High Resolution Underwater Image Enhancement. 1495-1505 - Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava:
Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement. 1506-1516 - Alexander Krull, Hector Basevi, Benjamin Salmon, Andre Zeug, Franziska Müller, Samuel Tonks, Leela Muppala, Ales Leonardis:
Image Denoising and the Generative Accumulation of Photons. 1517-1526 - Hung-Yu Shu, Yi-Hsien Lin, Yi-Chang Lu:
Deep Plug-and-play Nighttime Non-blind Deblurring with Saturated Pixel Handling Schemes. 1527-1535 - Shao-Yu Weng, Hsuan Yuan, Yu-Syuan Xu, Ching-Chun Huang, Wei-Chen Chiu:
Best of Both Worlds: Learning Arbitrary-scale Blind Super-Resolution via Dual Degradation Representations and Cycle-Consistency. 1536-1545 - Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee:
ICF-SRSR: Invertible scale-Conditional Function for Self-Supervised Real-world Single Image Super-Resolution. 1546-1556 - Fotios Logothetis, Ignas Budvytis, Roberto Cipolla:
A Neural Height-Map Approach for the Binocular Photometric Stereo Problem. 1557-1566 - Yijie Zhou, Chao Li, Jin Liang, Tianyi Xu, Xin Liu, Jun Xu:
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters. 1576-1586 - Hwayoon Lee, Kyoungkook Kang, Hyeongmin Lee, Seung-Hwan Baek, Sunghyun Cho:
UGPNet: Universal Generative Prior for Image Restoration. 1587-1597 - Jonghyuk Park, HyeonA Kim, Eunpil Park, Jae-Young Sim:
Fully-Automatic Reflection Removal for 360-Degree Images. 1598-1606 - Omri Berman, Navot Oz, David Mendlovic, Nir A. Sochen, Yafit Cohen, Iftach Klapp:
PETIT-GAN: Physically Enhanced Thermal Image-Translating Generative Adversarial Network. 1607-1616 - Xilai Li, Xiaosong Li, Tao Ye, Xiaoqi Cheng, Wuyang Liu, Haishu Tan:
Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion. 1617-1626 - Yuval Haitman, Oded Bialer:
BoostRad: Enhancing Object Detection by Boosting Radar Reflections. 1627-1636 - Chen Feng, Duolikun Danier, Fan Zhang, David Bull:
RankDVQA: Deep VQA based on Ranking-inspired Hybrid Training. 1637-1647 - Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo:
Reference-based Restoration of Digitized Analog Videotapes. 1648-1657 - Arnaud Barral, Pablo Arias, Axel Davy:
Fixed Pattern Noise Removal For Multi-View Single-Sensor Infrared Camera. 1658-1667 - Zhao Wang, Aoxue Li, Zhenguo Li, Qi Dou:
Efficient Transferability Assessment for Selection of Pre-trained Detectors. 1668-1678 - Alex Gomez-Villa, Bartlomiej Twardowski, Kai Wang, Joost van de Weijer:
Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning. 1679-1689 - Yanshuo Wang, Jie Hong, Ali Cheraghian, Shafin Rahman, David Ahmedt-Aristizabal, Lars Petersson, Mehrtash Harandi:
Continual Test-time Domain Adaptation via Dynamic Sample Selection. 1690-1699 - Hamza Rami, Jhony H. Giraldo, Nicolas Winckler, Stéphane Lathuilière:
Source-Guided Similarity Preservation for Online Person Re-Identification. 1700-1709 - Zhaoheng Zheng, Haidong Zhu, Ram Nevatia:
CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning. 1710-1720 - Imad Eddine Marouf, Enzo Tartaglione, Stéphane Lathuilière:
Mini but Mighty: Finetuning ViTs with Mini Adapters. 1721-1730 - Andrés C. Rodríguez, Stefano D'Aronco, Rodrigo Caye Daudt, Jan D. Wegner, Konrad Schindler:
Recognition of Unseen Bird Species by Learning from Field Guides. 1731-1740 - Weiqin Chuah, Ruwan B. Tennakoon, Reza Hoseinnezhad, David Suter, Alireza Bab-Hadiashar:
Single Domain Generalization via Normalised Cross-correlation Based Convolutions. 1741-1750 - Julien Nicolas, Florent Chiaroni, Imtiaz Masud Ziko, Ola Ahmad, Christian Desrosiers, Jose Dolz:
MoP-CLIP: A Mixture of Prompt-Tuned CLIP Models for Domain Incremental Learning. 1751-1761 - Zheng Gao, Chen Feng, Ioannis Patras:
Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features. 1762-1772 - Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara:
Revisiting Pixel-Level Contrastive Pre-Training on Scene Images. 1773-1782 - David Hart, Bryan S. Morse:
Improving Graph Networks through Selection-based Convolution. 1783-1793 - Bo Wan, Tinne Tuytelaars:
Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels. 1794-1804 - Prathmesh Bele, Valay Bundele, Avigyan Bhattacharya, Ankit Jha, Gemma Roig, Biplab Banerjee:
Learning Class and Domain Augmentations for Single-Source Open-Domain Generalization. 1805-1815 - Yuang Liu, Qiang Zhou, Jing Wang, Zhibin Wang, Fan Wang, Jun Wang, Wei Zhang:
Dynamic Token-Pass Transformers for Semantic Segmentation. 1816-1825 - Grégoire Petit, Michaël Soumm, Eva Feillet, Adrian Popescu, Bertrand Delezoide, David Picard, Céline Hudelot:
An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning. 1826-1836 - Wenlong Shi, Changsheng Lu, Ming Shao, Yinjie Zhang, Siyu Xia, Piotr Koniusz:
Few-shot Shape Recognition by Learning Deep Shape-aware Features. 1837-1848 - Asma Yamani, Albandari Alyami, Hamzah Luqman, Bernard Ghanem, Silvio Giancola:
Active Learning for Single-Stage Object Detection in UAV Images. 1849-1858 - Jumpei Goto, Yohei Nakata, Kiyofumi Abe, Yasunori Ishii, Takayoshi Yamashita:
Learning Intra-class Multimodal Distributions with Orthonormal Matrices. 1859-1868 - Hiromu Taketsugu, Norimichi Ukita:
Active Transfer Learning for Efficient Video-Specific Human Pose Estimation. 1869-1879 - Yun Yue, Fangzhou Lin, Guanyi Mou, Ziming Zhang:
Understanding Hyperbolic Metric Learning through Hard Negative Sampling. 1880-1892 - Sunandini Sanyal, Ashish Ramayee Asokan, Suvaansh Bhambri, Pradyumna YM, Akshay R. Kulkarni, Jogendra Nath Kundu, R. Venkatesh Babu:
Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation. 1893-1902 - Niv Nayman, Avram Golbert, Asaf Noy, Lihi Zelnik-Manor:
Diverse Imagenet Models Transfer Better. 1903-1914 - Aswathnarayan Radhakrishnan, Jim Davis, Zachary Rabin, Benjamin Lewis, Matthew Scherreik, Roman Ilin:
Design Choices for Enhancing Noisy Student Self-Training. 1915-1924 - Saypraseuth Mounsaveng, Florent Chiaroni, Malik Boudiaf, Marco Pedersoli, Ismail Ben Ayed:
Bag of Tricks for Fully Test-Time Adaptation. 1925-1934 - Thomas Westfechtel, Hao-Wei Yeh, Dexuan Zhang, Tatsuya Harada:
Gradual Source Domain Expansion for Unsupervised Domain Adaptation. 1935-1944 - Soroush Seifi, Daniel Olmeda Reino, Nikolay Chumerin, Rahaf Aljundi:
OOD Aware Supervised Contrastive Learning. 1945-1955 - Yao Deng, Xiang Xiang:
Expanding Hyperspherical Space for Few-Shot Class-Incremental Learning. 1956-1965 - Filip Szatkowski, Mateusz Pyla, Marcin Przewiezlikowski, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski:
Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-free Continual Learning. 1966-1976 - Matthew Inkawhich, Nathan Inkawhich, Hai Li, Yiran Chen:
Tunable Hybrid Proposal Networks for the Open World. 1977-1988 - Zheyuan Zhang, Bin Wang, Debesh Jha, Ugur Demir, Ulas Bagci:
Domain Generalization with Correlated Style Uncertainty. 1989-1998 - Fei Wu, Pablo Márquez-Neila, Mingyi Zheng, Hedyeh Rafii-Tari, Raphael Sznitman:
Correlation-aware active learning for surgery video segmentation. 1999-2009 - Keval Doshi, Amanmeet Garg, Burak Uzkent, Xiaolong Wang, Mohamed Omar:
A Multimodal Benchmark and Improved Architecture for Zero Shot Learning. 2010-2019 - Shiyao Li, Xuefei Ning, Shanghang Zhang, Lidong Guo, Tianchen Zhao, Huazhong Yang, Yu Wang:
TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning. 2020-2029 - Seyedalireza Khoshsirat, Chandra Kambhamettu:
Improving Normalization with the James-Stein Estimator. 2030-2040 - Jeeho Hyun, Sangyun Kim, Giyoung Jeon, Seung Hwan Kim, Kyunghoon Bae, Byung Jun Kang:
ReConPatch : Contrastive Patch Representation Learning for Industrial Anomaly Detection. 2041-2050 - Skyler Seto, Barry-John Theobald, Federico Danieli, Navdeep Jaitly, Dan Busbridge:
REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation. 2051-2060 - Jiaxin Zhang, Sirui Bi, Victor Fung:
On the Quantification of Image Reconstruction Uncertainty without Training Data. 2061-2070 - Gabriel Moreira, Manuel Marques, João Paulo Costeira, Alexander G. Hauptmann:
Hyperbolic vs Euclidean Embeddings in Few-Shot Learning: Two Sides of the Same Coin. 2071-2079 - Fahim Faisal Niloy, Sk Miraj Ahmed, Dripta S. Raychaudhuri, Samet Oymak, Amit K. Roy-Chowdhury:
Effective Restoration of Source Knowledge in Continual Test Time Adaptation. 2080-2089 - Anwesha Banerjee, Liyana Sahir Kallooriyakath, Soma Biswas:
AMEND: Adaptive Margin and Expanded Neighborhood for Efficient Generalized Category Discovery. 2090-2099 - Jeongbeen Yoon, Sanghyun Kim, Suha Kwak, Minsu Cho:
Optical Flow Domain Adaptation via Target Style Transfer. 2100-2110 - Konstantin Kirchheim, Tim Gonschorek, Frank Ortmeier:
Out-of-Distribution Detection with Logical Reasoning. 2111-2120 - Ruxiao Duan, Brian Caffo, Harrison X. Bai, Haris I. Sair, Craig K. Jones:
Evidential Uncertainty Quantification: A Variance-Based Perspective. 2121-2130 - Donghyeon Kwon, Minsu Cho, Suha Kwak:
Self-supervised Learning of Semantic Correspondence Using Web Videos. 2131-2141 - Ankit Shukla, Avinash Upadhyay, Swati Bhugra, Manoj Sharma:
Opinion Unaware Image Quality Assessment via Adversarial Convolutional Variational Autoencoder. 2142-2152 - Vitjan Zavrtanik, Matej Kristan, Danijel Skocaj:
Cheating Depth: Enhancing 3D Surface Anomaly Detection via Depth Simulation. 2153-2161 - Gao Yu Lee, Tanmoy Dam, Daniel Puiu Poenar, Vu N. Duong, Md Meftahul Ferdaus:
HELA-VFA: A Hellinger Distance-Attention-based Feature Aggregation Network for Few-Shot Classification. 2162-2172 - Ohad Amosy, Gal Eyal, Gal Chechik:
Late to the party? On-demand unlabeled personalized federated learning. 2173-2182 - Bin Wang, Hongyi Pan, Armstrong Aboah, Zheyuan Zhang, Elif Keles, Drew A. Torigian, Baris Turkbey, Elizabeth A. Krupinski, Jayaram K. Udupa, Ulas Bagci:
GazeGNN: A Gaze-Guided Graph Neural Network for Chest X-ray Classification. 2183-2192 - Xinglong Sun, Humphrey Shi:
Towards Better Structured Pruning Saliency by Reorganizing Convolution. 2193-2203 - Masih Aminbeidokhti, Fidel A. Guerrero-Peña, Heitor Rapela Medeiros, Thomas Dubail, Eric Granger, Marco Pedersoli:
Domain Generalization by Rejecting Extreme Augmentations. 2204-2214 - Yaoyao Liu, Yingying Li, Bernt Schiele, Qianru Sun:
Wakening Past Concepts without Past Data: Class-Incremental Learning from Online Placebos. 2215-2224 - Solang Kim, Yuho Jeong, Joon Sung Park, Sung Whan Yoon:
MICS: Midpoint Interpolation to Learn Compact and Separated Representations for Few-Shot Class-Incremental Learning. 2225-2234 - Boon Peng Yap, Beng Koon Ng:
Group-wise Contrastive Bottleneck for Weakly-Supervised Visual Representation Learning. 2235-2244 - Seyed Mojtaba Marvasti-Zadeh, Nilanjan Ray, Nadir Erbilgin:
Training-Based Model Refinement and Representation Disagreement for Semi-Supervised Object Detection. 2245-2254 - Junde Xu, Zikai Lin, Donghao Zhou, Yaodong Yang, Xiangyun Liao, Qiong Wang, Bian Wu, Guangyong Chen, Pheng-Ann Heng:
DPPMask: Masked Image Modeling with Determinantal Point Processes. 2255-2265 - Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Q. Phung, Gustavo Carneiro, Thanh-Toan Do:
Frequency Attention for Knowledge Distillation. 2266-2275 - M. Yashwanth, Gaurav Kumar Nayak, Harsh Rangwani, Arya Singh, R. Venkatesh Babu, Anirban Chakraborty:
Minimizing Layerwise Activation Norm Improves Generalization in Federated Learning. 2276-2285 - Michalis Lazarou, Yannis Avrithis, Tania Stathaki:
Adaptive manifold for imbalanced transductive few-shot learning. 2286-295 - Yuwen Tan, Xiang Xiang:
Cross-Domain Few-Shot Incremental Learning for Point-Cloud Recognition. 2296-2305 - Taehoon Kim, Bohyung Han:
Randomized Adversarial Style Perturbations for Domain Generalization. 2306-2314 - Xinkuan Qiu, Meina Kan, Yongbin Zhou, Yanchao Bi, Shiguang Shan:
Shape-biased CNNs are Not Always Superior in Out-of-Distribution Robustness. 2315-2324 - Udbhav Bamba, Neeraj Anand, Saksham Aggarwal, Dilip K. Prasad, Deepak K. Gupta:
Partial Binarization of Neural Networks for Budget-Aware Efficient Learning. 2325-2334 - Aral Hekimoglu, Michael Schmidt, Alvaro Marcos-Ramiro:
Monocular 3D Object Detection with LiDAR Guided Semi Supervised Active Learning. 2335-2344 - Erik Wallin, Lennart Svensson, Fredrik Kahl, Lars Hammarstrand:
Improving Open-Set Semi-Supervised Learning with Self-Supervision. 2345-2354 - Xiaofan Yu, Tajana Rosing, Yunhui Guo:
Evolve: Enhancing Unsupervised Continual Learning with Multiple Experts. 2355-2366 - Simon Klenk, David Bonello, Lukas Koestler, Nikita Araslanov, Daniel Cremers:
Masked Event Modeling: Self-Supervised Pretraining for Event Cameras. 2367-2377 - Xiang Song, Kuang Shu, Songlin Dong, Jie Cheng, Xing Wei, Yihong Gong:
Overcoming Catastrophic Forgetting for Multi-Label Class-Incremental Learning. 2378-2387 - Xingchen Zhao, Niluthpol Chowdhury Mithun, Abhinav Rajvanshi, Han-Pang Chiu, Supun Samarasekera:
Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement. 2388-2398 - Nikhil Mehta, Kevin J. Liang, Jing Huang, Fu-Jen Chu, Li Yin, Tal Hassner:
HyperMix: Out-of-Distribution Detection and Classification in Few-Shot Settings. 2399-2409 - ChiatPin Tay, Vigneshwaran Subbaraju, Thivya Kandappu:
PrivObfNet: A Weakly Supervised Semantic Segmentation Model for Data Protection. 2410-2420 - Minghai Qin, Chao Sun, Jaco Hofmann, Dejan Vucinic:
DISCO: Distributed Inference with Sparse Communications. 2421-2429 - Khanh-Binh Nguyen:
Debiasing, calibrating, and improving Semi-supervised Learning performance via simple Ensemble Projector. 2430-2439 - Kartik Gupta, Akshay Asthana:
Reducing the Side-Effects of Oscillations in Training of Quantized YOLO Networks. 2440-2449 - Joonhyeok Jang, Sunhyeok Lee, Seonghak Kim, Jung-Un Kim, Seonghyun Kim, Daeshik Kim:
Robust Unsupervised Domain Adaptation through Negative-View Regularization. 2450-2459 - Bastian Wittmann, Johannes C. Paetzold, Chinmay Prabhakar, Daniel Rueckert, Bjoern H. Menze:
Link Prediction for Flow-Driven Spatial Networks. 2460-2469 - Chi-Chih Chang, Yuan-Yao Sung, Shixing Yu, Ning-Chi Huang, Diana Marculescu, Kai-Chiang Wu:
FLORA: Fine-grained Low-Rank Architecture Search for Vision Transformer. 2470-2479 - Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu:
CL-MAE: Curriculum-Learned Masked Autoencoders. 2480-2490 - Aral Hekimoglu, Michael Schmidt, Alvaro Marcos-Ramiro:
Active Learning with Task Consistency and Diversity in Multi-Task Networks. 2491-2500 - Sejun Kim, Soonyong Gwon, Kisung Seo:
Enhancing Diverse Intra-identity Representation for Visible-Infrared Person Re-Identification. 2501-2510 - Zhuowei Li, Long Zhao, Zizhao Zhang, Han Zhang, Di Liu, Ting Liu, Dimitris N. Metaxas:
Steering Prototypes with Prompt-tuning for Rehearsal-free Continual Learning. 2511-2521 - Debanjan Goswami, Shayok Chakraborty:
Active Batch Sampling for Multi-label Classification with Binary User Feedback. 2522-2531 - Muhammad Abdullah Jamal, Omid Mohareri:
M33D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding. 2532-2542 - Robert A. Marsden, Mario Döbler, Bin Yang:
Universal Test-time Adaptation through Weight Ensembling, Diversity Weighting, and Prior Correction. 2543-2553 - Yuzhe Lu, Xinran Liu, Andrea Soltoggio, Soheil Kolouri:
SLoSH: Set Locality Sensitive Hashing via Sliced-Wasserstein Embeddings. 2554-2564 - Lin Zhang, Linghan Xu, Saman Motamed, Shayok Chakraborty, Fernando De la Torre:
D3GU: Multi-target Active Domain Adaptation via Enhancing Domain Alignment. 2565-2574 - Jin Hyuk Lim, SeungBum Ha, Sung Whan Yoon:
MetaVers: Meta-Learned Versatile Representations for Personalized Federated Learning. 2575-2584 - Jiahao Zhang, Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara:
Instruct Me More! Random Prompting for Visual In-Context Learning. 2585-2594 - Soroush Abbasi Koohpayegani, Hamed Pirsiavash:
SimA: Simple Softmax-free Attention for Vision Transformers. 2595-2605 - Jona Otholt, Christoph Meinel, Haojin Yang:
Guided Cluster Aggregation: A Hierarchical Approach to Generalized Category Discovery. 2606-2615 - Nilotpal Sinha, Abd El Rahman Shabayek, Anis Kacem, Peyman Rostami, Carl Shneider, Djamila Aouada:
Hardware Aware Evolutionary Neural Architecture Search using Representation Similarity Metric. 2616-2625 - Rishabh Tiwari, Durga Sivasubramanian, Anmol Reddy Mekala, Ganesh Ramakrishnan, Pradeep Shenoy:
Using Early Readouts to Mediate Featural Bias in Distillation. 2626-2635 - Durga Sivasubramanian, Lokesh Nagalapatti, Rishabh K. Iyer, Ganesh Ramakrishnan:
Gradient Coreset for Federated Learning. 2636-2645 - Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza:
Revisiting Token Pruning for Object Detection and Instance Segmentation. 2646-2656 - Ran Liu, Sahil Khose, Jingyun Xiao, Lakshmi Sathidevi, Keerthan Ramnath, Zsolt Kira, Eva L. Dyer:
LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration. 2657-2667 - Lassi Meronen, Martin Trapp, Andrea Pilzer, Le Yang, Arno Solin:
Fixing Overconfidence in Dynamic Neural Networks. 2668-2678 - Zhengfeng Lai, Haoping Bai, Haotian Zhang, Xianzhi Du, Jiulong Shan, Yinfei Yang, Chen-Nee Chuah, Meng Cao:
Empowering Unsupervised Domain Adaptation with Large-scale Pre-trained Vision-Language Models. 2679-2689 - Manogna Sreenivas, Goirik Chakrabarty, Soma Biswas:
pSTarC: Pseudo Source Guided Target Clustering for Fully Test-Time Adaptation. 2690-2698 - Arshita Gupta, Tien Bau, Joonsoo Kim, Zhe Zhu, Sumit Jha, Hrishikesh Garud:
Torque based Structured Pruning for Deep Neural Network. 2699-2708 - Vinay Kumar Verma, Nikhil Mehta, Kevin J. Liang, Aakansha Mishra, Lawrence Carin:
Meta-Learned Attribute Self-Interaction Network for Continual and Generalized Zero-Shot Learning. 2709-2719 - Jiajing Chen, Minmin Yang, Senem Velipasalar:
Letting 3D Guide the Way: 3D Guided 2D Few-Shot Image Classification. 2720-2728 - Minh Nguyen, Alan Q. Wang, Heejong Kim, Mert R. Sabuncu:
Robust Learning via Conditional Prevalence Adjustment. 2729-2738 - Piotr Teterwak, Soren Nelson, Nikoli Dryden, Dina Bashkirova, Kate Saenko, Bryan A. Plummer:
Learning to Compose SuperWeights for Neural Parameter Allocation Search. 2739-2748 - Zhi-Yi Chin, Chieh-Ming Jiang, Ching-Chun Huang, Pin-Yu Chen, Wei-Chen Chiu:
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where. 2749-2758 - Yusuke Kanebako:
Critical Gap Between Generalization Error and Empirical Error in Active Learning. 2759-2767 - Yuki Tanaka, Shuhei M. Yoshida, Takashi Shibata, Makoto Terao, Takayuki Okatani, Masashi Sugiyama:
Appearance-Based Curriculum for Semi-Supervised Learning with Multi-Angle Unlabeled Data. 2768-2777 - Toan Nguyen, Kien Do, Bao Duong, Thin Nguyen:
Domain Generalisation via Risk Distribution Matching. 2778-2787 - Chau Pham, Piotr Teterwak, Soren Nelson, Bryan A. Plummer:
MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters. 2788-2797 - Carlo Metta, Marco Fantozzi, Andrea Papini, Gianluca Amato, Matteo Bergamaschi, Silvia Giulia Galfrè, Alessandro Marchetti, Michelangelo Vegliò, Maurizio Parton, Francesco Morandin:
Increasing biases can be more efficient than increasing weights. 2798-2807 - Yewei Zhao, Hu Han, Shiguang Shan, Xilin Chen:
Deep Subdomain Alignment for Cross-domain Image Classification. 2808-2817 - Joshua Niemeijer, Manuel Schwonberg, Jan-Aike Termöhlen, Nico M. Schmidt, Tim Fingscheidt:
Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation. 2818-2828 - Chi Ian Tang, Lorena Qendro, Dimitris Spathis, Fahim Kawsar, Cecilia Mascolo, Akhil Mathur:
Kaizen: Practical self-supervised continual learning with continual fine-tuning. 2829-2838 - Alokendu Mazumder, Tirthajit Baruah, Bhartendu Kumar, Rishab Sharma, Vishwajeet Pattanaik, Punit Rathore:
Learning Low-Rank Latent Spaces with Simple Deterministic Autoencoder: Theoretical and Empirical Insights. 2839-2848 - Matteo Destro, Michael Gygli:
CycleCL: Self-supervised Learning for Periodic Videos. 2849-2858 - Dhruv Kudale, Badri Vishal Kasuba, Venkatapathy Subramanian, Parag Chaudhuri, Ganesh Ramakrishnan:
Textron: Weakly Supervised Multilingual Text Detection through Data Programming. 2859-2868 - Nathan Beck, Krishnateja Killamsetty, Suraj Kothawade, Rishabh K. Iyer:
Beyond Active Learning: Leveraging the Full Potential of Human Interaction via Auto-Labeling, Human Correction, and Human Verification. 2869-2877 - Paul Grimal, Hervé Le Borgne, Olivier Ferret, Julien Tourille:
TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation. 2878-2887 - Shoma Iwai, Tomo Miyazaki, Shinichiro Omachi:
Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model. 2888-2897 - Zhaoyu Zhang, Yang Hua, Hui Wang, Seán F. McLoone:
Improving the Fairness of the Min-Max Game in GANs Training. 2898-2907 - Xichen Pan, Pengda Qin, Yuhong Li, Hui Xue, Wenhu Chen:
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models. 2908-2918 - Gayatri Deshmukh, Onkar Susladkar, Dhruv Makwana, Sparsh Mittal, R. Sai Chandra Teja:
Textual Alchemy: CoFormer for Scene Text Understanding. 2919-2929 - Siddharth Katageri, Arkadipta De, Chaitanya Devaguptapu, V. S. S. V. Prasad, Charu Sharma, Manohar Kaul:
Synergizing Contrastive Learning and Optimal Transport for 3D Point Cloud Domain Adaptation. 2930-2939 - Peter Ebert Christensen, Vésteinn Snæbjarnarson, Andrea Dittadi, Serge J. Belongie, Sagie Benaim:
Assessing Neural Network Robustness via Adversarial Pivotal Tuning. 2940-2949 - Jordy Van Landeghem, Sanket Biswas, Matthew B. Blaschko, Marie-Francine Moens:
Beyond Document Page Classification: Design, Datasets, and Challenges. 2950-2960 - Gianni Franchi, Marwane Hariat, Xuanlong Yu, Nacim Belkhir, Antoine Manzanera, David Filliat:
InfraParis: A multi-modal and multi-task autonomous driving dataset. 2961-2971 - Md. Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell:
Discovering and Mitigating Biases in CLIP-based Image Editing. 2972-2981 - Xuefeng Hu, Ke Zhang, Lu Xia, Albert Chen, Jiajia Luo, Yuyin Sun, Ken Wang, Nan Qiao, Xiao Zeng, Min Sun, Cheng-Hao Kuo, Ram Nevatia:
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation. 2982-2991 - Minsoo Lee, Hyunmin Lee, Bumsoo Kim, Seunghwan Kim:
UNSPAT: Uncertainty-Guided SpatioTemporal Transformer for 3D Human Pose and Shape Estimation on Videos. 2992-3001 - Md. Amirul Islam, Seyed Shahabeddin Nabavi, Irina Kezele, Yang Wang, Yuanhao Yu, Jin Tang:
Visually Guided Audio Source Separation with Meta Consistency Learning. 3002-3011 - Pan Xie, Taiying Peng, Yao Du, Qipeng Zhang:
Sign Language Production with Latent Motion Transformer. 3012-3022 - Quanzhou Li, Jingbo Wang, Chen Change Loy, Bo Dai:
Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations. 3023-3032 - Xihao Chen, Wenming Weng, Yueyi Zhang, Zhiwei Xiong:
Depth from Asymmetric Frame-Event Stereo: A Divide-and-Conquer Approach. 3033-3042 - Ananta R. Bhattarai, Matthias Nießner, Artem Sevastopolsky:
TriPlaneNet: An Encoder for EG3D Inversion. 3043-3053 - Deepti Hegde, Vishal M. Patel:
Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection. 3054-3064 - Tarun Yenamandra, Ayush Tewari, Nan Yang, Florian Bernard, Christian Theobalt, Daniel Cremers:
FIRe: Fast Inverse Rendering using Directional and Signed Distance Functions. 3065-3075 - Thibaud Ehret, Roger Marí, Gabriele Facciolo:
A generic and flexible regularization framework for NeRFs. 3076-3085 - Ziwei Liao, Steven L. Waslander:
Multi-view 3D Object Reconstruction and Uncertainty Modelling with Neural Shape Prior. 3086-3095 - Angtian Wang, Wufei Ma, Alan L. Yuille, Adam Kortylewski:
Neural Textured Deformable Meshes for Robust Analysis-by-Synthesis. 3096-3105 - Weijian Deng, Dylan Campbell, Chunyi Sun, Shubham Kanitkar, Matthew E. Shaffer, Stephen Gould:
Ray Deformation Networks for Novel View Synthesis of Refractive Objects. 3106-3116 - Pit Henrich, Balázs Gyenes, Paul Maria Scheikl, Gerhard Neumann, Franziska Mathis-Ullrich:
Registered and Segmented Deformable Object Reconstruction from a Single View Point Cloud. 3117-3126 - Mohammed Brahimi, Bjoern Haefner, Tarun Yenamandra, Bastian Goldluecke, Daniel Cremers:
SupeRVol: Super-Resolution Shape and Reflectance Estimation in Inverse Volume Rendering. 3127-3137 - Bahareh Shakibajahromi, Edward Kim, David E. Breen:
RIMeshGNN: A Rotation-Invariant Graph Neural Network for Mesh Classification. 3138-3148 - Rahul Ahuja, Chris L. Baker, Wilko Schwarting:
OptFlow: Fast Optimization-based Scene Flow Estimation without Supervision. 3149-3158 - Byeongjun Park, Changick Kim:
Point-DynRF: Point-based Dynamic Radiance Fields from a Monocular Video. 3159-3169 - Min-Jung Kim, Gyojung Gu, Jaegul Choo:
LensNeRF: Rethinking Volume Rendering based on Thin-Lens Camera Model. 3170-3179 - Harsh Pal, Ritwik Khandelwal, Shivam Pande, Biplab Banerjee, Srikrishna Karanam:
Domain Adaptive 3D Shape Retrieval from Monocular Images. 3180-3189 - Jinbo Wu, Xiaobo Gao, Xing Liu, Zhengyang Shen, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding:
HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation. 3190-3199 - Tao Wang, Jing Wu, Ze Ji, Yu-Kun Lai:
Sparse Convolutional Networks for Surface Reconstruction from Noisy Point Clouds. 3200-3209 - Suresh Guttikonda, Jason R. Rambach:
Single Frame Semantic Segmentation Using Multi-Modal Spherical Images. 3210-3219 - Edgar Medina, Leyong Loh, Namrata Gurung, Kyung Hun Oh, Niels Heller:
Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting. 3220-3229 - Vibhas K. Vats, Sripad Joshi, David J. Crandall, Md. Alimoor Reza, Soon-Heung Jung:
GC-MVSNet: Multi-View, Multi-Scale, Geometrically-Consistent Multi-View Stereo. 3230-3240 - Weiyi Xie, Nathalie Willems, Shubham Patil, Yang Li, Mayank Kumar:
SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images. 3241-3249 - Xu Song, Hao Kang, Atsunori Moteki, Genta Suzuki, Yoshie Kobayashi, Zhiming Tan:
MSCC: Multi-Scale Transformers for Camera Calibration. 3250-3259 - Ai Matsune, Shichen Hu, Guangquan Li, Sihan Wen, Xiantan Zhu, Zhiming Tan:
A Geometry Loss Combination for 3D Human Pose Estimation. 3260-3269 - Zizhang Wu, Yunzhe Wu, Xiaoquan Wang, Yuanzhu Gan, Jian Pu:
A Robust Diffusion Modeling Framework for Radar Camera 3D Object Detection. 3270-3280 - Mohang Zhang, Yushi Li, Rong Chen, Yushan Pan, Jia Wang, Yunzhe Wang, Rong Xiang:
WalkFormer: Point Cloud Completion via Guided Walks. 3281-3290 - Zhangsihao Yang, Kaize Ding, Huan Liu, Yalin Wang:
MGM-AE: Self-Supervised Learning on 3D Shape Using Mesh Graph Masked Autoencoders. 3291-3301 - Haorui Ji, Hui Deng, Yuchao Dai, Hongdong Li:
Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling. 3302-3311 - Qiuxiao Chen, Xiaojun Qi:
Residual Graph Convolutional Network for Bird's-Eye-View Semantic Segmentation. 3312-3319 - Yingfeng Wang, Zhengwei Wang, Muyu Li, Hong Yan:
3D Human Pose Estimation with Two-step Mixed-Training Strategy. 3320-3329 - Andreas Langeland Teigen, Yeonsoo Park, Annette Stahl, Rudolf Mester:
RGB-D Mapping and Tracking in a Plenoxel Radiance Field. 3330-3339 - Chengjie Huang, Vahdat Abdelzad, Sean Sedwards, Krzysztof Czarnecki:
SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling. 3340-3349 - Zhenjun Zhao:
BALF: Simple and Efficient Blur Aware Local Feature Detector. 3350-3360 - Yunsheng Ma, Juanwu Lu, Can Cui, Sicheng Zhao, Xu Cao, Wenqian Ye, Ziran Wang:
MACP: Efficient Model Adaptation for Cooperative Perception. 3361-3370 - Georg Krispel, David Schinagl, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof:
MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds. 3371-3380 - Weiyi Xue, Fan Lu, Guang Chen:
HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration. 3381-3391 - Sebastian Koch, Pedro Hermosilla, Narunas Vaskevicius, Mirco Colosi, Timo Ropinski:
SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction. 3392-3402 - Kevin Lin, Chung-Ching Lin, Lin Liang, Zicheng Liu, Lijuan Wang:
MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction. 3403-3413 - Peter Hardy, Hansung Kim:
LInKs "Lifting Independent Keypoints" - Partial Pose Lifting for Occlusion Handling with Improved Accuracy in 2D-3D Human Pose Estimation. 3414-3423 - Matthias Wödlinger, Jan Kotera, Manuel Keglevic, Jan Xu, Robert Sablatnig:
ECSIC: Epipolar Cross Attention for Stereo Image Compression. 3424-3433 - Jiahao Yang, Wufei Ma, Angtian Wang, Xiaoding Yuan, Alan L. Yuille, Adam Kortylewski:
Robust Category-Level 3D Pose Estimation from Diffusion-Enhanced Synthetic Data. 3434-3443 - Hao Zhang, Fang Li, Narendra Ahuja:
Open-NeRF: Towards Open Vocabulary NeRF Decomposition. 3444-3453 - Rafael Weilharter, Friedrich Fraundorfer:
HAMMER: Learning Entropy Maps to Create Accurate 3D Models in Multi-View Stereo. 3454-3463 - Jinyu Zhao, Jumpei Oishi, Yusuke Monno, Masatoshi Okutomi:
Polarimetric PatchMatch Multi-View Stereo. 3464-3472 - Lars Haalck, Benjamin Risse:
Solving the Plane-Sphere Ambiguity in Top-Down Structure-from-Motion. 3473-3481 - Miaowei Wang, Daniel D. Morris:
Self-Annotated 3D Geometric Learning for Smeared Points Removal. 3482-3491 - Jianwei Feng, Prateek Singhal:
3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization. 3492-3501 - Jianchun Chen, Jayakorn Vongkulbhisal, Fernando De la Torre Frade:
A Sequential Learning-based Approach for Monocular Human Performance Capture. 3502-3511 - Ahmed Abdelreheem, Kyle Olszewski, Hsin-Ying Lee, Peter Wonka, Panos Achlioptas:
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes. 3512-3522 - Zihua Liu, Yizhou Li, Masatoshi Okutomi:
Global Occlusion-Aware Transformer for Robust Stereo Matching. 3523-3532 - Renat Bashirov, Alexey Larionov, Evgeniya Ustinova, Mikhail Sidorenko, David Svitov, Ilya Zakharkin, Victor Lempitsky:
MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video. 3533-3543 - Anh-Thuan Tran, Hoanh-Su Le, Suk-Hwan Lee, Ki-Ryong Kwon:
PointCT: Point Central Transformer Network for Weakly-supervised Point Cloud Semantic Segmentation. 3544-3553 - Maksim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich:
Top-Down Beats Bottom-Up in 3D Instance Segmentation. 3554-3562 - Qiuhui Chen, Qiang Fu, Hao Bai, Yi Hong:
LongFormer: Longitudinal Transformer for Alzheimer's Disease Classification with Structural MRIs. 3563-3572 - Vishal Vinod, Tanmay Shah, Dmitry Lagun:
TEGLO: High Fidelity Canonical Texture Mapping from Single-View Images. 3573-3583 - Sitian Shen, Zilin Zhu, Linqian Fan, Harry Zhang, Xinxiao Wu:
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification. 3584-3593 - Son Tung Nguyen, Alejandro Fontán, Michael Milford, Tobias Fischer:
FocusTune: Tuning Visual Localization through Focus-Guided Sampling. 3594-3603 - Yuya Matsumoto, Gaku Nakano, Kazumine Ogura:
Indoor Visual Localization using Point and Line Correspondences in dense colored point cloud. 3604-3613 - Yeonjin Chang, Yearim Kim, Seunghyeon Seo, Jung Yi, Nojun Kwak:
Fast Sun-aligned Outdoor Scene Relighting based on TensoRF. 3614-3624 - Rémi Marsal, Florian Chabot, Angelique Loesch, William Grolleau, Hichem Sahbi:
MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty. 3625-3634 - Akash Karthikeyan, Robert Ren, Yash Kant, Igor Gilitschenski:
AvatarOne: Monocular 3D Human Animation. 3635-3645 - Achleshwar Luthra, Shiva Souhith Gantha, Xiyun Song, Heather Yu, Zongfang Lin, Liang Peng:
Deblur-NSFF: Neural Scene Flow Fields for Blurry Dynamic Scenes. 3646-3655 - Minmin Yang, Weiheng Chai, Jiyang Wang, Senem Velipasalar:
SimpliMix: A Simplified Manifold Mixup for Few-shot Point Cloud Classification. 3656-3665 - Chushan Zhang, Jinguang Tong, Tao Jun Lin, Chuong Nguyen, Hongdong Li:
PMVC: Promoting Multi-View Consistency for 3D Scene Reconstruction. 3666-3676 - Yifan Wang, Yi Gong, Yuan Zeng:
Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields. 3677-3686 - Hermes McGriff, Renato Martins, Nicolas Andreff, Cédric Demonceaux:
Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images. 3687-3696 - Byeonghyeon Lee, Howoong Lee, Usman Ali, Eunbyung Park:
Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields using Sharpness Prior. 3697-3706 - Qingtao Yu, Heming Du, Chen Liu, Xin Yu:
When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision. 3707-3716 - Houda Saffi, Naima Otberdout, Youssef Hmamouche, Amal El Fallah Seghrouchni:
Auto-BPA: An Enhanced Ball-Pivoting Algorithm with Adaptive Radius using Contextual Bandits. 3717-3725 - Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando De la Torre:
Towards Realistic Generative 3D Face Models. 3726-3736 - Lahiru N. S. Wijayasingha, Homa Alemzadeh, John A. Stankovic:
Camera-Independent Single Image Depth Estimation from Defocus Blur. 3737-3746 - Jiaxu Liu, Zhengdi Yu, Toby P. Breckon, Hubert P. H. Shum:
U3DS3: Unsupervised 3D Semantic Scene Segmentation. 3747-3756 - Runsong Zhu, Di Kang, Ka-Hei Hui, Yue Qian, Shi Qiu, Zhen Dong, Linchao Bao, Pheng-Ann Heng, Chi-Wing Fu:
SSP: Semi-signed prioritized neural fitting for surface reconstruction from unoriented point clouds. 3757-3766 - Omri Ben-Dov, Pravir Singh Gupta, Victoria Fernández Abrevaya, Michael J. Black, Partha Ghosh:
Adversarial Likelihood Estimation With One-Way Flows. 3767-3776 - Sungmin Cha, Naeun Ko, Heewoong Choi, Youngjoon Yoo, Taesup Moon:
NCIS: Neural Contextual Iterative Smoothing for Purifying Adversarial Perturbations. 3777-3787 - Akshay Mehra, Yunbei Zhang, Bhavya Kailkhura, Jihun Hamm:
On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization. 3788-3799 - Ashish Hooda, Neal Mangaokar, Ryan Feng, Kassem Fawaz, Somesh Jha, Atul Prakash:
D4: Detection of Adversarial Diffusion Deepfakes Using Disjoint Ensembles. 3800-3810 - Akshit Jindal, Vikram Goyal, Saket Anand, Chetan Arora:
Army of Thieves: Enhancing Black-Box Model Extraction via Ensemble based sample selection. 3811-3820 - Abhijith Sharma, Phil Munz, Apurva Narayan:
Assist Is Just as Important as the Goal: Image Resurfacing to Aid Model's Robust Prediction. 3821-3830 - Roy Ganz, Michael Elad:
CLIPAG: Towards Generator-Free Text-to-Image Generation. 3831-3841 - Mark Ofori-Oduro, Maria A. Amer:
Defending Object Detection Models against Image Distortions. 3842-3851 - Gerhard Krumpl, Henning Avenhaus, Horst Possegger, Horst Bischof:
ATS: Adaptive Temperature Scaling for Enhancing Out-of-Distribution Detection Methods. 3852-3861 - Akshayvarun Subramanya, Soroush Abbasi Koohpayegani, Aniruddha Saha, Ajinkya Tejankar, Hamed Pirsiavash:
A Closer Look at Robustness of Vision Transformers to Backdoor Attacks. 3862-3871 - Feng Wang, Senem Velipasalar, Mustafa Cenk Gursoy:
Maximum Knowledge Orthogonality Reconstruction with Gradients in Federated Learning. 3872-3881 - Marwane Hariat, Olivier Laurent, Rémi Kazmierczak, Shihao Zhang, Andrei Bursuc, Angela Yao, Gianni Franchi:
Learning to generate training datasets for robust semantic segmentation. 3882-3893 - Kira Maag, Asja Fischer:
Uncertainty-weighted Loss Functions for Improved Adversarial Attacks on Semantic Segmentation. 3894-3902 - Teng-Fang Hsiao, Bo-Lun Huang, Zi-Xiang Ni, Yan-Ting Lin, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng:
Natural Light Can Also be Dangerous: Traffic Sign Misinterpretation Under Adversarial Natural Light Attacks. 3903-3912 - Matías Tailanián, Marina Gardella, Álvaro Pardo, Pablo Musé:
Diffusion models meet image counter-forensics. 3913-3923 - Inder Pal Singh, Enjie Ghorbel, Anis Kacem, Arunkumar Rathinam, Djamila Aouada:
Discriminator-free Unsupervised Domain Adaptation for Multi-label Image Classification. 3924-3933 - Kenichiro Fukushi, Yoshitaka Nozaki, Kosuke Nishihara, Kentaro Nakahara:
Few-shot generative model for skeleton-based human action synthesis using cross-domain adversarial learning. 3934-3943 - Shaltiel Eloul, Fran Silavong, Sanket Kamthe, Antonios Georgiadis, Sean J. Moran:
Mixing Gradients in Neural Networks as a Strategy to Enhance Privacy in Federated Learning. 3944-3953 - Yaxin Li, Jie Ren, Han Xu, Hui Liu:
Neural Style Protection: Counteracting Unauthorized Neural Style Transfer. 3954-3963 - Gihyun Kim, Juyeop Kim, Jong-Seok Lee:
Exploring Adversarial Robustness of Vision Transformers in the Spectral Perspective. 3964-3973 - Jeonghwan Park, Paul Miller, Niall McLaughlin:
Hard-label based Small Query Black-box Adversarial Attack. 3974-3983 - Gilad Cohen, Raja Giryes:
Simple Post-Training Robustness using Test Time Augmentations and Random Forest. 3984-3994 - Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim T. J. L. Pouw, Peter Uhrig, Judith Holler, Ivan Toni, Asli Özyürek, Raquel Fernández:
Co-Speech Gesture Detection through Multi-Phase Sequence Labeling. 3995-4003 - Minsu Kim, Yongjun Lee, Woo Kyoung Han, Kyong Hwan Jin:
Learning Residual Elastic Warps for Image Stitching under Dirichlet Boundary Condition. 4004-4012 - Martin Nicolas Everaert, Athanasios Fitsios, Marco Bocchio, Sami Arpa, Sabine Süsstrunk, Radhakrishna Achanta:
Exploiting the Signal-Leak Bias in Diffusion Models. 4013-4022 - Håkon Hukkelås, Frank Lindseth:
Synthesizing Anyone, Anywhere, in Any Pose. 4023-4034 - Takafumi Iwaguchi, Hiroyuki Kubo, Hiroshi Kawasaki:
Specular Object Reconstruction Behind Frosted Glass by Differentiable Rendering. 4035-4044 - Hemanth Pidaparthy, Abhay Chauhan, Pavan Sudheendra:
Multi-level Attention Aggregation for Aesthetic Face Relighting. 4045-4054 - Sergey Sinitsa, Ohad Fried:
Deep Image Fingerprint: Towards Low Budget Synthetic Image Detection and Model Lineage Analysis. 4055-4064 - Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Martin Danelljan, Luc Van Gool:
StyleGenes: Discrete and Efficient Latent Distributions for GANs. 4065-4074 - Minsu Kim, Jaewon Lee, Byeonghun Lee, Sunghoon Im, Kyong Hwan Jin:
Implicit Neural Image Stitching With Enhanced and Blended Feature Reconstruction. 4075-4084 - Andreas Aakerberg, Majed El Helou, Kamal Nasrollahi, Thomas B. Moeslund:
PDA-RWSR: Pixel-Wise Degradation Adaptive Real-World Super-Resolution. 4085-4095 - Sudheer Achary, Rohit Girmaji, Adhiraj Anil Deshmukh, Vineet Gandhi:
Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings. 4096-4104 - Anuj Fulari, Satish Mulleti, Ajit Rajwade:
Unsupervised Model-based Learning for Simultaneous Video Deflickering and Deblotching. 4105-4113 - Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar:
Differentiable JPEG: The Devil is in the Details. 4114-4123 - Zeyu Xiao, Yurui Zhu, Xueyang Fu, Zhiwei Xiong:
TSA2: Temporal Segment Adaptation and Aggregation for Video Harmonization. 4124-4133 - Cindy M. Nguyen, Eric R. Chan, Alexander W. Bergman, Gordon Wetzstein:
Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition. 4134-4145 - Yatao Zhong, Ilya Zharkov:
Lightweight Portrait Matting via Regional Attention and Refinement. 4146-4155 - Dongyeun Lee, Chaewon Kim, Sangjoon Yu, Jaejun Yoo, Gyeong-Moon Park:
RADIO: Reference-Agnostic Dubbing Video Synthesis. 4156-4166 - Gereon Fox, Xingang Pan, Ayush Tewari, Mohamed Elgharib, Christian Theobalt:
Unsupervised Event-Based Video Reconstruction. 4167-4176 - Wei Jiang, Wei Wang, Yue Chen:
Neural Image Compression Using Masked Sparse Visual Representation. 4177-4185 - Dong Huk Park, Grace Luo, Clayton Toste, Samaneh Azadi, Xihui Liu, Maka Karalashvili, Anna Rohrbach, Trevor Darrell:
Shape-Guided Diffusion with Inside-Outside Attention. 4186-4195 - Vishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon Fatahalian:
Collage Diffusion. 4196-4205 - Sreenithy Chandran, Tatsuya Yatagawa, Hiroyuki Kubo, Suren Jayasuriya:
Learning-based Spotlight Position Optimization for Non-Line-of-Sight Human Localization and Posture Classification. 4206-4215 - Zhewei Huang, Ailin Huang, Xiaotao Hu, Chen Hu, Jun Xu, Shuchang Zhou:
Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution. 4216-4227 - Pu Cao, Lu Yang, Dongxv Liu, Xiaoya Yang, Tianrui Huang, Qing Song:
What Decreases Editing Capability? Domain-Specific Hybrid Refinement for Improved GAN Inversion. 4228-4237 - Wenjie Yang, Ning Xu, Yifei Fan:
Latent-Guided Exemplar-Based Image Re-Colorization. 4238-4247 - Lukas Mehl, Andrés Bruhn, Markus Gross, Christopher Schroers:
Stereo Conversion with Disparity-Aware Warping, Compositing and Inpainting. 4248-4257 - Yi-Ting Tsai, Yu Wei Chen, Hong-Han Shuai, Ching-Chun Huang:
Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks. 4258-4267 - Dawit Mureja Argaw, Junsik Kim, In So Kweon:
Blurry Video Compression A Trade-off between Visual Enhancement and Data Compression. 4268-4278 - Ligong Han, Song Wen, Qi Chen, Zhixing Zhang, Kunpeng Song, Mengwei Ren, Ruijiang Gao, Anastasis Stathopoulos, Xiaoxiao He, Yuxiao Chen, Di Liu, Qilong Zhangli, Jindong Jiang, Zhaoyang Xia, Akash Srivastava, Dimitris N. Metaxas:
ProxEdit: Improving Tuning-Free Real Image Editing with Proximal Guidance. 4279-4289 - Boyang Wang, Bowen Liu, Shiyu Liu, Fengyu Yang:
VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data. 4290-4300 - Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel:
Latent Feature-Guided Diffusion Models for Shadow Removal. 4301-4310 - Ties van Rozendaal, Tushar Singhal, Hoang Le, Guillaume Sautière, Amir Said, Krishna Buska, Anjuman Raha, Dimitris Kalatzis, Hitarth Mehta, Frank Mayer, Liang Zhang, Markus Nagel, Auke J. Wiggers:
MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device. 4311-4321 - Ciprian A. Corneanu, Raghudeep Gadde, Aleix M. Martínez:
LatentPaint: Image Inpainting in Latent Space with Diffusion Models. 4322-4331 - Ortal Glatt, Yotam Ater, Woo-Shik Kim, Shira Werman, Oded Berby, Yael Zini, Shay Zelinger, Sangyoon Lee, Heejin Choi, Evgeny Soloveichik:
Beyond RGB: A Real World Dataset for Multispectral Imaging in Mobile Devices. 4332-4342 - Yizhak Ben-Shabat, Jonathan Paul, Eviatar Segev, Oren Shrout, Stephen Gould:
IKEA Ego 3D Dataset: Understanding furniture assembly actions from ego-view 3D Point Clouds. 4343-4352 - Tim J. Schoonbeek, Tim Houben, Hans Onvlee, Peter H. N. de With, Fons van der Sommen:
IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting. 4353-4362 - Fanqing Lin, Tony R. Martinez:
Ego2HandsPose: A Dataset for Egocentric Two-hand 3D Global Pose Estimation. 4363-4371 - Nicolas Gorlo, Kenneth Blomqvist, Francesco Milano, Roland Siegwart:
ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification. 4372-4384 - Joshua Feinglass, Yezhou Yang:
Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding. 4385-4395 - Christiano Couto Gava, Yunmin Cho, Federico Raue, Sebastian Palacio, Alain Pagani, Andreas Dengel:
SphereCraft: A Dataset for Spherical Keypoint Detection, Matching and Camera Pose Estimation. 4396-4405 - Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Zachariah Carmichael, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna Gutierrez, Antonio Guillen, Avisek Naug:
Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness. 4406-4415 - Ly Bui, Son Lam Phung, Yang Di, Hoang Thanh Le, Tran Thanh Phong Nguyen, Sandy Burden, Abdesselam Bouzerdoum:
UOW-Vessel: A Benchmark Dataset of High-Resolution Optical Satellite Images for Vessel Detection and Segmentation. 4416-4424 - Thorsten Hempel, Magnus Jung, Ahmed A. Abdelrahman, Ayoub Al-Hamadi:
NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction. 4425-4434 - Thomas Rothmeier, Werner Huber, Alois C. Knoll:
Time to Shine: Fine-Tuning Object Detection Models with Synthetic Adverse Weather Images. 4435-4444 - Janis Rosskamp, René Weller, Gabriel Zachmann:
Effects of Markers in Training Datasets on the Accuracy of 6D Pose Estimation. 4445-4454 - Zhihang Ren, Jefferson Ortega, Yifan Wang, Zhimin Chen, Yunhui Guo, Stella X. Yu, David Whitney:
VEATIC: Video-based Emotion and Affect Tracking in Context Dataset. 4455-4465 - Xiulong Liu, Zhikang Dong, Peng Zhang:
Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering. 4466-4475 - Prakhar Ganesh:
An Empirical Investigation into Benchmarking Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification. 4476-4485 - Sai Raam Venkataraman, Rishi Sridhar Rao, S. Balasubramanian, R. Raghunatha Sarma, Chandra Sekhar Vorugunti:
Can you even tell left from right? Presenting a new challenge for VQA. 4486-4495 - Xuqian Ren, Wenjia Wang, Dingding Cai, Tuuli Tuominen, Juho Kannala, Esa Rahtu:
MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis. 4496-4505 - Nathan Drenkow, Mathias Unberath:
RobustCLEVR: A Benchmark and Framework for Evaluating Robustness in Object-centric Learning. 4506-4515 - Derek Gloudemans, Gergely Zachár, Yanbing Wang, Junyi Ji, Matthew Nice, Matt Bunting, William Barbour, Jonathan Sprinkle, Benedetto Piccoli, Maria Laura Delle Monache, Alexandre M. Bayen, Benjamin Seibold, Daniel B. Work:
So you think you can track? 4516-4526 - Ensiyeh Keshtkaran, Brodie von Berg, Grant Regan, David Suter, Syed Zulqarnain Gilani:
Estimating Blood Alcohol Level Through Facial Features for Driver Impairment Assessment. 4527-4536 - Francesco Ragusa, Rosario Leonardi, Michele Mazzamuto, Claudia Bonanno, Rosario Scavo, Antonino Furnari, Giovanni Maria Farinella:
ENIGMA-51: Towards a Fine-Grained Understanding of Human Behavior in Industrial Scenarios. 4537-4547 - Tim Tarsi, Heike Adel, Jan Hendrik Metzen, Dan Zhang, Matteo Finco, Annemarie Friedrich:
SciOL and MuLMS-Img: Introducing A Large-Scale Multimodal Scientific Dataset and Models for Image-Text Tasks in the Scientific Domain. 4548-4559 - Kapitanov Alexander, Kvanchiani Karina, Nagaev Alexander, Kraynov Roman, Makhliarchuk Andrei:
HaGRID - HAnd Gesture Recognition Image Dataset. 4560-4569 - Marius Schubert, Tobias Riedlinger, Karsten Kahl, Daniel Kröll, Sebastian Schoenen, Sinisa Segvic, Matthias Rottmann:
Identifying Label Errors in Object Detection Datasets by Loss Inspection. 4570-4579 - Stanislav Panev, Emily Kim, Sai Abhishek Si Namburu, Desislava Nikolova, Celso de Melo, Fernando De la Torre, Jessica K. Hodgins:
Exploring the Impact of Rendering Method and Motion Quality on Model Performance when Using Multi-view Synthetic Data for Action Recognition. 4580-4590 - Adrian Cosma, Ion Emilian Radoi:
PsyMo: A Dataset for Estimating Self-Reported Psychological Traits from Gait. 4591-4601 - Furqan Ahmed Shaik, Abhishek Reddy Malreddy, Nikhil Reddy Billa, Kunal Chaudhary, Sunny Manchanda, Girish Varma:
IDD-AW: A Benchmark for Safe and Robust Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather. 4602-4611 - Jens Parslov, Erik Riise, Dim P. Papadopoulos:
CrashCar101: Procedural Generation for Damage Assessment. 4612-4622 - Isaac Corley, Jonathan Lwowski, Peyman Najafirad:
ZRG: A Dataset for Multimodal 3D Residential Rooftop Understanding. 4623-4631 - Shimon Malnick, Shai Avidan, Ohad Fried:
Taming Normalizing Flows. 4632-4642 - Yan Ju, Shu Hu, Shan Jia, George H. Chen, Siwei Lyu:
Improving Fairness in Deepfake Detection. 4643-4653 - Rahul Venkataramani, Parag Dutta, Vikram Melapudi, Ambedkar Dukkipati:
Causal Feature Alignment: Learning to Ignore Spurious Background Features. 4654-4662 - Seongbeom Park, Suhong Moon, Seunghyun Park, Jinkyu Kim:
Localization and Manipulation of Immoral Visual Cues for Safe Text-to-Image Generation. 4663-4672 - Ola Ahmad, Nicolas Béreux, Loïc Baret, Vahid Hashemi, Freddy Lécué:
Causal Analysis for Robust Interpretability of Neural Networks. 4673-4682 - Moreno D'Incà, Christos Tzelepis, Ioannis Patras, Nicu Sebe:
Improving Fairness using Vision-Language Driven Image Augmentation. 4683-4692 - Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan:
Linking convolutional kernel size to generalization bias in face analysis CNNs. 4693-4703 - Shiwei Ding, Lan Zhang, Miao Pan, Xiaoyong Yuan:
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks. 4704-4713 - Yuhang Lu, Zewei Xu, Touradj Ebrahimi:
Towards Visual Saliency Explanations of Face Verification. 4714-4723 - Marco Huber, Anh Thi Luu, Philipp Terhörst, Naser Damer:
Efficient Explainable Face Verification based on Similarity Score Argument Backpropagation. 4724-4733 - Jaisidh Singh, Harshil Bhatia, Mayank Vatsa, Richa Singh, Aparna Bharati:
SynthProv: Interpretable Framework for Profiling Identity Leakage. 4734-4744 - Guillaume Jeanneret, Loïc Simon, Frédéric Jurie:
Text-to-Image Models for Counterfactual Explanations: a Black-Box Approach. 4745-4755 - Zachariah Carmichael, Suhas Lohit, Anoop Cherian, Michael J. Jones, Walter J. Scheirer:
Pixel-Grounded Prototypical Part Networks. 4756-4767 - Ilke Demir, Umur Aybars Ciftci:
How Do Deepfakes Move? Motion Magnification for Deepfake Source Detection. 4768-4778 - Mingzhen Shao, Tolga Tasdizen, Sarang C. Joshi:
Analyzing the Domain Shift Immunity of Deep Homography Estimation. 4788-4796 - Fengyi Wu, Tianfang Zhang, Lei Li, Yian Huang, Zhenming Peng:
RPCANet: Deep Unfolding RPCA Based Infrared Small Target Detection. 4797-4806 - Tuan Hoang, Santu Rana, Sunil Gupta, Svetha Venkatesh:
Learn to Unlearn for Deep Neural Networks: Minimizing Unlearning Interference with Gradient Projection. 4807-4816 - Pedro H. V. Valois, Koichiro Niinuma, Kazuhiro Fukui:
Occlusion Sensitivity Analysis with Augmentation Subspace Perturbation in Deep Feature Space. 4817-4826 - Minxing Zhang, Ning Yu, Rui Wen, Michael Backes, Yang Zhang:
Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models. 4827-4837 - Johannes Gilg, Torben Teepe, Fabian Herzog, Philipp Wolters, Gerhard Rigoll:
Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration. 4838-4847 - Jan Dubinski, Antoni Kowalczuk, Stanislaw Pawlak, Przemyslaw Rokita, Tomasz Trzcinski, Pawel Morawiecki:
Towards More Realistic Membership Inference Attacks on Large Diffusion Models. 4848-4857 - Giacomo Capitani, Federico Bolelli, Angelo Porrello, Simone Calderara, Elisa Ficarra:
ClusterFix: A Cluster-Based Debiasing Approach without Protected-Group Supervision. 4858-4867 - Jinyung Hong, Keun Hee Park, Theodore P. Pavlic:
Concept-Centric Transformers: Enhancing Model Interpretability through Object-Centric Concept Learning within a Shared Global Workspace. 4868-4879 - Gilad Cohen, Raja Giryes:
Membership Inference Attack Using Self Influence Functions. 4880-4889 - Bhat Dittakavi, Bharathi Callepalli, Aleti Vardhan, Sai Vikas Desai, Vineeth N. Balasubramanian:
CARE: Counterfactual-based Algorithmic Recourse for Explainable Pose Correction. 4890-4899 - Jiahang Cao, Ziqing Wang, Hanzhong Guo, Hao Cheng, Qiang Zhang, Renjing Xu:
Spiking Denoising Diffusion Probabilistic Models. 4900-4909 - Ruyu Wang, Sabrina Schmedding, Marco F. Huber:
Improving the Effectiveness of Deep Generative Data. 4910-4920 - Hai Wang, Xiaoyu Xiang, Yuchen Fan, Jing-Hao Xue:
Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models. 4921-4931 - Kai Katsumata, Duc Minh Vo, Hideki Nakayama:
Label Augmentation as Inter-class Data Augmentation for Conditional Image Synthesis with Imbalanced Data. 4932-4941 - Haomiao Ni, Jiachen Liu, Yuan Xue, Sharon X. Huang:
3D-Aware Talking-Head Video Motion Transfer. 4942-4952 - Saksham Suri, Moustafa Meshry, Larry S. Davis, Abhinav Shrivastava:
GRIT: GAN Residuals for Paired Image-to-Image Translation. 4953-4963 - Hanyu Wang, Pengxiang Wu, Kevin Dela Rosa, Chen Wang, Abhinav Shrivastava:
Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion. 4964-4973 - Kanghyeok Ko, Minhyeok Lee:
ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields. 4974-4983 - Shashikant Verma, Aman Sharma, Roopa Sheshadri, Shanmuganathan Raman:
GraphFill: Deep Image Inpainting using Graphs. 4984-4994 - Noam Elata, Bahjat Kawar, Tomer Michaeli, Michael Elad:
Nested Diffusion Processes for Anytime Image Generation. 4995-5004 - Seoyoung Lee, Joonseok Lee:
PoseDiff: Pose-conditioned Multimodal Diffusion Model for Unbounded Scene Synthesis from Sparse Inputs. 5005-5015 - Jiwan Hur, Jaehyun Choi, Gyojin Han, Dong-Jae Lee, Junmo Kim:
Expanding Expressiveness of Diffusion Models with Limited Data via Self-Distillation based Fine-Tuning. 5016-5025 - Sandeep Manandhar, Auguste Genovesio:
One Style is All You Need to Generate a Video. 5026-5035 - Zhen Zhu, Yijun Li, Weijie Lyu, Krishna Kumar Singh, Zhixin Shu, Sören Pirk, Derek Hoiem:
Consistent Multimodal Generation via A Unified GAN Framework. 5036-5045 - Yeruru Asrar Ahmed, Anurag Mittal:
Unsupervised Co-generation of Foreground-Background Segmentation from Text-to-Image Synthesis. 5046-5057 - José Ribeiro-Gomes, Tianhui Cai, Zoltán Ádám Milacski, Chen Wu, Aayush Prakash, Shingo Takagi, Amaury Aubel, Daeil Kim, Alexandre Bernardino, Fernando De la Torre:
MotionGPT: Human Motion Synthesis with Improved Diversity and Realism via GPT-3 Prompting. 5058-5068 - Taehoon Kim, Chanhee Kang, JaeHyuk Park, Daun Jeong, ChangHee Yang, Suk-Ju Kang, Kyeongbo Kong:
Human Motion Aware Text-to-Video Generation with Explicit Camera Control. 5069-5078 - Song Wen, Hao Wang, Di Liu, Qilong Zhangli, Dimitris N. Metaxas:
Second-Order Graph ODEs for Multi-Agent Trajectory Forecasting. 5079-5088 - Michal Stypulkowski, Konstantinos Vougioukas, Sen He, Maciej Zieba, Stavros Petridis, Maja Pantic:
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation. 5089-5098 - Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzynska, David Bau:
Unified Concept Editing in Diffusion Models. 5099-5108 - Zipeng Xu, Songlong Xing, Enver Sangineto, Nicu Sebe:
SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective. 5109-5118 - Robert Harb, Thomas Pock, Heimo Müller:
Diffusion-based generation of Histopathological Whole Slide Images at a Gigapixel scale. 5119-5128 - Xudong Wang, Li Niu, Junyan Cao, Yan Hong, Liqing Zhang:
Painterly Image Harmonization via Adversarial Residual Learning. 5129-5138 - Jaeseok Jeong, Mingi Kwon, Youngjung Uh:
Training-free Content Injection using h-space in Diffusion Models. 5139-5149 - Yuen-Fui Lau, Tianjia Zhang, Zhefan Rao, Qifeng Chen:
ENTED: Enhanced Neural Texture Extraction and Distribution for Reference-based Blind Face Restoration. 5150-5159 - Jungeun Lee, Sanghun Kim, Hansol Lee, Tserendorj Adiya, Hwasup Lim:
PIDiffu: Pixel-aligned Diffusion Model for High-Fidelity Clothed Human Reconstruction. 5160-5169 - Srikar Yellapragada, Alexandros Graikos, Prateek Prasanna, Tahsin M. Kurç, Joel H. Saltz, Dimitris Samaras:
PathLDM: Text conditioned Latent Diffusion Model for Histopathology. 5170-5179 - Yixuan Ren, Jing Shi, Zhifei Zhang, Yifei Fan, Zhe Lin, Bo He, Abhinav Shrivastava:
Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks. 5180-5189 - Joshua Santoso, Christian Simon, Williem:
On Manipulating Scene Text in the Wild with Diffusion Models. 5190-5199 - Junjie Shentu, Noura Al Moubayed:
CXR-IRGen: An Integrated Vision and Language Model for the Generation of Clinically Accurate Chest X-Ray Image-Report Pairs. 5200-5209 - Adrian Suwala, Bartosz Wójcik, Magdalena Proszewska, Jacek Tabor, Przemyslaw Spurek, Marek Smieja:
Face Identity-Aware Disentanglement in StyleGAN. 5210-5219 - Zhongping Zhang, Jian Zheng, Jacob Zhiyuan Fang, Bryan A. Plummer:
Text-to-image Editing by Image Information Removal. 5220-5229 - Jeffrey Zhang, Shao-Yu Chang, Kedan Li, David A. Forsyth:
Preserving Image Properties Through Initializations in Diffusion Models. 5230-5238 - Hamza Rawal, Muhammad Junaid Ahmad, Farooq Zaman:
GC-VTON: Predicting Globally Consistent and Occlusion Aware Local Flows with Neighborhood Integrity Preservation for Virtual Try-on. 5239-5248 - Jingguo Liu, Heyu Chen, Shigang Li, Jianfeng Li:
Generation of Upright Panoramic Image from Non-upright Panoramic Image. 5249-5258 - Charles Laroche, Andrés Almansa, Eva Coupeté:
Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution. 5259-5269 - Hanao Li, Tian Han:
Enforcing Sparsity on Latent Space for Robust and Explainable Representations. 5270-5279 - Soumik Mukhopadhyay, Saksham Suri, Ravi Teja Gadde, Abhinav Shrivastava:
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization. 5280-5290 - Junuk Cha, Hansol Lee, Jaewon Kim, Nhat Nguyen Bao Truong, Jae Shin Yoon, Seungryul Baek:
3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image. 5291-5300 - Kai Katsumata, Duc Minh Vo, Bei Liu, Hideki Nakayama:
Revisiting Latent Space of GAN Inversion for Robust Real Image Editing. 5301-5310 - Kai Katsumata, Duc Minh Vo, Tatsuya Harada, Hideki Nakayama:
Soft Curriculum for Learning Conditional GANs with Noisy-Labeled and Uncurated Unlabeled Data. 5311-5320 - Baptiste Chopin, Hao Tang, Mohamed Daoudi:
Bipartite Graph Diffusion Model for Human Interaction Generation. 5321-5330 - Minghao Chen, Iro Laina, Andrea Vedaldi:
Training-Free Layout Control with Cross-Attention Guidance. 5331-5341 - Gabriele Valvano, Antonino Agostino, Giovanni De Magistris, Antonino Graziano, Giacomo Veneri:
Controllable Image Synthesis of Industrial Data using Stable Diffusion. 5342-5351 - Yiwen Huang, Zhiqiu Yu, Xinjie Yi, Yue Wang, James Tompkin:
Removing the Quality Tax in Controllable Face Generation. 5353-5361 - Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu:
Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation. 5362-5371 - Yiangos Georgiou, Marios Loizou, Tom Kelly, Melinos Averkiou:
FacadeNet: Conditional Facade Synthesis via Selective Editing. 5372-5381 - Shen Zheng, Changjie Lu, Srinivasa G. Narasimhan:
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain. 5382-5391 - Shanchuan Lin, Bingchen Liu, Jiashi Li, Xiao Yang:
Common Diffusion Noise Schedules and Sample Steps are Flawed. 5392-5399 - Zhaoyu Zhang, Yang Hua, Guanxiong Sun, Hui Wang, Seán F. McLoone:
Improving the Leaking of Augmentations in Data-Efficient GANs via Adaptive Negative Data Augmentation. 5400-5409 - Min Jin Chong, Krishna Kumar Singh, Yijun Li, Jingwan Lu, David A. Forsyth:
P2D: Plug and Play Discriminator for accelerating GAN frameworks. 5410-5419 - Jianjin Xu, Saman Motamed, Praneetha Vaddamanu, Chen Henry Wu, Christian Häne, Jean-Charles Bazin, Fernando De la Torre:
Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention. 5420-5430 - Perla Doubinsky, Nicolas Audebert, Michel Crucianu, Hervé Le Borgne:
Semantic Generative Augmentations for Few-Shot Counting. 5431-5440 - Kunpeng Song, Ligong Han, Bingchen Liu, Dimitris N. Metaxas, Ahmed Elgammal:
StyleGAN-Fusion: Diffusion Guided Domain Adaptation of Image Generators. 5441-5451 - Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu:
Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models. 5452-5461 - Kyle Buettner, Adriana Kovashka:
Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection. 5462-5472 - Xiangxi Shi, Stefan Lee:
Benchmarking Out-of-Distribution Detection in Visual Question Answering. 5473-5483 - Yuhang He, Sangyun Shin, Anoop Cherian, Niki Trigoni, Andrew Markham:
Sound3DVDet: 3D Sound Source Detection using Multiview Microphone Array and RGB Images. 5484-5495 - Yuxin Ye, Wenming Yang, Yapeng Tian:
LAVSS: Location-Guided Audio-Visual Spatial Audio Separation. 5496-5507 - Jingru Yi, Burak Uzkent, Oana Ignat, Zili Li, Amanmeet Garg, Xiang Yu, Linda Liu:
Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models. 5508-5518 - Elad Hirsch, Ayellet Tal:
CLID: Controlled-Length Image Descriptions with Limited Data. 5519-5529 - Shirsha Bose, Ankit Jha, Enrico Fini, Mainak Singha, Elisa Ricci, Biplab Banerjee:
StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain Generalization. 5530-5540 - Lin Zhao, Hongxuan Li, Xuefei Ning, Xinru Jiang:
THInImg: Cross-modal Steganography for Presenting Talking Heads in Images. 5541-5550 - Ugur Sahin, Hang Li, Qadeer Khan, Daniel Cremers, Volker Tresp:
Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining. 5551-5561 - Xiao Hu, Basavaraj Hampiholi, Heiko Neumann, Jochen Lang:
Temporal Context Enhanced Referring Video Object Segmentation. 5562-5571 - Muntasir Wahed, Xiaona Zhou, Tianjiao Yu, Ismini Lourentzou:
Fine-Grained Alignment for Cross-Modal Recipe Retrieval. 5572-5581 - Xueting Hu, Ce Zhang, Yi Zhang, Bowen Hai, Ke Yu, Zhihai He:
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation. 5582-5591 - Jinxiang Liu, Yu Wang, Chen Ju, Chaofan Ma, Ya Zhang, Weidi Xie:
Annotation-free Audio-Visual Segmentation. 5592-5602 - Yating Xu, Conghui Hu, Gim Hee Lee:
Rethink Cross-Modal Fusion in Weakly-Supervised Audio-Visual Video Parsing. 5603-5612 - Ziwen Li, Bo Xu, Jiake Xie, Yong Tang, Cheng Lu:
SDNet: An Extremely Efficient Portrait Matting Model via Self-Distillation. 5613-5622 - Yaoxin Zhuo, Baoxin Li:
FELGA: Unsupervised Fragment Embedding for Fine-Grained Cross-Modal Association. 5623-5633 - Eunyi Lyou, Doyeon Lee, Jooeun Kim, Joonseok Lee:
Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval. 5634-5643 - Sheng Shen, Shijia Yang, Tianjun Zhang, Bohan Zhai, Joseph E. Gonzalez, Kurt Keutzer, Trevor Darrell:
Multitask Vision-Language Prompt Tuning. 5644-5655 - Yewon Hwang, Jong-Hwan Kim:
EASUM: Enhancing Affective State Understanding through Joint Sentiment and Emotion Modeling for Multimodal Tasks. 5656-5666 - Antonio Tejero-de-Pablos:
Complementary-Contradictory Feature Regularization against Multimodal Overfitting. 5667-5676 - Noam Rotstein, David Bensaïd, Shaked Brody, Roy Ganz, Ron Kimmel:
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions. 5677-5688 - Jie Ruan, Yue Wu, Xiaojun Wan, Yuesheng Zhu:
Describe Images in a Boring Way: Towards Cross-Modal Sarcasm Generation. 5689-5698 - Sooyoung Park, Arda Senocak, Joon Son Chung:
Can CLIP Help Sound Source Localization? 5699-5708 - Muhammad Waleed Gondal, Jochen Gast, Inigo Alonso Ruiz, Richard Droste, Tommaso Macrì, Suren Kumar, Luitpold Staudigl:
Domain Aligned CLIP for Few-shot Classification. 5709-5718 - Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding, Vicente Ordonez:
SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data. 5719-5729 - Suzanne Petryk, Spencer Whitehead, Joseph E. Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach:
Simple Token-Level Confidence Improves Caption Correctness. 5730-5740 - Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, Stephen Gould:
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning. 5741-5750 - Sonia Raychaudhuri, Tommaso Campari, Unnat Jain, Manolis Savva, Angel X. Chang:
MOPA: Modular Object Navigation with PointGoal Agents. 5751-5761 - Guangyue Xu, Joyce Chai, Parisa Kordjamshidi:
GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot Learning. 5762-5771 - Md. Mahedi Hasan, Shoaib Meraj Sami, Nasser M. Nasrabadi:
Text-Guided Face Recognition using Multi-Granularity Cross-Modal Contrastive Learning. 5772-5781 - Arka Sadhu, Ram Nevatia:
Leveraging Task-Specific Pre-Training to Reason across Images and Videos. 5782-5792 - Adnen Abdessaied, Lei Shi, Andreas Bulling:
VD-GR: Boosting Visual Dialog with Cascaded Spatial-Temporal Multi-Modal GRaphs. 5793-5802 - Yue Ruan, Han-Hung Lee, Yiming Zhang, Ke Zhang, Angel X. Chang:
TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval. 5803-5813 - Thanh Luan Trinh, Fangge Chen, Takuya Nanri, Kei Akasaka:
3D Super-Resolution Model for Vehicle Flow Field Enrichment. 5814-5823 - Jesper Gaarsdal, Joakim Bruslund Haurum, Sune Wolff, Claus Brøndgaard Madsen:
AssemblyNet: A Point Cloud Dataset and Benchmark for Predicting Part Directions in an Exploded Layout. 5824-5833 - Anish Bhattacharya, Ratnesh Madaan, Fernando Cladera Ojeda, Sai Vemprala, Rogerio Bonatti, Kostas Daniilidis, Ashish Kapoor, Vijay Kumar, Nikolai Matni, Jayesh K. Gupta:
EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields. 5834-5843 - Dhruv Makwana, Gayatri Deshmukh, Onkar Susladkar, Sparsh Mittal, R. Sai Chandra Teja:
LIVENet: A novel network for real-world low-light image denoising and enhancement. 5844-5853 - Kedan Li, Jeffrey Zhang, Shao-Yu Chang, David A. Forsyth:
Controlling Virtual Try-on Pipeline Through Rendering Policies. 5854-5836 - Giacomo D'Amicantonio, Egor Bondarev, Peter H. N. de With:
Automated Camera Calibration via Homography Estimation with GNNs. 5864-5471 - Pranav Jeevan, Akella Srinidhi, Pasunuri Prathiba, Amit Sethi:
WaveMixSR: Resource-efficient Neural Network for Image Super-resolution. 5872-5880 - Ayush Gupta, Rama Chellappa:
You Can Run but not Hide: Improving Gait Recognition with Intrinsic Occlusion Type Awareness. 5881-5890 - Pengzhan Sun, Kerui Gu, Yunsong Wang, Linlin Yang, Angela Yao:
Rethinking Visibility in Human Pose Estimation: Occluded Pose Reasoning via Transformers. 5891-5900 - Yunseong Cho, Chanwoo Kim, Hoseong Cho, Yunhoe Ku, Eunseo Kim, Muhammadjon Boboev, Joonseok Lee, Seungryul Baek:
RMFER: Semi-supervised Contrastive Learning for Facial Expression Recognition with Reaction Mashup Video. 5901-5910 - Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Xiaoxi Du, Kaifeng Pang, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung:
CSAM: A 2.5D Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation. 5911-5920 - Akshay Paruchuri, Xin Liu, Yulu Pan, Shwetak N. Patel, Daniel McDuff, Soumyadip Sengupta:
Motion Matters: Neural Motion Transfer for Better Camera Physiological Measurement. 5921-5930 - Scarlett Raine, Ross Marchant, Brano Kusy, Frédéric Maire, Tobias Fischer:
Image Labels Are All You Need for Coarse Seagrass Segmentation. 5931-5940 - Vojtech Cermák, Lukás Picek, Lukás Adam, Kostas Papafitsoros:
WildlifeDatasets: An open-source toolkit for animal re-identification. 5941-5951 - Shristi Das Biswas, Adarsh Kosta, Chamika M. Liyanagedera, Marco Paul E. Apolinario, Kaushik Roy:
HALSIE: Hybrid Approach to Learning Segmentation by Simultaneously Exploiting Image and Event Modalities. 5952-5962 - Jad Abou-Chakra, Feras Dayoub, Niko Sünderhauf:
ParticleNeRF: A Particle-Based Encoding for Online Neural Radiance Fields. 5963-5972 - Yoichiro Hisadome, Tianyi Wu, Jiawei Qin, Yusuke Sugano:
Rotation-Constrained Cross-View Feature Fusion for Multi-View Appearance-based Gaze Estimation. 5973-5982 - Depanshu Sani, Sandeep Mahato, Sourabh Saini, Harsh Kumar Agarwal, Charu Chandra Devshali, Saket Anand, Gaurav Arora, Thiagarajan Jayaraman:
SICKLE: A Multi-Sensor Satellite Imagery Dataset Annotated with Multiple Key Cropping Parameters. 5983-5992 - Nathaniel Chodosh, Deva Ramanan, Simon Lucey:
Re-Evaluating LiDAR Scene Flow. 5993-6003 - K. V. Jobin, Anand Mishra, C. V. Jawahar:
Semantic Labels-Aware Transformer Model for Searching over a Large Collection of Lecture-Slides. 6004-6013 - Mirali Purohit, Jacob B. Adler, Hannah Kerner:
ConeQuest: A Benchmark for Cone Segmentation on Mars. 6014-6023 - Chien-Yu Lin, Qichen Fu, Thomas Merth, Karren D. Yang, Anurag Ranjan:
FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline. 6024-6033 - Kshitij Nikhal, Yujunrong Ma, Shuvra S. Bhattacharyya, Benjamin S. Riggan:
HashReID: Dynamic Network with Binary Codes for Efficient Person Re-identification. 6034-6043 - Jake Deane, Sinead Kearney, Kwang In Kim, Darren Cosker:
RGBT-Dog: A Parametric Model and Pose Prior For Canine Body Analysis Data Creation. 6044-6054 - Alon Shoshan, Nadav Bhonker, Emanuel Ben Baruch, Ori Nizan, Igor Kviatkovsky, Joshua J. Engelsma, Manoj Aggarwal, Gérard G. Medioni:
FPGAN-Control: A Controllable Fingerprint Generator for Training with Synthetic Data. 6055-6064 - Xiang Zhang, Huiyuan Yang, Taoyue Wang, Xiaotian Li, Lijun Yin:
Multimodal Channel-Mixing: Channel and Spatial Masked AutoEncoder on Facial Action Unit Detection. 6065-6074 - Xing Di, Yiyu Zheng, Xiaoming Liu, Yu Cheng:
ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation. 6075-6086 - Yufeng Yin, Di Chang, Guoxian Song, Shen Sang, Tiancheng Zhi, Jing Liu, Linjie Luo, Mohammad Soleymani:
FG-Net: Facial Action Unit Detection with Generalizable Pyramidal Features. 6087-6096 - Gavriel Habib, Noa Barzilay, Or Shimshi, Rami Ben-Ari, Nir Darshan:
Watch Where You Head: A View-biased Domain Gap in Gait Recognition and Unsupervised Adaptation. 6097-6107 - Pratik Kalshetti, Parag Chaudhuri:
Intrinsic Hand Avatar: Illumination-aware Hand Appearance and Shape Reconstruction from Monocular RGB Video. 6108-6118 - Haoyu Ma, Tong Zhang, Shanlin Sun, Xiangyi Yan, Kun Han, Xiaohui Xie:
CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer. 6119-6129 - Zhongyu Jiang, Zhuoran Zhou, Lei Li, Wenhao Chai, Cheng-Yen Yang, Jenq-Neng Hwang:
Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation. 6130-6140 - Arindam Dutta, Rohit Lal, Dripta S. Raychaudhuri, Calvin-Khang Ta, Amit K. Roy-Chowdhury:
POISE: Pose Guided Human Silhouette Extraction under Occlusions. 6141-6151 - Yufei Zhang, Jeffrey O. Kephart, Qiang Ji:
Incorporating Physics Principles for Precise Human Motion Prediction. 6152-6162 - Raghavendra Ramachandra, Sushma Venkatesh:
Fingervein Verification using Convolutional Multi-Head Attention Network. 6163-6172 - Raghavendra Ramachandra, Sushma Venkatesh, Naser Damer, Narayan Vetrekar, Rajendra S. Gad:
Multispectral Imaging for Differential Face Morphing Attack Detection: A Preliminary Study. 6173-6181 - Weiyuan Li, Bin Dai, Ziyi Zhou, Qi Yao, Baoyuan Wang:
Controlling Character Motions without Observable Driving Source. 6182-6191 - Chenxu Zhang, Chao Wang, Yifan Zhao, Shuo Cheng, Linjie Luo, Xiaohu Guo:
DR2: Disentangled Recurrent Representation Learning for Data-efficient Speech Video Synthesis. 6192-6202 - Marco Huber, Anh Thi Luu, Fadi Boutros, Arjan Kuijper, Naser Damer:
Bias and Diversity in Synthetic-based Face Recognition. 6203-6214 - Feng Liu, Ryan Ashbaugh, Nicholas Chimitt, Najmul Hassan, Ali Hassani, Ajay Jaiswal, Minchul Kim, Zhiyuan Mao, Christopher Perry, Zhiyuan Ren, Yiyang Su, Pegah Varghaei, Kai Wang, Stanley H. Chan, Arun Ross, Humphrey Shi, Zhangyang Wang, Anil Jain, Xiaoming Liu:
FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude. 6215-6224 - Chenyi Kuang, Jeffrey O. Kephart, Qiang Ji:
AU-Aware Dynamic 3D Face Reconstruction from Videos with Transformer. 6225-6235 - Pengfei Zhang, Deying Kong:
Handformer2T: A Lightweight Regression-based Model for Interacting Hands Pose Estimation from A Single RGB Image. 6236-6245 - Dragos-Constantin Tântaru, Elisabeta Oneata, Dan Oneata:
Weakly-supervised deepfake localization in diffusion-generated images. 6246-6256 - Meiling Fang, Naser Damer:
Face Presentation Attack Detection by Excavating Causal Clues and Adapting Embedding Statistics. 6257-6267 - Zhuoran Yu, Manchen Wang, Yanbei Chen, Paolo Favaro, Davide Modolo:
Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation. 6268-6277 - Haidong Zhu, Wanrong Zheng, Zhaoheng Zheng, Ram Nevatia:
ShARc: Shape and Appearance Recognition for Person Identification In-the-wild. 6278-6288 - Hai Phan, Cindy X. Le, Vu Le, Yihui He, Anh Totti Nguyen:
Fast and Interpretable Face Identification for Out-Of-Distribution Data Using Vision Transformers. 6289-6299 - Alexios Giazitzis, Elias N. Zois:
SigmML: Metric meta-learning for Writer Independent Offline Signature Verification in the Space of SPD Matrices. 6300-6310 - Huang-Ru Liao, Jen-Chun Lin, Chun-Yi Lee:
Progressive Hypothesis Transformer for 3D Human Mesh Recovery. 6311-6320 - Yuta Okuyama, Yuki Endo, Yoshihiro Kanamori:
DiffBody: Diffusion-based Pose and Shape Editing of Human Images. 6321-6330 - Maitreya Suin, Nithin Gopalakrishnan Nair, Chun Pong Lau, Vishal M. Patel, Rama Chellappa:
Diffuse and Restore: A Region-Adaptive Diffusion Model for Identity-Preserving Blind Face Restoration. 6331-6340 - Enes Duran, Muhammed Kocabas, Vasileios Choutas, Zicong Fan, Michael J. Black:
HMP: Hand Motion Priors for Pose and Shape Estimation from Video. 6341-6351 - Maximilian Weiherer, Finn Klein, Bernhard Egger:
Approximating Intersections and Differences Between Linear Statistical Shape Models Using Markov Chain Monte Carlo. 6352-6361 - Jeongmin Hong, Joseph Shin, Juhee Choi, Minsam Ko:
Robust Eye Blink Detection Using Dual Embedding Video Vision Transformer. 6362-6372 - Bita Azari, Angelica Lim:
EmoStyle: One-Shot Facial Expression Editing Using Continuous Emotion Parameters. 6373-6382 - Rishabh Shukla, Aditya Sinha, Vansh Singh, Harkeerat Kaur:
Vikriti-ID: A Novel Approach For Real Looking Fingerprint Data-set Generation. 6383-6391 - Kim Sung-Bin, Lee Hyun, Da Hye Hong, Suekyeong Nam, Janghoon Ju, Tae-Hyun Oh:
LaughTalk: Expressive 3D Talking Head Generation with Laughter. 6392-6401 - Zongyi Liu, Yarong Feng, Shunyan Luo, Yuan Ling, Shujing Dong, Shuyi Wang:
Detecting Content Segments from Online Sports Streaming Events: Challenges and Solutions. 6402-6411 - Quoc-Huy Tran, Ahmed Mehmood, Muhammad Ahmed, Muhammad Naufil, Anas Zafar, Andrey Konin, M. Zeeshan Zia:
Permutation-Aware Activity Segmentation via Unsupervised Frame-to-Segment Alignment. 6412-6422 - Yuerong Li, Zhengrong Xue, Huazhe Xu:
OTAS: Unsupervised Boundary Detection for Object-Centric Temporal Action Segmentation. 6423-6432 - Sha Hu, Yu Gong, Greg Mori:
Embodied Human Activity Recognition. 6433-6443 - Yutao Tang, Benjamín Béjar, René Vidal:
Semantic-aware Video Representation for Few-shot Action Recognition. 6444-6454 - Jie Zhao, Johan Edstedt, Michael Felsberg, Dong Wang, Huchuan Lu:
Leveraging the Power of Data Augmentation for Transformer-based Tracking. 6455-6464 - Felix Limanta, Kuniaki Uto, Koichi Shinoda:
CAMOT: Camera Angle-aware Multi-Object Tracking. 6465-6474 - Erik Scheurer, Jenny Schmalfuss, Alexander Lis, Andrés Bruhn:
Detection Defenses: An Empty Promise against Adversarial Patch Attacks on Optical Flow. 6475-6484 - Xinjie Li, Huijuan Xu:
Repetitive Action Counting with Motion Feature Learning. 6485-6494 - Siddhant Bansal, Chetan Arora, C. V. Jawahar:
United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos. 6495-6505 - Jun-Bo Zhang, Mengbiao Zhao, Fei Yin, Cheng-Lin Liu:
Sequential Transformer for End-to-End Video Text Detection. 6506-6516 - Eadom Dessalene, Michael Maynord, Cornelia Fermüller, Yiannis Aloimonos:
Context in Human Action through Motion Complementarity. 6517-6526 - Tsukasa Shiota, Motohiro Takagi, Kaori Kumagai, Hitoshi Seshimo, Yushi Aono:
Egocentric Action Recognition by Capturing Hand-Object Contact and Object State. 6527-6537 - Ryosuke Kawamura, Hideaki Hayashi, Noriko Takemura, Hajime Nagahara:
MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition. 6538-6548 - Takuya Ogawa, Takashi Shibata, Toshinori Hosoi:
FRoG-MOT: Fast and Robust Generic Multiple-Object Tracking by IoU and Motion-State Associations. 6549-6558 - Chang-Lin Wan, Feng-Kai Huang, Hong-Han Shuai:
Density-Based Flow Mask Integration via Deformable Convolution for Video People Flux Estimation. 6559-6568 - Hyeonchul Jung, Seokjun Kang, Takgen Kim, HyeongKi Kim:
ConfTrack: Kalman Filter-based Multi-Person Tracking by Utilizing Confidence Score of Detection Box. 6569-6578 - Alberto Pepe, Joan Lasenby, Sven Buchholz:
CGAPoseNet+GCAN: A Geometric Clifford Algebra Network for Geometry-aware Camera Pose Regression. 6579-6589 - Michael Peven, Gregory D. Hager:
Embedding Task Structure for Action Detection. 6590-6599 - Roy Hirsch, Regev Cohen, Tomer Golany, Daniel Freedman, Ehud Rivlin:
Random Walks for Temporal Action Segmentation with Timestamp Supervision. 6600-6610 - Ruiqi Xian, Xijun Wang, Dinesh Manocha:
MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition. 6611-6620 - Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Eustache Le Bihan, Haithem Boussaid, Ebtesam Almazrouei, Mérouane Debbah:
Do VSR Models Generalize Beyond LRS3? 6621-6630 - Haosong Zhang, Mei Chee Leong, Liyuan Li, Weisi Lin:
PGVT: Pose-Guided Video Transformer for Fine-Grained Action Recognition. 6631-6642 - Zelun Luo, Yuliang Zou, Yijin Yang, Zane Durante, De-An Huang, Zhiding Yu, Chaowei Xiao, Li Fei-Fei, Animashree Anandkumar:
Differentially Private Video Activity Recognition. 6643-6653 - Jiachen Li, Roberto Henschel, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Humphrey Shi:
Video Instance Matting. 6654-6663 - Jiachen Li, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Yunchao Wei, Humphrey Shi:
VMFormer: End-to-End Video Matting with Transformer. 6664-6673 - Mohammed Khaleed Almansoori, Mustansar Fiaz, Hisham Cholakkal:
DDAM-PS: Diligent Domain Adaptive Mixer for Person Search. 6674-6683 - Adam Ishay, Zhun Yang, Joohyung Lee, Ilgu Kang, Dongjae Lim:
Think before You Simulate: Symbolic Reasoning to Orchestrate Neural Computation for Counterfactual Question Answering. 6684-6693 - Goutam Yelluru Gopal, Maria A. Amer:
Separable Self and Mixed Attention Transformers for Efficient Object Tracking. 6694-6703 - Shan Lin, Edgar Simo-Serra:
Restoring Degraded Old Films with Recursive Recurrent Transformer Networks. 6704-6714 - Alexandros Stergiou, Brent De Weerdt, Nikos Deligiannis:
Holistic Representation Learning for Multitask Trajectory Anomaly Detection. 6715-6725 - Debaditya Roy, Ramanathan Rajendiran, Basura Fernando:
Interaction Region Visual Transformer for Egocentric Action Anticipation. 6726-6736 - Ce Zhang, Changcheng Fu, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, Chen Sun:
Object-centric Video Representation for Long-term Action Anticipation. 6737-6747 - Salman Khan, Izzeddin Teeti, Andrew Bradley, Mohamed Elhoseiny, Fabio Cuzzolin:
A Hybrid Graph Network for Complex Activity Detection in Video. 6748-6758 - Tanvir Mahmud, Chun-Hao Liu, Burhaneddin Yaman, Diana Marculescu:
SSVOD: Semi-Supervised Video Object Detection with Sparse Annotations. 6759-6768 - Cheng Huang, Yi-Lun Wu, Hong-Han Shuai, Ching-Chun Huang:
Semantic Fusion Augmentation and Semantic Boundary Detection: A Novel Approach to Multi-Target Video Moment Retrieval. 6769-6778 - Anas Al-Lahham, Nurbek Tastan, Muhammad Zaigham Zaheer, Karthik Nandakumar:
A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection. 6779-6788 - Roei Herzig, Ofir Abramovich, Elad Ben-Avraham, Assaf Arbelle, Leonid Karlinsky, Ariel Shamir, Trevor Darrell, Amir Globerson:
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data. 6789-6801 - Hyogun Lee, Kyungho Bae, Seong Jong Ha, Yumin Ko, Gyeong-Moon Park, Jinwoo Choi:
GLAD: Global-Local View Alignment and Background Debiasing for Unsupervised Video Domain Adaptation with Large Domain Gap. 6802-6811 - Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool, Alina Kuznetsova:
Beyond SOT: Tracking Multiple Generic Objects at Once. 6812-6822 - Michal Neoral, Jonás Serých, Jirí Matas:
MFT: Long-Term Tracking of Every Pixel. 6823-6833 - Hamza Karim, Keval Doshi, Yasin Yilmaz:
Real-Time Weakly Supervised Video Anomaly Detection. 6834-6842 - Radim Spetlík, Denys Rozumnyi, Jirí Matas:
Single-Image Deblurring, Trajectory and Shape Recovery of Fast Moving Objects with Denoising Diffusion Probabilistic Models. 6843-6852 - Pierre-François De Plaen, Nicola Marinello, Marc Proesmans, Tinne Tuytelaars, Luc Van Gool:
Contrastive Learning for Multi-Object Tracking with Transformers. 6853-6863 - Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo:
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders. 6864-6874 - Mohammed Guermal, Abid Ali, Rui Dai, François Brémond:
JOADAA: joint online action detection and action anticipation. 6875-6884 - Azin Jahedi, Maximilian Luz, Marc Rivinius, Andrés Bruhn:
CCMR: High Resolution Optical Flow Estimation via Coarse-to-Fine Context-Guided Motion Reasoning. 6885-6894 - Guy Bar-Shalom, George Leifman, Michael Elad:
Weakly-Supervised Representation Learning for Video Alignment and Analysis. 6895-6904 - Soroush Mehraban, Vida Adeli, Babak Taati:
MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network. 6905-6915 - Abdulrahman Kerim, Washington L. S. Ramos, Leandro Soriano Marcolino, Erickson R. Nascimento, Richard Jiang:
Leveraging Synthetic Data to Learn Video Stabilization Under Adverse Conditions. 6916-6925 - Sourabh Vasant Gothe, Vibhav Agarwal, Sourav Ghosh, Jayesh Rajkumar Vachhani, Pranay Kashyap, Barath Raj Kandur Raja:
What's in the Flow? Exploiting Temporal Motion Cues for Unsupervised Generic Event Boundary Detection. 6926-6935 - Thanos Delatolas, Vicky Kalogeiton, Dim P. Papadopoulos:
Learning the What and How of Annotation in Video Object Segmentation. 6936-6946 - Pirazh Khorramshahi, Zhe Wu, Tianchen Wang, Luke Deluccia, Hongcheng Wang:
Lightweight Delivery Detection on Doorbell Cameras. 6947-6956 - Takumi Kobayashi, Jiaxing Ye:
Spatio-temporal Filter Analysis Improves 3D-CNN For Action Classification. 6957-6966 - Ruiqi Xian, Xijun Wang, Divya Kothandaraman, Dinesh Manocha:
PMI Sampler: Patch Similarity Guided Frame Selection For Aerial Action Recognition. 6967-6976 - Giuliano Albanese, Arka Mitra, Jan-Nico Zaech, Yupeng Zhao, Ajad Chhatkuli, Luc Van Gool:
Optimizing Long-Term Robot Tracking with Multi-Platform Sensor Fusion. 6977-6987 - Yuchi Ishikawa, Masayoshi Kondo, Hirokatsu Kataoka:
Learnable Cube-based Video Encryption for Privacy-Preserving Action Recognition. 6988-6998 - Myeongjun Kim, Federica Spinola, Philipp Benz, Tae-Hoon Kim:
A*: Atrous Spatial Temporal Action Recognition for Real Time Applications. 6999-7000 - Pravin Nagar, K. N. Ajay Shastry, Jayesh Chaudhari, Chetan Arora:
SEMA: Semantic Attention for Capturing Long-Range Dependencies in Egocentric Lifelogs. 7010-7020 - Xuesong Nie, Xi Chen, Haoyuan Jin, Zhihang Zhu, Yunfeng Yan, Donglian Qi:
Triplet Attention Transformer for Spatiotemporal Predictive Learning. 7021-7030 - Thinh Phan, Khoa Vo, Duy Le, Gianfranco Doretto, Donald A. Adjeroh, Ngan Le:
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection. 7031-7040 - Moniek Smink, Haotian Liu, Dörte Döpfer, Yong Jae Lee:
Computer Vision on the Edge: Individual Cattle Identification in Real-time with ReadMyCow System. 7041-7050 - Felipe A. Lopes, Vasit Sagan, Flavio Esposito:
PlantPlotGAN: A Physics-Informed Generative Adversarial Network for Plant Disease Prediction. 7051-7060 - Robert Johanson, Christian Wilms, Ole Johannsen, Simone Frintrop:
S3AD: Semi-supervised Small Apple Detection in Orchard Environments. 7061-7070 - Komuravelli Prashanth, Jaladi Sri Harsha, Sivapuram Arun Kumar, Jaladi Srilekha:
Towards Accurate Disease Segmentation in Plant Images: A Comprehensive Dataset Creation and Network Evaluation. 7071-7079 - Anicetus Odo, Niall McLaughlin, Ilias Kyriazakis:
Automated Monitoring of Ear Biting in Pigs by Tracking Individuals and Events. 7080-7088 - Junhan Wen, Camiel R. Verschoor, Chengming Feng, Irina-Mona Epure, Thomas Abeel, Mathijs de Weerdt:
The Growing Strawberries Dataset: Tracking Multiple Objects with Biological Development over an Extended Period. 7089-7099 - Tayfun Karaderi, Tilo Burghardt, Raphael Morard, Daniela N. Schmidt:
Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species. 7100-7110 - Lars Haalck, Sebastian Thiele, Benjamin Risse:
Tracking Tiny Insects in Cluttered Natural Environments using Refinable Recurrent Neural Networks. 7111-7120 - Srikumar Sastry, Subash Khanal, Aayush Dhakal, Di Huang, Nathan Jacobs:
BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping. 7121-7130 - Lukás Adam, Vojtech Cermák, Kostas Papafitsoros, Lukás Picek:
SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification. 7131-7141 - Eike Gebauer, Sebastian Thiele, Pierre Ouvrard, Adrien Sicard, Benjamin Risse:
Towards a Dynamic Vision Sensor-based Insect Camera Trap. 7142-7151 - Matthew Dawkins, Jack Prior, Bryon Lewis, Robin Faillettaz, Thompson Banez, Mary Salvi, Audrey K. Rollo, Julien Simon, Matthew D. Campbell, Matthew Lucero, Aashish Chaudhary, Benjamin L. Richards, Anthony Hoogs:
FishTrack23: An Ensemble Underwater Dataset for Multi-Object Tracking. 7152-7161 - Xiulong Liu, Kun Su, Eli Shlizerman:
Let the Beat Follow You - Creating Interactive Drum Sounds From Body Rhythm. 7162-7172 - Josh Myers-Dean, Yifei Fan, Brian L. Price, Wilson Chan, Danna Gurari:
Interactive Segmentation for Diverse Gesture Types Without Context. 7173-7183 - Ganning Zhao, Wenhui Cui, Suya You, C.-C. Jay Kuo:
SemST: Semantically Consistent Multi-Scale Image Translation via Structure-Texture Alignment. 7184-7194 - Matthias Springstein, Stefanie Schneider, Javad Rahnama, Julian Stalter, Maximilian Kristen, Eric Müller-Budack, Ralph Ewerth:
Visual Narratives: Large-scale Hierarchical Classification of Art-historical Images. 7195-7205 - Vikram Jamwal, Ramaneswaran S.:
Composite Diffusion: whole >= Σparts. 7206-7215 - Samuel Grieggs, C. E. M. Henderson, Sebastian Sobecki, Alexandra Gillespie, Walter J. Scheirer:
The Paleographer's Eye ex machina: Using Computer Vision to Assist Humanists in Scribal Hand Identification. 7216-7225 - William Theisen, Walter J. Scheirer:
C-CLIP: Contrastive Image-Text Encoders to Close the Descriptive-Commentative Gap. 7226-7235 - Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa:
CAD - Contextual Multi-modal Alignment for Dynamic AVQA. 7236-7248 - Offry Hayon, Stefan Münger, Ilan Shimshoni, Ayellet Tal:
ArcAid: Analysis of Archaeological Artifacts using Drawings. 7249-7259 - Zhongping Zhang, Yiwen Gu, Bryan A. Plummer, Xin Miao, Jiayi Liu, Huayan Wang:
Movie Genre Classification by Language Augmentation and Shot Sampling. 7260-7270 - Golsa Tahmasebzadeh, Matthias Springstein, Ralph Ewerth, Eric Müller-Budack:
Few-Shot Event Classification in Images using Knowledge Graphs for Prompting. 7271-7280 - Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi:
Towards Diverse and Consistent Typography Generation. 7281-7290 - Chunyi Sun, Yanbin Liu, Junlin Han, Stephen Gould:
NeRFEditor: Differentiable Style Decomposition for 3D Scene Editing. 7291-7300 - Ananda Padhmanabhan Suresh, Sanjana Jain, Pavit Noinongyao, Ankush Ganguly, Ukrit Watchareeruetai, Aubin Samacoïts:
FastCLIPstyler: Optimisation-free Text-based Image Style Transfer Using Style Representations. 7301-7310 - Tibor Bleidt, Sedigheh Eslami, Gerard de Melo:
ArtQuest: Countering Hidden Language Biases in ArtVQA. 7311-7320 - Ozan Unal, Dengxin Dai, Lukas Hoyer, Yigit Baran Can, Luc Van Gool:
2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation. 7321-7330 - Jessica Maria Echterhoff, An Yan, Kyungtae Han, Amr Abdelraouf, Rohit Gupta, Julian J. McAuley:
Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving. 7331-7340 - Tianyuan Yuan, Yicheng Liu, Yue Wang, Yilun Wang, Hang Zhao:
StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map Construction. 7341-7350 - Sri Aditya Deevi, Connor Lee, Lu Gan, Sushruth Nagesh, Gaurav Pandey, Soon-Jo Chung:
RGB-X Object Detection via Scene-Specific Fusion Modules. 7351-7360 - Trung Pham, Mehran Maghoumi, Wanli Jiang, Bala Siva Sashank Jujjavarapu, Mehdi Sajjadi, Xin Liu, Hsuan-Chu Lin, Bor-Jeng Chen, Giang Truong, Chao Fang, Junghyun Kwon, Minwoo Park:
NVAutoNet: Fast and Accurate 360° 3D Visual Perception For Self Driving. 7361-7370 - Kai Fischer, Martin Simon, Stefan Milz, Patrick Mäder:
MagneticPillars: Efficient Point Cloud Registration through Hierarchized Birds-Eye-View Cell Correspondence Refinement. 7371-7380 - Youssef Shoeb, R. Chan, Gesina Schwalbe, Azarm Nowzad, Fatma Güney, Hanno Gottschalk:
Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes. 7381-7391 - Stamatis Alexandropoulos, Christos Sakaridis, Petros Maragos:
OVeNet: Offset Vector Network for Semantic Segmentation. 7392-7403 - Mincheol Chang, Seokha Moon, Reza Mahjourian, Jinkyu Kim:
BEVMap: Map-Aware BEV Modeling for 3D Perception. 7404-7413 - Koen Vellenga, H. Joe Steinhauer, Göran Falkman, Tomas Björklund:
Evaluation of Video Masked Autoencoders' Performance and Uncertainty Estimations for Driver Action and Intention Recognition. 7414-7422 - Georg Hess, Adam Tonderski, Christoffer Petersson, Kalle Åström, Lennart Svensson:
LidarCLIP or: How I Learned to Talk to Point Clouds. 7423-7432 - Mohamed Adel Musallam, Vincent Gaudillière, Djamila Aouada:
Self-Supervised Learning for Place Representation Generalization across Appearance Changes. 7433-7443 - Pranjay Shyam, Hyunjin Yoo:
PAIR : Perception Aided Image Restoration for Natural Driving Conditions. 7444-7455 - Pranjay Shyam, Hyunjin Yoo:
Lightweight Thermal Super-Resolution and Object Detection for Robust Perception in Adverse Weather Conditions. 7456-7467 - Florence Yellin, Scott McCloskey, Cole Hill, Eric Smith, Brian Clipp:
Concurrent Band Selection and Traversability Estimation from Long-Wave Hyperspectral Imagery in Off-Road Settings. 7468-7477 - Junyao Wang, Arnav Vaibhav Malawade, Junhong Zhou, Shih-Yuan Yu, Mohammad Abdullah Al Faruque:
RS2G: Data-Driven Scene-Graph Extraction and Embedding for Robust Autonomous Perception and Scenario Understanding. 7478-7487 - Jae-Keun Lee, Jin-Hee Lee, Joohyun Lee, Soon Kwon, Heechul Jung:
Re-VoxelDet: Rethinking Neck and Head Architectures for High-Performance Voxel-based 3D Detection. 7488-7497 - Enna Sachdeva, Nakul Agarwal, Suhas Chundi, Sean Roelofs, Jiachen Li, Mykel J. Kochenderfer, Chiho Choi, Behzad Dariush:
Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning. 7498-7507 - Himanshu Gupta, Oleksandr Kotlyar, Henrik Andreasson, Achim J. Lilienthal:
Robust Object Detection in Challenging Weather Conditions. 7508-7517 - Nupur Thakur, PrasanthSai Gouripeddi, Baoxin Li:
Graph(Graph): A Nested Graph-Based Framework for Early Accident Anticipation. 7518-7526 - Weijia Zhang, Dongnan Liu, Chao Ma, Tom Weidong Cai:
Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object Detection. 7527-7537 - Prajwal Singh, Dwip Dalal, Gautam Vashishtha, Krishna P. Miyapuram, Shanmuganathan Raman:
Learning Robust Deep Visual Representations from EEG Brain Recordings. 7538-7547 - Amin Ranem, Camila González, Daniel Pinto dos Santos, Andreas M. Bucher, Ahmed E. Othman, Anirban Mukhopadhyay:
Continual atlas-based segmentation of prostate MRI. 7548-7557 - Md Mahfuzur Rahman Siddiquee, Jay Shah, Teresa Wu, Catherine D. Chong, Todd J. Schwedt, Gina Dumkrieger, Simona Nikolova, Baoxin Li:
Brainomaly: Unsupervised Neurologic Disease Detection Utilizing Unannotated T1-weighted Brain MR Images. 7558-7567 - Yaopeng Peng, Hongxiao Wang, Milan Sonka, Danny Z. Chen:
PHG-Net: Persistent Homology Guided Medical Image Classification*. 7568-7577 - Neel Dey, S. Mazdak Abulnaga, Benjamin Billot, Esra Abaci Turk, Patricia Ellen Grant, Adrian V. Dalca, Polina Golland:
AnyStar: Domain randomized universal star-convex 3D instance segmentation. 7578-7588 - Jonghun Kim, Hyunjin Park:
Adaptive Latent Diffusion Model for 3D Medical Image to Image Translation: Multi-modal Magnetic Resonance Imaging Study. 7589-7598 - Wonwoo Cho, Dongmin Choi, Hyesu Lim, Jinho Choi, Saemee Choi, Hyunseok Min, Sungbin Lim, Jaegul Choo:
Slice and Conquer: A Planar-to-3D Framework for Efficient Interactive Segmentation of Volumetric Images. 7599-7608 - Farchan Hakim Raswa, Chun-Shien Lu, Jia-Ching Wang:
Attention-Guided Prototype Mixing: Diversifying Minority Context on Imbalanced Whole Slide Images Classification Learning. 7609-7618 - Joana Palés Huix, Adithya Raju Ganeshan, Johan Fredin Haslum, Magnus Söderberg, Christos Matsoukas, Kevin Smith:
Are Natural Domain Foundation Models Useful for Medical Image Classification? 7619-7628 - Meng Ye, Mikael Kanski, Dong Yang, Leon Axel, Dimitris N. Metaxas:
Unsupervised Exemplar-Based Image-to-Image Translation and Cascaded Vision Transformers for Tagged and Untagged Cardiac Cine MRI Registration. 7629-7639 - Mohammad Zalbagi Darestani, Vishwesh Nath, Wenqi Li, Yufan He, Holger R. Roth, Ziyue Xu, Daguang Xu, Reinhard Heckel, Can Zhao:
IR-FRestormer: Iterative Refinement with Fourier-Based Restormer for Accelerated MRI Reconstruction. 7640-7649 - Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer:
Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction. 7650-7659 - Takuro Fujii, Hayato Nakagawa, Teppei Takeshima, Yasushi Yumura, Tomoki Hamagami:
Automated Sperm Assessment Framework and Neural Network Specialized for Sperm Video Recognition. 7660-7669 - Chamuditha Jayanga Galappaththige, Gayal Kuruppu, Muhammad Haris Khan:
Generalizing to Unseen Domains in Diabetic Retinopathy Classification. 7670-7680 - Yingying Fang, Shuang Wu, Sheng Zhang, Chaoyan Huang, Tieyong Zeng, Xiaodan Xing, Simon Walsh, Guang Yang:
Dynamic Multimodal Information Bottleneck for Multimodality Classification. 7681-7691 - Kun Han, Shanlin Sun, Thanh-Tung Le, Xiangyi Yan, Haoyu Ma, Chenyu You, Xiaohui Xie:
Hybrid Neural Diffeomorphic Flow for Shape Representation and Generation via Triplane. 7692-7702 - Abbas Omidi, Aida Mohammadshahi, Neha Gianchandani, Regan King, Lara Leijser, Roberto Souza:
Unsupervised Domain Adaptation of MRI Skull-stripping Trained on Adult Data to Newborns. 7703-7712 - Md Mostafijur Rahman, Radu Marculescu:
G-CASCADE: Efficient Cascaded Graph Convolutional Decoding for 2D Medical Image Segmentation. 7713-7722 - Johan Fredin Haslum, Christos Matsoukas, Karl-Johan Leuchowius, Kevin Smith:
Bridging Generalization Gaps in High Content Imaging Through Online Self-Supervised Domain Adaptation. 7723-7732 - Mohamed ElHabebe, Shereen Elkordi, Ahmed Gamal-Eldin, Noha Adly, Marwan Torki, Ahmed Elmasry, Islam SH Ahmed:
DR10K: Transfer Learning Using Weak Labels for Grading Diabetic Retinopathy on DR10K Dataset. 7733-7743 - Yifei Chen, Binfeng Zou, Zhaoxin Guo, Yiyu Huang, Yifan Huang, Feiwei Qin, Qinhai Li, Changmiao Wang:
SCUNet++: Swin-UNet and CNN Bottleneck Hybrid Architecture with Multi-Fusion Dense Skip Connection for Pulmonary Embolism CT Image Segmentation. 7744-7752 - Vandan Gorade, Sparsh Mittal, Debesh Jha, Ulas Bagci:
SynergyNet: Bridging the Gap between Discrete and Continuous Representations for Precise Medical Image Segmentation. 7753-7762 - Sahar Almahfouz Nasser, Nihar Gupte, Amit Sethi:
Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data. 7763-7772 - Asha Rani, Yashaswi Verma:
Activity-based Early Autism Diagnosis Using A Multi-Dataset Supervised Contrastive Learning Approach. 7773-7782 - Yan Yang, Liyuan Pan, Liu Liu, Eric A. Stone:
Convolutional Masked Image Modeling for Dense Prediction Tasks on Pathology Images. 7783-7793 - Youngbeom Yoo, Jae Young Lee, Dong-Jae Lee, Jiwoon Jeon, Junmo Kim:
Real-Time Polyp Detection in Colonoscopy using Lightweight Transformer. 7794-7804 - Amani Almalki, Longin Jan Latecki:
Self-Supervised Learning with Masked Autoencoders for Teeth Segmentation from Intra-oral 3D Scans. 7805-7815 - Alec S. Xu, Nina I. Shamsi, Lars A. Gjesteby, Laura J. Brattain:
Self-Supervised Edge Detection Reconstruction for Topology-Informed 3D Axon Segmentation and Centerline Detection. 7816-7824 - Lingrui Li, Yanfeng Zhou, Ge Yang:
Robust Source-Free Domain Adaptation for Fundus Image Segmentation. 7825-7834 - Trong-Thang Pham, Jacob Brecheisen, Anh Nguyen, Hien Nguyen, Ngan Le:
I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR Diagnoses. 7835-7844 - Wenxuan Wang, Jing Wang, Chen Chen, Jianbo Jiao, Yuanxiu Cai, Shanshan Song, Jiangyun Li:
FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image Segmentation. 7845-7855 - Haoran Shen, Yifu Zhang, Wenxuan Wang, Chen Chen, Jing Liu, Shanshan Song, Jiangyun Li:
Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation. 7856-7866 - Jay Shah, Md Mahfuzur Rahman Siddiquee, Yi Su, Teresa Wu, Baoxin Li:
Ordinal Classification with Distance Regularization for Robust Brain Age Prediction. 7867-7876 - Quanfu Fan, Yilai Li, Yuguang Yao, John Cohn, Sijia Liu, Ziping Xu, Seychelle M. Vos, Michael A. Cianfrocco:
CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection. 7877-7887 - Linde S. Hesse, Nicola K. Dinsdale, Ana I. L. Namburete:
Prototype Learning for Explainable Brain Age Prediction. 7888-7898 - Girish Narayanswamy, Yujia Liu, Yuzhe Yang, Chengqian Ma, Xin Liu, Daniel McDuff, Shwetak N. Patel:
BigSmall: Efficient Multi-Task Learning for Disparate Spatial and Temporal Physiological Measurements. 7899-7909 - Tianang Leng, Yiming Zhang, Kun Han, Xiaohui Xie:
Self-Sampling Meta SAM: Enhancing Few-shot Medical Image Segmentation with Meta-Learning. 7910-7920 - Kyuri Kim, Yoonho Na, Sung-Joon Ye, Jimin Lee, Sungsoo Ahn, Ji Eun Park, Hwiyoung Kim:
Controllable Text-to-Image Synthesis for Multi-Modality MR Images. 7921-7930 - Dinkar Juyal, Siddhant Shingi, Syed Ashar Javed, Harshith Padigela, Chintan Shah, Anand Sampat, Archit Khosla, John Abel, Amaro Taylor-Weiner:
SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology. 7931-7940 - Lisa Weijler, Florian Kowarsch, Michael Reiter, Pedro Hermosilla, Margarita Maurer-Granofszky, Michael N. Dworzak:
FATE: Feature-Agnostic Transformer-based Encoder for learning generalized embedding spaces in flow cytometry data. 7941-7949 - Yongjin Choi, Doeyoung Kwon, Seung Jun Baek:
Dual Domain Diffusion Guidance for 3D CBCT Metal Artifact Reduction. 7950-7959 - Xiangyi Yan, Shanlin Sun, Kun Han, Thanh-Tung Le, Haoyu Ma, Chenyu You, Xiaohui Xie:
AFTer-SAM: Adapting SAM with Axial Fusion Transformer for Medical Imaging Segmentation. 7960-7969 - Nhat-Tan Bui, Dinh-Hieu Hoang, Quang-Thuc Nguyen, Minh-Triet Tran, Ngan Le:
MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation. 7970-7979 - Tiancheng Gu, Dongnan Liu, Zhiyuan Li, Weidong Cai:
Complex Organ Mask Guided Radiology Report Generation. 7980-7989 - Nina I. Shamsi, Alec S. Xu, Lars A. Gjesteby, Laura J. Brattain:
Improved Topological Preservation in 3D Axon Segmentation and Centerline Detection using Geometric Assessment-driven Topological Smoothing (GATS). 7990-7999 - Vinay Kumar Verma, Dween Rabius Sanny, Abhishek Singh, Deepak Gupta:
CoD: Coherent Detection of Entities from Images with Multiple Modalities. 8000-8009 - Masato Fujitake:
DTrOCR: Decoder-only Transformer for Optical Character Recognition. 8010-8020 - Jiaxin Zhang, Joy Rimchala, Lalla Mouatadid, Kamalika Das, Kumar Sricharan:
DECDM: Document Enhancement using Cycle-Consistent Diffusion Models. 8021-8030 - Amila Silva, Olga Moskvyak, Alexander Long, Ravi Garg, Stephen Gould, Gil Avraham, Anton van den Hengel:
LipAT: Beyond Style Transfer for Controllable Neural Simulation of Lipstick using Cosmetic Attributes. 8031-8040 - Kaicheng Pang, Xingxing Zou, Waikeung Wong:
Learning Visual Body-shape-Aware Embeddings for Fashion Compatibility. 8041-8050 - Junkyu Jang, Eugene Hwang, Sung-Hyuk Park:
Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval. 8051-8060 - Alexander Naumann, Felix Hertlein, Laura Dörr, Kai Furmans:
TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains. 8061-8071 - Amruth Sagar, Rishabh Srivastava, Rakshitha R. T, Venkata Kesav Venna, Ravi Kiran Sarvadevabhatla:
MAdVerse: A Hierarchical Dataset of Multi-Lingual Ads from Diverse Sources and Categories. 8072-8081 - Oliver Boyne, Gwangbin Bae, James Charles, Roberto Cipolla:
FOUND: Foot Optimization with Uncertain Normals for Surface Deformation Using Synthetic Data. 8082-8091 - K. J. Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan:
Iterative Multi-granular Image Editing using Diffusion Models. 8092-8101 - Wenyi Wu, Qi Li, Wenliang Zhong, Junzhou Huang:
MIVC: Multiple Instance Visual Component for Visual-Language Models. 8102-8111 - Axel De Nardin, Silvia Zottin, Claudio Piciarelli, Emanuela Colombi, Gian Luca Foresti:
A One-Shot Learning Approach to Document Layout Segmentation of Ancient Arabic Manuscripts. 8112-8121 - Gerald Ebmer, Adam Loch, Minh Nhat Vu, Roberto Mecca, Germain Haessig, Christian Hartl-Nesic, Markus Vincze, Andreas Kugi:
Real-time 6-DoF Pose Estimation by an Event-based Camera using Active LED Markers. 8122-8131 - Yang Lin, Edoardo Charbon:
Spiking Neural Networks for Active Time-Resolved SPAD Imaging. 8132-8141 - Roshan Kenia, Jihane Mendil, Ahmed Jasim, Muthanna Al-Dahhan, Zhaozheng Yin:
Robust TRISO-fueled Pebble Identification by Digit Recognition. 8142-8150 - Zeqi Zhu, Arash Pourtaherian, Luc Waeijen, Ibrahim Batuhan Akkaya, Egor Bondarev, Orlando Moreira:
CATS: Combined Activation and Temporal Suppression for Efficient Network Inference. 8151-8160 - Ryan Rad:
Vision Transformer for Multispectral Satellite Imagery: Advancing Landcover Classification*. 8161-8168 - Prateek Chhikara, Dhiraj Chaurasia, Yifan Jiang, Omkar Masur, Filip Ilievski:
FIRE: Food Image to REcipe generation. 8169-8179 - Siddeshwar Raghavan, Jiangpeng He, Fengqing Zhu:
Online Class-Incremental Learning For Real-World Food Image Classification. 8180-8189 - Di Chang, Yufeng Yin, Zongjian Li, Minh Tran, Mohammad Soleymani:
LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis. 8190-8200 - Zahra Babaiee, Peyman M. Kiasari, Daniela Rus, Radu Grosu:
Neural Echos: Depthwise Convolutional Filters Replicate Biological Receptive Fields. 8201-8210 - Weihao Xia, Raoul de Charette, Cengiz Öztireli, Jing-Hao Xue:
DREAM: Visual Decoding from REversing HumAn Visual SysteM. 8211-8220 - Sidike Paheding, Abel A. Reyes, A. Rajaneesh, K. S. Sajinkumar, Thomas Oommen:
MarsLS-Net: Martian Landslides Segmentation Network and Benchmark Dataset. 8221-8230 - André Luiz Buarque Vieira e Silva, Francisco Simões, Danny Kowerko, Tobias Schlosser, Felipe Battisti, Veronica Teichrieb:
Attention Modules Improve Image-Level Anomaly Detection for Industrial Inspection: A DifferNet Case Study. 8231-8240 - Marvin Burges, Sebastian Zambanini, Philipp Pirker:
CHAI: Craters in Historical Aerial Images. 8241-8250 - Rudraksh Kapil, Seyed Mojtaba Marvasti-Zadeh, Nadir Erbilgin, Nilanjan Ray:
ShadowSense: Unsupervised Domain Adaptation and Feature Fusion for Shadow-Agnostic Tree Crown Detection from RGB-Thermal Drone Imagery. 8251-8261 - Connor Greenwell, Jon Crall, Matthew Purri, Kristin J. Dana, Nathan Jacobs, Armin Hadzic, Scott Workman, Matthew J. Leotta:
WATCH: Wide-Area Terrestrial Change Hypercube. 8262-8271 - Jian Song, Hongruixuan Chen, Naoto Yokoya:
SyntheWorld: A Large-Scale Synthetic Dataset for Land Cover Mapping and Building Change Detection. 8272-8281 - Violet Felt, Justin Fletcher:
Seeing Stars: Learned Star Localization for Narrow-Field Astrometry. 8282-8290 - Justin Fletcher:
Deep Optics for Optomechanical Control Policy Design. 8291-8300 - Anindya Sarkar, Michael Lanier, Scott Alfeld, Jiarui Feng, Roman Garnett, Nathan Jacobs, Yevgeniy Vorobeychik:
A Visual Active Search Framework for Geospatial Exploration. 8301-8310 - Jiangying Qin, Ming Li, Jie Zhao, Jiageng Zhong, Hanqi Zhang:
Revolutionize the Oceanic Drone RGB Imagery with Pioneering Sun Glint Detection and Removal Techniques. 8311-8320 - Sayyedjavad Ziaratnia, Tipporn Laohakangvalvit, Midori Sugaya, Peeraya Sripian:
Multimodal Deep Learning for Remote Stress Estimation Using CCT-LSTM. 8321-8329 - Huiming Sun, Lan Fu, Jinlong Li, Qing Guo, Zibo Meng, Tianyun Zhang, Yuewei Lin, Hongkai Yu:
Defense against Adversarial Cloud Attack on Remote Sensing Salient Object Detection. 8330-8339 - Simiao Ren, Francesco Luzi, Saad Lahrichi, Kaleb Kassaw, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof:
Segment anything, from space? 8340-8350 - Keiller Nogueira, Mayara Maezano Faita Pinheiro, Ana Paula Marques Ramos, Wesley Nunes Gonçalves, José Marcato Junior, Jefersson A. dos Santos:
Prototypical Contrastive Network for Imbalanced Aerial Image Segmentation. 8351-8361 - Benjamin Thérien, Chengjie Huang, Adrian Chow, Krzysztof Czarnecki:
Object Re-Identification from Point Clouds. 8362-8373 - Arkadeep Narayan Chaudhury, Leonid Keselman, Christopher G. Atkeson:
Shape from Shading for Robotic Manipulation. 8374-8383 - Sudarshan S. Harithas, Gurkirat Singh, Aneesh Chavan, Sarthak Sharma, Suraj Patni, Chetan Arora, K. Madhava Krishna:
FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation. 8384-8393 - Jack Borer, Jeremy Tschirner, Florian Ölsner, Stefan Milz:
From Chaos to Calibration: A Geometric Mutual Information Approach to Target-Free Camera LiDAR Extrinsic Calibration. 8394-8403 - Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz:
Continual Learning of Unsupervised Monocular Depth from Videos. 8404-8414 - Pei-Chun Chien, Powei Liao, Eiji Fukuzawa, Jun Ohya:
Classifying Cable Tendency with Semantic Segmentation by Utilizing Real and Simulated RGB Data. 8415-8423 - Benedikt Kolbeinsson, Krystian Mikolajczyk:
Multi-Class Segmentation from Aerial Views using Recursive Noise Diffusion. 8424-8434 - Wenbo Li, Yi Wei, Yilin Shen, Hongxia Jin:
Efficient Layout-Guided Image Inpainting for Mobile Use. 8435-8444 - Clemens J. S. Schaefer, Siddharth Joshi, Shan Li, Raúl Blázquez:
Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks. 8445-8454 - Pragya Paramita Sahu, Abhishek Raut, Jagdish Singh Samant, Mahesh Gorijala, Vignesh Lakshminarayanan, Pinaki Bhaskar:
POP-VQA - Privacy preserving, On-device, Personalized Visual Question Answering. 8455-8464 - Sangmin Woo, So-Yeong Jeon, Jinyoung Park, Minji Son, Sumin Lee, Changick Kim:
Sketch-based Video Object Localization. 8465-8474 - Ondrej Bohdal, Da Li, Shell Xu Hu, Timothy M. Hospedales:
Feed-Forward Latent Domain Adaptation. 8475-8484 - Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte:
BSRAW: Improving Blind RAW Image Super-Resolution. 8485-8495 - Matteo Dunnhofer, Luca Sordi, Niki Martinel, Christian Micheloni:
Tracking Skiers from the Top to the Bottom. 8496-8506 - Jarek Reynolds, Chandra Kanth Nagesh, Danna Gurari:
Salient Object Detection for Images Taken by People With Vision Impairments. 8507-8516 - Jialin Yuan, Ye Yu, Gaurav Mittal, Matthew Hall, Sandra Sajeev, Mei Chen:
Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modality. 8517-8527 - Kajal Kansal, Yongkang Wong, Mohan S. Kankanhalli:
Privacy-Enhancing Person Re-identification Framework - A Dual-Stage Approach. 8528-8537 - Khiem Vuong, Robert Tamburo, Srinivasa G. Narasimhan:
Toward Planet-Wide Traffic Camera Calibration. 8538-8547 - Tai D. Nguyen, Shengbang Fang, Matthew C. Stamm:
VideoFACT: Detecting Video Forgeries Using Attention, Scene Context, and Forensic Traces. 8548-8558 - Snehashis Majhi, Rui Dai, Quan Kong, Lorenzo Garattoni, Gianpiero Francesca, François Brémond:
OE-CTST: Outlier-Embedded Cross Temporal Scale Transformer for Weakly-supervised Video Anomaly Detection. 8559-8568 - Arturo Miguel Russell Bernal, Walter J. Scheirer, Jane Cleland-Huang:
NOMAD: A Natural, Occluded, Multi-scale Aerial Dataset, for Emergency Response Scenarios. 8569-8580 - Colton Clemmer, Junhua Ding, Yunhe Feng:
PreciseDebias: An Automatic Prompt Engineering Approach for Generative AI to Mitigate Image Demographic Biases. 8581-8590 - Abid Ali, Ashish Marisetty, François Brémond:
P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification. 8591-8600 - Juan Leon Alcazar, Yazeed Alnumay, Cheng Zheng, Hassane Trigui, Sahejad Patel, Bernard Ghanem:
Learning to Read Analog Gauges from Synthetic Data. 8601-8610 - Johannes Flotzinger, Philipp Jonas Rösch, Thomas Braml:
dacl10k: Benchmark for Semantic Bridge Damage Segmentation. 8611-8620 - Achref Jaziri, Martin Mundt, Andres Fernandez Rodriguez, Visvanathan Ramesh:
Designing a Hybrid Neural System to Learn Real-world Crack Segmentation from Fractal-based Simulation. 8621-8631 - Fei Pan, Sangryul Jeon, Brian Wang, Frank McKenna, Stella X. Yu:
Zero-shot Building Attribute Extraction from Large-Scale Vision and Language Models. 8632-8641 - Sanket Kumar Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue:
Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos. 8642-8651 - Sagie Benaim, Frederik Warburg, Peter Ebert Christensen, Serge J. Belongie:
Volumetric Disentanglement for 3D Scene Manipulation. 8652-8662 - Juan C. Pérez, Thu Nguyen-Phuoc, Chen Cao, Artsiom Sanakoyeu, Tomas Simon, Pablo Arbeláez, Bernard Ghanem, Ali K. Thabet, Albert Pumarola:
StyleAvatar: Stylizing Animatable Head Avatars. 8663-8672 - Zheng Chen, Zhiqi Zhang, Junsong Yuan, Yi Xu, Lantao Liu:
Show Your Face: Restoring Complete Facial Images from Partial Observations for VR Meeting. 8673-8682 - Patrick Grady, Jeremy A. Collins, Chengcheng Tang, Christopher D. Twigg, Kunal Aneja, James Hays, Charles C. Kemp:
PressureVision++: Estimating Fingertip Pressure from Diverse RGB Images. 8683-8693 - Hunor Laczkó, Meysam Madadi, Sergio Escalera, Jordi Gonzàlez:
A Generative Multi-Resolution Pyramid and Normal-Conditioning 3D Cloth Draping. 8694-8703 - Ziang Cheng, Jiayu Yang, Hongdong Li:
Stereo Matching in Time: 100+ FPS Video Stereo Matching for Extended Reality. 8704-8713
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.