{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,3]],"date-time":"2026-05-03T02:45:44Z","timestamp":1777776344131,"version":"3.51.4"},"reference-count":962,"publisher":"Emerald","issue":"1-2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,12,23]]},"abstract":"<jats:p>For robots to navigate and interact more richly with the world around them, they will likely require a deeper understanding of the world in which they operate. In robotics and related research fields, the study of understanding is often referred to as semantics, which dictates what does the world \u201cmean\u201d to a robot, and is strongly tied to the question of how to represent that meaning. With humans and robots increasingly operating in the same world, the prospects of human\u2013robot interaction also bring semantics and ontology of natural language into the picture. Driven by need, as well as by enablers like increasing availability of training data and computational resources, semantics is a rapidly growing research area in robotics. The field has received significant attention in the research literature to date, but most reviews and surveys have focused on particular aspects of the topic: the technical research issues regarding its use in specific robotic topics like mapping or segmentation, or its relevance to one particular application domain like autonomous driving. A new treatment is therefore required, and is also timely because so much relevant research has occurred since many of the key surveys were published. This survey therefore provides an overarching snapshot of where semantics in robotics stands today. We establish a taxonomy for semantics research in or relevant to robotics, split into four broad categories of activity, in which semantics are extracted, used, or both. Within these broad categories we survey dozens of major topics including fundamentals from the computer vision field and key robotics research areas utilizing semantics, including mapping, navigation and interaction with the world. The survey also covers key practical considerations, including enablers like increased data availability and improved computational hardware, and major application areas where semantics is or is likely to play a key role. In creating this survey, we hope to provide researchers across academia and industry with a comprehensive reference that helps facilitate future research in this exciting field.<\/jats:p>","DOI":"10.1561\/2300000059","type":"journal-article","created":{"date-parts":[[2020,12,23]],"date-time":"2020-12-23T05:19:01Z","timestamp":1608700741000},"page":"1-224","source":"Crossref","is-referenced-by-count":95,"title":["Semantics for Robotic Mapping, Perception and Interaction: A Survey"],"prefix":"10.1108","volume":"8","author":[{"given":"Sourav","family":"Garg","sequence":"first","affiliation":[{"name":"QUT Centre for Robotics and School of Electrical Engineering and Robotics, Queensland University of Technology ,","place":["Australia"]}]},{"given":"Niko","family":"S\u00fcnderhauf","sequence":"additional","affiliation":[{"name":"QUT Centre for Robotics and School of Electrical Engineering and Robotics, Queensland University of Technology ,","place":["Australia"]}]},{"given":"Feras","family":"Dayoub","sequence":"additional","affiliation":[{"name":"QUT Centre for Robotics and School of Electrical Engineering and Robotics, Queensland University of Technology ,","place":["Australia"]}]},{"given":"Douglas","family":"Morrison","sequence":"additional","affiliation":[{"name":"QUT Centre for Robotics and School of Electrical Engineering and Robotics, Queensland University of Technology ,","place":["Australia"]}]},{"given":"Akansel","family":"Cosgun","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Systems Engineering, Monash University ,","place":["Australia"]}]},{"given":"Gustavo","family":"Carneiro","sequence":"additional","affiliation":[{"name":"School of Computer Science, University of Adelaide ,","place":["Australia"]}]},{"given":"Qi","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Computer Science, University of Adelaide ,","place":["Australia"]}]},{"given":"Tat-Jun","family":"Chin","sequence":"additional","affiliation":[{"name":"School of Computer Science, University of Adelaide ,","place":["Australia"]}]},{"given":"Ian","family":"Reid","sequence":"additional","affiliation":[{"name":"School of Computer Science, University of Adelaide ,","place":["Australia"]}]},{"given":"Stephen","family":"Gould","sequence":"additional","affiliation":[{"name":"College of Engineering and Computer Science, Australian National University ,","place":["Australia"]}]},{"given":"Peter","family":"Corke","sequence":"additional","affiliation":[{"name":"QUT Centre for Robotics and School of Electrical Engineering and Robotics, Queensland University of Technology ,","place":["Australia"]}]},{"given":"Michael","family":"Milford","sequence":"additional","affiliation":[{"name":"QUT Centre for Robotics and School of Electrical Engineering and Robotics, Queensland University of Technology ,","place":["Australia"]}]}],"member":"140","published-online":{"date-parts":[[2020,12,23]]},"reference":[{"issue":"6","key":"2026040113103210200_ref001","doi-asserted-by":"crossref","first-page":"1309","DOI":"10.1109\/TRO.2016.2624754","article-title":"Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age","volume":"32","author":"Cadena","year":"2016","journal-title":"IEEE Transactions on Robotics"},{"issue":"2","key":"2026040113103210200_ref002","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1109\/MRA.2006.1638022","article-title":"Simultaneous localization and mapping: Part i","volume":"13","author":"Durrant-Whyte","year":"2006","journal-title":"IEEE Robotics and Automation Magazine"},{"issue":"3","key":"2026040113103210200_ref003","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1109\/MRA.2006.1678144","article-title":"Simultaneous localization and mapping (SLAM): Part ii","volume":"13","author":"Bailey","year":"2006","journal-title":"IEEE Robotics and Automation Magazine"},{"issue":"3","key":"2026040113103210200_ref004","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1145\/504729.504754","article-title":"Probabilistic robotics","volume":"45","author":"Thrun","year":"2002","journal-title":"Communications of the ACM"},{"key":"2026040113103210200_ref005","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1007\/978-3-319-32552-1_46","article-title":"Simultaneous localization and mapping","author":"Stachniss","year":"2016","journal-title":"Springer Handbook of Robotics"},{"key":"2026040113103210200_ref006","first-page":"477","article-title":"A review of recent developments in simultaneous localization and mapping","author":"Dissanayake","year":"2011","journal-title":"2011 6th International Conference on Industrial and Information Systems"},{"key":"2026040113103210200_ref007","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1016\/j.robot.2014.12.006","article-title":"Semantic mapping for mobile robotics tasks: A survey","volume":"66","author":"Kostavelis","year":"2015","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref008","article-title":"FutureMapping: The computational structure of spatial AI systems","author":"Davison","year":"2018","journal-title":"arXiv preprint arXiv:1803.11288"},{"key":"2026040113103210200_ref009","article-title":"Futuremapping 2: Gaussian belief propagation for spatial ai","author":"Davison","year":"2019","journal-title":"arXiv preprint arXiv:1910.14139"},{"issue":"3","key":"2026040113103210200_ref010","doi-asserted-by":"crossref","DOI":"10.1177\/1729881420919185","article-title":"A survey of image semantics-based visual simultaneous localization and mapping: Application-oriented solutions to autonomous navigation of mobile robots","volume":"17","author":"Xia","year":"2020","journal-title":"International Journal of Advanced Robotic Systems"},{"key":"2026040113103210200_ref011","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","article-title":"Deep learning in neural networks: An overview","volume":"61","author":"Schmidhuber","year":"2015","journal-title":"Neural Networks"},{"issue":"2","key":"2026040113103210200_ref012","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1007\/s11263-019-01247-4","article-title":"Deep learning for generic object detection: A survey","volume":"128","author":"Liu","year":"2020","journal-title":"International Journal of Computer Vision"},{"key":"2026040113103210200_ref013","first-page":"103","article-title":"A review of algorithms for filtering the 3D point cloud","volume":"57","author":"Han","year":"2017","journal-title":"Signal Processing: Image Communication"},{"key":"2026040113103210200_ref014","article-title":"A comprehensive review of 3d point cloud descriptors","author":"Hana","year":"2018","journal-title":"arXiv preprint arXiv:1802.02297"},{"issue":"19","key":"2026040113103210200_ref015","doi-asserted-by":"crossref","first-page":"4188","DOI":"10.3390\/s19194188","article-title":"Deep learning on point clouds and its application: A survey","volume":"19","author":"Liu","year":"2019","journal-title":"Sensors"},{"key":"2026040113103210200_ref016","article-title":"Deep learning for 3d point clouds: A survey","author":"Guo","year":"2020","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"2","key":"2026040113103210200_ref017","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3042064","article-title":"Deep learning advances in computer vision with 3d data: A survey","volume":"50","author":"Ioannidou","year":"2017","journal-title":"ACM Computing Surveys (CSUR)"},{"issue":"10","key":"2026040113103210200_ref018","doi-asserted-by":"crossref","first-page":"3782","DOI":"10.1109\/TITS.2019.2892405","article-title":"A survey on 3d object detection methods for autonomous driving applications","volume":"20","author":"Arnold","year":"2019","journal-title":"IEEE Transactions on Intelligent Transportation Systems"},{"key":"2026040113103210200_ref019","doi-asserted-by":"crossref","first-page":"2947","DOI":"10.1109\/TIP.2019.2955239","article-title":"Recent advances in 3D object detection in the era of deep neural networks: A survey","volume":"29","author":"Rahman","year":"2019","journal-title":"IEEE Transactions on Image Processing"},{"key":"2026040113103210200_ref020","first-page":"339","article-title":"A review of point clouds segmentation and classification algorithms","volume":"42","author":"Grilli","year":"2017","journal-title":"The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences"},{"key":"2026040113103210200_ref021","doi-asserted-by":"crossref","first-page":"179 118","DOI":"10.1109\/ACCESS.2019.2958671","article-title":"A review of deep learning-based semantic segmentation for point cloud (november 2019)","volume":"7","author":"Zhang","year":"2019","journal-title":"IEEE Access"},{"key":"2026040113103210200_ref022","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1016\/j.neucom.2019.02.003","article-title":"Survey on semantic segmentation using deep learning techniques","volume":"338","author":"Lateef","year":"2019","journal-title":"Neurocomputing"},{"key":"2026040113103210200_ref023","doi-asserted-by":"crossref","DOI":"10.1109\/MGRS.2019.2937630","article-title":"A review of point cloud semantic segmentation","author":"Xie","year":"2020","journal-title":"IEEE Geoscience and Remote Sensing Magazine (GRSM)"},{"issue":"2","key":"2026040113103210200_ref024","first-page":"735","article-title":"Deep learning for semantic segmentation of 3D point cloud","volume":"42","author":"Malinverni","year":"2019","journal-title":"International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences"},{"key":"2026040113103210200_ref025","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1016\/j.asoc.2018.05.018","article-title":"A survey on deep learning techniques for image and video semantic segmentation","volume":"70","author":"Garcia-Garcia","year":"2018","journal-title":"Applied Soft Computing"},{"issue":"2","key":"2026040113103210200_ref026","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1007\/s13735-017-0141-z","article-title":"A review of semantic segmentation using deep neural networks","volume":"7","author":"Guo","year":"2018","journal-title":"International Journal of Multimedia Information Retrieval"},{"issue":"2","key":"2026040113103210200_ref027","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1007\/s11633-017-1053-3","article-title":"A survey on deep learning-based fine-grained object classification and semantic segmentation","volume":"14","author":"Zhao","year":"2017","journal-title":"International Journal of Automation and Computing"},{"key":"2026040113103210200_ref028","first-page":"1","article-title":"Deep semantic segmentation for automated driving: Taxonomy, roadmap and challenges","author":"Siam","year":"2017","journal-title":"2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC)"},{"key":"2026040113103210200_ref029","article-title":"A survey of semantic segmentation","author":"Thoma","year":"2016","journal-title":"arXiv preprint arXiv:1602.06541"},{"key":"2026040113103210200_ref030","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1016\/j.jvcir.2015.10.012","article-title":"Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation","volume":"34","author":"Zhu","year":"2016","journal-title":"Journal of Visual Communication and Image Representation"},{"key":"2026040113103210200_ref031","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/j.robot.2019.05.013","article-title":"A survey on semantic-based methods for the understanding of human movements","volume":"119","author":"Ramirez-Amaro","year":"2019","journal-title":"Robotics and Autonomous Systems"},{"issue":"1","key":"2026040113103210200_ref032","doi-asserted-by":"crossref","DOI":"10.3390\/robotics5010008","article-title":"Extracting semantic information from visual data: A survey","volume":"5","author":"Liu","journal-title":"Robotics"},{"key":"2026040113103210200_ref033","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.robot.2019.03.005","article-title":"A survey of knowledge representation in service robotics","volume":"118","author":"Paulius","year":"2019","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref034","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1007\/3-540-45113-7_29","article-title":"Towards a comprehensive survey of the semantic gap in visual image retrieval","author":"Enser","year":"2003","journal-title":"International Conference on Image and Video Retrieval"},{"issue":"1","key":"2026040113103210200_ref035","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1016\/j.patcog.2006.04.045","article-title":"A survey of content-based image retrieval with high-level semantics","volume":"40","author":"Liu","year":"2007","journal-title":"Pattern Recognition"},{"issue":"1","key":"2026040113103210200_ref036","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2906152","article-title":"Socializing the semantic gap: A comparative survey on image tag assignment, refinement, and retrieval","volume":"49","author":"Li","year":"2016","journal-title":"ACM Computing Surveys (CSUR)"},{"key":"2026040113103210200_ref037","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1016\/j.procs.2015.04.020","article-title":"Reducing semantic gap in video retrieval with fusion: A survey","volume":"50","author":"Sudha","year":"2015","journal-title":"Procedia Computer Science"},{"key":"2026040113103210200_ref038","first-page":"372","article-title":"Ontology based semantic search: An introduction and a survey of current approaches","author":"Ramkumar","year":"2014","journal-title":"2014 International Conference on Intelligent Computing Applications"},{"issue":"1","key":"2026040113103210200_ref039","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.patcog.2011.05.013","article-title":"A review on automatic image annotation techniques","volume":"45","author":"Zhang","year":"2012","journal-title":"Pattern Recognition"},{"key":"2026040113103210200_ref040","volume":"4","author":"Ehrig","year":"2006","journal-title":"Ontology Alignment: Bridging the Semantic Gap"},{"key":"2026040113103210200_ref041","doi-asserted-by":"crossref","DOI":"10.1117\/12.647755","article-title":"Mind the gap: Another look at the problem of the semantic gap in image retrieval","volume":"6073","author":"Hare","year":"2006","journal-title":"Multimedia Content Analysis, Management, and Retrieval 2006"},{"issue":"5","key":"2026040113103210200_ref042","first-page":"536","article-title":"Review and research on \u2018semantic gap\u2019 problem in the content based image retrieval","volume":"35","author":"Wen","year":"2005","journal-title":"Journal of Northwest University (Natural Science Edition)"},{"key":"2026040113103210200_ref043","first-page":"1","article-title":"Vision-based robotic grasping from object localization, object pose estimation to grasp estimation for parallel grippers: A review","author":"Du","year":"2020","journal-title":"Artificial Intelligence Review"},{"key":"2026040113103210200_ref044","article-title":"Affordances in robotic tasks\u2014A survey","author":"Ard\u00f3n","year":"2020","journal-title":"arXiv preprint arXiv:2004.07400"},{"key":"2026040113103210200_ref045","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1146\/annurev-control-101119-071628","article-title":"Robots that use language","volume":"3","author":"Tellex","year":"2020","journal-title":"Annual Review of Control, Robotics, and Autonomous Systems"},{"key":"2026040113103210200_ref046","first-page":"601","article-title":"PointNet: Deep learning on point sets for 3D classification and segmentation","author":"Qi","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref047","first-page":"95","article-title":"Deep projective 3D semantic segmentation","author":"Lawin","year":"2017","journal-title":"International Conference on Computer Analysis of Images and Patterns"},{"key":"2026040113103210200_ref048","doi-asserted-by":"crossref","first-page":"2670","DOI":"10.1109\/ICPR.2016.7900038","article-title":"Point cloud labeling using 3d convolutional neural network","author":"Huang","year":"2016","journal-title":"2016 23rd International Conference on Pattern Recognition (ICPR)"},{"key":"2026040113103210200_ref049","doi-asserted-by":"publisher","first-page":"189","DOI":"10.1016\/j.cag.2017.11.010","article-title":"SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks","volume":"71","author":"Boulch","year":"2018","journal-title":"Computers and Graphics (Pergamon)"},{"key":"2026040113103210200_ref050","first-page":"458","article-title":"3DMV: Joint 3D-multi-view prediction for 3D semantic scene segmentation","author":"Dai","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref051","doi-asserted-by":"crossref","first-page":"2036","DOI":"10.1109\/CVPR.2009.5206718","article-title":"Towards total scene understanding: Classification, annotation and segmentation in an automatic framework","author":"Li","year":"2009","journal-title":"2009 IEEE Conference on Computer Vision and Pattern Recognition,"},{"key":"2026040113103210200_ref052","first-page":"1","article-title":"Object Bank: A high-level image representation for scene classification and semantic feature sparsification","author":"Li","year":"2010","journal-title":"Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010"},{"key":"2026040113103210200_ref053","first-page":"702","article-title":"Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation","author":"Yao","year":"2012","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"issue":"3","key":"2026040113103210200_ref054","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1007\/s11263-013-0636-x","article-title":"Image classification with the fisher vector: Theory and practice","volume":"105","author":"S\u00e1nchez","year":"2013","journal-title":"International Journal of Computer Vision"},{"issue":"3","key":"2026040113103210200_ref055","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1134\/S1054661819030222","article-title":"Image classification model using visual bag of semantic words","volume":"29","author":"Qi","year":"2019","journal-title":"Pattern Recognition and Image Analysis"},{"key":"2026040113103210200_ref056","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1109\/CVPR.2009.5206848","article-title":"Imagenet: A large-scale hierarchical image database","author":"Deng","year":"2009","journal-title":"2009 IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref057","first-page":"1097","article-title":"Imagenet classification with deep convolutional neural networks","author":"Krizhevsky","year":"2012","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref058","first-page":"487","article-title":"Learning deep features for scene recognition using places database","author":"Zhou","year":"2014","journal-title":"MIT Web Domain"},{"issue":"6","key":"2026040113103210200_ref059","doi-asserted-by":"crossref","first-page":"1452","DOI":"10.1109\/TPAMI.2017.2723009","article-title":"Places: A 10 million image database for scene recognition","volume":"40","author":"Zhou","year":"2018","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref060","article-title":"Object detectors emerge in deep scene CNNs","author":"Zhou","year":"2015","journal-title":"ICLR"},{"key":"2026040113103210200_ref061","article-title":"Learning deep features for discriminative localization","author":"Zhou","year":"2016","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref062","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1145\/3132515.3132519","article-title":"A deep multi-modal fusion approach for semantic place prediction in social media","author":"Meng","year":"2017","journal-title":"MUSA2 2017\u2014Proceedings of the Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes, Co-Located with MM 2017"},{"key":"2026040113103210200_ref063","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1007\/978-3-030-30508-6_5","article-title":"Aggregating rich deep semantic features for fine-grained place classification","volume":"11729","author":"Wei","year":"2019","journal-title":"International Conference on Artificial Neural Networks"},{"key":"2026040113103210200_ref064","first-page":"5995","article-title":"Modality and component aware feature fusion for RGB-D scene classification","author":"Wang","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref065","first-page":"2827","article-title":"Cross modal distillation for supervision transfer","author":"Gupta","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref066","first-page":"11 836","article-title":"Translate-torecognize networks for rgb-d scene recognition","author":"Du","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"issue":"4","key":"2026040113103210200_ref067","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1109\/TBDATA.2016.2515640","article-title":"Learning visual semantic relationships for efficient visual retrieval","volume":"1","author":"Hong","year":"2016","journal-title":"IEEE Transactions on Big Data"},{"key":"2026040113103210200_ref068","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1016\/j.patcog.2017.11.032","article-title":"Sketch-based image retrieval with deep visual semantic descriptor","volume":"76","author":"Huang","year":"2018","journal-title":"Pattern Recognition"},{"issue":"1","key":"2026040113103210200_ref069","first-page":"1","article-title":"Latent semantic minimal hashing for image retrieval","volume":"26","author":"Hoang","year":"2009","journal-title":"IEEE Transactions on Image Processing on Image Processing"},{"key":"2026040113103210200_ref070","first-page":"5272","article-title":"Beyond instance-level image retrieval: Leveraging captions to learn a global visual representation for semantic retrieval","volume":"2017","author":"Gordo","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref071","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1016\/j.jvcir.2017.11.021","article-title":"Saliency-based multi-feature modeling for semantic image retrieval","volume":"50","author":"Bai","year":"2017","journal-title":"Journal of Visual Communication and Image Representation"},{"issue":"2","key":"2026040113103210200_ref072","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"International Journal of Computer Vision"},{"issue":"7","key":"2026040113103210200_ref073","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1109\/TPAMI.2002.1017623","article-title":"Multiresolution gray-scale and rotation invariant texture classification with local binary patterns","volume":"24","author":"Ojala","year":"2002","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"3","key":"2026040113103210200_ref074","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1109\/TPAMI.2014.2345401","article-title":"Global contrast based salient region detection","volume":"37","author":"Cheng","year":"2015","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref075","first-page":"172","article-title":"Semantic image retrieval via active grounding of visual situations","volume":"2018","author":"Quinn","year":"2018","journal-title":"Proceedings\u201412th IEEE International Conference on Semantic Computing, ICSC 2018"},{"key":"2026040113103210200_ref076","doi-asserted-by":"crossref","first-page":"638","DOI":"10.1109\/WACV.2019.00073","article-title":"Hierarchy-based image embeddings for semantic image retrieval","author":"Barz","year":"2019","journal-title":"Proceedings\u20142019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019"},{"key":"2026040113103210200_ref077","volume-title":"WordNet: An Electronic Lexical Database","author":"Miller","year":"1998"},{"key":"2026040113103210200_ref078","first-page":"2278","article-title":"Sfnet: Learning object-aware semantic correspondence","author":"Lee","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref079","first-page":"4654","article-title":"Visual semantic reasoning for image-text matching","author":"Li","year":"2019","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref080","first-page":"1","article-title":"Faster R-CNN: Towards real-time object detection with region proposal networks","author":"Ren","year":"2015","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref081","first-page":"3984","article-title":"Finding beans in burgers: Deep semantic-visual embedding with localization","author":"Engilberge","year":"2018","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"issue":"5","key":"2026040113103210200_ref082","doi-asserted-by":"crossref","first-page":"1180","DOI":"10.1109\/TCYB.2016.2539546","article-title":"Bi-level semantic representation analysis for multimedia event detection","volume":"47","author":"Chang","year":"2017","journal-title":"IEEE Transactions on Cybernetics"},{"key":"2026040113103210200_ref083","doi-asserted-by":"crossref","first-page":"1150","DOI":"10.1109\/ICCV.1999.790410","article-title":"Object recognition from local scale-invariant features","volume":"2","author":"Lowe","year":"1999","journal-title":"Proceedings of the Seventh IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref084","first-page":"404","article-title":"Surf: Speeded up robust features","author":"Bay","year":"2006","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref085","first-page":"1440","article-title":"Fast R-CNN","volume":"2015","author":"Girshick","year":"2015","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"issue":"9","key":"2026040113103210200_ref086","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref087","article-title":"A fast, modular scene understanding system using context-aware object detection","author":"Cadena","year":"2015","journal-title":"Proc. International Conf. on Robotics and Automation"},{"key":"2026040113103210200_ref088","article-title":"Complexer-YOLO: Real-time 3D object detection and tracking on semantic point clouds","author":"Simon","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops"},{"issue":"6","key":"2026040113103210200_ref089","doi-asserted-by":"crossref","first-page":"1687","DOI":"10.1109\/TCSVT.2018.2848358","article-title":"Semantics-aware visual object tracking","volume":"29","author":"Yao","year":"2019","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"2026040113103210200_ref090","first-page":"5813","article-title":"Single-shot object detection with enriched semantics","author":"Zhang","year":"2018","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref091","doi-asserted-by":"crossref","first-page":"8634","DOI":"10.1609\/aaai.v33i01.33018634","article-title":"Visual-semantic graph reasoning for pedestrian attribute recognition","volume":"33","author":"Li","year":"2019","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"2026040113103210200_ref092","first-page":"12 224","article-title":"Autolabeling 3D objects with differentiable rendering of sdf shape priors","author":"Zakharov","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"issue":"2","key":"2026040113103210200_ref093","first-page":"207","article-title":"Colour image segmentation: A state-of-the-art survey","volume":"67","author":"Lucchese","year":"2001","journal-title":"Proceedings-Indian National Science Academy Part A"},{"issue":"6","key":"2026040113103210200_ref094","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/34.87344","article-title":"Watersheds in digital spaces: An efficient algorithm based on immersion simulations","author":"Vincent","year":"1991","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"5","key":"2026040113103210200_ref095","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1109\/34.1000236","article-title":"Mean shift: A robust approach toward feature space analysis","volume":"24","author":"Comaniciu","year":"2002","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref096","first-page":"373","article-title":"Inducing semantic segmentation from an example","author":"Schnitman","year":"2006","journal-title":"Asian Conference on Computer Vision"},{"issue":"3","key":"2026040113103210200_ref097","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1109\/TCSVT.2007.890636","article-title":"Semantic image segmentation and object labeling","volume":"17","author":"Athanasiadis","year":"2007","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"2026040113103210200_ref098","first-page":"1485","article-title":"Pylon model for semantic segmentation","author":"Lempitsky","year":"2011","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref099","first-page":"670","article-title":"Class segmentation and object localization with superpixel neighborhoods","author":"Fulkerson","year":"2009","journal-title":"2009 IEEE 12th International Conference on Computer Vision"},{"key":"2026040113103210200_ref100","first-page":"1","article-title":"Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation","author":"Shotton","year":"2006","journal-title":"European Conference on Computer Vision"},{"issue":"11","key":"2026040113103210200_ref101","doi-asserted-by":"crossref","first-page":"1222","DOI":"10.1109\/34.969114","article-title":"Fast approximate energy minimization via graph cuts","volume":"23","author":"Boykov","year":"2001","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref102","first-page":"640","article-title":"Fully convolutional networks for semantic segmentation","volume":"39","author":"Long","year":"2015","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref103","article-title":"Multi-scale context aggregation by dilated convolutions","author":"Yu","year":"2016","journal-title":"ICLR"},{"issue":"12","key":"2026040113103210200_ref104","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"SegNet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref105","first-page":"234","article-title":"U-net: Convolutional networks for biomedical image segmentation","author":"Ronneberger","year":"2015","journal-title":"International Conference on Medical Image Computing and Computer-Assisted Intervention"},{"key":"2026040113103210200_ref106","article-title":"Enet: A deep neural network architecture for real-time semantic segmentation","author":"Paszke","year":"2016","journal-title":"arXiv preprint arXiv:1606.02147"},{"issue":"4","key":"2026040113103210200_ref107","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs","volume":"40","author":"Chen","year":"2018","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref108","first-page":"5229","article-title":"Gated-SCNN: Gated shape CNNs for semantic segmentation","author":"Takikawa","year":"2019","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"issue":"9","key":"2026040113103210200_ref109","doi-asserted-by":"publisher","first-page":"973","DOI":"10.1007\/s11263-018-1072-8","article-title":"Semantic foggy scene understanding with synthetic data","volume":"126","author":"Sakaridis","year":"2018","journal-title":"International Journal of Computer Vision"},{"issue":"2","key":"2026040113103210200_ref110","doi-asserted-by":"crossref","first-page":"3580","DOI":"10.1109\/LRA.2020.2978666","article-title":"Semantic segmentation with unsupervised domain adaptation under varying weather conditions for autonomous vehicles","volume":"5","author":"Erkent","year":"2020","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref111","first-page":"3168","article-title":"Feature space optimization for semantic video segmentation","author":"Kundu","year":"2016","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref112","first-page":"270","article-title":"The cityscapes dataset for semantic urban scene understanding","volume":"29","author":"Cordts","year":"2016","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref113","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9197204","article-title":"Temporal information integration for video semantic segmentation","author":"Guarino","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref114","first-page":"7158","article-title":"STD2P: RGBD semantic segmentation using spatio-temporal data-driven pooling","author":"He","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref115","article-title":"Gradient and log-based active learning for semantic segmentation of crop and weed for agricultural robots","author":"Lottes","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref116","first-page":"724","article-title":"A benchmark dataset and evaluation methodology for video object segmentation","author":"Perazzi","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref117","first-page":"221","article-title":"One-shot video object segmentation","author":"Caelles","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"issue":"1","key":"2026040113103210200_ref118","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1109\/TIP.2017.2754941","article-title":"Video salient object detection via fully convolutional networks","volume":"27","author":"Wang","year":"2017","journal-title":"IEEE Transactions on Image Processing"},{"issue":"1","key":"2026040113103210200_ref119","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1109\/TPAMI.2017.2662005","article-title":"Saliency-aware video object segmentation","volume":"40","author":"Wang","year":"2017","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref120","first-page":"2663","article-title":"Learning video object segmentation from static images","author":"Perazzi","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref121","first-page":"2701","article-title":"Learning features by watching objects move","author":"Pathak","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref122","article-title":"A probabilistic framework for real-time 3D segmentation using spatial, temporal, and semantic cues","volume":"12","author":"Held","year":"2016","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref123","first-page":"2961","article-title":"Mask R-CNN","author":"He","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref124","first-page":"5221","article-title":"Deep watershed transform for instance segmentation","author":"Bai","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref125","first-page":"5038","article-title":"Segmentation-aware convolutional networks using local attention masks","author":"Harley","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref126","first-page":"2277","article-title":"Associative embedding: End-to-end learning for joint detection and grouping","author":"Newell","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref127","first-page":"6409","article-title":"Mask scoring R-CNN","author":"Huang","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref128","first-page":"9404","article-title":"Panoptic segmentation","author":"Kirillov","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref129","first-page":"6399","article-title":"Panoptic feature pyramid networks","author":"Kirillov","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref130","first-page":"8818","article-title":"Upsnet: A unified panoptic segmentation network","author":"Xiong","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref131","article-title":"Fusing predictions for end-to-end panoptic segmentation","author":"Li","year":"2020"},{"key":"2026040113103210200_ref132","first-page":"8523","article-title":"Real-time panoptic segmentation from dense detections","author":"Hou","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref133","first-page":"12 475","article-title":"Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation","author":"Cheng","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref134","article-title":"Axial-deeplab: Stand-alone axial-attention for panoptic segmentation","author":"Wang","year":"2020","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref135","first-page":"746","article-title":"Indoor segmentation and support inference from RGBD images","author":"Silberman","year":"2012","journal-title":"European Conference on Computer Vision"},{"issue":"2","key":"2026040113103210200_ref136","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1007\/s11263-014-0777-6","article-title":"Indoor scene understanding with RGB-D images: Bottom-up segmentation, object detection and semantic segmentation","volume":"112","author":"Gupta","year":"2015","journal-title":"International Journal of Computer Vision"},{"key":"2026040113103210200_ref137","article-title":"Multi-modal auto-encoders as joint estimators for robotics scene understanding","author":"Cadena","year":"2016","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref138","article-title":"Multimodal deep learning","author":"Ngiam","year":"2011","journal-title":"ICML"},{"key":"2026040113103210200_ref139","first-page":"465","article-title":"Deep multispectral semantic scene understanding of forested environments using multimodal fusion","author":"Valada","year":"2016","journal-title":"International Symposium on Experimental Robotics"},{"key":"2026040113103210200_ref140","first-page":"1841","article-title":"Learning semantic segmentation from synthetic data: A geometrically guided input-output adaptation approach","author":"Chen","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref141","first-page":"384","article-title":"Identifying unknown instances for autonomous driving","author":"Wong","year":"2020","journal-title":"Conference on Robot Learning"},{"key":"2026040113103210200_ref142","first-page":"154","article-title":"Exploiting semantic information and deep matching for optical flow","author":"Bai","year":"2016","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref143","first-page":"1","article-title":"SegStereo: Exploiting semantic information for disparity estimation","author":"Yang","year":"2018","journal-title":"Eccv"},{"key":"2026040113103210200_ref144","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9196784","article-title":"Real-time semantic stereo matching","author":"Dovesi","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref145","first-page":"7484","article-title":"Semantic stereo matching with pyramid cost volumes","author":"Wu","year":"2019","journal-title":"Iccv"},{"key":"2026040113103210200_ref146","article-title":"Semantically-guided representation learning for self-supervised monocular depth","author":"Guizilini","year":"2020","journal-title":"2020 International Conference on Learning Representations (ICLR)"},{"key":"2026040113103210200_ref147","first-page":"1","article-title":"Fast scene understanding for autonomous driving","author":"Neven","year":"2017","journal-title":"Proceedings DLVP2017"},{"key":"2026040113103210200_ref148","first-page":"290","article-title":"UberNet: Training a universal convolutional neural network for low-, mid-, and high-level vision using diverse datasets and limited memory","volume":"99","author":"Kokkinos","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref149","first-page":"7101","article-title":"Real-time joint semantic segmentation and depth estimation using asymmetric annotations","author":"Nekrasov","year":"2019","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref150","first-page":"4628","article-title":"SemanticFusion: Dense 3D semantic mapping with convolutional neural networks","author":"McCormac","year":"2017","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation,"},{"key":"2026040113103210200_ref151","first-page":"96","article-title":"Predicting polarization beyond semantics for wearable robotics","author":"Yang","year":"2019","journal-title":"IEEE-RAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref152","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1007\/978-3-642-03798-6_6","article-title":"The stixel world\u2014A compact medium level representation of the 3d-world","author":"Badino","year":"2009","journal-title":"Joint Pattern Recognition Symposium"},{"key":"2026040113103210200_ref153","first-page":"110","article-title":"Semantic stixels: Depth is not enough","author":"Schneider","year":"2016","journal-title":"IEEE Intelligent Vehicles Symposium, Proceedings"},{"issue":"12","key":"2026040113103210200_ref154","doi-asserted-by":"crossref","first-page":"1349","DOI":"10.1109\/34.895972","article-title":"Content-based image retrieval at the end of the early years","volume":"22","author":"Smeulders","year":"2000","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref155","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1007\/978-0-387-35561-0_9","article-title":"A user interface for emergent semantics in image databases","author":"Santini","year":"1999","journal-title":"Database Semantics"},{"key":"2026040113103210200_ref156","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1109\/ICCV.1990.139562","article-title":"Steerable filters for early vision, image analysis, and wavelet decomposition","author":"Freeman","year":"1990","journal-title":"[1990] Proceedings Third International Conference on Computer Vision"},{"issue":"3","key":"2026040113103210200_ref157","doi-asserted-by":"crossref","first-page":"394","DOI":"10.1109\/TPAMI.2007.61","article-title":"Supervised learning of semantic classes for image annotation and retrieval","volume":"29","author":"Carneiro","year":"2007","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"11","key":"2026040113103210200_ref158","doi-asserted-by":"crossref","first-page":"2274","DOI":"10.1109\/TPAMI.2012.120","article-title":"Slic superpixels compared to state-of-the-art superpixel methods","volume":"34","author":"Achanta","year":"2012","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"5","key":"2026040113103210200_ref159","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1109\/34.589215","article-title":"Local grayvalue invariants for image retrieval","volume":"19","author":"Schmid","year":"1997","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref160","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1007\/978-3-642-77225-2_13","article-title":"Indexing via color histograms","author":"Swain","year":"1992","journal-title":"Active Perception and Robot Vision"},{"key":"2026040113103210200_ref161","doi-asserted-by":"publisher","first-page":"1470","DOI":"10.1109\/ICCV.2003.1238663","article-title":"Video google: A text retrieval approach to object matching in videos","volume":"2","author":"Sivic","year":"2003","journal-title":"Proceedings 9th IEEE International Conference on Computer Vision"},{"issue":"12","key":"2026040113103210200_ref162","doi-asserted-by":"crossref","first-page":"1188","DOI":"10.1016\/j.robot.2009.06.010","article-title":"A comparison of loop closing techniques in monocular SLAM","volume":"57","author":"Williams","year":"2009","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref163","first-page":"1417","article-title":"Holistic scene understanding for 3D object detection with RGBD cameras","author":"Lin","year":"2013","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref164","first-page":"3668","article-title":"Image retrieval using scene graphs","author":"Johnson","year":"2015","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref165","doi-asserted-by":"crossref","first-page":"70","DOI":"10.18653\/v1\/W15-2812","article-title":"Generating semantically precise scene graphs from textual descriptions for improved image retrieval","author":"Schuster","year":"2015","journal-title":"Proceedings of the Fourth Workshop on Vision and Language"},{"key":"2026040113103210200_ref166","first-page":"3097","article-title":"Scene graph generation by iterative message passing","author":"Xu","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref167","first-page":"1270","article-title":"Scene graph generation from objects, phrases and region captions","author":"Li","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref168","first-page":"5831","article-title":"Neural motifs: Scene graph parsing with global context","author":"Zellers","year":"2018","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref169","first-page":"7211","article-title":"Mapping images to scene graphs with permutation-invariant structured prediction","author":"Herzig","year":"2018","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref170","first-page":"1912","article-title":"Image generation from scene graphs","author":"Johnson","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref171","first-page":"4561","article-title":"Specifying object attributes and relations in interactive scene generation","author":"Ashual","year":"2019","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref172","first-page":"2580","article-title":"Scene graph prediction with limited labels","author":"Chen","year":"2019","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref173","first-page":"1969","article-title":"Scene graph generation with external knowledge and image reconstruction","author":"Gu","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"issue":"9","key":"2026040113103210200_ref174","doi-asserted-by":"crossref","first-page":"2251","DOI":"10.1109\/TPAMI.2018.2857768","article-title":"Zero-shot learning\u2014A comprehensive evaluation of the good, the bad and the ugly","volume":"41","author":"Xian","year":"2018","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"3","key":"2026040113103210200_ref175","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1109\/TPAMI.2013.140","article-title":"Attribute-based classification for zero-shot visual object categorization","volume":"36","author":"Lampert","year":"2013","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref176","first-page":"1410","article-title":"Zero-shot learning with semantic output codes","author":"Palatucci","year":"2009","journal-title":"Advances in Neural Information Processing Systems 22\u2014Proceedings of the 2009 Conference"},{"key":"2026040113103210200_ref177","first-page":"4166","article-title":"Zero-shot learning via semantic similarity embedding","author":"Zhang","year":"2015","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref178","first-page":"1043","article-title":"Zero-shot visual recognition using semantics-preserving adversarial embedding networks","author":"Chen","year":"2018","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref179","first-page":"4281","article-title":"Generalized zero-shot learning via synthesized examples","author":"Kumar","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref180","first-page":"21","article-title":"Multi-modal cycle-consistent generalized zero-shot learning","author":"Felix","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref181","first-page":"11 671","article-title":"Adaptive confidence smoothing for generalized zero-shot learning","author":"Atzmon","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref182","first-page":"1","article-title":"Generalised zero-shot learning with domain classification in a joint semantic and visual space","author":"Felix","year":"2019","journal-title":"2019 Digital Image Computing: Techniques and Applications (DICTA)"},{"issue":"2","key":"2026040113103210200_ref183","doi-asserted-by":"crossref","first-page":"965","DOI":"10.1109\/TIP.2018.2872916","article-title":"Zero-shot learning via category-specific visual-semantic mapping and label refinement","volume":"28","author":"Niu","year":"2019","journal-title":"IEEE Transactions on Image Processing"},{"key":"2026040113103210200_ref184","first-page":"70","article-title":"Leveraging seen and unseen semantic relationships for generative zero-shot learning","author":"Vyas","year":"2020","journal-title":"European Conference on Computer Vision"},{"issue":"11","key":"2026040113103210200_ref185","doi-asserted-by":"crossref","DOI":"10.1038\/nn.4656","article-title":"The cognitive map in humans: Spatial navigation and beyond","volume":"20","author":"Epstein","year":"2017","journal-title":"Nature Neuroscience"},{"issue":"2","key":"2026040113103210200_ref186","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1162\/neco.1989.1.2.253","article-title":"A robot that walks; emergent behaviors from a carefully evolved network","volume":"1","author":"Brooks","year":"1989","journal-title":"Neural Computation"},{"key":"2026040113103210200_ref187","author":"Braitenberg","year":"1986","journal-title":"Vehicles: Experiments in Synthetic Psychology"},{"key":"2026040113103210200_ref188","first-page":"796","article-title":"Learning to coordinate behaviors","volume":"90","author":"Maes","year":"1990","journal-title":"AAAI"},{"key":"2026040113103210200_ref189","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1007\/978-3-642-40686-7_32","article-title":"Efficient large-scale 3D mobile mapping and surface reconstruction of an underground mine","author":"Zlot","year":"2014","journal-title":"Field and Service Robotics"},{"key":"2026040113103210200_ref190","first-page":"176","article-title":"Visual localization within lidar maps for automated urban driving","author":"Wolcott","year":"2014","journal-title":"2014 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref191","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1177\/0278364904042203","article-title":"Navigation and mapping in large unstructured environments","volume":"23","author":"Guivant","year":"2004","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref192","first-page":"449","article-title":"LaneLoc: Lane marking based localization using highly accurate maps","author":"Schreiber","year":"2013","journal-title":"IEEE Intelligent Vehicles Symposium, Proceedings"},{"key":"2026040113103210200_ref193","article-title":"What localizes beneath: A metric multisensor localization and mapping system for autonomous underground mining vehicles","author":"Jacobson","year":"2020","journal-title":"Journal of Field Robotics"},{"key":"2026040113103210200_ref194","first-page":"407","article-title":"Discrete residual flow for probabilistic pedestrian behavior prediction","author":"Jain","year":"2020","journal-title":"Conference on Robot Learning"},{"key":"2026040113103210200_ref195","doi-asserted-by":"crossref","first-page":"9697","DOI":"10.1109\/ICRA.2019.8794214","article-title":"Deepsignals: Predicting intent of drivers through visual signals","author":"Frossard","year":"2019","journal-title":"2019 International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref196","first-page":"2911","article-title":"Dagmapper: Learning to map by discovering lane topology","author":"Homayounfar","year":"2019","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref197","doi-asserted-by":"crossref","first-page":"1689","DOI":"10.1109\/ICRA40945.2020.9196885","article-title":"Kimera: An open-source library for real-time metric-semantic localization and mapping","author":"Rosinol","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref198","first-page":"1352","article-title":"SLAM++: Simultaneous localisation and mapping at the level of objects","author":"Salas-Moreno","year":"2013","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref199","first-page":"5079","article-title":"Meaningful maps with object-oriented semantic mapping","author":"Suenderhauf","year":"2017","journal-title":"IEEE International Conference on Intelligent Robots and Systems,"},{"issue":"3","key":"2026040113103210200_ref200","doi-asserted-by":"crossref","first-page":"1687","DOI":"10.1109\/LRA.2018.2801879","article-title":"X-view: Graph-based semantic multiview localization","volume":"3","author":"Gawel","year":"2018","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref201","article-title":"LoST? Appearance-invariant place recognition for opposite viewpoints using visual semantics","author":"Garg","year":"2018","journal-title":"Proceedings of Robotics: Science and Systems XIV"},{"key":"2026040113103210200_ref202","first-page":"3515","article-title":"Large-scale semantic mapping and reasoning with heterogeneous modalities","author":"Pronobis","year":"2012","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref203","article-title":"3D dynamic scene graphs: Actionable spatial perception with places, objects, and humans","author":"Rosinol","year":"2020","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref204","article-title":"Real-time simultaneous localisation and mapping with a single camera","author":"Davison","year":"2003","journal-title":"International Conference on Computer Vision"},{"issue":"6","key":"2026040113103210200_ref205","doi-asserted-by":"crossref","first-page":"1052","DOI":"10.1109\/TPAMI.2007.1049","article-title":"Monoslam: Real-time single camera SLAM","volume":"29","author":"Davison","year":"2007","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref206","first-page":"83","article-title":"Parallel tracking and mapping on a camera phone","author":"Klein","year":"2009","journal-title":"2009 8th IEEE International Symposium on Mixed and Augmented Reality"},{"key":"2026040113103210200_ref207","article-title":"A constant-time efficient stereo SLAM system","author":"Mei","year":"2009","journal-title":"British Machine Vision Conference"},{"issue":"2","key":"2026040113103210200_ref208","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/j.imavis.2012.02.009","article-title":"Visual SLAM: Why filter?","volume":"30","author":"Strasdat","year":"2012","journal-title":"Image and Vision Computing"},{"issue":"5","key":"2026040113103210200_ref209","doi-asserted-by":"crossref","first-page":"1147","DOI":"10.1109\/TRO.2015.2463671","article-title":"ORB-SLAM: A versatile and accurate monocular SLAM system","volume":"31","author":"Mur-Artal","year":"2015","journal-title":"IEEE Transactions on Robotics"},{"key":"2026040113103210200_ref210","doi-asserted-by":"crossref","first-page":"2320","DOI":"10.1109\/ICCV.2011.6126513","article-title":"DTAM: Dense tracking and mapping in real-time","author":"Newcombe","year":"2011","journal-title":"2011 International Conference on Computer Vision"},{"key":"2026040113103210200_ref211","first-page":"1449","article-title":"Semi-dense visual odometry for a monocular camera","author":"Engel","year":"2013","journal-title":"International Conference on Computer Vision"},{"key":"2026040113103210200_ref212","first-page":"15","article-title":"SVO: Fast semi-direct monocular visual odometry","author":"Forster","year":"2014","journal-title":"International Conference on Robotics and Automation"},{"issue":"3","key":"2026040113103210200_ref213","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1109\/TPAMI.2017.2658577","article-title":"Direct sparse odometry","volume":"40","author":"Engel","year":"2018","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"2","key":"2026040113103210200_ref214","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1364\/AOP.3.000128","article-title":"Structured-light 3d surface imaging: A tutorial","volume":"3","author":"Geng","year":"2011","journal-title":"Adv. Opt. Photon."},{"key":"2026040113103210200_ref215","doi-asserted-by":"publisher","first-page":"477","DOI":"10.1007\/978-3-642-28572-1_33","article-title":"RGB-D mapping: Using depth cameras for dense 3D modeling of indoor environments","volume":"79","author":"Henry","year":"","journal-title":"Experimental Robotics\u2014The 12th International Symposium on Experimental Robotics, ISER 2010,"},{"key":"2026040113103210200_ref216","article-title":"Kinectfusion: Real-time dense surface mapping and tracking","author":"Newcombe","year":"2011","journal-title":"International Symposium on Mixed and Augmented Reality"},{"key":"2026040113103210200_ref217","doi-asserted-by":"publisher","first-page":"1691","DOI":"10.1109\/ICRA.2012.6225199","article-title":"An evaluation of the RGB-D SLAM system","author":"Endres","year":"2012","journal-title":"IEEE International Conference on Robotics and Automation, ICRA 2012,"},{"issue":"11","key":"2026040113103210200_ref218","doi-asserted-by":"publisher","first-page":"1241","DOI":"10.1109\/TVCG.2015.2459891","article-title":"Very high frame rate volumetric integration of depth images on mobile devices","volume":"21","author":"K\u00e4hler","year":"2015","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"2026040113103210200_ref219","doi-asserted-by":"crossref","first-page":"500","DOI":"10.1007\/978-3-319-46484-8_30","article-title":"Real-time large-scale dense 3D reconstruction with loop closure","author":"K\u00e4hler","year":"2016","journal-title":"Computer Vision\u2014ECCV 2016\u201414th European Conference, Proceedings, Part VIII,"},{"key":"2026040113103210200_ref220","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1109\/ROBOT.1985.1087348","article-title":"Visual map making for a mobile robot","volume":"2","author":"Brooks","year":"1985","journal-title":"Proceedings. 1985 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref221","first-page":"979","article-title":"Topological mapping for mobile robots using a combination of sonar and vision sensing","volume":"94","author":"Kortenkamp","year":"1994","journal-title":"AAAI"},{"key":"2026040113103210200_ref222","first-page":"420","article-title":"An integrated navigation and motion control system for autonomous multisensory mobile robots","author":"Giralt","year":"1983","journal-title":"Autonomous Robot Vehicles"},{"issue":"3","key":"2026040113103210200_ref223","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1016\/1049-9660(92)90045-5","article-title":"Fast vision-guided mobile robot navigation using model-based reasoning and prediction of uncertainties","volume":"56","author":"Kosaka","year":"1992","journal-title":"CVGIP: Image Understanding"},{"key":"2026040113103210200_ref224","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1109\/ICPR.1990.118059","article-title":"Experiments in autonomous navigation","volume":"1","author":"Fennema","year":"1990","journal-title":"[1990] Proceedings. 10th International Conference on Pattern Recognition"},{"issue":"2","key":"2026040113103210200_ref225","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1109\/70.928558","article-title":"Topological simultaneous localization and mapping (SLAM): Toward exact localization without explicit localization","volume":"17","author":"Choset","year":"2001","journal-title":"IEEE Transactions on Robotics and Automation"},{"key":"2026040113103210200_ref226","doi-asserted-by":"crossref","DOI":"10.1109\/CVPR.2004.1315088","article-title":"Visual odometry and map correlation","volume":"1","author":"Levin","year":"2004","journal-title":"Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004"},{"key":"2026040113103210200_ref227","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1109\/ROBOT.2005.1570189","article-title":"SLAM-loop closing with visually salient features","author":"Newman","year":"2005","journal-title":"Proceedings of the 2005 IEEE International Conference on Robotics and Automation"},{"issue":"3","key":"2026040113103210200_ref228","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1109\/TRO.2004.839228","article-title":"Vision-based global localization and mapping for mobile robots","volume":"21","author":"Se","year":"2005","journal-title":"IEEE Transactions on Robotics"},{"issue":"6","key":"2026040113103210200_ref229","doi-asserted-by":"crossref","first-page":"647","DOI":"10.1177\/0278364908090961","article-title":"Fab-map: Probabilistic localization and mapping in the space of appearance","volume":"27","author":"Cummins","year":"2008","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref230","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.robot.2014.11.009","article-title":"Vision-based topological mapping and localization methods: A survey","volume":"64","author":"Garcia-Fidalgo","year":"2015","journal-title":"Robotics and Autonomous Systems"},{"issue":"1","key":"2026040113103210200_ref231","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TRO.2015.2496823","article-title":"Visual place recognition: A survey","volume":"32","author":"Lowry","year":"2016","journal-title":"IEEE Transactions on Robotics"},{"issue":"1\u20132","key":"2026040113103210200_ref232","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/0921-8890(91)90014-C","article-title":"A robot exploration and mapping strategy based on a semantic hierarchy of spatial representations","volume":"8","author":"Kuipers","year":"1991","journal-title":"Robotics and Autonomous Systems"},{"issue":"1","key":"2026040113103210200_ref233","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/S0004-3702(97)00078-7","article-title":"Learning metric-topological maps for indoor mobile robot navigation","volume":"99","author":"Thrun","year":"1998","journal-title":"Artificial Intelligence"},{"key":"2026040113103210200_ref234","first-page":"1708","article-title":"A global topological map formed by local metric maps","volume":"3","author":"Simhon","year":"1998","journal-title":"Proceedings. 1998 IEEE\/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No. 98CH36190)"},{"key":"2026040113103210200_ref235","doi-asserted-by":"crossref","first-page":"1899","DOI":"10.1109\/ROBOT.2003.1241872","article-title":"An atlas framework for scalable mapping","volume":"2","author":"Bosse","year":"2003","journal-title":"2003 IEEE International Conference on Robotics and Automation (Cat. No. 03CH37422)"},{"issue":"1","key":"2026040113103210200_ref236","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/S0921-8890(03)00006-X","article-title":"Hybrid simultaneous localization and map building: A natural integration of topological and metric","volume":"44","author":"Tomatis","year":"2003","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref237","first-page":"3738","article-title":"Closing loops without places","author":"Mei","year":"2010","journal-title":"2010 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"issue":"2","key":"2026040113103210200_ref238","doi-asserted-by":"crossref","first-page":"432","DOI":"10.1109\/TASE.2014.2377791","article-title":"RoboEarth semantic mapping: A cloud enabled knowledge-based approach","volume":"12","author":"Riazuelo","year":"2015","journal-title":"IEEE Transactions on Automation Science and Engineering"},{"issue":"3","key":"2026040113103210200_ref239","first-page":"1785","article-title":"Scene flow propagation for semantic mapping and object discovery in dynamic street scenes","author":"Kochanov","year":"2016","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref240","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9197261","article-title":"A hierarchical framework for collaborative probabilistic semantic mapping","author":"Yue","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"issue":"1998","key":"2026040113103210200_ref241","first-page":"1306","article-title":"Semantic place classification of indoor environments with mobile robots using boosting","volume":"3","author":"Rottmann","year":"2005","journal-title":"Proceedings of the National Conference on Artificial Intelligence"},{"key":"2026040113103210200_ref242","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1007\/978-3-319-08338-4_28","article-title":"Exploiting structural properties of buildings towards general semantic mapping systems","volume":"302","author":"Luperto","year":"2016","journal-title":"Advances in Intelligent Systems and Computing"},{"key":"2026040113103210200_ref243","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/3-540-59119-2_166","article-title":"A desicion-theoretic generalization of on-line learning and an application to boosting","author":"Freund","year":"1995","journal-title":"European Conference on Computational Learning Theory"},{"key":"2026040113103210200_ref244","first-page":"1692","article-title":"Speeding-up multirobot exploration by considering semantic place information","volume":"2006","author":"Stachniss","year":"2006","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref245","first-page":"3999","article-title":"Learning semantic place labels from occupancy grids using CNNs","author":"Goeddel","year":"2016","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref246","first-page":"5729","article-title":"Place categorization and semantic mapping on a mobile robot","author":"Suenderhauf","year":"2016","journal-title":"Proceedings\u2014 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref247","first-page":"2318","article-title":"Understand scene categories by objects: A semantic regularized scene classifier using convolutional neural networks","author":"Liao","year":"2016","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"issue":"5","key":"2026040113103210200_ref248","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1007\/s10514-016-9600-2","article-title":"Dynamic Bayesian network for semantic place classification in mobile robotics","volume":"41","author":"Premebida","year":"2017","journal-title":"Autonomous Robots"},{"key":"2026040113103210200_ref249","first-page":"4265","article-title":"Applying probabilistic mixture models to semantic place classification in mobile robotics","author":"Premebida","year":"2015","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"issue":"3","key":"2026040113103210200_ref250","doi-asserted-by":"crossref","first-page":"1794","DOI":"10.1109\/LRA.2017.2705282","article-title":"Learning deep NBNN representations for robust place categorization","volume":"2","author":"Mancini","year":"2017","journal-title":"IEEE Robotics and Automation Letters"},{"issue":"3","key":"2026040113103210200_ref251","doi-asserted-by":"crossref","first-page":"2093","DOI":"10.1109\/LRA.2018.2809700","article-title":"Robust place categorization with deep domain generalization","volume":"3","author":"Mancini","year":"2018","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref252","first-page":"3511","article-title":"From pixels to buildings: End-to-end probabilistic deep networks for large-scale semantic mapping","author":"Zheng","year":"2019","journal-title":"2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"issue":"4","key":"2026040113103210200_ref253","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1007\/s10514-012-9273-4","article-title":"PLISS: Labeling places using online change-point detection","volume":"32","author":"Ranganathan","year":"2012","journal-title":"Autonomous Robots"},{"key":"2026040113103210200_ref254","first-page":"3982","article-title":"Visual place categorization in maps","author":"Ranganathan","year":"2011","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"issue":"4","key":"2026040113103210200_ref255","doi-asserted-by":"crossref","first-page":"468","DOI":"10.1177\/0278364911434936","article-title":"Histogram of oriented uniform patterns for robust place recognition and categorization","volume":"31","author":"Fazl-Ersi","year":"2012","journal-title":"International Journal of Robotics Research"},{"key":"2026040113103210200_ref256","article-title":"Model learning and real-time tracking using multi-resolution surfel maps","author":"St\u00fcckler","year":"2012","journal-title":"Twenty-Sixth AAAI Conference on Artificial Intelligence"},{"key":"2026040113103210200_ref257","first-page":"41","article-title":"3D semantic map-based shared control for smart wheelchair","author":"Wei","year":"2012","journal-title":"International Conference on Intelligent Robotics and Applications"},{"key":"2026040113103210200_ref258","first-page":"2228","article-title":"Building semantic object maps from sparse and noisy 3D data","author":"Gunther","year":"2013","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"issue":"6","key":"2026040113103210200_ref259","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1016\/j.robot.2008.03.005","article-title":"Bayesian space conceptualization and place classification for semantic maps in mobile robotics","volume":"56","author":"Vasudevan","year":"2008","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref260","first-page":"1288","article-title":"Dense reconstruction using 3D object shape priors","author":"Dame","year":"2013","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref261","first-page":"2295","article-title":"When 2.5D is not enough: Simultaneous reconstruction, segmentation and recognition on dense SLAM","author":"Tateno","year":"2016","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref262","first-page":"3567","article-title":"Visual-inertial-semantic scene representation for 3D object detection","author":"Dong","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref263","first-page":"779","article-title":"You only look once: Unified, real-time object detection","author":"Redmon","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref264","first-page":"1766","article-title":"A conditional random field model for place and object classification","author":"Rogers","year":"2012","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref265","first-page":"4199","article-title":"Geometrically consistent plane extraction for dense indoor 3D maps segmentation","author":"Pham","year":"2016","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref266","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1109\/3DV.2018.00015","article-title":"Fusion++: Volumetric object-level SLAM","author":"McCormac","year":"2018","journal-title":"Proceedings\u2014 2018 International Conference on 3D Vision, 3DV 2018"},{"issue":"7","key":"2026040113103210200_ref267","doi-asserted-by":"crossref","first-page":"1757","DOI":"10.1109\/TPAMI.2012.256","article-title":"Toward open set recognition","volume":"35","author":"Scheirer","year":"2012","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"11","key":"2026040113103210200_ref268","doi-asserted-by":"crossref","first-page":"1686","DOI":"10.1109\/TPAMI.2005.224","article-title":"Open set face recognition using transduction","volume":"27","author":"Li","year":"2005","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref269","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1007\/978-0-85729-932-1_21","article-title":"Evaluation methods in face recognition","author":"Phillips","year":"2011","journal-title":"Handbook of Face Recognition"},{"issue":"11","key":"2026040113103210200_ref270","doi-asserted-by":"crossref","first-page":"2317","DOI":"10.1109\/TPAMI.2014.2321392","article-title":"Probability models for open set recognition","volume":"36","author":"Scheirer","year":"2014","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref271","first-page":"1893","article-title":"Towards open world recognition","author":"Bendale","year":"2015","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref272","first-page":"1563","article-title":"Towards open set deep networks","author":"Bendale","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref273","first-page":"6835","article-title":"Incremental object database: Building 3D models from multiple partial observations","author":"Furrer","year":"2018","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"issue":"3","key":"2026040113103210200_ref274","doi-asserted-by":"crossref","first-page":"3037","DOI":"10.1109\/LRA.2019.2923960","article-title":"Volumetric instance-aware semantic mapping and 3D object discovery","volume":"4","author":"Grinvald","year":"2019","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref275","first-page":"4465","article-title":"Real-time and scalable incremental segmentation on dense SLAM","author":"Tateno","year":"2015","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref276","first-page":"75","article-title":"Incremental dense semantic stereo fusion for large-scale semantic scene reconstruction","author":"Vineet","year":"2015","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref277","doi-asserted-by":"crossref","first-page":"3206","DOI":"10.1109\/ACCESS.2018.2887022","article-title":"Efficient object-oriented semantic mapping with object detector","volume":"7","author":"Nakajima","year":"2019","journal-title":"IEEE Access"},{"key":"2026040113103210200_ref278","doi-asserted-by":"crossref","first-page":"1089","DOI":"10.1109\/WACV.2019.00121","article-title":"Real-time progressive 3D semantic segmentation for indoor scenes","author":"Pham","year":"2019","journal-title":"Proceedings\u20142019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019"},{"key":"2026040113103210200_ref279","first-page":"4471","article-title":"Co-fusion: Real-time segmentation, tracking and fusion of multiple objects","author":"Runz","year":"2017","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref280","first-page":"10","article-title":"MaskFusion: Real-time recognition, tracking and reconstruction of multiple moving objects","author":"Runz","year":"2019","journal-title":"Proceedings of the 2018 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2018"},{"key":"2026040113103210200_ref281","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9197240","article-title":"A unified framework for piecewise semantic reconstruction in dynamic scenes via exploiting superpixel relations","author":"Di","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref282","first-page":"526","article-title":"Consistent cuboid detection for semantic mapping","author":"Hashemifar","year":"2017","journal-title":"Proceedings\u2014IEEE 11th International Conference on Semantic Computing, ICSC 2017"},{"issue":"1","key":"2026040113103210200_ref283","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/LRA.2018.2866205","article-title":"QuadricSLAM: Dual quadrics from object detections as landmarks in object-oriented SLAM","volume":"4","author":"Nicholson","year":"2019","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref284","first-page":"410","article-title":"Structure aware SLAM using quadrics and planes","author":"Hosseinzadeh","year":"2018","journal-title":"Asian Conference on Computer Vision"},{"key":"2026040113103210200_ref285","doi-asserted-by":"crossref","first-page":"7123","DOI":"10.1109\/ICRA.2019.8793728","article-title":"Real-time monocular object-model aware sparse SLAM","author":"Hosseinzadeh","year":"2019","journal-title":"2019 International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref286","first-page":"3635","article-title":"Novelty detection and 3D shape retrieval using superquadrics and multiscale sampling for autonomous mobile robots","author":"Drews","year":"2010","journal-title":"Proceedings\u2014 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref287","first-page":"920","article-title":"Validation of whole-body loco-manipulation affordances for pushability and liftability","author":"Kaiser","year":"2015","journal-title":"IEEE-RAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref288","first-page":"453","article-title":"A scene graph based shared 3D world model for robotic applications","author":"Blumenthal","year":"2013","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref289","first-page":"398","article-title":"Categorizing object-action relations from semantic scene graphs","author":"Aksoy","year":"2010","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref290","first-page":"869","article-title":"Graph-based visual semantic perception for humanoid robots","author":"Grotz","year":"2017","journal-title":"IEEE-RAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref291","first-page":"1","article-title":"3-D scene graph: A sparse and semantic representation of physical environments for intelligent agents","author":"Kim","year":"2019","journal-title":"IEEE Transactions on Cybernetics"},{"key":"2026040113103210200_ref292","first-page":"7462","article-title":"Semantic robot programming for goal-directed manipulation in cluttered scenes","author":"Zeng","year":"2018","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"issue":"8","key":"2026040113103210200_ref293","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1109\/MCG.1986.276770","article-title":"PHIGS: A standard, dynamic, interactive graphics interface","volume":"6","author":"Shuey","year":"1986","journal-title":"IEEE Computer Graphics and Applications"},{"key":"2026040113103210200_ref294","article-title":"Towards a domain specific language for a scene graph based robotic world model","author":"Blumenthal","journal-title":"Proceedings of the Fourth International Workshop on Domain-Specific Languages and Models for Robotic Systems (DSLRob)"},{"key":"2026040113103210200_ref295","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1016\/j.robot.2018.12.009","article-title":"COSMO: Contextualized scene modeling with Boltzmann machines","volume":"113","author":"Bozcan","year":"2019","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref296","first-page":"10 870","article-title":"Spatio-temporal graph for video captioning with knowledge distillation","author":"Pan","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref297","article-title":"Growing semantically meaningful models for visual SLAM","author":"Flint","year":"2010","journal-title":"Proc. IEEE Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref298","article-title":"A dynamic programming approach to reconstructing building interiors","author":"Flint","year":"2010","journal-title":"Proc. 11th European Conf. on Computer"},{"key":"2026040113103210200_ref299","article-title":"Manhattan scene understanding using monocular, stereo, and 3d features","author":"Flint","year":"2011","journal-title":"Proc. 13th IEEE Int. Conf. on Computer Vision"},{"key":"2026040113103210200_ref300","first-page":"703","article-title":"Joint semantic segmentation and 3D reconstruction from monocular video","author":"Kundu","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref301","first-page":"2631","article-title":"Dense 3D semantic mapping of indoor scenes from RGB-D images","author":"Hermans","year":"2014","journal-title":"Proceedings\u2014 IEEE International Conference on Robotics and Automation,"},{"issue":"4","key":"2026040113103210200_ref302","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1007\/s11554-013-0379-5","article-title":"Dense real-time mapping of object-class semantics from RGB-D video","volume":"10","author":"St\u00fcckler","year":"2015","journal-title":"Journal of Real-Time Image Processing"},{"key":"2026040113103210200_ref303","article-title":"Semi-dense 3d semantic mapping from monocular slam","author":"Li","year":"2016","journal-title":"arXiv preprint arXiv:1611.04144"},{"key":"2026040113103210200_ref304","article-title":"ElasticFusion: Dense SLAM without a pose graph","volume":"11","author":"Whelan","year":"2015","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref305","article-title":"DA-RNN: Semantic mapping with data associated recurrent neural networks","volume":"13","author":"Xiang","year":"2017","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref306","first-page":"590","article-title":"Semantic 3D occupancy mapping through efficient high order CRFs","author":"Yang","year":"2017","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref307","first-page":"213","article-title":"FuseNet: Incorporating depth into semantic segmentation via fusion-based CNN architecture","author":"Hazirbas","year":"2016","journal-title":"Asian Conference on Computer Vision"},{"key":"2026040113103210200_ref308","first-page":"345","article-title":"Learning rich features from RGB-D images for object detection and segmentation","author":"Gupta","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref309","article-title":"Multi-view deep learning for consistent semantic mapping with RGB-D cameras","author":"Ma","year":"2017","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref310","first-page":"834","article-title":"LSD-SLAM: Large-scale direct monocular SLAM","author":"Engel","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref311","article-title":"Dense monocular reconstruction using surface normals","author":"Weerasekera","year":"2017","journal-title":"Proc. International Conference on Robotics and Automation"},{"issue":"11","key":"2026040113103210200_ref312","doi-asserted-by":"crossref","first-page":"915","DOI":"10.1016\/j.robot.2008.08.001","article-title":"Towards semantic maps for mobile robots","volume":"56","author":"N\u00fcchter","year":"2008","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref313","first-page":"1","article-title":"Using context to create semantic 3D models of indoor environments","author":"Xiong","year":"2010","journal-title":"British Machine Vision Conference, BMVC 2010\u2014Proceedings"},{"key":"2026040113103210200_ref314","doi-asserted-by":"crossref","first-page":"6891","DOI":"10.1109\/ICRA.2019.8793256","article-title":"Dense 3D visual mapping via semantic simplification","author":"Morreale","year":"2019","journal-title":"2019 International Conference on Robotics and Automation (ICRA)"},{"issue":"2","key":"2026040113103210200_ref315","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1109\/MMUL.2012.24","article-title":"Microsoft kinect sensor and its effect","volume":"19","author":"Zhang","year":"2012","journal-title":"IEEE Multimedia"},{"key":"2026040113103210200_ref316","first-page":"4867","article-title":"Fast semantic segmentation of 3D point clouds using a dense CRF with learned parameters","author":"Wolf","year":"2015","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref317","first-page":"922","article-title":"Voxnet: A 3d convolutional neural network for real-time object recognition","author":"Maturana","year":"2015","journal-title":"2015 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref318","article-title":"Learning where to classify in multi-view semantic segmentation","author":"Riemenschneider","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref319","first-page":"1534","article-title":"3D semantic parsing of large-scale indoor spaces","author":"Armeni","year":"2016","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref320","article-title":"Capturing and aligning multiple 3-dimensional scenes","author":"Bell"},{"key":"2026040113103210200_ref321","article-title":"Unsupervised feature learning for classification of outdoor 3d scans","volume":"2","author":"De Deuge","year":"2013","journal-title":"Australasian Conference on Robitics and Automation"},{"key":"2026040113103210200_ref322","first-page":"1912","article-title":"3d shapenets: A deep representation for volumetric shapes","author":"Wu","year":"2015","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref323","first-page":"656","article-title":"Convolutional-recursive deep learning for 3d object classification","author":"Socher","year":"2012","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref324","doi-asserted-by":"crossref","first-page":"4428","DOI":"10.1109\/ICRA.2012.6225239","article-title":"An occlusion-aware feature for range images","author":"Quadros","year":"2012","journal-title":"2012 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref325","article-title":"Sliding shapes for 3d object detection in RGB-D images","volume":"2","author":"Song","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref326","doi-asserted-by":"crossref","first-page":"889","DOI":"10.1007\/978-3-319-08338-4_64","article-title":"3D object recognition using convolutional neural networks with transfer learning between input channels","author":"Alexandre","year":"2016","journal-title":"Intelligent Autonomous Systems 13"},{"issue":"4\u20135","key":"2026040113103210200_ref327","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1177\/0278364914549607","article-title":"Deep learning for detecting robotic grasps","volume":"34","author":"Lenz","year":"2015","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref328","article-title":"Vehicle detection from 3d lidar using fully convolutional network","author":"Li","year":"2016","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref329","first-page":"80","article-title":"Fast semantic segmentation of RGB-D scenes with gpu-accelerated deep neural networks","author":"H\u00f6ft","year":"2014","journal-title":"Joint German\/Austrian Conference on Artificial Intelligence (K\u00fcnstliche Intelligenz)"},{"issue":"7","key":"2026040113103210200_ref330","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1162\/neco.2006.18.7.1527","article-title":"A fast learning algorithm for deep belief nets","volume":"18","author":"Hinton","year":"2006","journal-title":"Neural Computation"},{"key":"2026040113103210200_ref331","first-page":"424","article-title":"3D u-net: Learning dense volumetric segmentation from sparse annotation","author":"\u00c7i\u00e7ek","year":"2016","journal-title":"International Conference on Medical Image Computing and Computer-Assisted Intervention"},{"issue":"11","key":"2026040113103210200_ref332","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proceedings of the IEEE"},{"key":"2026040113103210200_ref333","first-page":"5648","article-title":"Volumetric and multi-view cnns for object classification on 3d data","author":"Qi","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref334","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1109\/3DV.2017.00067","article-title":"Segcloud: Semantic segmentation of 3d point clouds","author":"Tchapmi","year":"2017","journal-title":"2017 International Conference on 3D Vision (3DV)"},{"key":"2026040113103210200_ref335","first-page":"1746","article-title":"Semantic scene completion from a single depth image","author":"Song","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref336","first-page":"6545","article-title":"Shape completion using 3D-encoder-predictor CNNs and shape synthesis","author":"Dai","year":"2017","journal-title":"Proceedings\u2014 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"2026040113103210200_ref337","first-page":"4578","article-title":"ScanComplete: Large-scale scene completion and semantic segmentation for 3D scans","author":"Dai","year":"2018","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref338","article-title":"OctNet: Learning deep 3D representations at high resolutions deep learning for 3D data shape classification","author":"Riegler","year":"2017","journal-title":"Cvpr"},{"issue":"2","key":"2026040113103210200_ref339","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1016\/0146-664X(82)90104-6","article-title":"Geometric modeling using octree encoding","volume":"19","author":"Meagher","year":"1982","journal-title":"Computer Graphics and Image Processing"},{"key":"2026040113103210200_ref340","doi-asserted-by":"publisher","first-page":"150.1","DOI":"10.5244\/C.29.150","article-title":"Sparse 3D convolutional neural networks","author":"Graham","journal-title":"Proceedings of the British Machine Vision Conference (BMVC)"},{"key":"2026040113103210200_ref341","doi-asserted-by":"crossref","first-page":"1355","DOI":"10.1109\/ICRA.2017.7989161","article-title":"Vote3deep: Fast object detection in 3d point clouds using efficient convolutional neural networks","author":"Engelcke","year":"2017","journal-title":"2017 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref342","first-page":"9224","article-title":"3d semantic segmentation with submanifold sparse convolutional networks","author":"Graham","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref343","article-title":"Spatially-sparse convolutional neural networks","author":"Graham","year":"2016","journal-title":"BMVA Symposium on Deep Learning for Computer Vision"},{"key":"2026040113103210200_ref344","article-title":"Voting for voting in online point cloud object detection","volume":"1","author":"Wang","year":"2015","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref345","first-page":"315","article-title":"Deep sparse rectifier neural networks","author":"Glorot","year":"2011","journal-title":"Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics"},{"key":"2026040113103210200_ref346","first-page":"4452","article-title":"Learning sparse high dimensional filters: Image filtering, dense crfs and bilateral neural networks","author":"Jampani","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref347","first-page":"307","article-title":"FPNN: Field probing neural networks for 3d data","author":"Li","year":"2016","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref348","first-page":"945","article-title":"Multiview convolutional neural networks for 3d shape recognition","author":"Su","year":"2015","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref349","first-page":"3887","article-title":"Tangent convolutions for dense prediction in 3d","author":"Tatarchenko","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref350","first-page":"863","article-title":"Escape from cells: Deep kdnetworks for the recognition of 3d point cloud models","author":"Klokov","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref351","article-title":"PointNet++: Deep hierarchical feature learning on point sets in a metric space","author":"Qi","year":"2017","journal-title":"Advances in Neural Information Processing Systems."},{"key":"2026040113103210200_ref352","first-page":"716","article-title":"Exploring spatial context for 3D semantic segmentation of point clouds","author":"Engelmann","year":"2018","journal-title":"Proceedings\u20142017 IEEE International Conference on Computer Vision Workshops, ICCVW 2017"},{"key":"2026040113103210200_ref353","first-page":"984","article-title":"Pointwise convolutional neural networks","author":"Hua","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref354","first-page":"9621","article-title":"Pointconv: Deep convolutional networks on 3d point clouds","author":"Wu","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref355","first-page":"403","article-title":"3d recurrent neural networks with context fusion for point cloud semantic segmentation","author":"Ye","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref356","first-page":"2626","article-title":"Recurrent slice networks for 3d segmentation of point clouds","author":"Huang","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref357","doi-asserted-by":"crossref","first-page":"746","DOI":"10.1145\/3240508.3240621","article-title":"Rgcnn: Regularized graph cnn for point cloud segmentation","author":"Te","year":"2018","journal-title":"Proceedings of the 26th ACM International Conference on Multimedia"},{"issue":"5","key":"2026040113103210200_ref358","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3326362","article-title":"Dynamic graph CNN for learning on point clouds","volume":"38","author":"Wang","year":"2019","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"2026040113103210200_ref359","first-page":"206","article-title":"Foldingnet: Point cloud auto-encoder via deep grid deformation","author":"Yang","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref360","doi-asserted-by":"crossref","DOI":"10.1016\/j.cviu.2020.102921","article-title":"Adversarial autoencoders for compact representations of 3D point clouds","volume":"193","author":"Zamorski","year":"2020","journal-title":"Computer Vision and Image Understanding"},{"key":"2026040113103210200_ref361","first-page":"3693","article-title":"Dynamic edge-conditioned filters in convolutional neural networks on graphs","author":"Simonovsky","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref362","first-page":"726","article-title":"3D graph neural networks for RGBD semantic segmentation","author":"Qi","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref363","article-title":"3dcontextnet: Kd tree guided hierarchical learning of point clouds using local and global contextual cues","author":"Zeng","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref364","first-page":"4558","article-title":"Large-scale point cloud semantic segmentation with superpoint graphs","author":"Landrieu","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"issue":"9","key":"2026040113103210200_ref365","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1145\/361002.361007","article-title":"Multidimensional binary search trees used for associative searching","volume":"18","author":"Bentley","year":"1975","journal-title":"Communications of the ACM"},{"key":"2026040113103210200_ref366","first-page":"9397","article-title":"So-net: Self-organizing network for point cloud analysis","author":"Li","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"issue":"9","key":"2026040113103210200_ref367","doi-asserted-by":"crossref","first-page":"1464","DOI":"10.1109\/5.58325","article-title":"The self-organizing map","volume":"78","author":"Kohonen","year":"1990","journal-title":"Proceedings of the IEEE"},{"issue":"5","key":"2026040113103210200_ref368","doi-asserted-by":"crossref","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","article-title":"ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras","volume":"33","author":"Mur-Artal","year":"2017","journal-title":"IEEE Transactions on Robotics"},{"key":"2026040113103210200_ref369","first-page":"4104","article-title":"Structure-from-motion revisited","author":"Schonberger","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref370","first-page":"1558","article-title":"Deep non-rigid structure from motion","author":"Kong","year":"2019","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref371","article-title":"SFM-net: Learning of structure and motion from video","author":"Vijayanarasimhan","year":"2017","journal-title":"arXiv preprint arXiv:1704.07804"},{"key":"2026040113103210200_ref372","first-page":"568","article-title":"Two-stream convolutional networks for action recognition in videos","author":"Simonyan","year":"2014","journal-title":"Advances in Neural Processing Systems"},{"key":"2026040113103210200_ref373","first-page":"1","article-title":"Learning realistic human actions from movies","author":"Laptev","year":"2008","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref374","first-page":"1","article-title":"Action MACH: A spatio-temporal maximum average correlation height filter for action recognition","author":"Rodriguez","year":"2008","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref375","article-title":"UCF101: A dataset of 101 human actions classes from videos in the wild","author":"Soomro","journal-title":"CoRR"},{"key":"2026040113103210200_ref376","first-page":"6299","article-title":"Quo vadis, action recognition? A new model and the kinetics dataset","author":"Carreira","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref377","first-page":"759","article-title":"Finding action tubes","author":"Gkioxari","year":"2015","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref378","first-page":"961","article-title":"ActivityNet: A large-scale video benchmark for human activity understanding","author":"Fabian Caba Heilbron","year":"2015","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref379","first-page":"720","article-title":"Scaling egocentric vision: The EPIC-kitchens dataset","author":"Damen","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref380","first-page":"3169","article-title":"Action recognition by dense trajectories","author":"Wang","year":"2011","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref381","doi-asserted-by":"crossref","first-page":"3201","DOI":"10.1109\/CVPR.2011.5995646","article-title":"Actom sequence models for efficient action detection","author":"Gaidon","year":"2011","journal-title":"CVPR 2011"},{"issue":"3","key":"2026040113103210200_ref382","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1007\/s11263-013-0677-1","article-title":"Activity representation with motion hierarchies","volume":"107","author":"Gaidon","year":"2014","journal-title":"International Journal of Computer Vision"},{"key":"2026040113103210200_ref383","article-title":"Rank pooling for action recognition","author":"Fernando","year":"2016","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref384","first-page":"1187","article-title":"Learning end-to-end video classification with rank-pooling","author":"Fernando","year":"2016","journal-title":"Proceedings of the International Conference on Machine Learning"},{"key":"2026040113103210200_ref385","first-page":"1924","article-title":"Discriminative hierarchical rank pooling for activity recognition","author":"Fernando","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref386","first-page":"3034","article-title":"Dynamic image networks for action recognition","author":"Bilen","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref387","article-title":"Automatic annotation of everyday movements","author":"Ramanan","year":"2003","journal-title":"Advances in Neural Processing Systems"},{"key":"2026040113103210200_ref388","first-page":"915","article-title":"An approach to pose-based action recognition","author":"Wang","year":"2013","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref389","first-page":"5137","article-title":"2d\/3d pose estimation and action recognition using multitask deep learning","author":"Luvizon","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref390","article-title":"Semantic-level understanding of human actions and interactions using event hierarchy","author":"Park","year":"2004","journal-title":"IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops"},{"issue":"2","key":"2026040113103210200_ref391","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1145\/2493432.2493511","article-title":"Towards zero-shot learning for human activity recognition using semantic attribute sequence model","author":"Cheng","year":"2013","journal-title":"UbiComp 2013\u2014Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing"},{"key":"2026040113103210200_ref392","first-page":"361","article-title":"NuActiv: Recognizing unseen new activities using semantic attribute-based learning","author":"Cheng","year":"2013","journal-title":"MobiSys 2013\u2014Proceedings of the 11th Annual International Conference on Mobile Systems, Applications, and Services"},{"key":"2026040113103210200_ref393","first-page":"5043","article-title":"Automatic segmentation and recognition of human activities from observation based on semantic reasoning","author":"Ramirez-Amaro","year":"2014","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref394","first-page":"438","article-title":"Bootstrapping humanoid robot skills by extracting semantic representations of human-like activities from virtual reality","author":"Ramirez-Amaro","year":"2015","journal-title":"IEEE-RAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref395","first-page":"1141","article-title":"Robust semantic representations for inferring human co-manipulation activities even with different demonstration styles","author":"Ramirez-Amaro","year":"2015","journal-title":"IEEE-RAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref396","first-page":"456","article-title":"Enhancing human action recognition through spatio-temporal feature learning and semantic rules","author":"Ramirez-Amaro","year":"2015","journal-title":"IEEERAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref397","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1016\/j.artint.2015.08.009","article-title":"Transferring skills to humanoid robots by extracting semantic representations from observations of human activities","volume":"247","author":"Ramirez-Amaro","year":"2017","journal-title":"Artificial Intelligence"},{"issue":"1","key":"2026040113103210200_ref398","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2939381","article-title":"Added value of gaze-exploiting semantic representation to allow robots inferring human behaviors","volume":"7","author":"Ramirez-Amaro","year":"2017","journal-title":"ACM Transactions on Interactive Intelligent Systems"},{"issue":"2","key":"2026040113103210200_ref399","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1007\/s13218-019-00582-5","article-title":"A semantic-based method for teaching industrial robots new tasks","volume":"33","author":"Ramirez-Amaro","year":"2019","journal-title":"KI\u2014Kunstliche Intelligenz"},{"key":"2026040113103210200_ref400","first-page":"5060","article-title":"3D semantic trajectory reconstruction from 3D pixel continuum","author":"Yoon","year":"2018","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref401","first-page":"4216","article-title":"Kinematic structure correspondences via hypergraph matching","author":"Chang","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"issue":"12","key":"2026040113103210200_ref402","doi-asserted-by":"crossref","first-page":"2920","DOI":"10.1109\/TPAMI.2017.2777486","article-title":"Learning kinematic structure correspondences using multi-order similarities","volume":"40","author":"Chang","year":"2018","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref403","first-page":"250","article-title":"Computer vision for lifelogging: Characterizing everyday activities based on visual semantics","author":"Wang","year":"2018","journal-title":"Computer Vision For Assistive Healthcare"},{"key":"2026040113103210200_ref404","author":"Gibson","year":"1979","journal-title":"The Ecological Approach to Visual Perception: Classic Edition"},{"key":"2026040113103210200_ref405","doi-asserted-by":"crossref","first-page":"3140","DOI":"10.1109\/ROBOT.2003.1242073","article-title":"Learning about objects through action-initial steps towards artificial cognition","volume":"3","author":"Fitzpatrick","year":"2003","journal-title":"2003 IEEE International Conference on Robotics and Automation (Cat. No. 03CH37422)"},{"issue":"4","key":"2026040113103210200_ref406","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1177\/1059712307084689","article-title":"To afford or not to afford: A new formalization of affordances toward affordance-based robot control","volume":"15","author":"\u015eahin","year":"2007","journal-title":"Adaptive Behavior"},{"key":"2026040113103210200_ref407","doi-asserted-by":"crossref","first-page":"729","DOI":"10.1109\/IROS.2007.4399469","article-title":"From primitive behaviors to goal-directed behavior using affordances","author":"Dogar","year":"2007","journal-title":"2007 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"issue":"10","key":"2026040113103210200_ref408","doi-asserted-by":"crossref","first-page":"740","DOI":"10.1016\/j.robot.2011.05.009","article-title":"Object\u2013action complexes: Grounded abstractions of sensory\u2013motor processes","volume":"59","author":"Kr\u00fcger","year":"2011","journal-title":"Robotics and Autonomous Systems"},{"issue":"10","key":"2026040113103210200_ref409","doi-asserted-by":"crossref","first-page":"1229","DOI":"10.1177\/0278364911410459","article-title":"Learning the semantics of object-action relations by observation","volume":"30","author":"Aksoy","year":"2011","journal-title":"International Journal of Robotics Research"},{"key":"2026040113103210200_ref410","first-page":"4555","article-title":"Toward a library of manipulation actions based on semantic object-action relations","author":"Aein","year":"2013","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref411","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1016\/j.robot.2014.11.003","article-title":"Model-free incremental learning of the semantics of manipulation actions","volume":"71","author":"Aksoy","year":"2015","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref412","doi-asserted-by":"crossref","DOI":"10.1609\/aaai.v29i1.9671","article-title":"Robot learning manipulation action plans by \u2018watching\u2019 unconstrained videos from the world wide web","author":"Yang","year":"2015","journal-title":"Twenty-Ninth AAAI Conference on Artificial Intelligence"},{"issue":"3","key":"2026040113103210200_ref413","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1016\/j.robot.2011.07.015","article-title":"Templates for pre-grasp sliding interactions","volume":"60","author":"Kappler","year":"2012","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref414","first-page":"1343","article-title":"Afrob: The affordance network ontology for robots","author":"Varadarajan","year":"2012","journal-title":"2012 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"issue":"1","key":"2026040113103210200_ref415","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1007\/s10514-014-9402-3","article-title":"Grasp quality measures: Review and performance","volume":"38","author":"Roa","year":"2015","journal-title":"Autonomous Robots"},{"key":"2026040113103210200_ref416","first-page":"348","article-title":"Robotic grasping and contact: A review","volume":"1","author":"Bicchi","year":"2000","journal-title":"Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No. 00CH37065)"},{"issue":"2","key":"2026040113103210200_ref417","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1109\/TRO.2013.2289018","article-title":"Data-driven grasp synthesis\u2014A survey","volume":"30","author":"Bohg","year":"2013","journal-title":"IEEE Transactions on Robotics"},{"issue":"1","key":"2026040113103210200_ref418","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2478\/s13230-011-0012-x","article-title":"Learning grasp affordance densities","volume":"2","author":"Detry","year":"2011","journal-title":"Paladyn, Journal of Behavioral Robotics"},{"key":"2026040113103210200_ref419","doi-asserted-by":"crossref","first-page":"2287","DOI":"10.1109\/ROBOT.2010.5509126","article-title":"Refining grasp affordance models by experience","author":"Detry","year":"2010","journal-title":"2010 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref420","article-title":"Multi-view self-supervised deep learning for 6D pose estimation in the Amazon picking challenge","author":"Zeng","year":"2017","journal-title":"ICRA"},{"key":"2026040113103210200_ref421","first-page":"5663","article-title":"Integrated grasp planning and visual object localization for a humanoid robot with five-fingered hands","author":"Morales","year":"2006","journal-title":"2006 IEEE\/ RSJ International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref422","article-title":"Deep object pose estimation for semantic robotic grasping of household objects","author":"Tremblay","year":"2018","journal-title":"Conference on Robot Learning (CoRL)"},{"issue":"4","key":"2026040113103210200_ref423","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1177\/0278364911436019","article-title":"Rigid 3D geometry matching for grasping of known objects in cluttered scenes","volume":"31","author":"Papazov","year":"2012","journal-title":"The International Journal of Robotics Research"},{"issue":"10","key":"2026040113103210200_ref424","doi-asserted-by":"crossref","first-page":"1284","DOI":"10.1177\/0278364911401765","article-title":"The moped framework: Object recognition and pose estimation for manipulation","volume":"30","author":"Collet","year":"2011","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref425","first-page":"1311","article-title":"Semantic grasping: Planning robotic grasps functionally suitable for an object manipulation task","author":"Dang","year":"2012","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref426","first-page":"2963","article-title":"Transferring functional grasps through contact warping and local replanning","author":"Hillenbrand","year":"2012","journal-title":"2012 IEEE\/ RSJ International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref427","doi-asserted-by":"crossref","first-page":"3791","DOI":"10.1109\/ICRA.2012.6224992","article-title":"Generalizing grasps across partly similar objects","author":"Detry","year":"2012","journal-title":"2012 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref428","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1109\/ICRA.2013.6630635","article-title":"Learning a dictionary of prototypical grasp-predicting parts from grasping experience","author":"Detry","year":"2013","journal-title":"2013 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref429","first-page":"623","article-title":"Localizing handle-like grasp affordances in 3d point clouds","author":"Ten","year":"2016","journal-title":"Experimental Robotics"},{"issue":"2","key":"2026040113103210200_ref430","doi-asserted-by":"crossref","first-page":"798","DOI":"10.1109\/TASE.2015.2396014","article-title":"Learning to detect visual grasp affordance","volume":"13","author":"Song","year":"2015","journal-title":"IEEE Transactions on Automation Science and Engineering"},{"key":"2026040113103210200_ref431","article-title":"kPAM: Keypoint affordances for category-level robotic manipulation","author":"Manuelli","year":"2019","journal-title":"International Symposium on Robotics Research (ISRR)"},{"key":"2026040113103210200_ref432","article-title":"Closing the loop for robotic grasping: A real-time, generative grasp synthesis approach","author":"Morrison","year":"2018","journal-title":"Robotics: Science and Systems (RSS)"},{"key":"2026040113103210200_ref433","article-title":"Learning robust, real-time, reactive robotic grasping","author":"Morrison","year":"2019","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref434","article-title":"Dex-net 2.0: Deep learning to plan robust grasps with synthetic point clouds and analytic grasp metrics","author":"Mahler","year":"2017","journal-title":"Robotics: Science and Systems (RSS)"},{"key":"2026040113103210200_ref435","doi-asserted-by":"crossref","DOI":"10.1126\/scirobotics.aau4984","article-title":"Learning ambidextrous robot grasping policies","author":"Mahler","year":"2019","journal-title":"Science Robotics"},{"key":"2026040113103210200_ref436","article-title":"Grasp pose detection in point clouds","author":"Ten","year":"2017","journal-title":"The International Journal of Robotics Research (IJRR)"},{"key":"2026040113103210200_ref437","doi-asserted-by":"crossref","DOI":"10.1126\/science.aat8414","article-title":"Trends and challenges in robot manipulation","volume":"364","author":"Billard","year":"2019","journal-title":"Science"},{"key":"2026040113103210200_ref438","article-title":"Cartman: The low-cost cartesian manipulator that won the amazon robotics challenge","author":"Morrison","year":"2018","journal-title":"International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref439","first-page":"1","article-title":"Robotic pick-andplace of novel objects in clutter with multi-affordance grasping and cross-domain image matching","author":"Zeng","year":"2018","journal-title":"2018 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref440","doi-asserted-by":"crossref","first-page":"3347","DOI":"10.1109\/ICRA.2018.8461195","article-title":"Fast object learning and dual-arm coordination for cluttered stowing, picking, and packing","author":"Schwarz","year":"2018","journal-title":"2018 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref441","doi-asserted-by":"crossref","first-page":"2038","DOI":"10.1109\/ICRA.2016.7487351","article-title":"Object discovery and grasp detection with a shared convolutional neural network","author":"Guo","year":"2016","journal-title":"2016 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref442","first-page":"119","article-title":"End-to-end learning of semantic grasping","author":"Jang","year":"2017","journal-title":"Conference on Robot Learning"},{"issue":"13","key":"2026040113103210200_ref443","doi-asserted-by":"crossref","first-page":"4508","DOI":"10.1523\/JNEUROSCI.5451-11.2012","article-title":"Cortical dynamics of sensorimotor integration during grasp planning","volume":"32","author":"Verhagen","year":"2012","journal-title":"Journal of Neuroscience"},{"key":"2026040113103210200_ref444","doi-asserted-by":"crossref","first-page":"1374","DOI":"10.1109\/ICRA.2015.7139369","article-title":"Affordance detection of tool parts from geometric features","author":"Myers","year":"2015","journal-title":"2015 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref445","first-page":"2765","article-title":"Detecting object affordances with convolutional neural networks","author":"Nguyen","year":"2016","journal-title":"2016 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref446","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1109\/HUMANOIDS.2017.8239542","article-title":"Affordance detection for task-specific grasping using deep learning","author":"Kokic","year":"2017","journal-title":"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)"},{"key":"2026040113103210200_ref447","doi-asserted-by":"crossref","first-page":"1944","DOI":"10.1109\/ICRA.2011.5979666","article-title":"Multivariate discretization for Bayesian network structure learning in robot grasping","author":"Song","year":"2011","journal-title":"2011 IEEE International Conference on Robotics and Automation"},{"issue":"3","key":"2026040113103210200_ref448","doi-asserted-by":"crossref","first-page":"546","DOI":"10.1109\/TRO.2015.2409912","article-title":"Task-based robot grasp planning using probabilistic inference","volume":"31","author":"Song","year":"2015","journal-title":"IEEE Transactions on Robotics"},{"key":"2026040113103210200_ref449","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1007\/978-3-319-20904-3_20","article-title":"Learning human priors for task-constrained grasping","author":"Hjelm","year":"2015","journal-title":"International Conference on Computer Vision Systems"},{"key":"2026040113103210200_ref450","first-page":"5","article-title":"Towards robust grasps: Using the environment semantics for robotic object affordances","author":"Ard\u00f3n","year":"2018","journal-title":"Proceedings of the AAAI Fall Symposium on Reasoning and Learning in Real-World Systems for Long-Term Autonomy"},{"key":"2026040113103210200_ref451","first-page":"3266","article-title":"Task-oriented grasping with semantic and geometric scene understanding","author":"Detry","year":"2017","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref452","first-page":"4263","article-title":"Autonomous semantic mapping for robots performing everyday manipulation tasks in kitchen environments","author":"Blodow","year":"2011","journal-title":"2011 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"issue":"1","key":"2026040113103210200_ref453","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1515\/pjbr-2018-0020","article-title":"Context-aware robot navigation using interactively built semantic maps","volume":"9","author":"Cosgun","year":"2018","journal-title":"Paladyn, Journal of Behavioral Robotics"},{"issue":"10","key":"2026040113103210200_ref454","doi-asserted-by":"crossref","first-page":"1131","DOI":"10.1016\/j.robot.2012.12.007","article-title":"Inferring robot goals from violations of semantic knowledge","volume":"61","author":"Galindo","year":"2013","journal-title":"Robotics and Autonomous Systems"},{"issue":"3","key":"2026040113103210200_ref455","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1109\/TCDS.2017.2717041","article-title":"What can i do with this tool? self-supervised learning of tool affordances from their 3-d geometry","volume":"10","author":"Mar","year":"2017","journal-title":"IEEE Transactions on Cognitive and Developmental Systems"},{"key":"2026040113103210200_ref456","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1177\/0278364919872545","article-title":"Learning task-oriented grasping for tool manipulation from simulated self-supervision","volume":"39","author":"Fang","year":"2020","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref457","article-title":"Asking for help using inverse semantics","author":"Tellex","year":"2014","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref458","doi-asserted-by":"crossref","DOI":"10.1609\/aaai.v25i1.7979","article-title":"Understanding natural language commands for robotic navigation and mobile manipulation","author":"Tellex","year":"2011","journal-title":"Twenty-Fifth AAAI Conference on Artificial Intelligence"},{"key":"2026040113103210200_ref459","first-page":"4451","article-title":"Temporal spatial inverse semantics for robots communicating with humans","author":"Gong","year":"2018","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref460","doi-asserted-by":"crossref","first-page":"857","DOI":"10.65109\/GYIY5789","article-title":"Robot program construction via grounded natural language semantics and simulation robotics track","volume":"2","author":"Pomarlan","year":"2018","journal-title":"Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS"},{"key":"2026040113103210200_ref461","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/j.robot.2019.01.007","article-title":"Semantic reasoning in service robots using expert systems","volume":"114","author":"Savage","year":"2019","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref462","article-title":"Incremental semantically grounded learning from demonstration","volume":"9","author":"Niekum","year":"2013","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref463","article-title":"Unsupervised perceptual rewards for imitation learning","author":"Sermanet","year":"2017","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref464","first-page":"189","article-title":"Efficient model learning from joint-action demonstrations for human\u2013 robot collaborative tasks","author":"Nikolaidis","year":"2015","journal-title":"2015 10th ACM\/IEEE International Conference on Human\u2013Robot Interaction (HRI)"},{"key":"2026040113103210200_ref465","first-page":"2623","article-title":"Learning spatial-semantic representations from natural language descriptions and scene classifications","author":"Hemachandra","year":"2014","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref466","article-title":"Visual semantic navigation using scene priors","author":"Yang","year":"2019","journal-title":"ICLR"},{"key":"2026040113103210200_ref467","doi-asserted-by":"crossref","first-page":"1505","DOI":"10.1109\/IROS.2004.1389609","article-title":"Exploration with active loop-closing for fastSLAM","volume":"2","author":"Stachniss","year":"2004","journal-title":"2004 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No. 04CH37566)"},{"key":"2026040113103210200_ref468","first-page":"287","article-title":"An application of kullback-leibler divergence to active SLAM and exploration with particle filters","author":"Carlone","year":"2010","journal-title":"2010 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"issue":"2","key":"2026040113103210200_ref469","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1007\/s10846-013-9981-9","article-title":"Active SLAM and exploration with particle filters using kullback-leibler divergence","volume":"75","author":"Carlone","year":"2014","journal-title":"Journal of Intelligent and Robotic Systems"},{"key":"2026040113103210200_ref470","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1016\/j.artint.2016.07.002","article-title":"Artificial cognition for social human\u2013robot interaction: An implementation","volume":"247","author":"Lemaignan","year":"2017","journal-title":"Artificial Intelligence"},{"issue":"5","key":"2026040113103210200_ref471","doi-asserted-by":"crossref","first-page":"874","DOI":"10.1109\/TRO.2007.904911","article-title":"A human aware mobile robot motion planner","volume":"23","author":"Sisbot","year":"2007","journal-title":"IEEE Transactions on Robotics"},{"issue":"6","key":"2026040113103210200_ref472","doi-asserted-by":"crossref","first-page":"1419","DOI":"10.1109\/TRO.2015.2492862","article-title":"Simulation-based behavior planning to prevent congestion of pedestrians around a robot","volume":"31","author":"Kidokoro","year":"2015","journal-title":"IEEE Transactions on Robotics"},{"key":"2026040113103210200_ref473","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1109\/ICRA.2011.5980476","article-title":"Lingodroids: Studies in spatial cognition and language","author":"Schulz","year":"2011","journal-title":"2011 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref474","article-title":"Computational modelling of embodied visual perspective-taking","author":"Fischer","year":"2019","journal-title":"IEEE Transactions on Cognitive and Developmental Systems"},{"key":"2026040113103210200_ref475","first-page":"3309","article-title":"Markerless perspective taking for humanoid robots in unconstrained environments","author":"Fischer","year":"2016","journal-title":"Proceedings of the IEEE International Conference on Robotics and Automation"},{"issue":"5","key":"2026040113103210200_ref476","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1016\/j.robot.2006.02.004","article-title":"Using perspective taking to learn from ambiguous demonstrations","volume":"54","author":"Breazeal","year":"2006","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref477","first-page":"3674","article-title":"Vision-andlanguage navigation: Interpreting visually-grounded navigation instructions in real environments","author":"Anderson","year":"2018","journal-title":"CVPR"},{"key":"2026040113103210200_ref478","first-page":"3318","article-title":"Speaker-follower models for vision-and-language navigation","author":"Fried","year":"2018","journal-title":"Neur-IPS"},{"key":"2026040113103210200_ref479","first-page":"2610","article-title":"Learning to navigate unseen environments: Back translation with environmental dropout","author":"Tan","year":"2019","journal-title":"NAACL"},{"key":"2026040113103210200_ref480","article-title":"Self-monitoring navigation agent via auxiliary progress estimation","author":"Ma","year":"2019","journal-title":"ICLR"},{"key":"2026040113103210200_ref481","first-page":"6732","article-title":"The regretful agent: Heuristic-aided navigation through progress estimation","author":"Ma","year":"2019","journal-title":"CVPR"},{"key":"2026040113103210200_ref482","first-page":"6741","article-title":"Tactical rewind: Self-correction via backtracking in vision-and-language navigation","author":"Ke","year":"2019","journal-title":"CVPR"},{"key":"2026040113103210200_ref483","article-title":"Reverie: Remote embodied visual referring expression in real indoor environments","author":"Qi","year":"2020","journal-title":"CVPR"},{"issue":"6","key":"2026040113103210200_ref484","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1016\/j.imavis.2008.07.010","article-title":"Simultaneous place and object recognition using collaborative context information","volume":"27","author":"Kim","year":"2009","journal-title":"Image and Vision Computing"},{"key":"2026040113103210200_ref485","first-page":"2792","article-title":"Simultaneous place and object recognition with mobile robot using pose encoded contextual information","author":"Luo","year":"2011","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref486","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/S0079-6123(06)55002-2","article-title":"Building the gist of a scene: The role of global image features in recognition","volume":"155","author":"Oliva","year":"2006","journal-title":"Progress in Brain Research"},{"issue":"1\u20133","key":"2026040113103210200_ref487","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1177\/0278364915596589","article-title":"Localization from semantic observations via the matrix permanent","volume":"35","author":"Atanasov","year":"2016","journal-title":"International Journal of Robotics Research"},{"issue":"9","key":"2026040113103210200_ref488","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1109\/TPAMI.2009.167","article-title":"Object detection with discriminatively trained part-based models","volume":"32","author":"Felzenszwalb","year":"2009","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref489","first-page":"1","article-title":"Semantic signatures for urban visual localization","author":"Weng","year":"2018","journal-title":"Proceedings\u2014International Workshop on Content-Based Multimedia Indexing"},{"key":"2026040113103210200_ref490","doi-asserted-by":"crossref","first-page":"21 963","DOI":"10.1109\/ACCESS.2019.2899049","article-title":"A coarse to fine indoor visual localization method using environmental semantic information","volume":"7","author":"Zhang","year":"2019","journal-title":"IEEE Access"},{"issue":"4","key":"2026040113103210200_ref491","doi-asserted-by":"crossref","first-page":"3669","DOI":"10.1109\/LRA.2018.2856274","article-title":"Learning of Holism-Landmark graph embedding for place recognition in long-term autonomy","volume":"3","author":"Han","year":"2018","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref492","first-page":"602","article-title":"GIS-assisted object detection and geospatial localization","author":"Ardeshir","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref493","first-page":"9","article-title":"Semantic cross-view matching","author":"Castaldo","year":"2015","journal-title":"Proceedings of the IEEE International Conference on Computer Vision Workshops"},{"key":"2026040113103210200_ref494","first-page":"1","article-title":"Visual map matching and localization using a global feature map","author":"Pink","year":"2008","journal-title":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops"},{"issue":"1","key":"2026040113103210200_ref495","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1115\/1.3662552","article-title":"A new approach to linear filtering and prediction problems","volume":"82","author":"Kalman","year":"1960","journal-title":"Journal of Basic Engineering"},{"key":"2026040113103210200_ref496","first-page":"3425","article-title":"Learning to align semantic segmentation and 2.5 d maps for geolocalization","author":"Armagan","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref497","article-title":"Semantic image based geolocation given a map","author":"Mousavian"},{"key":"2026040113103210200_ref498","first-page":"1","article-title":"Development of positioning technique using omni-directional IR camera and aerial survey data","author":"Meguro","year":"2007","journal-title":"2007 IEEE\/ASME International Conference on Advanced Intelligent Mechatronics"},{"key":"2026040113103210200_ref499","doi-asserted-by":"crossref","first-page":"409","DOI":"10.1109\/ROBOT.2009.5152262","article-title":"Dynamic programming and skyline extraction in catadioptric infrared images","author":"Bazin","year":"2009","journal-title":"2009 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref500","article-title":"Sky segmentation with ultraviolet images can be used for navigation","author":"Stone","year":"2014","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref501","doi-asserted-by":"crossref","first-page":"5615","DOI":"10.1109\/ICRA.2016.7487780","article-title":"Skyline-based localisation for aggressively manoeuvring robots using uv sensors and spherical harmonics","author":"Stone","year":"2016","journal-title":"2016 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref502","first-page":"3816","article-title":"Skyline2gps: Localization in urban canyons using omni-skylines","author":"Ramalingam","year":"2010","journal-title":"2010 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"issue":"3","key":"2026040113103210200_ref503","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/s11263-015-0830-0","article-title":"Image based geo-localization in the ALPs","volume":"116","author":"Saurer","year":"2016","journal-title":"International Journal of Computer Vision"},{"issue":"9","key":"2026040113103210200_ref504","doi-asserted-by":"crossref","first-page":"1057","DOI":"10.1177\/0278364915618766","article-title":"Routed roads: Probabilistic vision-based place recognition for changing conditions, split streets and varied viewpoints","volume":"35","author":"Pepperell","year":"2016","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref505","first-page":"1","article-title":"Geolocating static cameras","author":"Jacobs","year":"2007","journal-title":"2007 IEEE 11th International Conference on Computer Vision"},{"key":"2026040113103210200_ref506","first-page":"3196","article-title":"VLASE: Vehicle localization by aggregating semantic edges","author":"Yu","year":"2018","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"issue":"9","key":"2026040113103210200_ref507","doi-asserted-by":"crossref","first-page":"1704","DOI":"10.1109\/TPAMI.2011.235","article-title":"Aggregating local image descriptors into compact codes","volume":"34","author":"Jegou","year":"2011","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2026040113103210200_ref508","first-page":"5964","article-title":"Casenet: Deep category-aware semantic edge detection","author":"Yu","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref509","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1007\/978-3-319-25781-5_6","article-title":"Semantically guided geo-location and modeling in urban environments","author":"Singh","year":"2016","journal-title":"Large-Scale Visual Geo-Localization"},{"key":"2026040113103210200_ref510","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1109\/ICCV.2003.1238308","article-title":"Learning a classification model for segmentation","volume":"1","author":"Ren","year":"2003","journal-title":"Proceedings Ninth IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref511","first-page":"1","article-title":"Decomposing a scene into geometric and semantically consistent regions","author":"Gould","year":"2009","journal-title":"2009 IEEE 12th International Conference on Computer Vision"},{"key":"2026040113103210200_ref512","article-title":"Addressing challenging place recognition tasks using generative adversarial networks","author":"Latif","year":"2018","journal-title":"ICRA"},{"key":"2026040113103210200_ref513","article-title":"Adversarial training for adverse conditions: Robust metric localisation using appearance transfer","author":"Porav","year":"2018","journal-title":"ICRA"},{"key":"2026040113103210200_ref514","article-title":"Night-to-day image translation for retrieval-based localization","author":"Anoosheh","year":"2019","journal-title":"ICRA"},{"issue":"14","key":"2026040113103210200_ref515","doi-asserted-by":"crossref","first-page":"1645","DOI":"10.1177\/0278364913499193","article-title":"Experience-based navigation for long-term localisation","volume":"32","author":"Churchill","year":"2013","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref516","article-title":"Scalable place recognition under appearance change for autonomous driving","author":"Doan","year":"2019","journal-title":"ICCV"},{"key":"2026040113103210200_ref517","article-title":"The gist of maps-summarizing experience for lifelong localization","author":"Dymczyk","year":"2015","journal-title":"ICRA"},{"issue":"6","key":"2026040113103210200_ref518","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1109\/MC.2010.170","article-title":"Google street view: Capturing the world at street level","volume":"43","author":"Anguelov","year":"2010","journal-title":"Computer"},{"key":"2026040113103210200_ref519","article-title":"A2d2: Audi autonomous driving dataset","author":"Geyer","year":"2020","journal-title":"arXiv preprint arXiv:2004.06320"},{"key":"2026040113103210200_ref520","article-title":"One thousand and one hours: Self-driving motion prediction dataset","author":"Houston","year":"2020","journal-title":"arXiv preprint arXiv:2006. 14480"},{"key":"2026040113103210200_ref521","first-page":"2626","article-title":"Mapillary street-level sequences: A dataset for lifelong place recognition","author":"Warburg","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref522","first-page":"111","article-title":"The global network of outdoor webcams: Properties and applications","author":"Jacobs","year":"2009","journal-title":"Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems"},{"key":"2026040113103210200_ref523","doi-asserted-by":"crossref","DOI":"10.5204\/thesis.eprints.134410","article-title":"Robust visual place recognition under simultaneous variations in viewpoint and appearance","author":"Garg","year":"2019"},{"key":"2026040113103210200_ref524","first-page":"650","article-title":"Long-term 3D localization and pose from semantic labellings","author":"Toft","year":"2018","journal-title":"Proceedings\u20142017 IEEE International Conference on Computer Vision Workshops, ICCVW 2017"},{"key":"2026040113103210200_ref525","first-page":"6863","article-title":"Improving condition- and environment-invariant place recognition with semantic place categorization","author":"Garg","year":"2017","journal-title":"IEEE International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref526","doi-asserted-by":"crossref","first-page":"1643","DOI":"10.1109\/ICRA.2012.6224623","article-title":"SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights","author":"Milford","year":"2012","journal-title":"2012 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref527","first-page":"2614","article-title":"Semantics-aware visual localization under challenging perceptual conditions","author":"Naseer","year":"2017","journal-title":"Proceedings\u2014IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref528","first-page":"891","article-title":"Cross-view image geolocalization","author":"Lin","year":"2013","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref529","article-title":"Semantic-geometric visual place recognition: A new perspective for reconciling opposing views","author":"Garg","year":"2019","journal-title":"International Journal of Robotics Research"},{"key":"2026040113103210200_ref530","first-page":"5297","article-title":"Netvlad: CNN architecture for weakly supervised place recognition","author":"Arandjelovic","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref531","article-title":"Visual semantic navigation based on deep learning for indoor mobile robots","volume":"2018","author":"Wang","year":"2018","journal-title":"Complexity"},{"key":"2026040113103210200_ref532","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1016\/j.robot.2015.09.006","article-title":"Semantic localization in the PCL library","volume":"75","author":"Mart\u00ednez-G\u00f3mez","year":"2016","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref533","first-page":"391","article-title":"Edge boxes: Locating object proposals from edges","author":"Zitnick","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref534","article-title":"Place recognition with convnet landmarks: Viewpoint-robust, condition-robust, training-free","author":"S\u00fcnderhauf","year":"2015","journal-title":"Proceedings of Robotics: Science and Systems XII"},{"key":"2026040113103210200_ref535","first-page":"1","article-title":"A robust semi-semantic approach for visual localization in urban environment","author":"Cascianelli","year":"2016","journal-title":"IEEE 2nd International Smart Cities Conference: Improving the Citizens Quality of Life, ISC2 2016\u2014Proceedings"},{"key":"2026040113103210200_ref536","article-title":"Semantically-aware attentive neural embeddings for image-based visual localization","author":"Seymour","year":"2019","journal-title":"Proceedings of the British Machine Vision Conference (BMVC)"},{"key":"2026040113103210200_ref537","first-page":"4297","article-title":"On the performance of convnet features for place recognition","author":"S\u00fcnderhauf","year":"2015","journal-title":"2015 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref538","doi-asserted-by":"crossref","first-page":"3645","DOI":"10.1109\/ICRA.2018.8461051","article-title":"Don\u2019t look back: Robustifying place categorization for viewpoint-and condition-invariant place recognition","author":"Garg","year":"2018","journal-title":"2018 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref539","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1109\/ECMR.2013.6698843","article-title":"Distinctive 3D surface entropy features for place recognition","author":"Fiolka","year":"2013","journal-title":"2013 European Conference on Mobile Robots"},{"issue":"4\u20135","key":"2026040113103210200_ref540","doi-asserted-by":"crossref","first-page":"674","DOI":"10.1177\/0278364914548708","article-title":"Place recognition based on matching of planar surfaces and line segments","volume":"34","author":"Cupec","year":"2015","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref541","doi-asserted-by":"crossref","first-page":"4830","DOI":"10.1109\/ICRA.2016.7487687","article-title":"Point cloud descriptors for place recognition using sparse visual information","author":"Cieslewski","year":"2016","journal-title":"2016 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref542","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA.2019.8794178","article-title":"Look no deeper: Recognizing places from opposing viewpoints under varying scene appearance using single-view depth estimation","author":"Garg","year":"2019","journal-title":"IEEE International Conference on Robotics and Automation (ICRA), 2019"},{"issue":"2","key":"2026040113103210200_ref543","doi-asserted-by":"crossref","first-page":"1525","DOI":"10.1109\/LRA.2019.2895826","article-title":"Real-time wide-baseline place recognition using depth completion","volume":"4","author":"Maffra","year":"2019","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref544","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-030-58604-1_27","article-title":"Unsupervised monocular depth estimation for night-time images using adversarial domain feature adaptation","author":"Vankadari","year":"2020","journal-title":"16th European Conference on Computer Vision (ECCV), 2020"},{"key":"2026040113103210200_ref545","article-title":"LCD\u2013line clustering and description for place recognition","author":"Taubner","year":"2020","journal-title":"arXiv preprint arXiv:2010.10867"},{"key":"2026040113103210200_ref546","first-page":"1808","article-title":"24\/7 place recognition by view synthesis","author":"Torii","year":"2015","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref547","first-page":"31","article-title":"Fine-grained segmentation networks: Self-supervised segmentation for improved long-term visual localization","author":"Larsson","year":"2019","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref548","first-page":"4990","article-title":"The mapillary vistas dataset for semantic understanding of street scenes","author":"Neuhold","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"issue":"4\u20135","key":"2026040113103210200_ref549","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1177\/0278364918770733","article-title":"The limits and potentials of deep learning for robotics","volume":"37","author":"S\u00fcnderhauf","year":"2018","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref550","doi-asserted-by":"crossref","first-page":"3471","DOI":"10.1109\/ICRA.2015.7139679","article-title":"3d convolutional neural networks for landing zone detection from lidar","author":"Maturana","year":"2015","journal-title":"2015 IEEE International Conference on Robotics and Automation (ICRA)"},{"issue":"4","key":"2026040113103210200_ref551","doi-asserted-by":"crossref","first-page":"3434","DOI":"10.1109\/LRA.2018.2852843","article-title":"Rt3d: Real-time 3-d vehicle detection in lidar point cloud for autonomous driving","volume":"3","author":"Zeng","year":"2018","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref552","first-page":"379","article-title":"R-fcn: Object detection via region-based fully convolutional networks","author":"Dai","year":"2016","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"2","key":"2026040113103210200_ref553","doi-asserted-by":"crossref","first-page":"865","DOI":"10.1109\/LRA.2018.2792681","article-title":"Noise-resistant deep learning for object classification in three-dimensional point clouds using a point pair descriptor","volume":"3","author":"Bobkov","year":"2018","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref554","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1109\/IM.2003.1240284","article-title":"Surflet-pair-relation histograms: A statistical 3D-shape representation for rapid classification","author":"Wahl","year":"2003","journal-title":"Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings"},{"key":"2026040113103210200_ref555","doi-asserted-by":"crossref","first-page":"3212","DOI":"10.1109\/ROBOT.2009.5152473","article-title":"Fast point feature histograms (FPFH) for 3D registration","author":"Rusu","year":"2009","journal-title":"2009 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref556","first-page":"998","article-title":"Model globally, match locally: Efficient and robust 3D object recognition","author":"Drost","year":"2010","journal-title":"2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref557","doi-asserted-by":"crossref","first-page":"2987","DOI":"10.1109\/ROBIO.2011.6181760","article-title":"Ensemble of shape functions for 3d object classification","author":"Wohlkinger","year":"2011","journal-title":"2011 IEEE International Conference on Robotics and Biomimetics"},{"key":"2026040113103210200_ref558","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1109\/3DV.2015.65","article-title":"Point pair features based object detection and pose estimation revisited","author":"Birdal","year":"2015","journal-title":"2015 International Conference on 3D Vision"},{"key":"2026040113103210200_ref559","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1109\/ISMAR.2013.6671781","article-title":"Robust monocular SLAM in dynamic environments","author":"Tan","year":"2013","journal-title":"2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)"},{"key":"2026040113103210200_ref560","first-page":"4602","article-title":"SLAM with objects using a nonparametric pose graph","author":"Mu","year":"2016","journal-title":"2016 IEEE\/ RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"issue":"4","key":"2026040113103210200_ref561","doi-asserted-by":"crossref","first-page":"5189","DOI":"10.1109\/LRA.2020.3005387","article-title":"Perspective-2-ellipsoid: Bridging the gap between object detections and 6-dof camera pose","volume":"5","author":"Gaudilli\u00e8re","year":"2020","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref562","article-title":"Towards self-supervised semantic representation with a viewpoint-dependent observation model","author":"Feldman","year":"2020","journal-title":"2020 Robotics: Science and Systems Workshop on Self-Supervised Robot Learning"},{"key":"2026040113103210200_ref563","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1109\/ICRA.2019.8794344","article-title":"Robust object-based SLAM for high-speed autonomous navigation","author":"Ok","year":"2019","journal-title":"2019 International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref564","doi-asserted-by":"crossref","DOI":"10.3390\/s20185150","article-title":"RGB-D object SLAM using quadrics for indoor environments","volume":"20","author":"Liao","year":"2020","journal-title":"Sensors"},{"key":"2026040113103210200_ref565","first-page":"340","article-title":"Diagnosing error in object detectors","author":"Hoiem","year":"2012","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref566","doi-asserted-by":"crossref","first-page":"4038","DOI":"10.1109\/ICRA.2012.6224734","article-title":"What could move? finding cars, pedestrians and bicyclists in 3d laser data","author":"Wang","year":"2012","journal-title":"2012 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref567","first-page":"8445","article-title":"Pseudo-lidar from visual depth estimation: Bridging the gap in 3d object detection for autonomous driving","author":"Wang","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref568","article-title":"Pseudo-lidar++: Accurate depth for 3D object detection in autonomous driving","author":"You","year":"2019","journal-title":"International Conference on Learning Representations"},{"key":"2026040113103210200_ref569","article-title":"Refinedmpl: Refined monocular pseudolidar for 3D object detection in autonomous driving","author":"Vianney","year":"2019","journal-title":"arXiv preprint arXiv:1911.09712"},{"key":"2026040113103210200_ref570","first-page":"5881","article-title":"End-to-end pseudo-lidar for image-based 3D object detection","author":"Qian","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"issue":"3","key":"2026040113103210200_ref571","doi-asserted-by":"crossref","first-page":"448","DOI":"10.1162\/neco.1992.4.3.448","article-title":"A practical Bayesian framework for backpropagation networks","volume":"4","author":"MacKay","year":"1992","journal-title":"Neural Computation"},{"key":"2026040113103210200_ref572","first-page":"1050","article-title":"Dropout as a Bayesian approximation: Representing model uncertainty in deep learning","author":"Gal","year":"2016","journal-title":"International Conference on Machine Learning"},{"key":"2026040113103210200_ref573","first-page":"6402","article-title":"Simple and scalable predictive uncertainty estimation using deep ensembles","author":"Lakshminarayanan","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref574","first-page":"13 153","article-title":"A simple baseline for Bayesian uncertainty in deep learning","author":"Maddox","year":"2019","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref575","first-page":"1","article-title":"Dropout sampling for robust object detection in open-set conditions","author":"Miller","year":"2018","journal-title":"2018 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref576","first-page":"1031","article-title":"Probabilistic object detection: Definition and evaluation","author":"Hall","year":"2020","journal-title":"The IEEE Winter Conference on Applications of Computer Vision"},{"key":"2026040113103210200_ref577","article-title":"Inferring spatial uncertainty in object detection","author":"Wang","year":"2020","journal-title":"arXiv preprint arXiv:2003.03644"},{"key":"2026040113103210200_ref578","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1109\/ICRA40945.2020.9196544","article-title":"Bayesod: A Bayesian approach for uncertainty estimation in deep object detectors","author":"Harakeh","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref579","first-page":"520","article-title":"Efficient uncertainty estimation for semantic segmentation in videos","author":"Huang","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref580","article-title":"Uncertainty measures and prediction quality rating for the semantic segmentation of nested multi resolution street scene images","author":"Rottmann","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops"},{"key":"2026040113103210200_ref581","article-title":"Performance monitoring of object detection during deployment","author":"Rahman","year":"2020","journal-title":"arXiv preprint arXiv:2009.08650"},{"key":"2026040113103210200_ref582","article-title":"The Fishyscapes benchmark: Measuring blind spots in semantic segmentation","author":"Blum","year":"2019","journal-title":"ICCV Workshops"},{"key":"2026040113103210200_ref583","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1177\/0278364909356483","article-title":"Multi-modal semantic place classification","volume":"29","author":"Pronobis","year":"2010","journal-title":"International Journal of Robotics Research"},{"key":"2026040113103210200_ref584","first-page":"1","article-title":"Hierarchical multi-modal place categorization","author":"Pronobis","year":"2011","journal-title":"Proceedings of the European Conference on Mobile Robots"},{"issue":"5","key":"2026040113103210200_ref585","doi-asserted-by":"crossref","first-page":"6695","DOI":"10.3390\/s120506695","article-title":"Categorization of indoor places using the Kinect sensor","volume":"12","author":"Mozos","year":"2012","journal-title":"Sensors (Switzerland)"},{"issue":"6","key":"2026040113103210200_ref586","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1080\/01691864.2015.1120242","article-title":"Local N-ary patterns: A local multi-modal descriptor for place categorization","volume":"30","author":"Jung","year":"2016","journal-title":"Advanced Robotics"},{"key":"2026040113103210200_ref587","first-page":"404","article-title":"Gray scale and rotation invariant texture classification with local binary patterns","volume":"1842","author":"Ojala","year":"2000","journal-title":"European Conference on Computer Vision"},{"issue":"6","key":"2026040113103210200_ref588","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1109\/MSP.2017.2738401","article-title":"Deep multimodal learning: A survey on recent advances and trends","volume":"34","author":"Ramachandram","year":"2017","journal-title":"IEEE Signal Processing Magazine"},{"key":"2026040113103210200_ref589","article-title":"Deep multi-modal object detection and semantic segmentation for autonomous driving: Datasets, methods, and challenges","author":"Feng","year":"2020","journal-title":"IEEE Transactions on Intelligent Transportation Systems"},{"key":"2026040113103210200_ref590","first-page":"424","article-title":"3d object proposals for accurate object class detection","author":"Chen","year":"2015","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref591","first-page":"1907","article-title":"Multi-view 3d object detection network for autonomous driving","author":"Chen","year":"2017","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref592","first-page":"1","article-title":"Joint 3d proposal generation and object detection from view aggregation","author":"Ku","year":"2018","journal-title":"2018 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref593","first-page":"4604","article-title":"Pointpainting: Sequential fusion for 3d object detection","author":"Vora","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref594","first-page":"641","article-title":"Deep continuous fusion for multi-sensor 3d object detection","author":"Liang","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref595","first-page":"1","article-title":"Fusing bird\u2019s eye view lidar point cloud and front view camera image for 3d object detection","author":"Wang","year":"2018","journal-title":"2018 IEEE Intelligent Vehicles Symposium (IV)"},{"key":"2026040113103210200_ref596","first-page":"918","article-title":"Frustum pointnets for 3d object detection from RGB-D data","author":"Qi","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref597","article-title":"IPOD: Intensive point-based object detector for point cloud","author":"Yang","year":"2018","journal-title":"arXiv preprint arXiv:1812.05276"},{"key":"2026040113103210200_ref598","first-page":"1742","article-title":"Frustum convnet: Sliding frustums to aggregate local point-wise features for amodal","author":"Wang","year":"2019","journal-title":"2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref599","first-page":"41","article-title":"Placer: Semantic place labels from diary data","author":"Krumm","year":"2013","journal-title":"UbiComp 2013\u2014Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing"},{"key":"2026040113103210200_ref600","first-page":"11","article-title":"Placer++: Semantic place labels beyond the visit","author":"Krumm","year":"2015","journal-title":"2015 IEEE International Conference on Pervasive Computing and Communications, PerCom 2015"},{"key":"2026040113103210200_ref601","doi-asserted-by":"crossref","first-page":"1142","DOI":"10.1016\/j.neucom.2015.08.071","article-title":"The discovery of personally semantic places based on trajectory data mining","volume":"173","author":"Lv","year":"2016","journal-title":"Neurocomputing"},{"key":"2026040113103210200_ref602","article-title":"Sensor fusion for semantic place labeling","author":"Roor","year":"2017","journal-title":"VEHITS 2017\u2014Proceedings of the 3rd International Conference on Vehicle Technology and Intelligent Transport Systems"},{"key":"2026040113103210200_ref603","article-title":"Semantic place classification and mapping for autonomous agricultural robots","author":"Weiss","year":"2010","journal-title":"IEEE International Conference on Robotics and Automation, Workshop on Semantic Mapping and Autonomous Knowledge Acquisition"},{"key":"2026040113103210200_ref604","author":"Roomba i Series, iRobot,"},{"key":"2026040113103210200_ref605","author":"RS Model, Robomow Friendly House,"},{"key":"2026040113103210200_ref606","author":"Zodiac Robotic Cleaners VX42, Fluidra Group"},{"key":"2026040113103210200_ref607","author":"AUV Sentry, Woods Hole Oceanographic Institution"},{"key":"2026040113103210200_ref608","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1109\/AUV.2016.7778680","article-title":"The design and 200 day per year operation of the autonomous underwater vehicle sentry","author":"Kaiser","year":"2016","journal-title":"2016 IEEE\/OES Autonomous Underwater Vehicles (AUV)"},{"key":"2026040113103210200_ref609","author":"Mars Curiosity Rover, Nasa Science Mars Exploration Program"},{"key":"2026040113103210200_ref610","author":"Manning","year":"2014","journal-title":"Mars Rover Curiosity: An Inside Account from Curiosity\u2019s Chief Engineer"},{"issue":"5","key":"2026040113103210200_ref611","doi-asserted-by":"crossref","first-page":"744","DOI":"10.1109\/TNSRE.2014.2347377","article-title":"An intelligent robotic hospital bed for safe transportation of critical neurosurgery patients along crowded hospital corridors","volume":"23","author":"Wang","year":"2014","journal-title":"IEEE Transactions on Neural Systems and Rehabilitation Engineering"},{"key":"2026040113103210200_ref612","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/978-3-319-25554-5_2","article-title":"Social robots for older adults: Framework of activities for aging in place with robots","author":"Alves-Oliveira","year":"2015","journal-title":"International Conference on Social Robotics"},{"key":"2026040113103210200_ref613","author":"Da Vinci, Intuitive,"},{"issue":"4","key":"2026040113103210200_ref614","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1016\/j.amjsurg.2006.06.042","article-title":"Three-dimensional imaging improves surgical performance for both novice and experienced operators using the da vinci robot system","volume":"193","author":"Byrn","year":"2007","journal-title":"The American Journal of Surgery"},{"key":"2026040113103210200_ref615","first-page":"341","article-title":"The when, where, and how: An adaptive robotic info-terminal for care home residents","author":"Hanheide","year":"2017","journal-title":"Proceedings of the 2017 ACM\/IEEE International Conference on Human\u2013Robot Interaction"},{"issue":"1","key":"2026040113103210200_ref616","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1007\/s12369-018-0482-7","article-title":"Sam, an assistive robotic device dedicated to helping persons with quadriplegia: Usability study","volume":"11","author":"Fattal","year":"2019","journal-title":"International Journal of Social Robotics"},{"key":"2026040113103210200_ref617","author":"Nao, Softbank Robotics,"},{"key":"2026040113103210200_ref618","author":"Pepper, Softbank Robotics,"},{"key":"2026040113103210200_ref619","author":"Vector, Anki,"},{"key":"2026040113103210200_ref620","author":"Beam, Suitable Technologies, Inc."},{"key":"2026040113103210200_ref621","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1016\/j.robot.2016.01.014","article-title":"Long-term assessment of a service robot in a hotel environment","volume":"79","author":"Pinillos","year":"2016","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref622","first-page":"1","article-title":"Product counting using images with application to robot-based retail stock assessment","author":"Kejriwal","year":"2015","journal-title":"2015 IEEE International Conference on Technologies for Practical Robot Applications (TePRA)"},{"issue":"5","key":"2026040113103210200_ref623","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1109\/TRO.2010.2062550","article-title":"A communication robot in a shopping mall","volume":"26","author":"Kanda","year":"2010","journal-title":"IEEE Transactions on Robotics"},{"key":"2026040113103210200_ref624","first-page":"2005","article-title":"Toomas: Interactive shopping guide robots in everyday use-final implementation and experiences from long-term field trials","author":"Gross","year":"2009","journal-title":"2009 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref625","first-page":"1","article-title":"A tea-serving robot for office environment","author":"Kumar","year":"2014","journal-title":"ISR\/Robotik 2014; 41st International Symposium on Robotics"},{"issue":"2","key":"2026040113103210200_ref626","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1080\/1023697X.2015.1043960","article-title":"Robotics in ecommerce logistics","volume":"22","author":"Huang","year":"2015","journal-title":"HKIE Transactions"},{"key":"2026040113103210200_ref627","author":"Starship, Starship Technologies,"},{"key":"2026040113103210200_ref628","author":"Robots\u2014Your guide to the world of robots"},{"key":"2026040113103210200_ref629","doi-asserted-by":"crossref","first-page":"3354","DOI":"10.1109\/CVPR.2012.6248074","article-title":"Are we ready for autonomous driving? The kitti vision benchmark suite","author":"Geiger","year":"2012","journal-title":"2012 IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref630","first-page":"370","article-title":"The unmanned aerial vehicle benchmark: Object detection and tracking","author":"Du","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref631","article-title":"Vision meets drones: A challenge","author":"Zhu","year":"2018","journal-title":"arXiv preprint arXiv:1804.07437"},{"key":"2026040113103210200_ref632","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1016\/j.isprsjprs.2020.05.009","article-title":"UAVid: A semantic segmentation dataset for UAV imagery","volume":"165","author":"Lyu","year":"2020","journal-title":"ISPRS Journal of Photogrammetry and Remote Sensing"},{"key":"2026040113103210200_ref633","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1016\/j.isprsjprs.2013.10.004","article-title":"Results of the ISPRS benchmark on urban object detection and 3D building reconstruction","volume":"93","author":"Rottensteiner","year":"2014","journal-title":"ISPRS Journal of Photogrammetry and Remote Sensing"},{"issue":"12","key":"2026040113103210200_ref634","doi-asserted-by":"crossref","first-page":"5547","DOI":"10.1109\/JSTARS.2016.2569162","article-title":"Processing of extremely high-resolution lidar and RGB data: Outcome of the 2015 IEEE GRSS data fusion contest\u2013part a: 2-d contest","volume":"9","author":"Campos-Taberner","year":"2016","journal-title":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing"},{"issue":"6","key":"2026040113103210200_ref635","doi-asserted-by":"crossref","first-page":"2405","DOI":"10.1109\/JSTARS.2014.2305441","article-title":"Hyperspectral and lidar data fusion: Outcome of the 2013 GRSS data fusion contest","volume":"7","author":"Debes","year":"2014","journal-title":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing"},{"key":"2026040113103210200_ref636","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1109\/CVPRW.2018.00031","article-title":"Deepglobe 2018: A challenge to parse the earth through satellite images","author":"Demir","year":"2018","journal-title":"2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)"},{"key":"2026040113103210200_ref637","doi-asserted-by":"crossref","first-page":"1499","DOI":"10.1109\/WACV.2018.00168","article-title":"Ensemble knowledge transfer for semantic segmentation","author":"Nigam","year":"2018","journal-title":"2018 IEEE Winter Conference on Applications of Computer Vision (WACV)"},{"key":"2026040113103210200_ref638","doi-asserted-by":"crossref","DOI":"10.3390\/rs11111369","article-title":"Unsupervised domain adaptation using generative adversarial networks for semantic segmentation of aerial images","volume":"11","author":"Benjdira","year":"2019","journal-title":"Remote Sensing"},{"key":"2026040113103210200_ref639","first-page":"2672","article-title":"Generative adversarial nets","author":"Goodfellow","year":"2014","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2026040113103210200_ref640","first-page":"1","article-title":"Automatic detection, classification and tracking of objects in the ocean surface from UAVs using a thermal camera","author":"Leira","year":"2015","journal-title":"2015 IEEE Aerospace Conference"},{"key":"2026040113103210200_ref641","doi-asserted-by":"crossref","DOI":"10.1109\/ICNSURV.2017.8011932","article-title":"Bi-heterogeneous convolutional neural network for UAV-based dynamic scene classification","author":"Zheng","year":"2017","journal-title":"2017 Integrated Communications, Navigation and Surveillance Conference (ICNS)"},{"key":"2026040113103210200_ref642","first-page":"255","article-title":"Nature conservation drones for automatic localization and counting of animals","author":"van Gemert","year":"2014","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref643","doi-asserted-by":"crossref","DOI":"10.3390\/s18072048","article-title":"Detection of cattle using drones and convolutional neural networks","volume":"18","author":"Rivas","year":"2018","journal-title":"Sensors"},{"key":"2026040113103210200_ref644","doi-asserted-by":"crossref","DOI":"10.1609\/aaai.v32i1.11414","article-title":"Spot poachers in action: Augmenting conservation drones with automatic detection in near real time","author":"Bondi","year":"2018","journal-title":"Thirty-Second AAAI Conference on Artificial Intelligence"},{"key":"2026040113103210200_ref645","article-title":"Construction site management and control technology based on UAV visual surveillance","author":"Huang","year":"2018","journal-title":"Automation and Instrumentation"},{"key":"2026040113103210200_ref646","author":"Wing, Wing Aviation LLC,"},{"issue":"4","key":"2026040113103210200_ref647","doi-asserted-by":"crossref","first-page":"959","DOI":"10.1007\/s10712-019-09529-9","article-title":"New opportunities for forest remote sensing through ultra-high-density drone lidar","volume":"40","author":"Kellner","year":"2019","journal-title":"Surveys in Geophysics"},{"key":"2026040113103210200_ref648","article-title":"Cad2rl: Real single-image flight without a single real image","author":"Sadeghi","year":"2017","journal-title":"Robotics: Science and Systems"},{"issue":"2","key":"2026040113103210200_ref649","doi-asserted-by":"crossref","first-page":"1088","DOI":"10.1109\/LRA.2018.2795643","article-title":"Dronet: Learning to fly by driving","volume":"3","author":"Loquercio","year":"2018","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref650","first-page":"3948","article-title":"Learning to fly by crashing","author":"Gandhi","year":"2017","journal-title":"2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref651","doi-asserted-by":"crossref","first-page":"2169","DOI":"10.1109\/ICRA.2017.7989250","article-title":"Learning modular neural network policies for multi-task and multi-robot transfer","author":"Devin","year":"2017","journal-title":"2017 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref652","first-page":"1537","article-title":"Policy transfer via modularity and reward guiding","author":"Clavera","year":"2017","journal-title":"2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref653","first-page":"1","article-title":"Driving policy transfer via modularity and abstraction","author":"Mueller","year":"2018","journal-title":"Conference on Robot Learning"},{"issue":"1","key":"2026040113103210200_ref654","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TRO.2019.2942989","article-title":"Deep drone racing: From simulation to reality with domain randomization","volume":"36","author":"Loquercio","year":"2019","journal-title":"IEEE Transactions on Robotics"},{"key":"2026040113103210200_ref655","doi-asserted-by":"crossref","first-page":"690","DOI":"10.1109\/ICRA.2019.8793631","article-title":"Beauty and the beast: Optimal methods meet learning for drone racing","author":"Kaufmann","year":"2019","journal-title":"2019 International Conference on Robotics and Automation (ICRA)"},{"issue":"5","key":"2026040113103210200_ref656","doi-asserted-by":"crossref","first-page":"8357","DOI":"10.1109\/JIOT.2019.2917066","article-title":"A 64-mw DNN-based visual navigation engine for autonomous nano-drones","volume":"6","author":"Palossi","year":"2019","journal-title":"IEEE Internet of Things Journal"},{"issue":"5","key":"2026040113103210200_ref657","doi-asserted-by":"crossref","first-page":"1389","DOI":"10.1109\/TRO.2020.2994881","article-title":"A real-time game theoretic planner for autonomous two-player drone racing","volume":"36","author":"Spica","year":"2020","journal-title":"IEEE Transactions on Robotics"},{"key":"2026040113103210200_ref658","first-page":"1","article-title":"Towards simulating semantic onboard UAV navigation","author":"Mandel","year":"2020","journal-title":"Proceedings of the 2020 IEEE Aerospace Conference"},{"key":"2026040113103210200_ref659","article-title":"A discourse on winning and losing [briefing slides]","author":"Boyd","year":"1987","journal-title":"Maxwell Air Force Base, AL: Air University Library (Document No. MU 43947)"},{"key":"2026040113103210200_ref660","volume-title":"A Discourse on Winning and Losing","author":"Boyd","year":"2018"},{"key":"2026040113103210200_ref661","first-page":"115","article-title":"Towards semantic context-aware drones for aerial scenes understanding","author":"Cavaliere","year":"2016","journal-title":"2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)"},{"issue":"3","key":"2026040113103210200_ref662","doi-asserted-by":"crossref","first-page":"555","DOI":"10.1109\/TSMC.2017.2757462","article-title":"Semantically enhanced UAVs to increase the aerial scene understanding","volume":"49","author":"Cavaliere","year":"2017","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics: Systems"},{"key":"2026040113103210200_ref663","first-page":"1","article-title":"Relationship between UAVs and ambient objects with threat situational awareness through grid map-based ontology reasoning","author":"Jeon","year":"2019","journal-title":"International Journal of Computers and Applications"},{"issue":"4","key":"2026040113103210200_ref664","doi-asserted-by":"crossref","first-page":"602","DOI":"10.1109\/JAS.2017.7510604","article-title":"A survey of human-centered intelligent robots: Issues and challenges","volume":"4","author":"He","year":"2017","journal-title":"IEEE\/CAA Journal of Automatica Sinica"},{"key":"2026040113103210200_ref665","doi-asserted-by":"crossref","first-page":"1981","DOI":"10.1109\/ROBIO.2018.8665075","article-title":"Autonomous navigation by mobile robots in human environments: A survey","author":"Cheng","year":"2018","journal-title":"2018 IEEE International Conference on Robotics and Biomimetics (ROBIO)"},{"key":"2026040113103210200_ref666","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1007\/978-3-319-70022-9_18","article-title":"A need for service robots among health care professionals in hospitals and housing services","author":"V\u00e4nni","year":"2017","journal-title":"International Conference on Social Robotics"},{"key":"2026040113103210200_ref667","first-page":"225","article-title":"Monitoring the acceptance of a social service robot in a shopping mall: First results","author":"Niemel\u00e4","year":"2017","journal-title":"Proceedings of the Companion of the 2017 ACM\/IEEE International Conference on Human\u2013Robot Interaction"},{"key":"2026040113103210200_ref668","first-page":"308","article-title":"Consumer evaluation of hotel service robots","author":"Tussyadiah","year":"2018","journal-title":"Information and Communication Technologies in Tourism 2018"},{"key":"2026040113103210200_ref669","doi-asserted-by":"crossref","DOI":"10.3389\/fneur.2017.00228","article-title":"Challenges for service robots\u2014requirements of elderly adults with cognitive impairments","volume":"8","author":"Korchut","year":"2017","journal-title":"Frontiers in Neurology"},{"key":"2026040113103210200_ref670","doi-asserted-by":"crossref","first-page":"12 913","DOI":"10.1109\/ACCESS.2018.2808369","article-title":"A review of service robots coping with uncertain information in natural language instructions","volume":"6","author":"Muthugala","year":"2018","journal-title":"IEEE Access"},{"key":"2026040113103210200_ref671","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1007\/978-3-030-38724-2_13","article-title":"Service robots in the hospitality industry: An exploratory literature review","author":"Rosete","year":"2020","journal-title":"International Conference on Exploring Services Science"},{"key":"2026040113103210200_ref672","doi-asserted-by":"crossref","DOI":"10.1088\/1757-899X\/705\/1\/012003","article-title":"A review on service robots: Mechanical design and localization system","volume":"705","author":"Bakri","year":"2019","journal-title":"IOP Conference Series: Materials Science and Engineering"},{"issue":"5","key":"2026040113103210200_ref673","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1057\/s41303-017-0046-1","article-title":"Service robots in hospitals: New perspectives on niche evolution and technology affordances","volume":"26","author":"Mettler","year":"2017","journal-title":"European Journal of Information Systems"},{"issue":"4","key":"2026040113103210200_ref674","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1007\/s12369-017-0452-5","article-title":"Social acceptance of robots in different occupational fields: A systematic literature review","volume":"10","author":"Savela","year":"2018","journal-title":"International Journal of Social Robotics"},{"key":"2026040113103210200_ref675","article-title":"Robots in welfare services: A systematic literature review","author":"Aaen","year":"2018","journal-title":"IRIS\/SCIS Conference 2018"},{"key":"2026040113103210200_ref676","article-title":"Service robots and the changing roles of employees in restaurants: A cross cultural study","volume":"17","author":"Tuomi","year":"2019","journal-title":"E-Review of Tourism Research"},{"issue":"2","key":"2026040113103210200_ref677","first-page":"267","article-title":"Robots or frontline employees?: Exploring customers\u2019 attributions of responsibility and stability after service failure or success","volume":"31","author":"Gracia","year":"2020","journal-title":"Journal of Service Management"},{"key":"2026040113103210200_ref678","article-title":"Desiderata for planning systems in general-purpose service robots","author":"Walker","year":"2019","journal-title":"International Conference on Automated Planning and Scheduling (ICAPS) Workshop on Planning and Robotics (PlanRob)"},{"key":"2026040113103210200_ref679","author":"Care-o-bot 4, Fraunhofer Institute for Manufacturing Engineering and Automation,"},{"key":"2026040113103210200_ref680","author":"PR2, Willow Garage,"},{"key":"2026040113103210200_ref681","author":"Robot Operating System (ROS), Open Robotics,"},{"key":"2026040113103210200_ref682","first-page":"210","article-title":"Accurate pouring with an autonomous robot using an rgb-d camera","author":"Do","year":"2018","journal-title":"International Conference on Intelligent Autonomous Systems"},{"key":"2026040113103210200_ref683","doi-asserted-by":"crossref","first-page":"3671","DOI":"10.1109\/ICRA.2012.6224742","article-title":"A robot path planning framework that learns from experience","author":"Berenson","year":"2012","journal-title":"2012 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref684","author":"SPOT, Boston Dynamics,"},{"issue":"1","key":"2026040113103210200_ref685","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1109\/TSMCB.2008.2004050","article-title":"Multisensor-based human detection and tracking for mobile service robots","volume":"39","author":"Bellotto","year":"2008","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)"},{"key":"2026040113103210200_ref686","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9197556","article-title":"An intelligent spraying system with deep learning-based semantic segmentation of fruit trees in orchards","author":"Kim","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref687","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9196830","article-title":"Semantic linking maps for active visual object search","author":"Zeng","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref688","doi-asserted-by":"crossref","DOI":"10.1201\/9780429297922","author":"Sharma","year":"2019","journal-title":"From Visual Surveillance to Internet of Things: Technology and Applications"},{"key":"2026040113103210200_ref689","doi-asserted-by":"crossref","DOI":"10.4324\/9781315743981-26","article-title":"Visual surveillance technologies","author":"Jones","year":"2017","journal-title":"The Routledge Handbook of Technology, Crime and Justice,"},{"key":"2026040113103210200_ref690","article-title":"Visual surveillance of natural environments: Background subtraction challenges and methods","author":"Bouwmans","year":"2019","journal-title":"From Visual Surveillance to Internet of Things: Technology and Applications,"},{"key":"2026040113103210200_ref691","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/978-3-319-68533-5_1","article-title":"A survey of using biometrics for smart visual surveillance: Gait recognition","author":"Bouchrika","year":"2018","journal-title":"Surveillance in Action"},{"key":"2026040113103210200_ref692","article-title":"Anomaly detection in road traffic using visual surveillance: A survey","author":"Kumaran","year":"2019","journal-title":"arXiv preprint arXiv:1901.08292"},{"key":"2026040113103210200_ref693","first-page":"475","article-title":"Video analytics for visual surveillance and applications: An overview and survey","author":"Olatunji","year":"2019","journal-title":"Machine Learning Paradigms"},{"key":"2026040113103210200_ref694","doi-asserted-by":"crossref","first-page":"42","DOI":"10.4018\/978-1-5225-3056-5.ch003","article-title":"Applications of intelligent video analytics in the field of retail management: A study","author":"Singh","year":"2018","journal-title":"Supply Chain Management Strategies and Risk Assessment in Retail Environments"},{"issue":"2","key":"2026040113103210200_ref695","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1007\/s10462-017-9545-7","article-title":"Suspicious human activity recognition: A review","volume":"50","author":"Tripathi","year":"2018","journal-title":"Artificial Intelligence Review"},{"key":"2026040113103210200_ref696","article-title":"Application of an adaptive background model for monitoring honeybees","author":"Knauer","year":"2005","journal-title":"Proceedings of Visualization, Imaging and Image Processing (VIIP),"},{"key":"2026040113103210200_ref697","article-title":"Automated visual monitoring of nesting seabirds","author":"Dickinson","year":"2010","journal-title":"Workshop on Visual Observation and Analysis of Animal and Insect Behaviour, Istanbul"},{"key":"2026040113103210200_ref698","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1016\/j.cviu.2013.12.003","article-title":"A texton-based kernel density estimation approach for background modeling under extreme conditions","volume":"122","author":"Spampinato","year":"2014","journal-title":"Computer Vision and Image Understanding"},{"key":"2026040113103210200_ref699","doi-asserted-by":"crossref","first-page":"1966","DOI":"10.1109\/ROBIO.2016.7866617","article-title":"Position and direction estimation of wolf spiders, pardosa astrigera, from video images","author":"Iwatani","year":"2016","journal-title":"2016 IEEE International Conference on Robotics and Biomimetics (ROBIO)"},{"key":"2026040113103210200_ref700","article-title":"Visual surveillance of human activities: Background subtraction challenges and methods","author":"Bouwmans","year":"2019","journal-title":"From Visual Surveillance to Internet of Things: Technology and Applications"},{"key":"2026040113103210200_ref701","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1016\/j.trc.2019.12.013","article-title":"Microscopic modelling of area-based heterogeneous traffic flow: Area selection and vehicle movement","volume":"111","author":"Sarkar","year":"2020","journal-title":"Transportation Research Part C: Emerging Technologies"},{"key":"2026040113103210200_ref702","first-page":"110","article-title":"Robust segmentation process to detect incidents on highways","author":"Monteiro","year":"2008","journal-title":"International Conference Image Analysis and Recognition"},{"key":"2026040113103210200_ref703","doi-asserted-by":"crossref","first-page":"1649","DOI":"10.1109\/ICCAS.2016.7832520","article-title":"Automatic parking system using background subtraction with CCTV environment international conference on control, automation and systems (ICCAS 2016)","author":"Cho","year":"2016","journal-title":"2016 16th International Conference on Control, Automation and Systems (ICCAS)"},{"key":"2026040113103210200_ref704","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/j.compind.2018.03.005","article-title":"Vanishing point detection for visual surveillance systems in railway platform environments","volume":"98","author":"Tarrit","year":"2018","journal-title":"Computers in Industry"},{"issue":"2","key":"2026040113103210200_ref705","doi-asserted-by":"crossref","first-page":"1201","DOI":"10.1007\/s11042-014-2364-9","article-title":"Towards automated visual surveillance using gait for identity recognition and tracking across multiple non-intersecting cameras","volume":"75","author":"Bouchrika","year":"2016","journal-title":"Multimedia Tools and Applications"},{"issue":"6","key":"2026040113103210200_ref706","doi-asserted-by":"crossref","first-page":"2882","DOI":"10.1109\/TIP.2019.2891901","article-title":"Single image defogging based on illumination decomposition for visual maritime surveillance","volume":"28","author":"Hu","year":"2019","journal-title":"IEEE Transactions on Image Processing"},{"key":"2026040113103210200_ref707","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1007\/978-981-10-5146-3_15","article-title":"Rule based visual surveillance system for the retail domain","author":"Rashmi","year":"2018","journal-title":"Proceedings of International Conference on Cognition and Recognition"},{"key":"2026040113103210200_ref708","first-page":"73","article-title":"Gait recognition using motion trajectory analysis","author":"Khan","year":"2017","journal-title":"International Conference on Computer Recognition Systems"},{"key":"2026040113103210200_ref709","first-page":"868","article-title":"Mars: A video benchmark for large-scale person re-identification","author":"Zheng","year":"2016","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref710","first-page":"2119","article-title":"Attention-aware compositional network for person re-identification","author":"Xu","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref711","first-page":"3931","article-title":"Attention-aware deep reinforcement learning for video face recognition","author":"Rao","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref712","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1109\/SIBGRAPI.2018.00067","article-title":"Deep face recognition: A survey","author":"Masi","year":"2018","journal-title":"2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)"},{"issue":"2","key":"2026040113103210200_ref713","doi-asserted-by":"crossref","first-page":"166","DOI":"10.25103\/jestr.102.20","article-title":"Face recognition: A survey","volume":"10","author":"Sharif","year":"2017","journal-title":"Journal of Engineering Science and Technology Review"},{"key":"2026040113103210200_ref714","first-page":"3","article-title":"The sixth visual object tracking VOT2018 challenge results","author":"Kristan","year":"2018","journal-title":"European Conference on Computer Vision Workshops"},{"key":"2026040113103210200_ref715","article-title":"Mot16: A benchmark for multi-object tracking","author":"Milan","year":"2016","journal-title":"arXiv preprint arXiv:1603.00831"},{"key":"2026040113103210200_ref716","first-page":"4340","article-title":"Virtual worlds as proxy for multi-object tracking analysis","author":"Gaidon","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref717","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1007\/978-3-319-27857-5_13","article-title":"A hierarchical frame-by-frame association method based on graph matching for multi-object tracking","author":"Garg","year":"2015","journal-title":"International Symposium on Visual Computing"},{"key":"2026040113103210200_ref718","article-title":"Mot20: A benchmark for multi object tracking in crowded scenes","author":"Dendorfer","year":"2020","journal-title":"arXiv preprint arXiv:2003.09003"},{"issue":"6","key":"2026040113103210200_ref719","doi-asserted-by":"crossref","first-page":"7585","DOI":"10.1007\/s11042-018-6472-9","article-title":"Abandoned or removed object detection from visual surveillance: A review","volume":"78","author":"Tripathi","year":"2019","journal-title":"Multimedia Tools and Applications"},{"issue":"3","key":"2026040113103210200_ref720","doi-asserted-by":"crossref","first-page":"362","DOI":"10.1002\/rob.21918","article-title":"A survey of deep learning techniques for autonomous driving","volume":"37","author":"Grigorescu","year":"2019","journal-title":"Journal of Field Robotics"},{"key":"2026040113103210200_ref721","first-page":"1","article-title":"When to use what data set for your self-driving car algorithm: An overview of publicly available driving datasets","author":"Yin","year":"2017","journal-title":"2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC)"},{"issue":"1\u20133","key":"2026040113103210200_ref722","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1561\/0600000079","article-title":"Computer vision for autonomous vehicles: Problems, datasets and state of the art","volume":"12","author":"Janai","year":"2020","journal-title":"Foundations and Trends\u00ae in Computer Graphics and Vision"},{"issue":"4","key":"2026040113103210200_ref723","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1109\/MITS.2014.2336271","article-title":"Three decades of driver assistance systems: Review and future perspectives","volume":"6","author":"Bengler","year":"2014","journal-title":"IEEE Intelligent Transportation Systems Magazine"},{"key":"2026040113103210200_ref724","first-page":"11 621","article-title":"Nuscenes: A multimodal dataset for autonomous driving","author":"Caesar","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref725","first-page":"2446","article-title":"Scalability in perception for autonomous driving: Waymo open dataset","author":"Sun","year":"2020","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref726","article-title":"AI for Full-Self Driving","author":"Karpathy","journal-title":"Matroid, 5th Annual Scaled Machine Learning Conference"},{"key":"2026040113103210200_ref727","article-title":"P1-007: How automated vehicles will interact with road infrastructure now and in the future","author":"Milford"},{"key":"2026040113103210200_ref728","article-title":"Autonomous driving Interview with Michael Fausten, Bosch Global","author":"Fausten"},{"key":"2026040113103210200_ref729","first-page":"2147","article-title":"Monocular 3d object detection for autonomous driving","author":"Chen","year":"2016","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref730","article-title":"Reinforcement learning based control of imitative policies for near-accident driving","author":"Cao","year":"2020","journal-title":"Proceedings of Robotics: Science and Systems XVI"},{"key":"2026040113103210200_ref731","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9196556","article-title":"Segvoxelnet: Exploring semantic context and depth-aware features for 3D vehicle detection from point cloud","author":"Yi","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref732","first-page":"1513","article-title":"3d fully convolutional network for vehicle detection in point cloud","author":"Li","year":"2017","journal-title":"2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref733","first-page":"7345","article-title":"Multi-task multi-sensor fusion for 3d object detection","author":"Liang","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref734","first-page":"8660","article-title":"End-to-end interpretable neural motion planner","author":"Zeng","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref735","first-page":"3949","article-title":"Jointly learnable behavior and trajectory planning for self-driving vehicles","author":"Sadat","year":"2019","journal-title":"2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref736","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1109\/ICRA.2018.8462884","article-title":"End-to-end learning of multisensor 3d tracking by detection","author":"Frossard","year":"2018","journal-title":"2018 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref737","first-page":"759","article-title":"It is not the journey but the destination: Endpoint conditioned trajectory prediction","author":"Mangalam","year":"2020","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"issue":"2","key":"2026040113103210200_ref738","doi-asserted-by":"crossref","first-page":"3485","DOI":"10.1109\/LRA.2020.2976305","article-title":"Spatiotemporal relationship reasoning for pedestrian intent prediction","volume":"5","author":"Liu","year":"2020","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref739","first-page":"947","article-title":"Intentnet: Learning to predict intention from raw sensor data","author":"Casas","year":"2018","journal-title":"Conference on Robot Learning"},{"key":"2026040113103210200_ref740","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9196510","article-title":"Segmenting 2k-videos at 36.5 fps with 24.3 gflops: Accurate and lightweight realtime semantic segmentation network","author":"Oh","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref741","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1109\/3DV.2018.00053","article-title":"Efficient convolutions for real-time semantic segmentation of 3d point clouds","author":"Zhang","year":"2018","journal-title":"2018 International Conference on 3D Vision (3DV)"},{"key":"2026040113103210200_ref742","first-page":"7652","article-title":"Pixor: Real-time 3d object detection from point clouds","author":"Yang","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref743","first-page":"3569","article-title":"Fast and furious: Real time end-to-end 3d detection, tracking and motion forecasting with a single convolutional net","author":"Luo","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref744","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9196627","article-title":"Online camera-lidar calibration with sensor semantic information","author":"Zhu","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref745","first-page":"10 316","article-title":"Learning to localize through compressed binary maps","author":"Wei","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref746","first-page":"605","article-title":"Learning to localize using a lidar intensity map","author":"Barsan","year":"2018","journal-title":"CoRL"},{"key":"2026040113103210200_ref747","first-page":"3102","article-title":"Deep multi-sensor lane detection","author":"Bai","year":"2018","journal-title":"2018 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref748","first-page":"8024","article-title":"Matching adversarial networks","author":"M\u00e1ttyus","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref749","doi-asserted-by":"crossref","first-page":"3028","DOI":"10.1109\/ICCV.2017.327","article-title":"Torontocity: Seeing the world with a million eyes","author":"Wang","year":"2017","journal-title":"2017 IEEE International Conference on Computer Vision (ICCV)"},{"key":"2026040113103210200_ref750","first-page":"3417","article-title":"Hierarchical recurrent attention networks for structured online maps","author":"Homayounfar","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref751","first-page":"9512","article-title":"Convolutional recurrent network for road boundary extraction","author":"Liang","year":"2019","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref752","first-page":"396","article-title":"End-to-end deep structured models for drawing crosswalks","author":"Liang","year":"2018","journal-title":"Proceedings of the European Conference on Computer Vision (ECCV)"},{"key":"2026040113103210200_ref753","first-page":"3438","article-title":"Deeproadmapper: Extracting road topology from aerial images","author":"M\u00e1ttyus","year":"2017","journal-title":"Proceedings of the IEEE International Conference on Computer Vision"},{"key":"2026040113103210200_ref754","first-page":"146","article-title":"Hdnet: Exploiting hd maps for 3d object detection","author":"Yang","year":"2018","journal-title":"Conference on Robot Learning"},{"key":"2026040113103210200_ref755","first-page":"5304","article-title":"Exploiting sparse semantic hd maps for self-driving vehicle localization","author":"Ma","year":"2019","journal-title":"2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref756","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/j.cosrev.2018.03.001","article-title":"New trends on moving object detection in video images captured by a moving camera: A survey","volume":"28","author":"Yazdi","year":"2018","journal-title":"Computer Science Review"},{"key":"2026040113103210200_ref757","article-title":"Police body-worn cameras","volume":"96","author":"Stoughton","year":"2017","journal-title":"NCL Rev."},{"key":"2026040113103210200_ref758","first-page":"155","article-title":"Argus: Realistic target coverage by drones","author":"Saeed","year":"2017","journal-title":"Proceedings of the 16th ACM\/IEEE International Conference on Information Processing in Sensor Networks"},{"issue":"3","key":"2026040113103210200_ref759","doi-asserted-by":"crossref","first-page":"167","DOI":"10.5194\/gh-71-167-2016","article-title":"Domestic drones: The politics of verticality and the surveillance industrial complex","volume":"71","author":"Bracken-Roche","year":"2016","journal-title":"Geographica Helvetica"},{"key":"2026040113103210200_ref760","article-title":"Fully convolutional siamese autoencoder for change detection in UAV aerial images","author":"Mesquita","year":"2019","journal-title":"IEEE Geoscience and Remote Sensing Letters"},{"key":"2026040113103210200_ref761","doi-asserted-by":"crossref","first-page":"3674","DOI":"10.1109\/WSC.2016.7822394","article-title":"Effective visual surveillance of human crowds using cooperative unmanned vehicles","author":"Minaeian","year":"2016","journal-title":"2016 Winter Simulation Conference (WSC)"},{"key":"2026040113103210200_ref762","article-title":"Room searching performance evaluation for the jagabottm indoor surveillance robot","author":"Saad","year":"2016","journal-title":"KnE Engineering"},{"key":"2026040113103210200_ref763","first-page":"1679","article-title":"Design and implementation of surveillance robot for outdoor security","author":"Meghana","year":"2017","journal-title":"2017 2nd IEEE International Conference on Recent Trends in Electronics, Information and Communication Technology (RTEICT)"},{"key":"2026040113103210200_ref764","first-page":"43","article-title":"Robots in crisis management: A survey","author":"Kostavelis","year":"2017","journal-title":"International Conference on Information Systems for Crisis Response and Management in Mediterranean Countries"},{"issue":"1","key":"2026040113103210200_ref765","doi-asserted-by":"crossref","first-page":"249","DOI":"10.15748\/jasse.6.249","article-title":"Arbitrary viewpoint visualization for disaster response robots","volume":"6","author":"Fuchida","year":"2018","journal-title":"Journal of Advanced Simulation in Science and Engineering"},{"key":"2026040113103210200_ref766","first-page":"1","article-title":"Armatron\u2014 A wearable gesture recognition glove: For control of robotic devices in disaster management and human rehabilitation","author":"Asokan","year":"2016","journal-title":"2016 International Conference on Robotics and Automation for Humanitarian Applications (RAHA)"},{"key":"2026040113103210200_ref767","first-page":"1","article-title":"Semantic data exchange between collaborative robots in fog environment: Can coap be a choice?","author":"Dey","year":"2017","journal-title":"2017 Global Internet of Things Summit (GIoTS)"},{"key":"2026040113103210200_ref768","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1109\/APWC-on-CSE.2016.025","article-title":"Drone-assisted disaster management: Finding victims via infrared camera and lidar sensor fusion","author":"Lee","year":"2016","journal-title":"2016 3rd Asia-Pacific World Congress on Computer Science and Engineering (APWC on CSE)"},{"issue":"2\u20133","key":"2026040113103210200_ref769","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1561\/1100000049","article-title":"A survey of augmented reality","volume":"8","author":"Billinghurst","year":"2015","journal-title":"Foundations and Trends in Human-Computer Interaction"},{"key":"2026040113103210200_ref770","article-title":"The history of mobile augmented reality","author":"Arth","year":"2015","journal-title":"arXiv preprint arXiv:1505.01319"},{"issue":"3","key":"2026040113103210200_ref771","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1162\/PRES_a_00264","article-title":"The most important challenge facing augmented reality","volume":"25","author":"Azuma","year":"2016","journal-title":"Presence: Teleoperators and Virtual Environments,"},{"issue":"2","key":"2026040113103210200_ref772","first-page":"1","article-title":"A survey of augmented reality technologies, applications and limitations","volume":"9","author":"Van","year":"2010","journal-title":"International Journal of Virtual Reality"},{"issue":"4","key":"2026040113103210200_ref773","first-page":"133","article-title":"Augmented reality trends in education: A systematic review of research and applications","volume":"17","author":"Bacca Acosta","year":"2014","journal-title":"Journal of Educational Technology and Society"},{"key":"2026040113103210200_ref774","article-title":"Augmented reality browser survey","volume":"1101","author":"Grubert","year":"2011"},{"key":"2026040113103210200_ref775","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1109\/ISMAR.2017.29","article-title":"Recent developments and future challenges in medical mixed reality","author":"Chen","year":"2017","journal-title":"2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)"},{"key":"2026040113103210200_ref776","doi-asserted-by":"crossref","DOI":"10.1017\/S1460396913000277","article-title":"An overview of augmented and virtual reality applications in radiotherapy and future developments enabled by modern tablet devices","volume":"13","author":"Cosentino","year":"2014","journal-title":"Journal of Radiotherapy in Practice"},{"issue":"2","key":"2026040113103210200_ref777","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1016\/j.compmedimag.2013.01.009","article-title":"The state of the art of visualization in mixed reality image guided surgery","volume":"37","author":"Kersten-Oertel","year":"2013","journal-title":"Computerized Medical Imaging and Graphics"},{"issue":"4","key":"2026040113103210200_ref778","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1007\/s10143-016-0732-9","article-title":"Augmented reality in neurosurgery: A systematic review","volume":"40","author":"Meola","year":"2017","journal-title":"Neurosurgical Review"},{"key":"2026040113103210200_ref779","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1007\/978-3-319-43775-0_12","article-title":"Visualization techniques for augmented reality in endoscopic surgery","author":"Wang","year":"2016","journal-title":"International Conference on Medical Imaging and Augmented Reality"},{"key":"2026040113103210200_ref780","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1016\/j.media.2017.01.007","article-title":"The status of augmented reality in laparoscopic surgery as of 2016","volume":"37","author":"Bernhardt","year":"2017","journal-title":"Medical Image Analysis"},{"key":"2026040113103210200_ref781","first-page":"281","article-title":"A see through future: Augmented reality and health information systems.","author":"Monkman","year":"2015","journal-title":"ITCH"},{"key":"2026040113103210200_ref782","doi-asserted-by":"crossref","DOI":"10.7717\/peerj.469","article-title":"Augmented reality in healthcare education: An integrative review","volume":"2","author":"Zhu","year":"2014","journal-title":"PeerJ"},{"issue":"11","key":"2026040113103210200_ref783","doi-asserted-by":"crossref","first-page":"2947","DOI":"10.1109\/TVCG.2018.2868591","article-title":"Revisiting trends in augmented reality research: A review of the 2nd decade of ismar (2008\u20132017)","volume":"24","author":"Kim","year":"2018","journal-title":"IEEE Transactions on Visualization and Computer Graphics"},{"key":"2026040113103210200_ref784","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1109\/ISMAR.2008.4637362","article-title":"Trends in augmented reality tracking, interaction and display: A review of ten years of ismar","author":"Zhou","year":"2008","journal-title":"2008 7th IEEE\/ACM International Symposium on Mixed and Augmented Reality"},{"issue":"6","key":"2026040113103210200_ref785","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1109\/38.963459","article-title":"Recent advances in augmented reality","volume":"21","author":"Azuma","year":"2001","journal-title":"IEEE Computer Graphics and Applications"},{"issue":"4","key":"2026040113103210200_ref786","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1162\/pres.1997.6.4.355","article-title":"A survey of augmented reality","volume":"6","author":"Azuma","year":"1997","journal-title":"Presence: Teleoperators and Virtual Environments"},{"key":"2026040113103210200_ref787","first-page":"757","article-title":"A head-mounted three dimensional display","author":"Sutherland","year":"1968","journal-title":"Proceedings of the December 9\u201311, 1968, Fall Joint Computer Conference, Part I"},{"key":"2026040113103210200_ref788","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1109\/VR.1999.756959","article-title":"A motion-stabilized outdoor augmented reality system","author":"Azuma","year":"1999","journal-title":"Proceedings IEEE Virtual Reality (Cat. No. 99CB36316)"},{"issue":"4","key":"2026040113103210200_ref789","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1007\/BF01682023","article-title":"A touring machine: Prototyping 3D mobile augmented reality systems for exploring the urban environment","volume":"1","author":"Feiner","year":"1997","journal-title":"Personal Technologies"},{"key":"2026040113103210200_ref790","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1109\/ISWC.2001.962094","article-title":"Authoring of physical models using mobile computers","author":"Baillot","year":"2001","journal-title":"Proceedings Fifth International Symposium on Wearable Computers"},{"issue":"6","key":"2026040113103210200_ref791","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1016\/S0097-8493(99)00103-X","article-title":"Exploring mars: Developing indoor and outdoor user interfaces to a mobile augmented reality system","volume":"23","author":"H\u00f6llerer","year":"1999","journal-title":"Computers and Graphics"},{"key":"2026040113103210200_ref792","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1109\/ISWC.2001.962093","article-title":"Tinmith-metro: New outdoor techniques for creating city models with an augmented reality wearable computer","author":"Piekarski","year":"2001","journal-title":"Proceedings Fifth International Symposium on Wearable Computers"},{"key":"2026040113103210200_ref793","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1109\/ISWC.1998.729549","article-title":"A wearable computer system with augmented reality to support terrestrial navigation","author":"Thomas","year":"1998","journal-title":"Digest of Papers. Second International Symposium on Wearable Computers (Cat. No. 98EX215)"},{"key":"2026040113103210200_ref794","article-title":"Townwear: An outdoor wearable MR system with high-precision registration","author":"Satoh","year":"2001","journal-title":"Proc. 2nd Int. Symp. on Mixed Reality, 2001"},{"key":"2026040113103210200_ref795","doi-asserted-by":"crossref","first-page":"244","DOI":"10.1109\/VR.1999.756958","article-title":"Registration for outdoor augmented reality applications using computer vision techniques and hybrid sensors","author":"Behringer","year":"1999","journal-title":"Proceedings IEEE Virtual Reality (Cat. No. 99CB36316)"},{"issue":"6","key":"2026040113103210200_ref796","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1109\/MCG.2002.1046629","article-title":"Hybrid tracking for outdoor augmented reality applications","volume":"22","author":"Ribo","year":"2002","journal-title":"IEEE Computer Graphics and Applications"},{"key":"2026040113103210200_ref797","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1109\/ISMAR.2004.24","article-title":"Combining edge and texture information for real-time accurate 3d camera tracking","author":"Vacchetti","year":"2004","journal-title":"Third IEEE and ACM International Symposium on Mixed and Augmented Reality"},{"key":"2026040113103210200_ref798","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1109\/ISMAR.2003.1240694","article-title":"Robust visual tracking for non-instrumental augmented reality","author":"Klein","year":"2003","journal-title":"The Second IEEE and ACM International Symposium on Mixed and Augmented Reality, Proceedings"},{"key":"2026040113103210200_ref799","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1109\/ISMAR.2004.54","article-title":"Sensor fusion and occlusion refinement for tablet-based AR","author":"Klein","year":"2004","journal-title":"Third IEEE and ACM International Symposium on Mixed and Augmented Reality"},{"key":"2026040113103210200_ref800","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1109\/ISMAR.2006.297801","article-title":"Going out: Robust model-based tracking for outdoor augmented reality","author":"Reitmayr","year":"2006","journal-title":"2006 IEEE\/ ACM International Symposium on Mixed and Augmented Reality"},{"key":"2026040113103210200_ref801","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1145\/2047196.2047270","article-title":"Kinectfusion: Real-time 3D reconstruction and interaction using a moving depth camera","author":"Izadi","year":"2011","journal-title":"Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology"},{"key":"2026040113103210200_ref802","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1109\/ISMAR.2014.6948422","article-title":"Dense planar SLAM","author":"Salas-Moreno","year":"2014","journal-title":"2014 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)"},{"issue":"14","key":"2026040113103210200_ref803","doi-asserted-by":"crossref","first-page":"1697","DOI":"10.1177\/0278364916669237","article-title":"Elasticfusion: Real-time dense SLAM and light source estimation","volume":"35","author":"Whelan","year":"2016","journal-title":"The International Journal of Robotics Research"},{"key":"2026040113103210200_ref804","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.compedu.2013.09.004","article-title":"Experimenting with electromagnetism using augmented reality: Impact on flow student experience and educational effectiveness","volume":"71","author":"Ib\u00e1\u00f1ez","year":"2014","journal-title":"Computers and Education"},{"key":"2026040113103210200_ref805","author":"Webcam-social-shopper, Zugara,"},{"key":"2026040113103210200_ref806","author":"IKEA Place Demo AR App, Inter IKEA Systems B.V."},{"key":"2026040113103210200_ref807","author":"Charlotte Tilbury Westfield London Magic Mirror, Holition,"},{"key":"2026040113103210200_ref808","author":"Pokemon Go, Niantic, The Pokemon Company,"},{"key":"2026040113103210200_ref809","author":"Take off to your next destination with Google Maps, Google,"},{"key":"2026040113103210200_ref810","author":"Zeman","year":"2011"},{"key":"2026040113103210200_ref811","doi-asserted-by":"crossref","first-page":"850","DOI":"10.1109\/CSCS.2015.106","article-title":"Architectural design of a real-time augmented feedback system for neuromotor rehabilitation","author":"Caraiman","year":"2015","journal-title":"2015 20th International Conference on Control Systems and Computer Science"},{"issue":"3","key":"2026040113103210200_ref812","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1109\/TOH.2011.32","article-title":"Integrating haptics with augmented reality in a femoral palpation and needle insertion training simulation","volume":"4","author":"Coles","year":"2011","journal-title":"IEEE Transactions on Haptics"},{"issue":"6","key":"2026040113103210200_ref813","doi-asserted-by":"crossref","first-page":"1466","DOI":"10.1109\/TBME.2014.2385874","article-title":"Training for planning tumour resection: Augmented reality and human factors","volume":"62","author":"Abhari","year":"2014","journal-title":"IEEE Transactions on Biomedical Engineering"},{"key":"2026040113103210200_ref814","doi-asserted-by":"crossref","DOI":"10.1097\/BRS.0000000000001830","article-title":"Surgical navigation technology based on augmented reality and integrated 3D intraoperative imaging: A spine cadaveric feasibility and accuracy study","volume":"41","author":"Elmi-Terander","year":"2016","journal-title":"Spine"},{"issue":"2","key":"2026040113103210200_ref815","doi-asserted-by":"crossref","first-page":"918","DOI":"10.1109\/LRA.2019.2892199","article-title":"Dense-arthroSLAM: Dense intra-articular 3-d reconstruction with robust localization prior for arthroscopy","volume":"4","author":"Marmol","year":"2019","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref816","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1016\/j.cmpb.2018.02.006","article-title":"SLAM-based dense surface reconstruction in monocular minimally invasive surgery and its application to augmented reality","volume":"158","author":"Chen","year":"2018","journal-title":"Computer Methods and Programs in Biomedicine"},{"issue":"5","key":"2026040113103210200_ref817","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1049\/htl.2017.0068","article-title":"Real-time geometry-aware augmented reality in minimally invasive surgery","volume":"4","author":"Chen","year":"2017","journal-title":"Healthcare Technology Letters"},{"key":"2026040113103210200_ref818","first-page":"85","article-title":"Marker tracking and HMD calibration for a video-based augmented reality conferencing system","author":"Kato","year":"1999","journal-title":"Proceedings 2nd IEEE and ACM International Workshop on Augmented Reality (IWAR\u201999)"},{"key":"2026040113103210200_ref819","author":"ArtoolkitX, Realmax Inc."},{"key":"2026040113103210200_ref820","author":"Vuforia Engine Developer Portal, PTC Inc."},{"key":"2026040113103210200_ref821","author":"ARCore, Google Developers,"},{"key":"2026040113103210200_ref822","author":"EasyAR, VisionStar Information Technology (Shanghai) Co., Ltd.,"},{"key":"2026040113103210200_ref823","author":"Glass, Google,"},{"key":"2026040113103210200_ref824","author":"HoloLens2, Microsoft"},{"key":"2026040113103210200_ref825","doi-asserted-by":"crossref","first-page":"1292","DOI":"10.1109\/ICRA.2016.7487261","article-title":"Comparative design space exploration of dense and semi-dense SLAM","author":"Zia","year":"2016","journal-title":"2016 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref826","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1145\/2967938.2967963","article-title":"Integrating algorithmic parameters into benchmarking and design space exploration in 3D scene understanding","author":"Bodin","year":"2016","journal-title":"Proceedings of the 2016 International Conference on Parallel Architectures and Compilation"},{"key":"2026040113103210200_ref827","doi-asserted-by":"crossref","first-page":"1434","DOI":"10.1109\/IPDPSW.2017.107","article-title":"Algorithmic performance-accuracy trade-off in 3d vision applications using hypermapper","author":"Nardi","year":"2017","journal-title":"2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)"},{"issue":"11","key":"2026040113103210200_ref828","doi-asserted-by":"crossref","first-page":"2020","DOI":"10.1109\/JPROC.2018.2856739","article-title":"Navigating the landscape for real-time localization and mapping for robotics and virtual and augmented reality","volume":"106","author":"Saeedi","year":"2018","journal-title":"Proceedings of the IEEE"},{"key":"2026040113103210200_ref829","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1109\/MASCOTS.2019.00045","article-title":"Practical design space exploration","author":"Nardi","year":"2019","journal-title":"2019 IEEE 27th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)"},{"issue":"1","key":"2026040113103210200_ref830","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1109\/TCSI.2004.840093","article-title":"A general-purpose processor-per-pixel analog simd vision chip","volume":"52","author":"Dudek","year":"2005","journal-title":"IEEE Transactions on Circuits and Systems I: Regular Papers"},{"key":"2026040113103210200_ref831","author":"Moini","year":"1999","journal-title":"Vision Chips"},{"key":"2026040113103210200_ref832","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4419-6475-5","volume-title":"Focal-Plane Sensor-Processor Chips","author":"Zar\u00e1ndy","year":"2011"},{"issue":"10","key":"2026040113103210200_ref833","doi-asserted-by":"crossref","first-page":"889","DOI":"10.1016\/j.sysarc.2013.03.016","article-title":"Low power high-performance smart camera system based on scamp vision sensor","volume":"59","author":"Carey","year":"2013","journal-title":"Journal of Systems Architecture"},{"key":"2026040113103210200_ref834","article-title":"Vision chips with in-pixel processors for high-performance low-power embedded vision systems","volume":"6","author":"Martel","year":"2016","journal-title":"ASR-MOV Workshop, CGO"},{"key":"2026040113103210200_ref835","article-title":"Analog vision-neural network inference acceleration using analog simd computation in the focal plane","author":"Wong","year":"2018"},{"issue":"10","key":"2026040113103210200_ref836","doi-asserted-by":"crossref","first-page":"1629","DOI":"10.1109\/5.58356","article-title":"Neuromorphic electronic systems","volume":"78","author":"Mead","year":"1990","journal-title":"Proceedings of the IEEE"},{"issue":"3","key":"2026040113103210200_ref837","first-page":"580","article-title":"Computational sensors\u2013vision vlsi","volume":"82","author":"Aizawa","year":"1999","journal-title":"IEICE Transactions on Information and Systems"},{"key":"2026040113103210200_ref838","doi-asserted-by":"crossref","DOI":"10.3390\/computation7040063","article-title":"Field programmable gate array applications\u2014A scientometric review","volume":"7","author":"Ruiz-Rosero","year":"2019","journal-title":"Computation"},{"key":"2026040113103210200_ref839","first-page":"1","article-title":"A survey of FPGA-based accelerators for convolutional neural networks","author":"Mittal","year":"2020","journal-title":"Neural Computing and Applications"},{"key":"2026040113103210200_ref840","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1016\/j.sysarc.2019.01.007","article-title":"A survey and taxonomy of FPGA-based deep learning accelerators","volume":"98","author":"Blaiech","year":"2019","journal-title":"Journal of Systems Architecture"},{"key":"2026040113103210200_ref841","first-page":"1","article-title":"A survey of FPGA based deep learning accelerators: Challenges and opportunities","author":"Wang","year":"2018","journal-title":"arXiv preprint arXiv:1901.04988"},{"key":"2026040113103210200_ref842","doi-asserted-by":"crossref","first-page":"7823","DOI":"10.1109\/ACCESS.2018.2890150","article-title":"FPGA-based accelerators of deep learning networks for learning and classification: A review","volume":"7","author":"Shawahna","year":"2018","journal-title":"IEEE Access"},{"issue":"3","key":"2026040113103210200_ref843","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3186332","article-title":"Toolflows for mapping convolutional neural networks on FPGAs: A survey and future directions","volume":"51","author":"Venieris","year":"2018","journal-title":"ACM Computing Surveys (CSUR)"},{"issue":"5","key":"2026040113103210200_ref844","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1109\/6.490055","article-title":"Neuromorphic vision chips","volume":"33","author":"Koch","year":"1996","journal-title":"IEEE Spectrum"},{"key":"2026040113103210200_ref845","doi-asserted-by":"crossref","DOI":"10.1007\/s11432-017-9303-0","article-title":"Neuromorphic vision chips","volume":"61","author":"Wu","year":"2018","journal-title":"Science China Information Sciences"},{"key":"2026040113103210200_ref846","article-title":"A survey of neuromorphic computing and neural networks in hardware","author":"Schuman","year":"2017","journal-title":"arXiv preprint arXiv:1705.06963"},{"key":"2026040113103210200_ref847","first-page":"1","article-title":"Event-based row-by-row multi-convolution engine for dynamic-vision feature extraction on fpga","author":"Tapiador-Morales","year":"2018","journal-title":"2018 International Joint Conference on Neural Networks (IJCNN)"},{"issue":"3","key":"2026040113103210200_ref848","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1109\/TBCAS.2011.2174152","article-title":"Energy-efficient neuron, synapse and STDP integrated circuits","volume":"6","author":"Cruz-Albrecht","year":"2012","journal-title":"IEEE Transactions on Biomedical Circuits and Systems"},{"key":"2026040113103210200_ref849","first-page":"4176","article-title":"Spiking neural network on neuromorphic hardware for energy-efficient unidimensional SLAM","author":"Tang","year":"2019","journal-title":"2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)"},{"key":"2026040113103210200_ref850","first-page":"219","article-title":"Flyintel\u2014A platform for robot navigation based on a brain-inspired spiking neural network","author":"Huang-Yu","year":"2019","journal-title":"2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)"},{"key":"2026040113103210200_ref851","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.neunet.2015.09.005","article-title":"A gpuaccelerated cortical neural network model for visually guided robot navigation","volume":"72","author":"Beyeler","year":"2015","journal-title":"Neural Networks"},{"key":"2026040113103210200_ref852","doi-asserted-by":"crossref","DOI":"10.3389\/fnbot.2016.00001","article-title":"Serendipitous offline learning in a neuromorphic robot","volume":"10","author":"Stewart","year":"2016","journal-title":"Frontiers in Neurorobotics"},{"key":"2026040113103210200_ref853","doi-asserted-by":"crossref","DOI":"10.3389\/fnins.2019.00095","article-title":"Going deeper in spiking neural networks: Vgg and residual architectures","volume":"13","author":"Sengupta","year":"2019","journal-title":"Frontiers in Neuroscience"},{"key":"2026040113103210200_ref854","first-page":"173","article-title":"Homeostasis-based CNN-to-SNN conversion of inception and residual architectures","author":"Xing","year":"2019","journal-title":"International Conference on Neural Information Processing"},{"issue":"1","key":"2026040113103210200_ref855","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1007\/s11263-014-0788-3","article-title":"Spiking deep convolutional neural networks for energy-efficient object recognition","volume":"113","author":"Cao","year":"2015","journal-title":"International Journal of Computer Vision"},{"key":"2026040113103210200_ref856","first-page":"567","article-title":"SUN RGB-D: A RGB-D scene understanding benchmark suite","author":"Song","year":"2015","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"2026040113103210200_ref857","first-page":"92","article-title":"SceneNN: A scene meshes dataset with annotations","author":"Hua","year":"2016","journal-title":"Proceedings\u20142016 4th International Conference on 3D Vision, 3DV 2016"},{"key":"2026040113103210200_ref858","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1109\/3DV.2017.00081","article-title":"Matterport3D: Learning from RGB-D data in indoor environments","author":"Chang","year":"2017","journal-title":"2017 International Conference on 3D Vision (3DV)"},{"key":"2026040113103210200_ref859","first-page":"2784","article-title":"Disentangling human dynamics for pedestrian locomotion forecasting with noisy supervision","author":"Mangalam","year":"2020","journal-title":"The IEEE Winter Conference on Applications of Computer Vision"},{"key":"2026040113103210200_ref860","first-page":"340","article-title":"Sne-roadseg: Incorporating surface normal information into semantic segmentation for accurate freespace detection","author":"Fan","year":"2020","journal-title":"European Conference on Computer Vision"},{"key":"2026040113103210200_ref861","article-title":"Unsupervised domain adaptation for semantic segmentation of NIR images through generative latent search","author":"Pandey","year":"2020","journal-title":"European Conference on Computer Vision"},{"issue":"2","key":"2026040113103210200_ref862","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1177\/0278364917695640","article-title":"Robot@Home, a robotic dataset for semantic mapping of home environments","volume":"36","author":"Ruiz-Sarmiento","year":"2017","journal-title":"International Journal of Robotics Research,"},{"key":"2026040113103210200_ref863","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.neucom.2018.03.037","article-title":"Methods and datasets on semantic segmentation: A review","volume":"304","author":"Yu","year":"2018","journal-title":"Neurocomputing"},{"key":"2026040113103210200_ref864","author":"mrgloom, Awesome Semantic Segmentation, GitHub"},{"key":"2026040113103210200_ref865","first-page":"2432","article-title":"ScanNet: Richly-annotated 3D reconstructions of indoor scenes","author":"Dai","year":"2017","journal-title":"Proceedings\u201430th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"issue":"1W1","key":"2026040113103210200_ref866","doi-asserted-by":"crossref","first-page":"91","DOI":"10.5194\/isprs-annals-IV-1-W1-91-2017","article-title":"Semantic3D.Net: A new large-scale point cloud classification benchmark","volume":"4","author":"Hackel","year":"2017","journal-title":"ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences"},{"key":"2026040113103210200_ref867","article-title":"Joint 2d-3dsemantic data for indoor scene understanding","author":"Armeni","year":"2017","journal-title":"arXiv preprint arXiv:1702.01105"},{"issue":"12","key":"2026040113103210200_ref868","doi-asserted-by":"crossref","first-page":"2724","DOI":"10.1109\/TKDE.2017.2754499","article-title":"Knowledge graph embedding: A survey of approaches and applications","volume":"29","author":"Wang","year":"2017","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"issue":"3","key":"2026040113103210200_ref869","first-page":"489","article-title":"Knowledge graph refinement: A survey of approaches and evaluation methods","volume":"8","author":"Paulheim","year":"2017","journal-title":"Semantic Web"},{"key":"2026040113103210200_ref870","article-title":"Owl web ontology language overview","volume":"10","author":"McGuinness","year":"2004","journal-title":"W3C Recommendation"},{"key":"2026040113103210200_ref871","article-title":"Owl-s: Semantic markup for web services","volume":"22","author":"Martin","year":"2004","journal-title":"W3C Member Submission"},{"key":"2026040113103210200_ref872","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.websem.2018.06.003","article-title":"Sosa: A lightweight ontology for sensors, observations, samples, and actuators","volume":"56","author":"Janowicz","year":"2019","journal-title":"Journal of Web Semantics"},{"issue":"11","key":"2026040113103210200_ref873","doi-asserted-by":"crossref","first-page":"1193","DOI":"10.1016\/j.robot.2013.04.005","article-title":"Towards a core ontology for robotics and automation","volume":"61","author":"Prestes","year":"2013","journal-title":"Robotics and Autonomous Systems"},{"issue":"3","key":"2026040113103210200_ref874","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1017\/S0269888904000050","article-title":"Using ontologies to aid navigation planning in autonomous vehicles","volume":"18","author":"Schlenoff","year":"2003","journal-title":"The Knowledge Engineering Review"},{"issue":"2","key":"2026040113103210200_ref875","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/0926-5805(93)90007-K","article-title":"Map representation of a large in-door environment with path planning and navigation abilities for an autonomous mobile robot with its implementation on a real robot","volume":"2","author":"Habib","year":"1993","journal-title":"Automation in Construction"},{"issue":"1\u20132","key":"2026040113103210200_ref876","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/j.robot.2004.07.016","article-title":"Knowledge representation and planning for on-road driving","volume":"49","author":"Balakirsky","year":"2004","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref877","first-page":"77","article-title":"Modeling ontologies for robotic environments","author":"Chella","year":"2002","journal-title":"Proceedings of the 14th International Conference on Software Engineering and Knowledge Engineering"},{"issue":"1\u20132","key":"2026040113103210200_ref878","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.robot.2004.07.017","article-title":"How task analysis can be used to derive and organize the knowledge for the control of autonomous vehicles","volume":"49","author":"Barbera","year":"2004","journal-title":"Robotics and Autonomous Systems"},{"issue":"1\u20132","key":"2026040113103210200_ref879","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1016\/j.robot.2004.07.018","article-title":"Representation and purposeful autonomous agents","volume":"49","author":"Wood","year":"2004","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref880","first-page":"61","article-title":"Metaknowledge for autonomous systems","author":"Epstein","year":"2004","journal-title":"Proceedings of AAAI Spring Symposium on Knowledge Representation and Ontology for Autonomous Systems"},{"key":"2026040113103210200_ref881","article-title":"An ontology-based representation for policy-governed adjustable autonomy","author":"Jung","year":"2004","journal-title":"Proceedings of the AAAI Spring Symposium on Knowledge Representation and Ontology for Autonomous Systems"},{"key":"2026040113103210200_ref882","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/1096961.1096965","article-title":"A robot ontology for urban search and rescue","author":"Schlenoff","year":"2005","journal-title":"Proceedings of the 2005 ACM Workshop on Research in Knowledge Representation for Autonomous Systems"},{"key":"2026040113103210200_ref883","article-title":"Control architecture concepts and properties of an ontology devoted to exchanges in mobile robotics","author":"Dhouib","year":"2011","journal-title":"6th National Conference on Control Architectures of Robots"},{"key":"2026040113103210200_ref884","first-page":"1","article-title":"An ontology of robotics science","author":"Hallam","year":"2006","journal-title":"European Robotics Symposium 2006"},{"issue":"1","key":"2026040113103210200_ref885","first-page":"17","article-title":"What is a knowledge representation?","volume":"14","author":"Davis","year":"1993","journal-title":"AI Magazine"},{"issue":"2","key":"2026040113103210200_ref886","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1177\/1059712311421219","article-title":"A review of long-term memory in natural and synthetic systems","volume":"20","author":"Wood","year":"2012","journal-title":"Adaptive Behavior"},{"issue":"4","key":"2026040113103210200_ref887","doi-asserted-by":"crossref","first-page":"1005","DOI":"10.1109\/TCDS.2017.2754143","article-title":"Dac-h3: A proactive robot cognitive architecture to acquire and express knowledge about the world and the self","volume":"10","author":"Fischer","year":"2018","journal-title":"IEEE Transactions on Cognitive and Developmental Systems"},{"key":"2026040113103210200_ref888","first-page":"3548","article-title":"Oro, a knowledge management platform for cognitive architectures in robotics","author":"Lemaignan","year":"2010","journal-title":"2010 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref889","first-page":"4261","article-title":"Knowrob\u2014Knowledge processing for autonomous personal robots","author":"Tenorth","year":"2009","journal-title":"2009 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"issue":"2","key":"2026040113103210200_ref890","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1016\/j.icte.2018.04.008","article-title":"Ontology based knowledge representation technique, domain modeling languages and planners for robotic path planning: A survey","volume":"4","author":"Gayathri","year":"2018","journal-title":"ICT Express"},{"key":"2026040113103210200_ref891","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/IROS.2005.1545511","article-title":"Multi-hierarchical semantic maps for mobile robotics","author":"Galindo","year":"2005","journal-title":"2005 IEEE\/RSJ International Conference on Intelligent Robots and Systems"},{"key":"2026040113103210200_ref892","article-title":"Neoclassic reference manual: Version 1.0","author":"Patel-Schneider","year":"1996"},{"issue":"6","key":"2026040113103210200_ref893","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1016\/j.robot.2008.03.007","article-title":"Conceptual spatial representations for indoor mobile robots","volume":"56","author":"Zender","year":"2008","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref894","doi-asserted-by":"crossref","first-page":"430","DOI":"10.1109\/ICHR.2010.5686350","article-title":"Knowrob-mapknowledge-linked semantic object maps","author":"Tenorth","year":"2010","journal-title":"2010 10th IEEERAS International Conference on Humanoid Robots"},{"issue":"11","key":"2026040113103210200_ref895","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1145\/219717.219745","article-title":"Cyc: A large-scale investment in knowledge infrastructure","volume":"38","author":"Lenat","year":"1995","journal-title":"Communications of the ACM"},{"key":"2026040113103210200_ref896","first-page":"605","article-title":"Common sense data acquisition for indoor mobile robots","author":"Gupta","year":"2004","journal-title":"AAAI"},{"issue":"2","key":"2026040113103210200_ref897","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1109\/TASE.2014.2329556","article-title":"Rapyuta: A cloud robotics platform","volume":"12","author":"Mohanarajah","year":"2014","journal-title":"IEEE Transactions on Automation Science and Engineering"},{"key":"2026040113103210200_ref898","doi-asserted-by":"crossref","first-page":"5589","DOI":"10.1109\/ICRA.2011.5980170","article-title":"Towards semantic robot description languages","author":"Kunze","year":"2011","journal-title":"2011 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref899","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/j.knosys.2016.12.016","article-title":"Building multiversal semantic maps for mobile robot operation","volume":"119","author":"Ruiz-Sarmiento","year":"2017","journal-title":"Knowledge-Based Systems"},{"key":"2026040113103210200_ref900","first-page":"1","article-title":"Semantic image search for robotic applications","author":"Kulvicius","year":"2013","journal-title":"22nd International Workshop on Robotics in Alpe-Adria-Danube Region"},{"key":"2026040113103210200_ref901","doi-asserted-by":"crossref","DOI":"10.1016\/j.ipm.2020.102299","article-title":"Healthaid: Extracting domain targeted high precision procedural knowledge from on-line communities","volume":"57","author":"Alemu","year":"2020","journal-title":"Information Processing and Management"},{"key":"2026040113103210200_ref902","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1109\/HUMANOIDS.2012.6651555","article-title":"Things are made for what they are: Solving manipulation tasks by using functional object classes","author":"Leidner","year":"2012","journal-title":"2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012)"},{"key":"2026040113103210200_ref903","article-title":"Ppddl1. 0: An extension to pddl for expressing planning domains with probabilistic effects","volume":"2","author":"Younes","year":"2004"},{"key":"2026040113103210200_ref904","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9197477","article-title":"Probabilistic effect prediction through semantic augmentation and physical simulation","author":"Bauer","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"issue":"2","key":"2026040113103210200_ref905","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1109\/MRA.2011.940993","article-title":"Web-enabled robots","volume":"18","author":"Tenorth","year":"2011","journal-title":"IEEE Robotics and Automation Magazine"},{"key":"2026040113103210200_ref906","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1109\/Humanoids.2011.6100855","article-title":"Robotic roommates making pancakes","author":"Beetz","year":"2011","journal-title":"2011 11th IEEE-RAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref907","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1109\/Humanoids.2011.6100812","article-title":"Generalizing objects by analyzing language","author":"Tamosiunaite","year":"2011","journal-title":"2011 11th IEEE-RAS International Conference on Humanoid Robots"},{"key":"2026040113103210200_ref908","article-title":"Cloud-enabled humanoid robots","author":"Kuffner","year":"2010","journal-title":"2010 10th IEEE-RAS International Conference on Humanoid Robots (Humanoids),"},{"key":"2026040113103210200_ref909","author":"Goldberg","year":"2002","journal-title":"Beyond Webcams: An Introduction to Online Robots"},{"key":"2026040113103210200_ref910","doi-asserted-by":"crossref","first-page":"654","DOI":"10.1109\/ROBOT.1995.525358","article-title":"Desktop teleoperation via the world wide web","volume":"1","author":"Goldberg","year":"1995","journal-title":"Proceedings of 1995 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref911","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1007\/978-3-540-79142-3_4","article-title":"What is networked robotics?","author":"McKee","year":"2008","journal-title":"Informatics in Control Automation and Robotics"},{"issue":"5","key":"2026040113103210200_ref912","doi-asserted-by":"crossref","first-page":"456","DOI":"10.1109\/MDT.2004.68","article-title":"Open-source hardware","volume":"21","author":"Davidson","year":"2004","journal-title":"IEEE Design and Test of Computers"},{"issue":"7","key":"2026040113103210200_ref913","first-page":"97","article-title":"That \u201cinternet of things\u2019 thing","volume":"22","author":"Ashton","year":"2009","journal-title":"RFID Journal"},{"issue":"15","key":"2026040113103210200_ref914","doi-asserted-by":"crossref","first-page":"2787","DOI":"10.1016\/j.comnet.2010.05.010","article-title":"The internet of things: A survey","volume":"54","author":"Atzori","year":"2010","journal-title":"Computer Networks"},{"key":"2026040113103210200_ref915","article-title":"Cloud robotics and automation: A survey of related work","author":"Goldberg","year":"2013"},{"issue":"2","key":"2026040113103210200_ref916","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1109\/TASE.2014.2376492","article-title":"A survey of research on cloud robotics and automation","volume":"12","author":"Kehoe","year":"2015","journal-title":"IEEE Transactions on Automation Science and Engineering"},{"key":"2026040113103210200_ref917","first-page":"2797","article-title":"Cloud robotics: Current status and open issues","volume":"4","author":"Wan","year":"2016","journal-title":"IEEE Access"},{"issue":"3","key":"2026040113103210200_ref918","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1109\/MNET.2012.6201212","article-title":"Cloud robotics: Architecture, challenges and applications","volume":"26","author":"Hu","year":"2012","journal-title":"IEEE Network"},{"key":"2026040113103210200_ref919","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1109\/ROBOT.2010.5509307","article-title":"Raoblackwellized particle filters multi robot SLAM with unknown initial correspondences and limited communication","author":"Carlone","year":"2010","journal-title":"2010 IEEE International Conference on Robotics and Automation"},{"key":"2026040113103210200_ref920","first-page":"176","article-title":"Raoblackwellised particle filtering for dynamic Bayesian networks","author":"Doucet","year":"2000","journal-title":"Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence"},{"key":"2026040113103210200_ref921","doi-asserted-by":"crossref","first-page":"1979","DOI":"10.4018\/978-1-7998-1754-3.ch094","article-title":"An approach towards survey and analysis of cloud robotics","author":"Chowdhury","year":"2020","journal-title":"Robotic Systems: Concepts, Methodologies, Tools, and Applications"},{"key":"2026040113103210200_ref922","doi-asserted-by":"crossref","DOI":"10.3390\/robotics7030047","article-title":"A comprehensive survey of recent trends in cloud robotics architectures and applications","volume":"7","author":"Saha","year":"2018","journal-title":"Robotics"},{"key":"2026040113103210200_ref923","doi-asserted-by":"crossref","first-page":"36 662","DOI":"10.1109\/ACCESS.2018.2852295","article-title":"A study of robotic cooperation in cloud robotics: Architecture and challenges","volume":"6","author":"Chen","year":"2018","journal-title":"IEEE Access"},{"key":"2026040113103210200_ref924","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-41610-1_1-1","article-title":"Service-oriented software architecture for cloud robotics","author":"Koub\u00e2a","year":"2020","journal-title":"Encyclopedia of Robotics."},{"key":"2026040113103210200_ref925","article-title":"Network offloading policies for cloud robotics: A learning-based approach","author":"Chinchali","year":"2019","journal-title":"Robotics: Science and Systems"},{"key":"2026040113103210200_ref926","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/j.robot.2019.06.003","article-title":"Cloud robotics law and regulation: Challenges in the governance of complex and dynamic cyber\u2013physical ecosystems","volume":"119","author":"Fosch-Villaronga","year":"2019","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref927","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1109\/MobileCloud.2019.00010","article-title":"Cloud, fog, and dew robotics: Architectures for next generation applications","author":"Botta","year":"2019","journal-title":"2019 7th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering (MobileCloud)"},{"key":"2026040113103210200_ref928","first-page":"225","article-title":"Parallel tracking and mapping for small AR workspaces","author":"Klein","year":"2007","journal-title":"2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality"},{"key":"2026040113103210200_ref929","article-title":"CodeSLAM\u2014learning a compact, optimisable representation for dense visual SLAM","author":"Bloesch","year":"2018","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition."},{"issue":"4","key":"2026040113103210200_ref930","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1016\/j.robot.2013.11.007","article-title":"C2tam: A cloud framework for cooperative tracking and mapping","volume":"62","author":"Riazuelo","year":"2014","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref931","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1109\/ICWS.2018.00026","article-title":"Cloud-based framework for scalable and real-time multi-robot SLAM","author":"Zhang","year":"2018","journal-title":"2018 IEEE International Conference on Web Services (ICWS)"},{"key":"2026040113103210200_ref932","first-page":"729","article-title":"Multi robot object-based SLAM","author":"Choudhary","year":"2016","journal-title":"International Symposium on Experimental Robotics"},{"key":"2026040113103210200_ref933","first-page":"3185","article-title":"Multiple relative pose graphs for robust cooperative mapping","author":"Kim","year":"2010","journal-title":"2010 IEEE International Conference on Robotics and Automation"},{"issue":"7","key":"2026040113103210200_ref934","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1109\/JPROC.2006.876927","article-title":"Distributed multirobot exploration and mapping","volume":"94","author":"Fox","year":"2006","journal-title":"Proceedings of the IEEE"},{"key":"2026040113103210200_ref935","doi-asserted-by":"crossref","first-page":"17 215","DOI":"10.1109\/ACCESS.2018.2814606","article-title":"A reinforcement learning-based resource allocation scheme for cloud robotics","volume":"6","author":"Liu","year":"2018","journal-title":"IEEE Access"},{"key":"2026040113103210200_ref936","doi-asserted-by":"crossref","first-page":"13 810","DOI":"10.1109\/ACCESS.2018.2811762","article-title":"Dronetrack: Cloud-based real-time object tracking using unmanned aerial vehicles over the internet","volume":"6","author":"Koub\u00e2a","year":"2018","journal-title":"IEEE Access"},{"key":"2026040113103210200_ref937","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1016\/j.adhoc.2018.09.013","article-title":"Dronemap planner: A service-oriented cloud-based management system for the internet-of-drones","volume":"86","author":"Koub\u00e2a","year":"2019","journal-title":"Ad Hoc Networks"},{"key":"2026040113103210200_ref938","first-page":"1151","article-title":"Fast-SLAM 2.0: An improved particle filtering algorithm for simultaneous localization and mapping that provably converges","author":"Montemerlo","year":"2003","journal-title":"IJCAI"},{"key":"2026040113103210200_ref939","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1016\/j.compeleceng.2017.11.012","article-title":"Fastslam 2.0 tracking and mapping as a cloud robotics service","volume":"69","author":"Ali","year":"2018","journal-title":"Computers and Electrical Engineering"},{"issue":"2","key":"2026040113103210200_ref940","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1007\/s10489-018-1277-0","article-title":"Cloud robot: Semantic map building for intelligent service task","volume":"49","author":"Wu","year":"2019","journal-title":"Applied Intelligence"},{"key":"2026040113103210200_ref941","author":"Apache CloudStack, The Apache Software Foundation"},{"key":"2026040113103210200_ref942","article-title":"Decentralized robot-cloud architecture for an autonomous transportation system in a smart factory","author":"Martin","year":"2017","journal-title":"SEMANTICS Workshops"},{"key":"2026040113103210200_ref943","article-title":"From sensor to processing networks: Optimal estimation with computation and communication latency","author":"Ballotta","year":"2020","journal-title":"21st IFAC World Congress"},{"key":"2026040113103210200_ref944","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1016\/j.neunet.2019.09.010","article-title":"Tree-CNN: A hierarchical deep convolutional neural network for incremental learning","volume":"121","author":"Roy","year":"2020","journal-title":"Neural Networks"},{"key":"2026040113103210200_ref945","first-page":"1131","article-title":"Class-incremental learning via deep model consolidation","author":"Zhang","year":"2020","journal-title":"The IEEE Winter Conference on Applications of Computer Vision"},{"key":"2026040113103210200_ref946","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9196827","article-title":"Fast, compact and highly scalable visual place recognition through sequence-based matching of overloaded representations","author":"Garg","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref947","article-title":"Pre-training tasks for embedding-based large-scale retrieval","author":"Chang","year":"2020","journal-title":"2020 International Conference on Learning Representations (ICLR)"},{"key":"2026040113103210200_ref948","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9197383","article-title":"Efficient covisibility-based image matching for large-scale sfm","author":"Ye","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"},{"key":"2026040113103210200_ref949","doi-asserted-by":"crossref","DOI":"10.1016\/j.patcog.2020.107537","article-title":"Graph-based parallel large scale structure from motion","author":"Chen","year":"2020","journal-title":"Pattern Recognition"},{"key":"2026040113103210200_ref950","article-title":"One ring to rule them all: Certifiably robust geometric perception with outliers","author":"Yang","year":"2020","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"2","key":"2026040113103210200_ref951","doi-asserted-by":"crossref","first-page":"1127","DOI":"10.1109\/LRA.2020.2965893","article-title":"Graduated non-convexity for robust spatial perception: From non-minimal solvers to global outlier rejection","volume":"5","author":"Yang","year":"2020","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2026040113103210200_ref952","article-title":"Robotic services for new paradigm smart cities based on decentralized technologies","volume":"4","author":"Kapitonov","year":"2019","journal-title":"Ledger"},{"key":"2026040113103210200_ref953","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1016\/j.future.2017.03.034","article-title":"Smart city and IoT","volume":"76","author":"Kim","year":"2017","journal-title":"Future Generation Computer Systems"},{"issue":"1535","key":"2026040113103210200_ref954","doi-asserted-by":"crossref","first-page":"3527","DOI":"10.1098\/rstb.2009.0157","article-title":"Role of expressive behaviour for robots that learn from people","volume":"364","author":"Breazeal","year":"2009","journal-title":"Philosophical Transactions of the Royal Society B: Biological Sciences"},{"key":"2026040113103210200_ref955","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1016\/j.robot.2018.10.006","article-title":"A 2 ML: A general human-inspired motion language for anthropomorphic arms based on movement primitives","volume":"111","author":"Fang","year":"2019","journal-title":"Robotics and Autonomous Systems"},{"key":"2026040113103210200_ref956","doi-asserted-by":"crossref","DOI":"10.1007\/s12369-009-0034-2","article-title":"Towards the development of emotional dancing humanoid robots","volume":"1","author":"Or","year":"2009","journal-title":"International Journal of Social Robotics"},{"key":"2026040113103210200_ref957","first-page":"1","article-title":"Tonight we improvise!: Real-time tracking for human-robot improvisational dance","author":"Jochum","year":"2019","journal-title":"Proceedings of the 6th International Conference on Movement and Computing"},{"key":"2026040113103210200_ref958","doi-asserted-by":"crossref","first-page":"2203","DOI":"10.1109\/IROS.2004.1389736","article-title":"Effective emotional expressions with expression humanoid robot we-4rii: Integration of humanoid robot hand rch-1","volume":"3","author":"Miwa","year":"2004","journal-title":"2004 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No. 04CH37566)"},{"key":"2026040113103210200_ref959","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1109\/FG.2018.00046","article-title":"Emotion-preserving representation learning via generative adversarial network for multi-view facial expression recognition","author":"Lai","year":"2018","journal-title":"2018 13th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2018)"},{"key":"2026040113103210200_ref960","first-page":"328","article-title":"Expression of emotions by a service robot: A pilot study","author":"Giambattista","year":"2016","journal-title":"International Conference of Design, User Experience, and Usability"},{"key":"2026040113103210200_ref961","article-title":"Robot navigation in unseen spaces using an abstract map","author":"Talbot","year":"2020","journal-title":"IEEE Transactions on Cognitive and Developmental Systems"},{"key":"2026040113103210200_ref962","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA40945.2020.9196563","article-title":"Object-oriented semantic graph based natural question generation","author":"Moon","year":"2020","journal-title":"2020 IEEE International Conference on Robotics and Automation (ICRA)"}],"container-title":["Foundations and Trends\u00ae in Robotics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/ftrob\/article-pdf\/8\/1-2\/1\/10906161\/2300000059en.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/www.emerald.com\/ftrob\/article-pdf\/8\/1-2\/1\/10906161\/2300000059en.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T18:22:17Z","timestamp":1777486937000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/ftrob\/article\/8\/1-2\/1\/1321389\/Semantics-for-Robotic-Mapping-Perception-and"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,23]]},"references-count":962,"journal-issue":{"issue":"1-2","published-print":{"date-parts":[[2020,12,23]]}},"URL":"https:\/\/doi.org\/10.1561\/2300000059","relation":{},"ISSN":["1935-8253","1935-8261"],"issn-type":[{"value":"1935-8253","type":"print"},{"value":"1935-8261","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12,23]]}}}