{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T18:00:38Z","timestamp":1775844038668,"version":"3.50.1"},"reference-count":57,"publisher":"Institution of Engineering and Technology (IET)","issue":"13","license":[{"start":{"date-parts":[[2019,10,21]],"date-time":"2019-10-21T00:00:00Z","timestamp":1571616000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":["ietresearch.onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["IET Image Processing"],"published-print":{"date-parts":[[2019,11]]},"abstract":"<jats:p>In this study, the authors propose a framework SUGAMAN (Supervised and Unified framework using Grammar and Annotation Model for Access and Navigation). SUGAMAN is a Hindi word meaning \u2018easy passage from one place to another\u2019. SUGAMAN synthesises textual description from a given floor plan image, usable by visually impaired to navigate by understanding the arrangement of rooms and furniture. It is the first framework for describing a floor plan and giving direction for obstacle\u2010free movement within a building. The model learns five classes of room categories from 1355 room image samples under a supervised learning paradigm. These learned annotations are fed into a description synthesis framework to yield a holistic description of a floor plan image. Authors demonstrate the performance of various supervised classifiers on room learning and provided a comparative analysis of system generated and human\u2010written descriptions. The contribution of this study includes a novel framework for description generation from document images with graphics while proposing a new feature representing the floor plans, text annotations for a publicly available data set, and an algorithm for door to door obstacle avoidance navigation. This work can be applied to areas like understanding floor plans and design of historical monuments, and retrieval.<\/jats:p>","DOI":"10.1049\/iet-ipr.2018.5627","type":"journal-article","created":{"date-parts":[[2019,10,7]],"date-time":"2019-10-07T22:42:44Z","timestamp":1570488164000},"page":"2623-2635","update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["SUGAMAN: describing floor plans for visually impaired by annotation learning and proximity\u2010based grammar"],"prefix":"10.1049","volume":"13","author":[{"given":"Shreya","family":"Goyal","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering Indian Institute of Technology Jodhpur India"}]},{"given":"Satya","family":"Bhavsar","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering Indian Institute of Technology Jodhpur India"}]},{"given":"Shreya","family":"Patel","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering Indian Institute of Technology Jodhpur India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3431-0483","authenticated-orcid":false,"given":"Chiranjoy","family":"Chattopadhyay","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering Indian Institute of Technology Jodhpur India"}]},{"given":"Gaurav","family":"Bhatnagar","sequence":"additional","affiliation":[{"name":"Department of Mathematics Indian Institute of Technology Jodhpur India"}]}],"member":"265","published-online":{"date-parts":[[2019,10,21]]},"reference":[{"key":"e_1_2_11_2_1","unstructured":"1997 IWGR Nancy France A.K. Chhabra Graphic symbol recognition: an overview 68 79"},{"key":"e_1_2_11_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-84996-208-7_2"},{"issue":"3","key":"e_1_2_11_4_1","first-page":"391","article-title":"Isolating symbols from connection lines in a class of engineering drawings","volume":"27","author":"Yu Y.","year":"1994","journal-title":"PR"},{"key":"e_1_2_11_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.161351"},{"key":"e_1_2_11_6_1","unstructured":"1991 The Int. Conf. on Document Analysis and Recognition (ICDAR) St. Malo France C. Lai R. Kasturi Detection of dashed lines in engineering drawings and maps"},{"key":"e_1_2_11_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.677283"},{"key":"e_1_2_11_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.1995.602052"},{"key":"e_1_2_11_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.9112"},{"key":"e_1_2_11_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45869-7_24"},{"key":"e_1_2_11_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.584100"},{"key":"e_1_2_11_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/356625.356627"},{"key":"e_1_2_11_13_1","unstructured":"2007 GREC Curitiba Brazi R.J. Qureshi J.\u2010Y. Ramel D. Barret Spotting symbols in line drawing images using graph representations 91 103"},{"issue":"3","key":"e_1_2_11_14_1","first-page":"752","article-title":"A symbol spotting approach in graphical documents by hashing serialized graphs","volume":"46","author":"Dutta A.","year":"2013","journal-title":"PR"},{"key":"e_1_2_11_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2001.990517"},{"key":"e_1_2_11_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2002.1038171"},{"key":"e_1_2_11_17_1","unstructured":"1998 Machine Learning: ECML\u201098 Chemnitz Germany T. Joachims Text categorization with support vector machines: learning with many relevant features 137 142"},{"key":"e_1_2_11_18_1","unstructured":"2016 Workshop on Document Analysis Systems Santorini Greece V. Yadav N. Ragot Text extraction in document images: highlight on using corner points"},{"key":"e_1_2_11_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.275"},{"key":"e_1_2_11_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2011.153"},{"key":"e_1_2_11_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2011.177"},{"key":"e_1_2_11_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/DAS.2012.22"},{"key":"e_1_2_11_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSMC.2012.6377689"},{"key":"e_1_2_11_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-013-0215-2"},{"key":"e_1_2_11_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1815330.1815352"},{"key":"e_1_2_11_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2002.1017623"},{"key":"e_1_2_11_27_1","unstructured":"2018 2018 IEEE 14th Int. Colloquium on Signal Processing & Its Applications (CSPA) Penang Malaysia S. Goyal C. Chattopadhyay G. Bhatnagar Plan2text: A framework for describing building floor plan images from first person perspective"},{"key":"e_1_2_11_28_1","doi-asserted-by":"publisher","DOI":"10.1613\/jair.4900"},{"key":"e_1_2_11_29_1","unstructured":"2015 Computer Vision and Pattern Recognition (CVPR) Boston USA O. Vinyals A. Toshev S. Bengio Show and tell: A neural image caption generator"},{"key":"e_1_2_11_30_1","unstructured":"2015 Int. Conf. on Machine Learning (ICML) Lille France K. Xu J. Ba R. Kiros Show attend and tell: neural image caption generation with visual attention"},{"key":"e_1_2_11_31_1","volume-title":"Collective generation of natural image descriptions","author":"Kuznetsova P.","year":"2012"},{"key":"e_1_2_11_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298932"},{"key":"e_1_2_11_33_1","unstructured":"2014 Neural Information Processing Systems (NIPS) Montreal Canada A. Karpathy A. Joulin L. Fei Fei Deep fragment embeddings for bidirectional image sentence mapping"},{"key":"e_1_2_11_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2479787.2479801"},{"key":"e_1_2_11_35_1","unstructured":"2011 Computer Vision and Pattern Recognition (CVPR) Colorado Springs USA G. Kulkarni V. Premraj S. Dhar Baby talk: understanding and generating image descriptions"},{"key":"e_1_2_11_36_1","unstructured":"2010 European Conf. on Computer Vision (ECCV) Crete Greece A. Farhadi M. Hejrati M.A. Sadeghi Every picture tells a story: generating sentences from images"},{"key":"e_1_2_11_37_1","unstructured":"2014 British Machine Vision Conf. (BMVC) Nottingham UK Y. Verma C. Jawahar Im2text and text2im: associating images and texts for cross\u2010modal retrieval 2"},{"key":"e_1_2_11_38_1","unstructured":"2015 Int. Conf. on Computer Vision (ICCV) Santiago Chile Y. Zhu R. Kiros R. Zemel Aligning books and movies: towards story\u2010like visual explanations by watching movies and reading books"},{"key":"e_1_2_11_39_1","doi-asserted-by":"crossref","unstructured":"2013 The 2013 Conf. on Empirical Methods in Natural Language Processing (EMNLP) Seattle USA D. Elliott F. Keller Image description using visual dependency representations 1292 1302","DOI":"10.18653\/v1\/D13-1128"},{"key":"e_1_2_11_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00138-017-0825-7"},{"key":"e_1_2_11_41_1","volume-title":"BLEU: a method for automatic evaluation of machine translation","author":"Papineni K.","year":"2002"},{"key":"e_1_2_11_42_1","volume-title":"ROUGE: A package for automatic evaluation of summaries","author":"Lin C.\u2010Y.","year":"2004"},{"key":"e_1_2_11_43_1","unstructured":"2011 Proc. of the 6th Workshop on Statistical Machine Translation Edinburgh Scotland UK July 30\u201331 M. Denkowski A. Lavie Meteor 1.3: automatic metric for reliable optimization and evaluation of machine translation systems 85 91"},{"key":"e_1_2_11_44_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-2074"},{"key":"e_1_2_11_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-015-0247-x"},{"key":"e_1_2_11_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-010-0120-x"},{"key":"e_1_2_11_47_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-014-0236-5"},{"key":"e_1_2_11_48_1","unstructured":"2017 The Int. Conf. on Document Analysis and Recognition (ICDAR) Kyoto Japan D. Sharma N. Gupta C. Chattopadhyay DANIEL: a deep architecture for automatic analysis and retrieval of building floor plans"},{"key":"e_1_2_11_49_1","doi-asserted-by":"publisher","DOI":"10.1613\/jair.3994"},{"key":"e_1_2_11_50_1","unstructured":"ChenX. FangH. andLinT.\u2010Y.et al: \u2018Microsoft coco captions: data collection and evaluation server\u2019 arXiv preprint arXiv:1504.00325 2015"},{"key":"e_1_2_11_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2016.7899999"},{"key":"e_1_2_11_52_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_2_11_53_1","unstructured":"2006 European Conf. on Computer Vision (ECCV) Graz Austria H. Bay T. Tuytelaars L. Van Gool Surf: speeded up robust features 404 417"},{"issue":"2","key":"e_1_2_11_54_1","first-page":"313","article-title":"Building a large annotated corpus of English: the penn treebank","volume":"19","author":"Marcus M.P.","year":"1993","journal-title":"Comput. Linguist."},{"key":"e_1_2_11_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1962.1057692"},{"key":"e_1_2_11_56_1","doi-asserted-by":"publisher","DOI":"10.5194\/isprsarchives-XLI-B4-339-2016"},{"key":"e_1_2_11_57_1","unstructured":"2011 Proc. GiDM 2011 Antalya Turkey L. Liu S. Zlatanova A\u2019 door\u2010to\u2010door\u2019 path\u2010finding approach for indoor navigation"},{"key":"e_1_2_11_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/2533810.2533819"}],"container-title":["IET Image Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/iet-ipr.2018.5627","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/full-xml\/10.1049\/iet-ipr.2018.5627","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/iet-ipr.2018.5627","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T12:28:19Z","timestamp":1761654499000},"score":1,"resource":{"primary":{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/10.1049\/iet-ipr.2018.5627"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,10,21]]},"references-count":57,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2019,11]]}},"alternative-id":["10.1049\/iet-ipr.2018.5627"],"URL":"https:\/\/doi.org\/10.1049\/iet-ipr.2018.5627","archive":["Portico"],"relation":{},"ISSN":["1751-9659","1751-9667"],"issn-type":[{"value":"1751-9659","type":"print"},{"value":"1751-9667","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,10,21]]},"assertion":[{"value":"2018-06-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-09-04","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-10-21","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}