{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T06:27:38Z","timestamp":1770532058678,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,15]],"date-time":"2018-10-15T00:00:00Z","timestamp":1539561600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Natural Science Foundation of China","award":["61433018"],"award-info":[{"award-number":["61433018"]}]},{"name":"National Natural Science Foundation of China","award":["61375027"],"award-info":[{"award-number":["61375027"]}]},{"name":"National Key Research and Development Plan","award":["2016YFB1001200"],"award-info":[{"award-number":["2016YFB1001200"]}]},{"name":"the Innovation Method Fund of China","award":["2016IM010200"],"award-info":[{"award-number":["2016IM010200"]}]},{"name":"National Natural Science Foundation of China-Research Grant Council of Hong Kong (RGC) joint fund","award":["61531166002 N_CUHK404\/15"],"award-info":[{"award-number":["61531166002 N_CUHK404\/15"]}]},{"name":"National Social Science Foundation of China","award":["3&ZD189"],"award-info":[{"award-number":["3&ZD189"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,15]]},"DOI":"10.1145\/3240508.3240575","type":"proceedings-article","created":{"date-parts":[[2018,10,18]],"date-time":"2018-10-18T17:52:08Z","timestamp":1539885128000},"page":"136-144","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs"],"prefix":"10.1145","author":[{"given":"Runnan","family":"Li","sequence":"first","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhiyong","family":"Wu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jia","family":"Jia","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingbei","family":"Li","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Chen","sequence":"additional","affiliation":[{"name":"Sogou, Inc., Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Helen","family":"Meng","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Hong Kong, Hong Kong"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,10,15]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Mart'in Abadi and Ashish Agarwal et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https:\/\/www.tensorflow.org\/ Software available from tensorflow.org.  Mart'in Abadi and Ashish Agarwal et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https:\/\/www.tensorflow.org\/ Software available from tensorflow.org."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Ebru Arisoy and Murat Saraclar. 2016. Compositional Neural Network Language Models for Agglutinative Languages.. In INTERSPEECH. 3494--3498.  Ebru Arisoy and Murat Saraclar. 2016. Compositional Neural Network Language Models for Agglutinative Languages.. In INTERSPEECH. 3494--3498.","DOI":"10.21437\/Interspeech.2016-1239"},{"key":"e_1_3_2_1_3_1","first-page":"203","article-title":"Support vector regression","volume":"11","author":"Basak Debasish","year":"2007","unstructured":"Debasish Basak , Srimanta Pal , and Dipak Chandra Patranabis . 2007 . Support vector regression . Neural Information Processing-Letters and Reviews , Vol. 11 , 10 (2007), 203 -- 224 . Debasish Basak, Srimanta Pal, and Dipak Chandra Patranabis. 2007. Support vector regression. Neural Information Processing-Letters and Reviews, Vol. 11, 10 (2007), 203--224.","journal-title":"Neural Information Processing-Letters and Reviews"},{"key":"e_1_3_2_1_4_1","volume-title":"IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation","author":"Busso Carlos","year":"2008","unstructured":"Carlos Busso , Murtaza Bulut , Chi-Chun Lee , Abe Kazemzadeh , Emily Mower , Samuel Kim , Jeannette N Chang , Sungbok Lee , and Shrikanth S Narayanan . 2008 . IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation , Vol. 42 , 4 (2008), 335. Carlos Busso, Murtaza Bulut, Chi-Chun Lee, Abe Kazemzadeh, Emily Mower, Samuel Kim, Jeannette N Chang, Sungbok Lee, and Shrikanth S Narayanan. 2008. IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation, Vol. 42, 4 (2008), 335."},{"key":"e_1_3_2_1_5_1","volume-title":"Learning to learn","author":"Caruana Rich","unstructured":"Rich Caruana . 1998. Multitask learning . In Learning to learn . Springer , 95--133. Rich Caruana. 1998. Multitask learning. In Learning to learn. Springer, 95--133."},{"key":"e_1_3_2_1_6_1","volume-title":"mbox","author":"Franccois Chollet","year":"2015","unstructured":"Franccois Chollet et al mbox . 2015 . Keras . https:\/\/github.com\/fchollet\/keras. (2015). Franccois Chollet et almbox. 2015. Keras. https:\/\/github.com\/fchollet\/keras. (2015)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Christoph Feichtenhofer Axel Pinz and AP Zisserman. 2016. Convolutional two-stream network fusion for video action recognition. (2016).  Christoph Feichtenhofer Axel Pinz and AP Zisserman. 2016. Convolutional two-stream network fusion for video action recognition. (2016).","DOI":"10.1109\/CVPR.2016.213"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.293"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178872"},{"key":"e_1_3_2_1_10_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_11_1","volume-title":"mbox","author":"Ron Kohavi","year":"1995","unstructured":"Ron Kohavi et al mbox . 1995 . A study of cross-validation and bootstrap for accuracy estimation and model selection. In Ijcai, Vol. 14 . Montreal, Canada , 1137--1145. Ron Kohavi et almbox. 1995. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Ijcai, Vol. 14. Montreal, Canada, 1137--1145."},{"key":"e_1_3_2_1_12_1","volume-title":"A concordance correlation coefficient to evaluate reproducibility. Biometrics","author":"Lawrence I","year":"1989","unstructured":"I Lawrence and Kuei Lin . 1989. A concordance correlation coefficient to evaluate reproducibility. Biometrics ( 1989 ), 255--268. I Lawrence and Kuei Lin. 1989. A concordance correlation coefficient to evaluate reproducibility. Biometrics (1989), 255--268."},{"key":"e_1_3_2_1_13_1","volume-title":"Toward detecting emotions in spoken dialogs","author":"Lee Chul Min","year":"2005","unstructured":"Chul Min Lee and Shrikanth S Narayanan . 2005. Toward detecting emotions in spoken dialogs . IEEE transactions on speech and audio processing, Vol. 13 , 2 ( 2005 ), 293--303. Chul Min Lee and Shrikanth S Narayanan. 2005. Toward detecting emotions in spoken dialogs. IEEE transactions on speech and audio processing, Vol. 13, 2 (2005), 293--303."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/11573548_66"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2663204.2663236"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/2029556.2029563"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.25080\/Majora-7b98e3ed-003"},{"key":"e_1_3_2_1_18_1","unstructured":"Albert Mehrabian. 1980. Basic Dimensions for a General Psychological Theory Implications for Personality Social Environmental and Developmental Studies .  Albert Mehrabian. 1980. Basic Dimensions for a General Psychological Theory Implications for Personality Social Environmental and Developmental Studies ."},{"key":"e_1_3_2_1_19_1","volume-title":"Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov , Kai Chen , Greg Corrado , and Jeffrey Dean . 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 ( 2013 ). Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952552"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1081"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2016.0055"},{"key":"e_1_3_2_1_23_1","first-page":"3603","article-title":"Emotion recognition from speech","volume":"3","author":"Rao K Sreenivasa","year":"2012","unstructured":"K Sreenivasa Rao , Tummala Pavan Kumar , Kusam Anusha , Bathina Leela , Ingilela Bhavana , and SVSK Gowtham . 2012 . Emotion recognition from speech . International Journal of Computer Science and Information Technologies , Vol. 3 , 2 (2012), 3603 -- 3607 . K Sreenivasa Rao, Tummala Pavan Kumar, Kusam Anusha, Bathina Leela, Ingilela Bhavana, and SVSK Gowtham. 2012. Emotion recognition from speech. International Journal of Computer Science and Information Technologies, Vol. 3, 2 (2012), 3603--3607.","journal-title":"International Journal of Computer Science and Information Technologies"},{"key":"e_1_3_2_1_24_1","volume-title":"Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC)","author":"Rozgic Viktor","year":"2012","unstructured":"Viktor Rozgic , Sankaranarayanan Ananthakrishnan , Shirin Saleem , Rohit Kumar , and Rohit Prasad . 2012 . Ensemble of svm trees for multimodal emotion recognition . In Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC) , 2012 Asia-Pacific. IEEE, 1--4. Viktor Rozgic, Sankaranarayanan Ananthakrishnan, Shirin Saleem, Rohit Kumar, and Rohit Prasad. 2012. Ensemble of svm trees for multimodal emotion recognition. In Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific. IEEE, 1--4."},{"key":"e_1_3_2_1_25_1","volume-title":"Proceedings.(ICASSP'04)","volume":"1","author":"Schuller Bj\u00f6rn","year":"2004","unstructured":"Bj\u00f6rn Schuller , Gerhard Rigoll , and Manfred Lang . 2004 . Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In Acoustics, Speech, and Signal Processing, 2004 . Proceedings.(ICASSP'04) . IEEE International Conference on , Vol. 1 . IEEE, I--577. Bj\u00f6rn Schuller, Gerhard Rigoll, and Manfred Lang. 2004. Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In Acoustics, Speech, and Signal Processing, 2004. Proceedings.(ICASSP'04). IEEE International Conference on, Vol. 1. IEEE, I--577."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1363686.1364052"},{"key":"e_1_3_2_1_27_1","volume-title":"Thulac: An efficient lexical analyzer for chinese. Technical Report. Technical Report.","author":"Sun Maosong","year":"2016","unstructured":"Maosong Sun , Xinxiong Chen , Kaixu Zhang , Zhipeng Guo , and Zhiyuan Liu . 2016 . Thulac: An efficient lexical analyzer for chinese. Technical Report. Technical Report. Maosong Sun, Xinxiong Chen, Kaixu Zhang, Zhipeng Guo, and Zhiyuan Liu. 2016. Thulac: An efficient lexical analyzer for chinese. Technical Report. Technical Report."},{"key":"e_1_3_2_1_28_1","unstructured":"Ilya Sutskever Oriol Vinyals and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In Advances in neural information processing systems. 3104--3112.   Ilya Sutskever Oriol Vinyals and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In Advances in neural information processing systems. 3104--3112."},{"key":"e_1_3_2_1_29_1","volume-title":"Sixteenth Annual Conference of the International Speech Communication Association .","author":"Swietojanski Pawel","year":"2015","unstructured":"Pawel Swietojanski , Peter Bell , and Steve Renals . 2015 . Structured output layer with auxiliary targets for context-dependent acoustic modelling . In Sixteenth Annual Conference of the International Speech Communication Association . Pawel Swietojanski, Peter Bell, and Steve Renals. 2015. Structured output layer with auxiliary targets for context-dependent acoustic modelling. In Sixteenth Annual Conference of the International Speech Communication Association ."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12559-015-9326-z"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/5.58337"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/2209816.2210598"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2012.2225812"}],"event":{"name":"MM '18: ACM Multimedia Conference","location":"Seoul Republic of Korea","acronym":"MM '18","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 26th ACM international conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3240508.3240575","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3240508.3240575","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:57:34Z","timestamp":1750208254000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3240508.3240575"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,15]]},"references-count":33,"alternative-id":["10.1145\/3240508.3240575","10.1145\/3240508"],"URL":"https:\/\/doi.org\/10.1145\/3240508.3240575","relation":{},"subject":[],"published":{"date-parts":[[2018,10,15]]},"assertion":[{"value":"2018-10-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}