default search action

combined dblp search
author search
venue search
publication search

ask others

INTERSPEECH 2013: Lyon, France

> Home > Conferences and Workshops > INTERSPEECH

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/2013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2013
Frédéric Bimbot, Christophe Cerisara, Cécile Fougeron, Guillaume Gravier, Lori Lamel, François Pellegrino, Pascal Perrier:
14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013, Lyon, France, August 25-29, 2013. ISCA 2013

Systems for Search/Retrieval of Speech Documents

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Anguera13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Anguera13
Xavier Anguera:
Information retrieval-based dynamic time warping. 1-5
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CanN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CanN13
Dogan Can, Shrikanth S. Narayanan:
On the computation of document frequency statistics from spoken corpora using factor automata. 6-10
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KatsuradaMSIN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KatsuradaMSIN13
Kouichi Katsurada, Seiichi Miura, Kheang Seng, Yurie Iribe, Tsuneo Nitta:
Acceleration of spoken term detection using a suffix array by assigning optimal threshold values to sub-keywords. 11-14
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MandalHTMLZVFGKF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MandalHTMLZVFGKF13
Arindam Mandal, Julien van Hout, Yik-Cheung Tam, Vikramjit Mitra, Yun Lei, Jing Zheng, Dimitra Vergyri, Luciana Ferrer, Martin Graciarena, Andreas Kathol, Horacio Franco:
Strategies for high accuracy keyword detection in noisy channels. 15-19
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbadRPVB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbadRPVB13
Alberto Abad, Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Germán Bordel:
On the calibration and fusion of heterogeneous spoken term detection systems. 20-24
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NarumiKNIKITL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NarumiKNIKITL13
Shiro Narumi, Kazuma Konno, Takuya Nakano, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee:
Intensive acoustic models constructed by integrating low-occurrence models for spoken term detection. 25-28

Speech Analysis I-IV

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaneYDGC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaneYDGC13
John Kane, Irena Yanushevskaya, John Dalton, Christer Gobl, Ailbhe Ní Chasaide:
Using phonetic feature extraction to determine optimal speech regions for maximising the effectiveness of glottal source analysis. 29-33
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaMTNI13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaMTNI13
Hideki Kawahara, Masanori Morise, Tomoki Toda, Ryuichi Nisimura, Toshio Irino:
Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds. 34-38
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeSK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeSK13
JeeSok Lee, Frank K. Soong, Hong-Goo Kang:
A source-filter based adaptive harmonic model and its application to speech prosody modification. 39-43
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RameshPG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RameshPG13
K. Ramesh, S. R. Mahadeva Prasanna, D. Govind:
Detection of glottal opening instants using Hilbert envelope. 44-48
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GowdaPKA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GowdaPKA13
Dhananjaya N. Gowda, Jouni Pohjalainen, Mikko Kurimo, Paavo Alku:
Robust formant detection using group delay function and stabilized weighted linear prediction. 49-53
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HezardHD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HezardHD13
Thomas Hézard, Thomas Hélie, Boris Doval:
A source-filter separation algorithm for voiced sounds based on an exact anticausal/causal pole decomposition for the class of periodic signals. 54-58

Language and Dialect Recognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuZLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuZLL13
Weiwei Liu, Wei-Qiang Zhang, Zhiyi Li, Jia Liu:
Parallel absolute-relative feature based phonotactic language recognition. 59-63
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DiezVPRB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DiezVPRB13
Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition. 64-68
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaZMMLH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaZMMLH13
Jeff Z. Ma, Bing Zhang, Spyros Matsoukas, Sri Harish Reddy Mallidi, Feipeng Li, Hynek Hermansky:
Improvements in language identification on the RATS noisy speech corpus. 69-73
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoufifarBPCC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoufifarBPCC13
Mehdi Soufifar, Lukás Burget, Oldrich Plchot, Sandro Cumani, Jan Cernocký:
Regularized subspace n-gram model for phonotactic ivector extraction. 74-78
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BehravanHK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BehravanHK13
Hamid Behravan, Ville Hautamäki, Tomi Kinnunen:
Foreign accent detection from spoken Finnish using i-vectors. 79-83
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McLarenLLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McLarenLLS13
Mitchell McLaren, Aaron Lawson, Yun Lei, Nicolas Scheffer:
Adaptive Gaussian backend for robust language identification. 84-88

ASR - Neural Networks

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Paulik13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Paulik13
Matthias Paulik:
Lattice-based training of bottleneck feature extraction neural networks. 89-93
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GehringLKRLMW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GehringLKRLMW13
Jonas Gehring, Wonkyum Lee, Kevin Kilgour, Ian R. Lane, Yajie Miao, Alex Waibel:
Modular combination of deep neural networks for acoustic modeling. 94-98
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangM13
Shuo-Yiin Chang, Nelson Morgan:
Informative spectro-temporal bottleneck features for noise-robust speech recognition. 99-103
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanHX13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanHX13
Zhi-Jie Yan, Qiang Huo, Jian Xu:
A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR. 104-108
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RathPVC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RathPVC13
Shakti P. Rath, Daniel Povey, Karel Veselý, Jan Cernocký:
Improved feature processing for deep neural networks. 109-113
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VinyalsM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VinyalsM13
Oriol Vinyals, Nelson Morgan:
Deep vs. wide: depth on a budget for robust speech recognition. 114-118

Speech Acoustics

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Braun13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Braun13
Angelika Braun:
An early case of "VOT". 119-122
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FoxJH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FoxJH13
Robert Allen Fox, Ewa Jacewicz, Jessica Hart:
Pitch pattern variations in three regional varieties of American English. 123-127
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LienardB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LienardB13
Jean-Sylvain Liénard, Claude Barras:
Fine-grain voice strength estimation from vowel spectral cues. 128-132
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GodoyMS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GodoyMS13
Elizabeth Godoy, Catherine Mayo, Yannis Stylianou:
Linking loudness increases in normal and lombard speech to decreasing vowel formant separation. 133-137
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Motoki13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Motoki13
Kunitoshi Motoki:
Three-dimensional rectangular vocal-tract model with asymmetric wall impedances. 138-142
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AiraksinenSA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AiraksinenSA13
Manu Airaksinen, Brad H. Story, Paavo Alku:
Quasi closed phase analysis for glottal inverse filtering. 143-147

Paralinguistic Challenge (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchullerSBVSRCWEMMSPVK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchullerSBVSRCWEMMSPVK13
Björn W. Schuller, Stefan Steidl, Anton Batliner, Alessandro Vinciarelli, Klaus R. Scherer, Fabien Ringeval, Mohamed Chetouani, Felix Weninger, Florian Eyben, Erik Marchi, Marcello Mortillaro, Hugues Salamin, Anna Polychroniou, Fabio Valente, Samuel Kim:
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism. 148-152
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Janicki13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Janicki13
Artur Janicki:
Non-linguistic vocalisation recognition based on hybrid GMM-SVM approach. 153-157
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhCS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhCS13
Jieun Oh, Eunjoon Cho, Malcolm Slaney:
Characteristic contours of syllabic-level units in laughter. 158-162
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrikkeT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrikkeT13
Teun F. Krikke, Khiet P. Truong:
Detection of nonverbal vocalizations using Gaussian mixture models: looking for fillers and laughter in conversational speech. 163-167
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WagnerLA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WagnerLA13
Johannes Wagner, Florian Lingenfelser, Elisabeth André:
Using phonetic patterns for detecting social cues in natural conversations. 168-172
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaALN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaALN13
Rahul Gupta, Kartik Audhkhasi, Sungbok Lee, Shrikanth S. Narayanan:
Paralinguistic event detection from speech using probabilistic time-series smoothing and masking. 173-177
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnBR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnBR13
Gouzhen An, David Guy Brizan, Andrew Rosenberg:
Detecting laughter and filled pauses using syllable-based features. 178-181
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoneCAGTSLLN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoneCAGTSLLN13
Daniel Bone, Theodora Chaspari, Kartik Audhkhasi, James Gibson, Andreas Tsiartas, Maarten Van Segbroeck, Ming Li, Sungbok Lee, Shrikanth S. Narayanan:
Classifying language-related developmental disorders from speech cues: the promise and the potential confounds. 182-186
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KirchhoffLB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KirchhoffLB13
Katrin Kirchhoff, Yuzong Liu, Jeff A. Bilmes:
Classification of developmental disorders from speech signals using submodular feature selection. 187-190
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsgariBS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsgariBS13
Meysam Asgari, Alireza Bayestehtashk, Izhak Shafran:
Robust and accurate features for detecting and diagnosing autism spectrum disorders. 191-194
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GonzalezRLOM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GonzalezRLOM13
David Martínez González, Dayana Ribas, Eduardo Lleida, Alfonso Ortega, Antonio Miguel:
Suprasegmental information modelling for autism disorder spectrum and specific language impairment classification. 195-199
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrezesRR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrezesRR13
Félix Grèzes, Justin Richards, Andrew Rosenberg:
Let me finish: automatic conflict detection using speaker overlap. 200-204
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SethuEAL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SethuEAL13
Vidhyasaharan Sethu, Julien Epps, Eliathamby Ambikairajah, Haizhou Li:
GMM based speaker variability compensated system for interspeech 2013 compare emotion challenge. 205-209
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RasanenP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RasanenP13
Okko Räsänen, Jouni Pohjalainen:
Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech. 210-214
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHJCTKP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHJCTKP13
Hung-yi Lee, Ting-Yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao, Tsang-Long Pao:
Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition. 215-219
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GosztolyaBT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GosztolyaBT13
Gábor Gosztolya, Róbert Busa-Fekete, László Tóth:
Detecting autism, emotions and social signals using adaboost. 220-224

Perception of Prosody

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Niebuhr13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Niebuhr13
Oliver Niebuhr:
Resistance is futile - the intonation between continuation rise and calling contour in German. 225-229
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MixdorffN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MixdorffN13
Hansjörg Mixdorff, Oliver Niebuhr:
The influence of F0 contour continuity on prominence perception. 230-234
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SmithE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SmithE13
Caroline L. Smith, Paul Edmunds:
Native English listeners' perceptions of prosody in L1 and L2 reading. 235-238
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsurutaniL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsurutaniL13
Chiharu Tsurutani, Dean Luo:
Naturalness judgement of L2 Mandarin Chinese - does timing matter? 239-242
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AaltoSV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AaltoSV13
Daniel Aalto, Juraj Simko, Martti Vainio:
Language background affects the strength of the pitch bias in a duration discrimination task. 243-247
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zellers13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zellers13
Margaret Zellers:
Pitch and lengthening as cues to turn transition in Swedish. 248-252
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BissiriZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BissiriZ13
Maria Paola Bissiri, Margaret Zellers:
Perception of glottalization in varying pitch contexts across languages. 253-257
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WalshSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WalshSS13
Michael Walsh, Katrin Schweitzer, Nadja Schauffler:
Exemplar-based pitch accent categorisation using the generalized context model. 258-262
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BraunA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BraunA13
Bettina Braun, Yuki Asano:
Double contrast is signalled by prenuclear and nuclear accent types alone, not by f0-plateaux. 263-266
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CorreiaFBV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CorreiaFBV13
Susana Correia, Sónia Frota, Joseph Butler, Marina Vigário:
Word stress perception in European Portuguese. 267-271
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArnoldWB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArnoldWB13
Denis Arnold, Petra Wagner, R. Harald Baayen:
Using generalized additive models and random forests to model prosodic prominence in German. 272-276
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PfitzingerM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PfitzingerM13
Hartmut R. Pfitzinger, Hansjörg Mixdorff:
Perceiving speech rate differences between natural and time-scale modified utterances. 277-281

Prosody, Phonetics of Language Varieties

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarbosaEA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarbosaEA13
Plínio A. Barbosa, Anders Eriksson, Joel Åkesson:
On the robustness of some acoustic parameters for signalling word stress across styles in Brazilian Portuguese. 282-286
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyuP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyuP13
Shao-Ren Lyu, Ho-hsien Pan:
Reexamine the sandhi rules and the merging tones in hakka language. 287-290
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TabainBB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TabainBB13
Marija Tabain, Richard Beare, Andrew Butcher:
A preliminary spectral analysis of palatal and velar stop bursts in pitjantjatjara. 291-295
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahantaT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahantaT13
Shakuntala Mahanta, A. I. Twaha:
Presentational focus realisation in nalbaria variety of assamese. 296-299
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CruzF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CruzF13
Marisa Cruz, Sónia Frota:
On the relation between intonational phrasing and pitch accent distribution. evidence from European Portuguese varieties. 300-304
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NemotoA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NemotoA13
Rena Nemoto, Martine Adda-Decker:
How are word-final schwas different in the north and south of france? 305-309
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AshbyBSFF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AshbyBSFF13
Simone Ashby, Sílvia Barbosa, Catarina Silva, Paulino Fumo, José Pedro Ferreira:
Modeling postcolonial language varieties: challenges and lessons learned from mozambican Portuguese. 310-314
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SahkaiKM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SahkaiKM13
Heete Sahkai, Mari-Liis Kalvik, Meelis Mihkla:
Prosody of contrastive focus in estonian. 315-319
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KislerR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KislerR13
Thomas Kisler, Uwe D. Reichel:
Exploring the connection of acoustic and distinctive features. 320-324
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CunhaHH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CunhaHH13
Conceição Cunha, Jonathan Harrington, Phil Hoole:
A physiological analysis of the tense/lax vowel contrast in two varieties of German. 325-329
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeisterM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeisterM13
Einar Meister, Lya Meister:
Production of estonian quantity contrasts by native speakers of Finnish. 330-334
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeynadierG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeynadierG13
Yohann Meynadier, Yulia Gaydina:
Aerodynamic and durational cues of phonological voicing in whisper. 335-339
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Reichel13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Reichel13
Uwe D. Reichel:
Information theoretic syllable structure and its relation to the c-center effect. 340-344
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndreevaBK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndreevaBK13
Bistra Andreeva, William J. Barry, Jacques C. Koreman:
The bulgarian stressed and unstressed vowel system. a corpus study. 345-348

Speech Synthesis I. II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Prom-onBX13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Prom-onBX13
Santitham Prom-on, Peter Birkholz, Yi Xu:
Training an articulatory synthesizer with continuous acoustic data. 349-353
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KissS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KissS13
Géza Kiss, Jan P. H. van Santen:
Estimating speaker-specific intonation patterns using the linear alignment model. 354-358
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SungHKK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SungHKK13
June Sig Sung, Doo Hwa Hong, Hyun Woo Koo, Nam Soo Kim:
Factored maximum likelihood kernelized regression for HMM-based singing voice synthesis. 359-363
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakamichiTSSNN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakamichiTSSNN13
Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Improvements to HMM-based speech synthesis based on parameter generation with rich context models. 364-368
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakashikaTTA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakashikaTTA13
Toru Nakashika, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
Voice conversion in high-order eigen space using deep belief nets. 369-372
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SilenNHG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SilenNHG13
Hanna Silén, Jani Nurminen, Elina Helander, Moncef Gabbouj:
Voice conversion for non-parallel datasets using dynamic kernel partial least squares regression. 373-377
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NoseKKK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NoseKKK13
Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi:
A style control technique for singing voice synthesis based on multiple-regression HSMM. 378-382
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HinterleitnerNMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HinterleitnerNMH13
Florian Hinterleitner, Christoph Norrenbrock, Sebastian Möller, Ulrich Heute:
Predicting the quality of text-to-speech systems from a large-scale feature set. 383-387
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NurminenSG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NurminenSG13
Jani Nurminen, Hanna Silén, Moncef Gabbouj:
Speaker-specific retraining for enhanced compression of unit selection text-to-speech databases. 388-391
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuckvaleLW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuckvaleLW13
Mark A. Huckvale, Julian Leff, Geoff Williams:
Avatar therapy: an audio-visual dialogue system for treating auditory hallucinations. 392-396
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuthukumarBB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuthukumarBB13
Prasanna Kumar Muthukumar, Alan W. Black, H. Timothy Bunnell:
Optimizations and fitting procedures for the liljencrants-fant model for statistical parametric speech synthesis. 397-401
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HovyAPVLHB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HovyAPVLHB13
Dirk Hovy, Gopala Krishna Anumanchipalli, Alok Parlikar, Caroline Vaughn, Adam C. Lammert, Eduard H. Hovy, Alan W. Black:
Analysis and modeling of "focus" in context. 402-406

Perception, Dialectal Differences

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TranNCC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TranNCC13
Thi Anh Xuan Tran, Viet Son Nguyen, Eric Castelli, René Carré:
Production and perception of pseudo-V1CV2 outside the vowel triangle: speech illusion effects. 407-411
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CandeaAL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CandeaAL13
Maria Candea, Martine Adda-Decker, Lori Lamel:
Recent evolution of non-standard consonantal variants in French broadcast news. 412-416
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZimmererYR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZimmererYR13
Frank Zimmerer, Rei Yasuda, Henning Reetz:
Architekt or archtekt? perception of devoiced vowels produced by Japanese speakers of German. 417-420
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PlummerMMB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PlummerMMB13
Andrew R. Plummer, Lucie Ménard, Benjamin Munson, Mary E. Beckman:
Comparing vowel category response surfaces over age-varying maximal vowel spaces within and across language communities. 421-425
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BabelM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BabelM13
Molly Babel, Grant McGuire:
Perceived vocal attractiveness across dialects is similar but not uniform. 426-430
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangH13
Hongyan Wang, Vincent J. van Heuven:
Mutual intelligibility of American, Chinese and Dutch-accented speakers of English tested by SUS and SPIN sentences. 431-435

Speech Enhancement - Single Channel

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuTMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuTMH13
Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Speech enhancement based on deep denoising autoencoder. 436-440
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaruwatariKMSK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaruwatariKMSK13
Hiroshi Saruwatari, Suzumi Kanehara, Ryoichi Miyazaki, Kiyohiro Shikano, Kazunobu Kondo:
Musical noise analysis for Bayesian minimum mean-square error speech amplitude estimators based on higher-order statistics. 441-445
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyubimovK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyubimovK13
Nikolay Lyubimov, Mikhail Kotov:
Non-negative matrix factorization with linear constraints for single-channel speech enhancement. 446-450
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsengVHWXLZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsengVHWXLZ13
Hung-Wei Tseng, Srikanth Vishnubhotla, Mingyi Hong, Xiangfeng Wang, Jinjun Xiao, Zhi-Quan Luo, Tao Zhang:
A single channel speech enhancement approach by combining statistical criterion and multi-frame sparse dictionary learning. 451-455
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MirbagheriXAS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MirbagheriXAS13
Majid Mirbagheri, Yanbo Xu, Sahar Akram, Shihab A. Shamma:
Speech enhancement using convolutive nonnegative matrix factorization with cosparsity regularization. 456-459
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McCallumG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McCallumG13
Matthew C. McCallum, Bernard J. Guillemin:
Joint stochastic-deterministic wiener filtering with recursive Bayesian estimation of deterministic speech. 460-464

Dialog Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KnuuttilaRL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KnuuttilaRL13
Juha Knuuttila, Okko Räsänen, Unto K. Laine:
Automatic self-supervised learning of associations between speech and text. 465-469
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DaubigneyGP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DaubigneyGP13
Lucie Daubigney, Matthieu Geist, Olivier Pietquin:
Particle swarm optimisation of spoken dialogue system strategies. 470-474
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lison13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lison13
Pierre Lison:
Model-based Bayesian reinforcement learning for dialogue management. 475-479
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhigiTJB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhigiTJB13
Fabrizio Ghigi, M. Inés Torres, Raquel Justo, José-Miguel Benedí:
Evaluating spoken dialogue models under the interactive pattern recognition framework. 480-484
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenM13
Yun-Nung Chen, Florian Metze:
Multi-layer mutually reinforced random walk with hidden parameters for improved multi-party meeting summarization. 485-489
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuWWYL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuWWYL13
Pei-hao Su, Yow-Bang Wang, Tsung-Hsien Wen, Tien-han Yu, Lin-Shan Lee:
A recursive dialogue game framework with optimal Policy offering personalized computer-assisted language learning. 490-494

ASR - Lexical, Prosodic and Cross/Multi-Lingual

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HahnLWSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HahnLWSN13
Stefan Hahn, Patrick Lehnen, Simon Wiesler, Ralf Schlüter, Hermann Ney:
Improving LVCSR with hidden conditional random fields for grapheme-to-phoneme conversion. 495-499
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoXCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoXCL13
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context-dependent phone mapping for LVCSR of under-resourced languages. 500-504
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RasipuramM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RasipuramM13
Ramya Rasipuram, Mathew Magimai-Doss:
Improving grapheme-based ASR by probabilistic lexical modeling approach. 505-509
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MotlicekIG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MotlicekIG13
Petr Motlícek, David Imseng, Philip N. Garner:
Crosslingual tandem-SGMM: exploiting out-of-language data for acoustic model and feature level adaptation. 510-514
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VuS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VuS13
Ngoc Thang Vu, Tanja Schultz:
Multilingual multilayer perceptron for rapid language adaptation between and across language families. 515-519
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rosenberg13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rosenberg13
Andrew Rosenberg:
Modeling prosodic sequences with k-means and dirichlet process GMMs. 520-524

Phonetic Convergence

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchweitzerL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchweitzerL13
Antje Schweitzer, Natalie Lewandowski:
Convergence of articulation rate in spontaneous speech. 525-529
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Pardo13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Pardo13
Jennifer S. Pardo:
Phonetic convergence in shadowed speech: a comparison of perceptual and acoustic measures. 530-534
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WlodarczakSW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WlodarczakSW13
Marcin Wlodarczak, Juraj Simko, Petra Wagner:
Pitch and duration as a basis for entrainment of overlapped speech onsets. 535-538
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoninLGGVPSVC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoninLGGVPSVC13
Francesca Bonin, Céline De Looze, Sucheta Ghosh, Emer Gilmartin, Carl Vogel, Anna Polychroniou, Hugues Salamin, Alessandro Vinciarelli, Nick Campbell:
Investigating fine temporal dynamics of prosodic and lexical accommodation. 539-543
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimDD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimDD13
Jeesun Kim, Ruben Demirdjian, Chris Davis:
Spontaneous and explicit speech imitation. 544-547
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PodlipskySC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PodlipskySC13
Václav Jonás Podlipský, Sárka Simácková, Katerina Chládková:
Imitation interacts with one's second-language phonology but it does not operate cross-linguistically. 548-552

Speech Production, Acquisition and Development I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hsieh13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hsieh13
Po-jen Hsieh:
Prosodic markings of semantic predictability in taiwan Mandarin. 553-557
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoffmannMD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoffmannMD13
Rüdiger Hoffmann, Dieter Mehnert, Rolf Dietzel:
How did it work? historic phonetic devices explained by coeval photographs. 558-562
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KohtzN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KohtzN13
Lea S. Kohtz, Oliver Niebuhr:
Eliciting speech with sentence lists - a critical evaluation with special emphasis on segmental anchoring. 563-567
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangDCWWH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangDCWWH13
Yuguang Wang, Jianwu Dang, Xi Chen, Jianguo Wei, Hongcui Wang, Kiyoshi Honda:
An MRI-based acoustic study of Mandarin vowels. 568-571
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hirst13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hirst13
Daniel Hirst:
Melody metrics for prosodic typology: comparing English, French and Chinese. 572-576
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ProctorGLBTN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ProctorGLBTN13
Michael I. Proctor, Louis Goldstein, Adam C. Lammert, Dani Byrd, Asterios Toutios, Shrikanth S. Narayanan:
Velic coordination in French nasals: a real-time magnetic resonance imaging study. 577-581
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuckvaleS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuckvaleS13
Mark A. Huckvale, Amrita Sharma:
Learning to imitate adult speech with the KLAIR virtual infant. 582-586
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuceroSB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuceroSB13
Jorge C. Lucero, Jean Schoentgen, Mara Behlau:
Physics-based synthesis of disordered voices. 587-591
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DApolitoF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DApolitoF13
Sonia D'Apolito, Barbara Gili Fivela:
Place assimilation and articulatory strategies: the case of sibilant sequences in French as L1 and L2. 592-596
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamlowskiWM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamlowskiWM13
Barbara Samlowski, Petra Wagner, Bernd Möbius:
Effects of lexical class and lemma frequency on German homographs. 597-601
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LanciaAV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LanciaAV13
Leonardo Lancia, Heriberto Avelino, Daniel Voigt:
Measuring laryngealization in running speech: interaction with contrastive tones in yalálag zapotec. 602-606
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rusaw13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rusaw13
Erin Rusaw:
A neural oscillator model of speech timing and rhythm. 607-611
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WongFLSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WongFLSS13
Nicole Wong, Maojing Fu, Zhi-Pei Liang, Ryan Shosted, Bradley P. Sutton:
Observations of perseverative coarticulation in lateral approximants using MRI. 612-616

General Topics in ASR

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaB13
Vishwa Gupta, Gilles Boulianne:
Comparing computation in Gaussian mixture and neural network based large-vocabulary speech recognition. 617-621
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SteinSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SteinSS13
Daniel Stein, Jochen Schwenninger, Michael Stadtschnitzer:
Simultaneous perturbation stochastic approximation for automatic speech recognition. 622-626
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SheffieldALK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SheffieldALK13
David Sheffield, Michael J. Anderson, Yunsup Lee, Kurt Keutzer:
Hardware/software codesign for mobile speech recognition. 627-631
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiLWJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiLWJ13
Yangyang Shi, Martha A. Larson, Pascal Wiggers, Catholijn M. Jonker:
Exploiting the succeeding words in recurrent neural network language models. 632-636
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TorbatiPS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TorbatiPS13
Amir Hossein Harati Nejad Torbati, Joseph Picone, Marc Sobel:
Speech acoustic unit segmentation using hierarchical dirichlet processes. 637-641
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeorgesKK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeorgesKK13
Munir Georges, Stephan Kanthak, Dietrich Klakow:
Transducer-based speech recognition with dynamic language models. 642-646
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuboHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuboHN13
Yotaro Kubo, Takaaki Hori, Atsushi Nakamura:
A method for structure estimation of weighted finite-state transducers and its application to grapheme-to-phoneme conversion. 647-651
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JouvetF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JouvetF13
Denis Jouvet, Dominique Fohr:
Combining forward-based and backward-based decoders for improved speech recognition performance. 652-656
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiohanB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiohanB13
Olivier Siohan, Michiel Bacchiani:
ivector-based acoustic data selection. 657-661
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeiSGS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeiSGS13
Xin Lei, Andrew W. Senior, Alexander Gruenstein, Jeffrey Sorensen:
Accurate and compact large vocabulary speech recognition on mobile devices. 662-665
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AllauzenR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AllauzenR13
Cyril Allauzen, Michael Riley:
Pre-initialized composition for large-vocabulary speech recognition. 666-670
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KurniawatiG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KurniawatiG13
Evelyn Kurniawati, Sapna George:
Speaker dependent activation keyword detector based on GMM-UBM. 671-674
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SakSBA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SakSBA13
Hasim Sak, Yun-Hsuan Sung, Françoise Beaufays, Cyril Allauzen:
Written-domain language modeling for automatic speech recognition. 675-679

Voice Activity Detection and Speech Segmentation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VersteeghB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VersteeghB13
Maarten Versteegh, Louis ten Bosch:
Detecting words in speech using linear separability in a bag-of-events vector space. 680-684
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BurlickDZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BurlickDZ13
Matt Burlick, Dimitrios Dimitriadis, Eric Zavesky:
On the improvement of multimodal voice activity detection. 685-689
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeigerEESR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeigerEESR13
Jürgen T. Geiger, Florian Eyben, Nicholas W. D. Evans, Björn W. Schuller, Gerhard Rigoll:
Using linguistic information to detect overlapping speech. 690-694
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeKMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeKMH13
Jiaxing Ye, Takumi Kobayashi, Masahiro Murakawa, Tetsuya Higuchi:
Incremental acoustic subspace learning for voice activity detection using harmonicity-based features. 695-699
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungLL13
Hoon Chung, Sung Joo Lee, Yunkeun Lee:
Endpoint detection using weighted finite state transducer. 700-703
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SegbroeckTN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SegbroeckTN13
Maarten Van Segbroeck, Andreas Tsiartas, Shrikanth S. Narayanan:
A robust frontend for VAD: exploiting contextual, discriminative and spectral cues of human voice. 704-708
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GraciarenaAEFFHJLLMMSTSTW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GraciarenaAEFFHJLLMMSTSTW13
Martin Graciarena, Abeer Alwan, Dan Ellis, Horacio Franco, Luciana Ferrer, John H. L. Hansen, Adam Janin, Byung Suk Lee, Yun Lei, Vikramjit Mitra, Nelson Morgan, Seyed Omid Sadjadi, T. J. Tsai, Nicolas Scheffer, Lee Ngee Tan, Benjamin Williams:
All for one: feature combination for highly channel-degraded speech activity detection. 709-713
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CozPA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CozPA13
Maxime Le Coz, Julien Pinquier, Régine André-Obrecht:
Superposed speech localisation using frequency tracking. 714-717
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsiartasCKGLSPN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsiartasCKGLSPN13
Andreas Tsiartas, Theodora Chaspari, Nassos Katsamanis, Prasanta Kumar Ghosh, Ming Li, Maarten Van Segbroeck, Alexandros Potamianos, Shrikanth S. Narayanan:
Multi-band long-term signal variability features for robust voice activity detection. 718-722
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LezzoumGV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LezzoumGV13
Narimene Lezzoum, Ghyslain Gagnon, Jérémie Voix:
A low-complexity voice activity detector for smart hearing protection of hyperacusic persons. 723-727
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RyantLY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RyantLY13
Neville Ryant, Mark Liberman, Jiahong Yuan:
Speech activity detection on youtube using deep neural networks. 728-731
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GermainSM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GermainSM13
François G. Germain, Dennis L. Sun, Gautham J. Mysore:
Speaker and noise independent voice activity detection. 732-736
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaiJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaiJ13
T. J. Tsai, Adam Janin:
Confidence-based scoring: a useful diagnostic tool for detection tasks. 737-741
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanaiMU13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanaiMU13
Yasuaki Kanai, Shota Morita, Masashi Unoki:
Concurrent processing of voice activity detection and noise reduction using empirical mode decomposition and modulation spectrum analysis. 742-746

Show and Tell Sessions 1-3

- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/MoubayedBS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoubayedBS13
Samer Al Moubayed, Jonas Beskow, Gabriel Skantze:
The furhat social companion talking head. 747-749
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/GelinB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GelinB13
Rodolphe Gelin, Gabriele Barbieri:
Audition: the most important sense for humanoid robots? 750-751
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Hueber13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hueber13
Thomas Hueber:
Ultraspeech-player: intuitive visualization of ultrasound articulatory data for speech therapy and pronunciation training. 752-753
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/OhW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhW13
Jieun Oh, Ge Wang:
Laughter modulation: from speech to speech-laugh. 754-755
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/BikelH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BikelH13
Daniel M. Bikel, Keith B. Hall:
Refr: an open-source reranker framework. 756-758
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/SosiBCMRO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SosiBCMRO13
Alessandro Sosi, Fabio Brugnara, Luca Cristoforetti, Marco Matassoni, Mirco Ravanelli, Maurizio Omologo:
Embedding speech recognition to control lights. 759-760
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/MeltznerHD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeltznerHD13
Geoffrey S. Meltzner, James T. Heaton, Yunbin Deng:
The MUTE silent speech recognition system. 761-763
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/ScobbieTGKLR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScobbieTGKLR13
James M. Scobbie, Alice Turk, Christian Geng, Simon King, Robin J. Lickley, Korin Richmond:
The edinburgh speech production facility doubletalk corpus. 764-766
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/SityaevHS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SityaevHS13
Dmitry Sityaev, Jonathan Hotz, Vadim Snitkovsky:
Lexee: a cloud-based platform for building and deploying voice-enabled mobile applications. 767-769
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Ouni13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ouni13
Slim Ouni:
Visualizing articulatory data with VisArtico. 770-772
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/SouryGAD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SouryGAD13
Mariette Soury, Clément Gossart, Martine Adda-Decker, Laurence Devillers:
A tool to elicit and collect multicultural and multimodal laughter. 773-774
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/SchleicherWLLMRM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchleicherWLLMRM13
Robert Schleicher, Tilo Westermann, Jinjin Li, Moritz Lawitschka, Benjamin Mateev, Ralf Reichmuth, Sebastian Möller:
Design of a mobile app for interspeech conferences: towards an open tool for the spoken language community. 775-777

Discourse, Intonation, Prosody

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErikssonBA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErikssonBA13
Anders Eriksson, Plínio A. Barbosa, Joel Åkesson:
The acoustics of word stress in Swedish: a function of stress level, speaking style and word accent. 778-782
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MichelasPC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MichelasPC13
Amandine Michelas, Cristel Portes, Maud Champagne-Lavau:
Intonational contrasts encode speaker's certainty in neutral vs. incredulity declarative questions in French. 783-787
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshimotoEI13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshimotoEI13
Yuichi Ishimoto, Mika Enomoto, Hitoshi Iida:
Prosodic changes pre-announcing a syntactic completion point in Japanese utterance. 788-792
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Simard13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Simard13
Candide Simard:
Prosodic encoding of declarative, interrogative and imperative sentences in jaminjung, a language of australia. 793-797
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VullinghsGK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VullinghsGK13
Anne Vullinghs, Martijn Goudbeek, Emiel Krahmer:
Crosslinguistic priming in interactive reference: evidence for conceptual alignment in speech production. 798-802
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KousidisSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KousidisSS13
Spyros Kousidis, David Schlangen, Stavros Skopeteas:
A cross-linguistic study on turn-taking and temporal alignment in verbal interaction. 803-807

Source Separation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GraisE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GraisE13
Emad M. Grais, Hakan Erdogan:
Discriminative nonnegative dictionary learning using cross-coherence penalties for single channel source separation. 808-812
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimJPO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimJPO13
Han-Gyu Kim, Gil-Jin Jang, Jeong-Sik Park, Yung-Hwan Oh:
Monaural speech segregation based on pitch track correction using an ensemble kalman filter. 813-816
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TranCP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TranCP13
Ngoc Thuy Tran, William G. Cowley, André Pollok:
Voice activity classification for automatic bi-speaker adaptive beamforming in speech separation. 817-821
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinoshitaSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinoshitaSN13
Keisuke Kinoshita, Mehrez Souden, Tomohiro Nakatani:
Blind source separation using spatially distributed microphones based on microphone-location dependent source activities. 822-826
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarkerV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarkerV13
Tom Barker, Tuomas Virtanen:
Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation. 827-831
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeM13
Mario Kaoru Watanabe, Pejman Mowlaee:
Iterative sinusoidal-based partial phase reconstruction in single-channel source separation. 832-836

Paralinguistic Information I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoJMKT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoJMKT13
Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda:
Classification of speech under stress by modeling the aerodynamics of the laryngeal ventricle. 837-841
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RakovR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RakovR13
Rachel Rakov, Andrew Rosenberg:
"sure, i did the right thing": a system for sarcasm detection in speech. 842-846
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchererSGM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchererSGM13
Stefan Scherer, Giota Stratou, Jonathan Gratch, Louis-Philippe Morency:
Investigating voice quality as a speaker-independent indicator of depression and PTSD. 847-851
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PellegriniHMTTCDB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PellegriniHMTTCDB13
Thomas Pellegrini, Annika Hämäläinen, Philippe Boula de Mareüil, Michael Tjalve, Isabel Trancoso, Sara Candeias, Miguel Sales Dias, Daniela Braga:
A corpus-based study of elderly and young speakers of European Portuguese: acoustic correlates and their impact on speech recognition performance. 852-856
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CumminsESBG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CumminsESBG13
Nicholas Cummins, Julien Epps, Vidhyasaharan Sethu, Michael Breakspear, Roland Goecke:
Modeling spectral variability for the classification of depressed speech. 857-861
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Perez-RosasM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Perez-RosasM13
Verónica Pérez-Rosas, Rada Mihalcea:
Sentiment analysis of online spoken reviews. 862-866

ASR - Robustness Against Noise I-III

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbdelazizZK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbdelazizZK13
Ahmed Hussen Abdelaziz, Steffen Zeiler, Dorothea Kolossa:
Using twin-HMM-based audio-visual speech enhancement as a front-end for robust audio-visual speech recognition. 867-871
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GibsonSOGN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GibsonSOGN13
James Gibson, Maarten Van Segbroeck, Antonio Ortega, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Spectro-temporal directional derivative features for automatic speech recognition. 872-875
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoCL13
Xiong Xiao, Engsiong Chng, Haizhou Li:
Attribute-based histogram equalization (HEQ) and its adaptation for robust speech recognition. 876-880
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoshiPU13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoshiPU13
Vikas Joshi, N. Vishnu Prasad, Srinivasan Umesh:
Modified cepstral mean normalization - transforming to utterance specific non-zero mean. 881-885
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitraFG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitraFG13
Vikramjit Mitra, Horacio Franco, Martin Graciarena:
Damped oscillator cepstral coefficients for robust speech recognition. 886-890
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlamKO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlamKO13
Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Regularized MVDR spectrum estimation-based robust feature extractors for speech recognition. 891-895

Neural Basis of Speech Perception

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PobleteYS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PobleteYS13
Víctor Poblete, Néstor Becerra Yoma, Richard M. Stern:
Optimization of sigmoidal rate-level function based on acoustic features. 896-900
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadakataSSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadakataSSS13
Makiko Sadakata, Loukianos Spyrou, Mizuki Shingai, Kaoru Sekiyama:
Composing auditory ERPs: cross-linguistic comparison of auditory change complex for Japanese fricative consonants. 901-905
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BedoinKF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BedoinKF13
Nathalie Bedoin, Jennifer Krzonowski, Emmanuel Ferragne:
How voicing, place and manner of articulation differently modulate event-related potentials associated with response inhibition. 906-910
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BellierMTCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BellierMTCL13
Ludovic Bellier, Michel Mazzuca, Hung Thai-Van, Anne Caclin, Rafael Laboissière:
Categorization of speech in early auditory evoked responses. 911-915
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MancaG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MancaG13
Anna Dora Manca, Mirko Grimaldi:
Perception and production of Italian vowels: an ERP study. 916-920
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GroheB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GroheB13
Ann-Kathrin Grohe, Bettina Braun:
Implicit learning leads to familiarity effects for intonation but not for voice. 921-924

Spoofing and Countermeasures for Automatic Speaker Verification (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EvansKY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EvansKY13
Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi:
Spoofing and countermeasures for automatic speaker verification. 925-929
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HautamakiKHLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HautamakiKHLL13
Rosa González Hautamäki, Tomi Kinnunen, Ville Hautamäki, Timo Leino, Anne-Maria Laukkanen:
I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry. 930-934
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gomez-BarreroGGG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gomez-BarreroGGG13
Marta Gomez-Barrero, Javier Gonzalez-Dominguez, Javier Galbally, Joaquin Gonzalez-Rodriguez:
Security evaluation of i-vector based speaker verification systems against hill-climbing attacks. 935-939
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlegreVAE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlegreVAE13
Federico Alegre, Ravichander Vipperla, Asmaa Amehraye, Nicholas W. D. Evans:
A new speaker verification spoofing countermeasure based on local binary patterns. 940-944
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KonsA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KonsA13
Zvi Kons, Hagai Aronowitz:
Voice transformation-based spoofing of text-dependent speaker verification systems. 945-949
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuLLCKL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuLLCKL13
Zhizheng Wu, Anthony Larcher, Kong-Aik Lee, Engsiong Chng, Tomi Kinnunen, Haizhou Li:
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints. 950-954

Speech Production, Acquisition and Development I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimotoKHF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimotoKHF13
Masako Fujimoto, Tatsuya Kitamura, Hiroaki Hatano, Ichiro Fujimoto:
Timing differences in articulation between voiced and voiceless stop consonants: an analysis of cine-MRI data. 955-958
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LammertRPN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LammertRPN13
Adam C. Lammert, Vikram Ramanarayanan, Michael I. Proctor, Shrikanth S. Narayanan:
Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis. 959-962
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArrabothuCY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArrabothuCY13
Apoorv Reddy Arrabothu, Nivedita Chennupati, B. Yegnanarayana:
Syllable nuclei detection using perceptually significant features. 963-967
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsiehGBN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsiehGBN13
Fang-Ying Hsieh, Louis Goldstein, Dani Byrd, Shrikanth S. Narayanan:
Truncation of pharyngeal gesture in English diphthong [aɪ]. 968-972
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangRBN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangRBN13
Zhaojun Yang, Vikram Ramanarayanan, Dani Byrd, Shrikanth S. Narayanan:
The effect of word frequency and lexical class on articulatory-acoustic coupling. 973-977
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamakawaA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamakawaA13
Kimiko Yamakawa, Shigeaki Amano:
Discrimination between fricative and affricate in Japanese using time and spectral domain variables. 978-981
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DrozdovaCS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DrozdovaCS13
Polina Drozdova, Catia Cucchiarini, Helmer Strik:
L2 syntax acquisition: the effect of oral and written computer assisted practice. 982-986
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SignorelloD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SignorelloD13
Rosario Signorello, Didier Demolin:
The physiological use of the charismatic voice in Political speech. 987-991
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rose13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rose13
Ralph L. Rose:
Crosslinguistic corpus of hesitation phenomena: a corpus for investigating first and second language speech performance. 992-996
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PreussNB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PreussNB13
Simon Preuß, Christiane Neuschaefer-Rube, Peter Birkholz:
Real-time control of a 2d animation model of the vocal tract using optopalatography. 997-1001
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiddinsHKR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiddinsHKR13
Jessica Siddins, Jonathan Harrington, Felicitas Kleber, Ulrich Reubold:
The influence of accentuation and polysyllabicity on compensatory shortening in German. 1002-1006
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingH13
Hongwei Ding, Rüdiger Hoffmann:
An investigation of vowel epenthesis in Chinese learners' production of German consonants. 1007-1011
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RichmondLYU13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RichmondLYU13
Korin Richmond, Zhen-Hua Ling, Junichi Yamagishi, Benigno Uria:
On the evaluation of inversion mapping performance in the acoustic domain. 1012-1016

Speech Synthesis I. II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshiharaKYSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshiharaKYSS13
Tatsuma Ishihara, Hirokazu Kameoka, Kota Yoshizato, Daisuke Saito, Shigeki Sagayama:
Probabilistic speech F₀ contour model incorporating statistical vocabulary model of phrase-accent command sequence. 1017-1021
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McLoughlinLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McLoughlinLS13
Ian Vince McLoughlin, Jingjie Li, Yan Song:
Reconstruction of continuous voiced speech from whispers. 1022-1026
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiekerkB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiekerkB13
Daniel R. van Niekerk, Etienne Barnard:
Generating fundamental frequency contours for speech synthesis in yorùbá. 1027-1031
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AzarovVLP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AzarovVLP13
Elias Azarov, Maxim Vashkevich, Denis Likhachov, Alexander A. Petrovsky:
Real-time voice conversion using artificial neural networks with rectified linear units. 1032-1036
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrityakienHM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrityakienHM13
Oraphan Krityakien, Keikichi Hirose, Nobuaki Minematsu:
Generation of fundamental frequency contours for Thai speech synthesis using tone nucleus model. 1037-1041
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenB13
Langzhou Chen, Norbert Braunschweiler:
Unsupervised speaker and expression factorization for multi-speaker expressive synthesis of ebooks. 1042-1046
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakajimaMYT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakajimaMYT13
Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka, Satoshi Takahashi:
Which resemblance is useful to predict phrase boundary rise labels for Japanese expressive text-to-speech synthesis, numerically-expressed stylistic or distribution-based semantic? 1047-1051
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiSHK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiSHK13
Jinfu Ni, Yoshinori Shiga, Chiori Hori, Yutaka Kidawara:
A targets-based superpositional model of fundamental frequency contours applied to HMM-based speech synthesis. 1052-1056
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KobayashiDTNGNSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KobayashiDTNGNSN13
Kazuhiro Kobayashi, Hironori Doi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of acoustic features for singing voice conversion based on perceptual age. 1057-1061
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BollepalliRA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BollepalliRA13
Bajibabu Bollepalli, Tuomo Raitio, Paavo Alku:
Effect of MPEG audio compression on HMM-based speech synthesis. 1062-1066
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoiTNGN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoiTNGN13
Hironori Doi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Satoshi Nakamura:
Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion. 1067-1071
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaNK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaNK13
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Statistical nonparametric speech synthesis using sparse Gaussian processes. 1072-1076
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MohammadiD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MohammadiD13
Amir Mohammadi, Cenk Demiroglu:
Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems. 1077-1081
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cabral13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cabral13
João P. Cabral:
Uniform concatenative excitation model for synthesising speech without voiced/unvoiced classification. 1082-1086

Metadata, Evaluation and Resources I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SperberNFNW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SperberNFNW13
Matthias Sperber, Graham Neubig, Christian Fügen, Satoshi Nakamura, Alex Waibel:
Efficient speech transcription through respeaking. 1087-1091
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimGN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimGN13
Samuel Kim, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Annotation and classification of Political advertisements. 1092-1096
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HigashinakaDI13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HigashinakaDI13
Ryuichiro Higashinaka, Kohji Dohsaka, Hideki Isozaki:
Using role play for collecting question-answer pairs for dialogue agents. 1097-1100
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArimotoO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArimotoO13
Yoshiko Arimoto, Kazuo Okanoya:
Individual differences of emotional expression in speaker's behavioral and autonomic responses. 1101-1105
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WechsungWKEM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WechsungWKEM13
Ina Wechsung, Benjamin Weiss, Christine Kühnel, Patrick Ehrenbrink, Sebastian Möller:
Development and validation of the conversational agents scale (CAS). 1106-1110
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RiccardiGCB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RiccardiGCB13
Giuseppe Riccardi, Arindam Ghosh, S. A. Chowdhury, Ali Orkan Bayer:
Motivational feedback in crowdsourcing: a case study in speech transcription. 1111-1115
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FoxLZH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FoxLZH13
Charles Fox, Yulan Liu, Erich Zwyssig, Thomas Hain:
The sheffield wargames corpus. 1116-1120
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarMWK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarMWK13
Anuj Kumar, Florian Metze, Wenyi Wang, Matthew Kam:
Formalizing expert knowledge for developing accurate speech recognizers. 1121-1125
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoubayedEG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoubayedEG13
Samer Al Moubayed, Jens Edlund, Joakim Gustafson:
Analysis of gaze and speech patterns in three-party quiz game interaction. 1126-1130
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Galibert13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Galibert13a
Olivier Galibert:
Methodologies for the evaluation of speaker diarization and automatic speech recognition in the presence of overlapping speech. 1131-1134
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SangwanKYHO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SangwanKYHO13
Abhijeet Sangwan, Lakshmish Kaushik, Chengzhu Yu, John H. L. Hansen, Douglas W. Oard:
'houston, we have a solution': using NASA apollo program to advance speech and language processing technology. 1135-1139

Speech Technology for Speech and Hearing Disorders I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HofeBCEGMG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HofeBCEGMG13
Robin Hofe, Jie Bai, Lam Aun Cheah, Stephen R. Ell, James M. Gilbert, Roger K. Moore, Phil D. Green:
Performance of the MVOCA silent speech interface across multiple speakers. 1140-1143
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Andrade-MirandaG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Andrade-MirandaG13
Gustavo Andrade-Miranda, Juan Ignacio Godino-Llorente:
Automatic glottal tracking from high-speed digital images using a continuous normalized cross correlation. 1144-1148
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BockletSNS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BockletSNS13
Tobias Bocklet, Stefan Steidl, Elmar Nöth, Sabine Skodda:
Automatic evaluation of parkinson's speech - acoustic, prosodic and voice related cues. 1149-1153
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OrosanuJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OrosanuJ13
Luiza Orosanu, Denis Jouvet:
Comparison of approaches for an efficient phonetic decoding. 1154-1158
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChristensenGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChristensenGH13
Heidi Christensen, Phil D. Green, Thomas Hain:
Learning speaker-specific pronunciations of disordered speech. 1159-1163
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lopez-LudenaSGLF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lopez-LudenaSGLF13
Verónica López-Ludeña, Rubén San Segundo, Carlos González-Morcillo, Juan Carlos López, E. Ferreiro:
Adapting a speech into sign language translation system to a new domain. 1164-1168

Speech Analysis I-IV

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GodoyKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GodoyKS13
Elizabeth Godoy, Maria Koutsogiannaki, Yannis Stylianou:
Assessing the intelligibility impact of vowel space expansion via clear speech-inspired frequency warping. 1169-1173
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JensenT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JensenT13
Jesper Jensen, Cees H. Taal:
Prediction of intelligibility of noisy and time-frequency weighted speech based on mutual information between amplitude envelopes. 1174-1178
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JokinenTA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JokinenTA13
Emma Jokinen, Marko Takanen, Paavo Alku:
Frequency-adaptive post-filtering for intelligibility enhancement of narrowband telephone speech. 1179-1183
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiCAY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiCAY13
Junfeng Li, Fei Chen, Masato Akagi, Yonghong Yan:
Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese. 1184-1187
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HinesSKH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HinesSKH13
Andrew Hines, Jan Skoglund, Anil C. Kokaram, Naomi Harte:
Monitoring the effects of temporal clipping on voIP speech quality. 1188-1192
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Yuan13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yuan13
Jiahong Yuan:
The spectral dynamics of vowels in Mandarin Chinese. 1193-1197

Discriminative Training Methods for Language Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Schwenk13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Schwenk13
Holger Schwenk:
CSLM - a modular open-source continuous space language modeling toolkit. 1198-1202
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiHYL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiHYL13
Yangyang Shi, Mei-Yuh Hwang, Kaisheng Yao, Martha A. Larson:
Speed up of recurrent neural network language models with sentence independent subsampling stochastic gradient descent. 1203-1207
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangLPD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangLPD13
Shuangyu Chang, Michael Levit, Partha Parthasarathy, Benoît Dumoulin:
Improving unsupervised language model adaptation with discriminative data filtering. 1208-1212
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KobayashiOFS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KobayashiOFS13
Akio Kobayashi, Takahiro Oku, Yuya Fujita, Shoei Sato:
Lightly supervised training for risk-based discriminative language models. 1213-1217
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DikiciPRS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DikiciPRS13
Erinç Dikici, Emily Tucker Prud'hommeaux, Brian Roark, Murat Saraçlar:
Investigation of MT-based ASR confusion models for semi-supervised discriminative language modeling. 1218-1222
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObaOHMN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObaOHMN13
Takanobu Oba, Atsunori Ogawa, Takaaki Hori, Hirokazu Masataki, Atsushi Nakamura:
Unsupervised discriminative language modeling using error rate estimator. 1223-1227

ASR - Adaptive Training

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RathBKGC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RathBKGC13
Shakti P. Rath, Lukás Burget, Martin Karafiát, Ondrej Glembek, Jan Cernocký:
A region-specific feature-space transformation for speaker adaptation and singularity analysis of jacobian matrix. 1228-1232
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangG13
Yongqiang Wang, Mark J. F. Gales:
An explicit independence constraint for factorised adaptation in speech recognition. 1233-1237
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SazH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SazH13
Oscar Saz, Thomas Hain:
Asynchronous factorisation of speaker and background with feature transforms in speech recognition. 1238-1242
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuX13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuX13
Kai Yu, Hainan Xu:
Cluster adaptive training with factorized decision trees for speech recognition. 1243-1247
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Abdel-Hamid013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Abdel-Hamid013
Ossama Abdel-Hamid, Hui Jiang:
Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition. 1248-1252
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KintzleyJH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KintzleyJH13
Keith Kintzley, Aren Jansen, Hynek Hermansky:
Text-to-speech inspired duration modeling for improved whole-word acoustic models. 1253-1257

Speech Acquisition and Development

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GregoryTR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GregoryTR13
Adele Gregory, Marija Tabain, Michael Robb:
Duration of early vocalisations. 1258-1262
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangF13
Jing Yang, Robert Allen Fox:
Acoustic development of vowel production in American English children. 1263-1267
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Moulin-FrierO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Moulin-FrierO13
Clément Moulin-Frier, Pierre-Yves Oudeyer:
The role of intrinsic motivations in learning sensorimotor vocal mappings: a developmental robotics study. 1268-1272
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HazanP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HazanP13
Valérie Hazan, Michèle Pettinato:
Children's timing and repair strategies for communication in adverse listening conditions. 1273-1277
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarbierPMPTP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarbierPMPTP13
Guillaume Barbier, Pascal Perrier, Lucie Ménard, Yohan Payan, Mark K. Tiede, Joseph S. Perkell:
Speech planning as an index of speech motor control maturity. 1278-1282
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinsmanL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinsmanL13
Melissa Kinsman, Fangfang Li:
The relationship between gender-differentiated productions of /s/ and gender role behaviour in young children. 1283-1286

Articulatory Data Acquisition and Processing (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BerryF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BerryF13
Jeffrey Berry, Luciano Fadiga:
Data-driven design of a sentence list for an articulatory speech corpus. 1287-1291
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuTNN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuTNN13
Yinghua Zhu, Asterios Toutios, Shrikanth S. Narayanan, Krishna S. Nayak:
Faster 3d vocal tract real-time MRI using constrained reconstruction. 1292-1296
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CanevariBFM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CanevariBFM13
Claudia Canevari, Leonardo Badino, Luciano Fadiga, Giorgio Metta:
Relevance-weighted-reconstruction of articulatory features in deep-neural-network-based acoustic-to-articulatory mapping. 1297-1301
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TomaschekWAB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TomaschekWAB13
Fabian Tomaschek, Martijn Wieling, Denis Arnold, R. Harald Baayen:
Word frequency, vowel length and vowel quality in speech production: an EMA study of the importance of experience. 1302-1306
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SilvaTOM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SilvaTOM13
Samuel S. Silva, António J. S. Teixeira, Catarina Oliveira, Paula Martins:
Towards a systematic and quantitative analysis of vocal tract data. 1307-1311
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VazRN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VazRN13
Colin Vaz, Vikram Ramanarayanan, Shrikanth S. Narayanan:
A two-step technique for MRI audio enhancement using dictionary learning and wavelet packet analysis. 1312-1315
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StellaSSBGF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StellaSSBGF13
Massimo Stella, Antonio Stella, Francesco Sigona, Paolo Bernardini, Mirko Grimaldi, Barbara Gili Fivela:
Electromagnetic articulography with AG500 and AG501. 1316-1320
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BadinVKLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BadinVKLS13
Pierre Badin, Julián Andrés Valdés Vargas, Arielle Koncki, Laurent Lamalle, Christophe Savariaux:
Development and implementation of fiducial markers for vocal tract MRI imaging and speech articulatory modelling. 1321-1325
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchotzFGL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchotzFGL13
Susanne Schötz, Johan Frid, Lars Gustafsson, Anders Löfqvist:
Functional data analysis of tongue articulation in palatal vowels: gothenburg and malmöhus Swedish /iː, yː, ̟ʉː/. 1326-1330
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GreenWW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GreenWW13
Jordan R. Green, Jun Wang, David L. Wilson:
SMASH: a tool for articulatory data processing and analysis. 1331-1335

Topics in Speech Perception and Emotion

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinWW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinWW13
Jen-Chun Lin, Chung-Hsien Wu, Wen-Li Wei:
Emotion recognition of conversational affective speech using temporal course modeling. 1336-1340
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AltrovPP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AltrovPP13
Rene Altrov, Hille Pajupuu, Jaan Pajupuu:
The role of empathy in the recognition of vocal emotions. 1341-1344
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrunelliereD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrunelliereD13
Angèle Brunellière, Sophie Dufour:
Electrophysiological evidence for benefits of imitation during the processing of spoken words embedded in sentential contexts. 1345-1349
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OganeH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OganeH13
Rintaro Ogane, Masaaki Honda:
Compensatory speech response to time-scale altered auditory feedback. 1350-1354
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NweNL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NweNL13
Tin Lay Nwe, Trung Hieu Nguyen, Dilip Kumar Limbu:
Bhattacharyya distance based emotional dissimilarity measure in multi-dimensional space for emotion classification. 1355-1359
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PregoLN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PregoLN13
Thiago de M. Prego, Amaro A. de Lima, Sergio L. Netto:
On the enhancement of dereverberation algorithms based on a perceptual evaluation criterion. 1360-1364
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GussenhovenZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GussenhovenZ13
Carlos Gussenhoven, Wencui Zhou:
Revisiting pitch slope and height effects on perceived duration. 1365-1369
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuiraudFBB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuiraudFBB13
Hélène Guiraud, Emmanuel Ferragne, Nathalie Bedoin, Véronique Boulenger:
Adaptation to natural fast speech and time-compressed speech in children. 1370-1374
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WindmannSWW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WindmannSWW13
Andreas Windmann, Juraj Simko, Britta Wrede, Petra Wagner:
Modeling durational incompressibility. 1375-1379
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EmondML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EmondML13
Caroline Émond, Lucie Ménard, Marty Laforest:
Perceived prosodic correlates of smiled speech in spontaneous data. 1380-1383
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaakeSSE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaakeSSE13
Alexander Raake, Katrin Schoenenberg, Janto Skowronek, Sebastian Egger:
Predicting speech quality based on interactivity and delay. 1384-1388
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoukliaA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoukliaA13
Charlotte Kouklia, Nicolas Audibert:
Perceptual, acoustic and electroglottographic correlates of 3 aggressive attitudes in French: a pilot study. 1389-1393

Discourse and Machine Learning, Paralinguistic and Nonlinguistic Cues

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorchidLEM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorchidLEM13
Mohamed Morchid, Georges Linarès, Marc El-Bèze, Renato De Mori:
Theme identification in telephone service conversations using quaternions of speech features. 1394-1398
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaoKRC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaoKRC13
Hrishikesh Rao, Jonathan C. Kim, Agata Rozga, Mark A. Clements:
Detection of laughter in children's speech using spectral and prosodic acoustic features. 1399-1403
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Truong13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Truong13
Khiet P. Truong:
Classification of cooperative and competitive overlaps in speech using cues from the context, overlapper, and overlappee. 1404-1408
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimVV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimVV13
Samuel Kim, Fabio Valente, Alessandro Vinciarelli:
Annotation and detection of conflict escalation in Political debates. 1409-1413
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchielSRC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchielSRC13
Florian Schiel, Mary Stevens, Uwe D. Reichel, Francesco Cutugno:
Machine learning of probabilistic phonological pronunciation rules from the Italian CLIPS corpus. 1414-1418
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaumeisterS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaumeisterS13
Barbara Baumeister, Florian Schiel:
Human perception of alcoholic intoxication in speech. 1419-1423
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HouJL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HouJL13
Luying Hou, Yuan Jia, Aijun Li:
Phonetic manifestation and influence of zero anaphora in Chinese reading texts. 1424-1428
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarratAMS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarratAMS13
Salima Harrat, Mourad Abbas, Karima Meftouh, Kamel Smaïli:
Diacritics restoration for Arabic dialect texts. 1429-1433
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WlodarczakW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WlodarczakW13
Marcin Wlodarczak, Petra Wagner:
Effects of talk-spurt silence boundary thresholds on distribution of gaps and overlaps. 1434-1437
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KachkovskaiaVS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KachkovskaiaVS13
Tatiana Kachkovskaia, Nina B. Volskaya, Pavel A. Skrelin:
Final lengthening in Russian: a corpus-based study. 1438-1442
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Reichel13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Reichel13a
Uwe D. Reichel:
From segmentation bootstrapping to transcription-to-word conversion. 1443-1447
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Caelen-HaumontB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Caelen-HaumontB13
Geneviève Caelen-Haumont, Katarina Bartkova:
Manual and automatic tone annotation: the case of an endangered language from north vietnam "mo piu". 1448-1452
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeonarduzziH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeonarduzziH13
Laetitia Leonarduzzi, Sophie Herment:
Non-canonical syntactic structures in discourse: tonality, tonicity and tones in English (semi-)spontaneous speech. 1453-1457
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NouriPSGCMT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NouriPSGCMT13
Elnaz Nouri, Sunghyun Park, Stefan Scherer, Jonathan Gratch, Peter J. Carnevale, Louis-Philippe Morency, David R. Traum:
Prediction of strategy and outcome as negotiation unfolds by using basic verbal and behavioral features. 1458-1461

Language Identification, Speaker Diarization

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PoignantBLRQ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PoignantBLRQ13
Johann Poignant, Laurent Besacier, Viet Bac Le, Sophie Rosset, Georges Quénot:
Unsupervised naming of speakers in broadcast TV: using written names, pronounced names or both? 1462-1466
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BredinP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BredinP13
Hervé Bredin, Johann Poignant:
Integer linear programming for speaker diarization and cross-modal identification in TV broadcast. 1467-1471
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeMarcoC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeMarcoC13
Andrea DeMarco, Stephen J. Cox:
Native accent classification via i-vectors and speaker compensation fusion. 1472-1476
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RouvierDGKMM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RouvierDGKMM13
Mickael Rouvier, Grégor Dupuy, Paul Gay, Elie Khoury, Téva Merlin, Sylvain Meignier:
An open-source state-of-the-art toolbox for broadcast news diarization. 1477-1481
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KonsT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KonsT13
Zvi Kons, Orith Toledo-Ronen:
Audio event classification using deep neural networks. 1482-1486
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiangWH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiangWH13
Wei-Bin Liang, Chung-Hsien Wu, Chun-Shan Hsu:
Code-Switching event detection based on delta-BIC using phonetic eigenvoice models. 1487-1491
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HirayamaYIMO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HirayamaYIMO13
Naoki Hirayama, Koichiro Yoshino, Katsutoshi Itoyama, Shinsuke Mori, Hiroshi G. Okuno:
Automatic estimation of dialect mixing ratio for dialect speech recognition. 1492-1496
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rodriguez-FuentesBPVBD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rodriguez-FuentesBPVBD13
Luis Javier Rodríguez-Fuentes, Niko Brümmer, Mikel Peñagarikano, Amparo Varona, Germán Bordel, Mireia Díez:
The albayzin 2012 language recognition evaluation. 1497-1501
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanGLON13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanGLON13
Kyu Jeong Han, Sriram Ganapathy, Ming Li, Mohamed Kamal Omar, Shrikanth S. Narayanan:
TRAP language identification system for RATS phase II evaluation. 1502-1506
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LawsonMLMSFG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LawsonMLMSFG13
Aaron Lawson, Mitchell McLaren, Yun Lei, Vikramjit Mitra, Nicolas Scheffer, Luciana Ferrer, Martin Graciarena:
Improving language identification robustness to highly channel-degraded speech through multiple system fusion. 1507-1510

Metadata, Evaluation and Resources I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatousekT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatousekT13
Jindrich Matousek, Daniel Tihelka:
Annotation errors detection in TTS corpora. 1511-1515
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AhmedK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AhmedK13
Imran Ahmed, Sunil Kumar Kopparapu:
Technique for automatic sentence level alignment of long speech and transcripts. 1516-1519
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoffmannP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoffmannP13
Sarah Hoffmann, Beat Pfister:
Text-to-speech alignment of long recordings using universal phone models. 1520-1524
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StanBYK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StanBYK13
Adriana Stan, Peter Bell, Junichi Yamagishi, Simon King:
Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data. 1525-1529
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SapruB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SapruB13
Ashtosh Sapru, Hervé Bourlard:
Automatic social role recognition in professional meetings using conditional random fields. 1530-1534
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DraxlerF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DraxlerF13
Christoph Draxler, Hanna S. Feiser:
Same same but different - an acoustical comparison of the automatic segmentation of high quality and mobile telephone speech. 1535-1539

Speech Synthesis - Prosody and Emotion

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KangLDW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KangLDW13
Yongguo Kang, Jian Li, Yan Deng, Miaomiao Wang:
Multi-centroidal duration generation algorithm for HMM-based TTS. 1540-1543
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaitioSPAVA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaitioSPAVA13
Tuomo Raitio, Antti Suni, Jouni Pohjalainen, Manu Airaksinen, Martti Vainio, Paavo Alku:
Analysis and synthesis of shouted speech. 1544-1548
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NagataMN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NagataMN13
Tomohiro Nagata, Hiroki Mori, Takashi Nose:
Robust estimation of multiple-regression HMM parameters for dimension-based expressive dialogue speech synthesis. 1549-1553
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrognauxPD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrognauxPD13
Sandrine Brognaux, Benjamin Picart, Thomas Drugman:
A new prosody annotation protocol for live sports commentaries. 1554-1558
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MehrabaniMC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MehrabaniMC13
Mahnoosh Mehrabani, Taniya Mishra, Alistair Conkie:
Unsupervised prominence prediction for speech synthesis. 1559-1563
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CharfuelanS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CharfuelanS13
Marcela Charfuelan, Ingmar Steiner:
Expressive speech synthesis in MARY TTS using audiobook data and emotionML. 1564-1568

Spoken Language Information Retrieval

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WardW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WardW13
Nigel G. Ward, Steven D. Werner:
Using dialog-activity similarity for spoken information retrieval. 1569-1573
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenL13
I-Fan Chen, Chin-Hui Lee:
A hybrid HMM/DNN approach to keyword spotting of short words. 1574-1578
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wintrode13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wintrode13
Jonathan Wintrode:
Leveraging locality for topic identification of conversational speech. 1579-1583
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SenayBDLF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SenayBDLF13
Grégory Senay, Benjamin Bigot, Richard Dufour, Georges Linarès, Corinne Fredouille:
Person name spotting by combining acoustic matching and LDA topic models. 1584-1588
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SzaszakB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SzaszakB13
György Szaszák, András Beke:
Using phonological phrase segmentation to improve automatic keyword spotting for the highly agglutinating Hungarian language. 1589-1593
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeckHT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeckHT13
Larry P. Heck, Dilek Hakkani-Tür, Gökhan Tür:
Leveraging knowledge graphs for web-scale unsupervised semantic parsing. 1594-1598

Speaker Recognition I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CumaniL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CumaniL13
Sandro Cumani, Pietro Laface:
Fast and memory effective i-vector extraction using a factorized sub-space. 1599-1603
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimonchikSP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimonchikSP13
Konstantin Simonchik, Andrey Shulipa, Timur Pekhovsky:
Effective estimation of a multi-session speaker model using information on signal parameters. 1604-1608
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HautamakiLLSLKHSLBHF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HautamakiLLSLKHSLBHF13
Ville Hautamäki, Kong-Aik Lee, David A. van Leeuwen, Rahim Saeidi, Anthony Larcher, Tomi Kinnunen, Taufiq Hasan, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, John H. L. Hansen, Benoit G. B. Fauve:
Automatic regularization of cross-entropy cost for speaker recognition fusion. 1609-1613
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiKGRN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiKGRN13
Ming Li, Jangwon Kim, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Speaker verification based on fusion of acoustic and articulatory information. 1614-1618
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeuwenB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeuwenB13
David A. van Leeuwen, Niko Brümmer:
The distribution of calibrated likelihood-ratios in speaker recognition. 1619-1623
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KellyBH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KellyBH13
Finnian Kelly, Niko Brümmer, Naomi Harte:
Eigenageing compensation for speaker verification. 1624-1628

Multimodal Speech Perception

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErjavecL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErjavecL13
Grozdana Erjavec, Denis Legros:
Effects of mouth-only and whole-face displays on audio-visual speech perception in noise: is the vision of a talker's full face truly the most efficient solution? 1629-1633
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TiippanaTVV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TiippanaTVV13
Kaisa Tiippana, Mikko Tiainen, Lari Vainio, Martti Vainio:
Acoustic and visual phonetic features in the mcgurk effect - an audiovisual speech illusion. 1634-1638
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DavisK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DavisK13
Chris Davis, Jeesun Kim:
The effect of visual speech timing and form cues on the processing of speech and nonspeech. 1639-1642
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChandrashekaraBNS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChandrashekaraBNS13
Ganesh Attigodu Chandrashekara, Frédéric Berthommier, Olha Nahorna, Jean-Luc Schwartz:
Effect of context, rebinding and noise, on audiovisual speech fusion. 1643-1647
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RilliardESM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RilliardESM13
Albert Rilliard, Donna Erickson, Takaaki Shochi, João Antônio de Moraes:
Social face to face communication - American English attitudinal prosody. 1648-1652
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaillyRV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaillyRV13
Gérard Bailly, Amélie Rochet-Capellan, Coriandre Vilain:
Adaptation of respiratory patterns in collaborative reading. 1653-1657

Speech Analysis I-IV

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaneSMG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaneSMG13
John Kane, Stefan Scherer, Louis-Philippe Morency, Christer Gobl:
A comparative study of glottal open quotient estimation techniques. 1658-1662
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KasessK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KasessK13
Christian H. Kasess, Wolfgang Kreuzer:
Estimation of multiple-branch vocal tract models: the influence of prior assumptions. 1663-1667
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeigerESR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeigerESR13
Jürgen T. Geiger, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Detecting overlapping speech with long short-term memory recurrent neural networks. 1668-1672
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sasou13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sasou13
Akira Sasou:
Evaluation of fundamental validity in applying AR-HMM with automatic topology generation to pathology voice analysis. 1673-1676
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AdigaP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AdigaP13
Nagaraj Adiga, S. R. M. Prasanna:
Significance of instants of significant excitation for source modeling. 1677-1681
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AryaRH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AryaRH13
Devanshu Arya, Anant Raj, Rajesh M. Hegde:
Significance of variable height-bandwidth group delay filters in the spectral reconstruction of speech. 1682-1686
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PatilP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PatilP13
Hemant A. Patil, Tanvina B. Patel:
Nonlinear prediction of speech signal using volterra-wiener series. 1687-1691
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SattSTBKKT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SattSTBKKT13
Aharon Satt, Alexander Sorin, Orith Toledo-Ronen, Oren Barkan, Ioannis Kompatsiaris, Athina Kokonozi, Magda Tsolaki:
Evaluation of speech-based protocol for detection of early-stage dementia. 1692-1696
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AzarovVP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AzarovVP13
Elias Azarov, Maxim Vashkevich, Alexander A. Petrovsky:
Instantaneous harmonic representation of speech using multicomponent sinusoidal excitation. 1697-1701
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BabacanDDHD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BabacanDDHD13
Onur Babacan, Thomas Drugman, Nicolas D'Alessandro, Nathalie Henrich, Thierry Dutoit:
A quantitative comparison of glottal closure instant estimation algorithms on a large variety of singing sounds. 1702-1706
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarciaGC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarciaGC13
Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Germán Castellanos-Domínguez:
Automatic gender recognition in normal and pathological speech. 1707-1711
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiBP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiBP13
Shanqing Cai, H. Timothy Bunnell, Rupal Patel:
Unsupervised vocal-tract length estimation through model-based acoustic-to-articulatory inversion. 1712-1716
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MirzaeihN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MirzaeihN13
Sayeh Mirzaei, Hugo Van hamme, Yaser Norouzi:
Model order estimation using Bayesian NMF for discovering phone patterns in spoken utterances. 1717-1721

ASR - Feature Extraction

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Toth13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Toth13
László Tóth:
Convolutional deep rectifier neural nets for phone recognition. 1722-1726
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hirsch13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hirsch13
Hans-Günter Hirsch:
Pitch synchronous spectral analysis for a pitch dependent recognition of voiced phonemes - PISAR. 1727-1731
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rodriguez13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rodriguez13
José Luis Oropeza Rodríguez:
New parameters for automatic speech recognition based on the mammalian cochlea model using resonance analysis. 1732-1736
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JaitlyH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JaitlyH13
Navdeep Jaitly, Geoffrey E. Hinton:
Using an autoencoder with deformable templates to discover features for automated speech recognition. 1737-1740
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YehLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YehLL13
Ching-feng Yeh, Hung-yi Lee, Lin-Shan Lee:
Speaking rate normalization with lattice-based context-dependent phoneme duration modeling for personalized speech recognizers on mobile devices. 1741-1745
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QiWT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QiWT13
Jun Qi, Dong Wang, Javier Tejedor:
Subspace models for bottleneck features. 1746-1750
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QiWXT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QiWXT13
Jun Qi, Dong Wang, Ji Xu, Javier Tejedor:
Bottleneck features based on gammatone frequency cepstral coefficients. 1751-1755
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GolikDN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GolikDN13
Pavel Golik, Patrick Doetsch, Hermann Ney:
Cross-entropy vs. squared error training: a theoretical and experimental comparison. 1756-1760
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PatilR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PatilR13
Vaishali Patil, Preeti Rao:
Acoustic features for detection of phonemic aspiration in voiced plosives. 1761-1765
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PalazCM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PalazCM13
Dimitri Palaz, Ronan Collobert, Mathew Magimai-Doss:
Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks. 1766-1770
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OlasoT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OlasoT13
Javier Mikel Olaso, M. Inés Torres:
Hierarchical models based on a continuous acoustic space to identify phonological features. 1771-1775
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TomarR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TomarR13
Vikrant Singh Tomar, Richard C. Rose:
Locality sensitive hashing for fast computation of correlational manifold learning based feature space transformations. 1776-1780
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchatzPBJHD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchatzPBJHD13
Thomas Schatz, Vijayaditya Peddinti, Francis R. Bach, Aren Jansen, Hynek Hermansky, Emmanuel Dupoux:
Evaluating speech features with the minimal-pair ABX task: analysis of the classical MFC/PLP pipeline. 1781-1785

ASR - Pronunciation, Prosodic and New Paradigms

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChiangSCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChiangSCL13
Chen-Yu Chiang, Sabato Marco Siniscalchi, Sin-Horng Chen, Chin-Hui Lee:
Knowledge integration for improving performance in LVCSR. 1786-1790
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Heckmann13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Heckmann13
Martin Heckmann:
Inter-speaker variability in audio-visual classification of word prominence. 1791-1795
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuS13
Shilin Liu, Khe Chai Sim:
Parameter clustering for temporally varying weight regression for automatic speech recognition. 1796-1800
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlumaeN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlumaeN13
Tanel Alumäe, Rena Nemoto:
Phone duration modeling using clustering of rich contexts. 1801-1805
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AhmadiAM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AhmadiAM13
Farzaneh Ahmadi, Mousa Ahmadi, Ian McLoughlin:
Human mouth state detection using low frequency ultrasound. 1806-1810
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiQKM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiQKM13
Kun Li, Xiaojun Qian, Shiyin Kang, Helen Meng:
Lexical stress detection for L2 English speech using deep belief networks. 1811-1815
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianL13
Yanmin Qian, Jia Liu:
MLP-HMM two-stage unsupervised training for low-resource languages on conversational telephone speech recognition. 1816-1820
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NovakMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NovakMH13
Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose:
Failure transitions for joint n-gram models and G2p conversion. 1821-1825
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KameokaYIOKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KameokaYIOKS13
Hirokazu Kameoka, Kota Yoshizato, Tatsuma Ishihara, Yasunori Ohishi, Kunio Kashino, Shigeki Sagayama:
Generative modeling of speech F₀ contours. 1826-1830
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DavelHB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DavelHB13
Marelie H. Davel, Charl Johannes van Heerden, Etienne Barnard:
G2p variant prediction techniques for ASR and STD. 1831-1835
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinT13
Jin Jin, Joseph Tepperman:
Rhythm analysis of second-language speech through low-frequency auditory features. 1836-1839
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuK13
Yuzong Liu, Katrin Kirchhoff:
Graph-based semi-supervised learning for phone and segment classification. 1840-1843
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShenCR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShenCR13
Ao Shen, Neil Cooke, Martin J. Russell:
Selective use of gaze information to improve ASR performance in noisy environments by cache-based class language model adaptation. 1844-1848
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Abdel-HamidDYJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Abdel-HamidDYJ13
Ossama Abdel-Hamid, Li Deng, Dong Yu, Hui Jiang:
Deep segmental neural networks for speech recognition. 1849-1853
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CoeneHKBVG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CoeneHKBVG13
Martine Coene, Annemiek Hammer, Wojtek Kowalczyk, Louis ten Bosch, Bart Vaerenberg, Paul Govaerts:
Quantifying cross-linguistic variation in grapheme-to-phoneme mapping. 1854-1857

Show and Tell Sessions 1-3

- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/MetzeFB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeFB13
Florian Metze, Eric Fosler-Lussier, Rebecca Bates:
The speech recognition virtual kitchen. 1858-1860
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/ChenWSB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWSB13
John Chen, Shufei Wen, Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore:
Multilingual web conferencing using speech-to-speech translation. 1861-1863
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/FerragneFF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FerragneFF13
Emmanuel Ferragne, Sébastien Flavier, Christian Fressard:
ROCme! software for the recording and management of speech corpora. 1864-1865
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Burkhardt13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Burkhardt13
Felix Burkhardt:
Voice search in mobile applications with the rootvole framework. 1866-1868
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/NovakASKL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NovakASKL13
John S. Novak III, Jason Archer, Valeriy Shafiro, Robert V. Kenyon, Jason Leigh:
On-line audio dilation for human interaction. 1869-1871
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/MowlaeeWS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MowlaeeWS13
Pejman Mowlaee, Mario Kaoru Watanabe, Rahim Saeidi:
Phase-aware single-channel speech enhancement. 1872-1874
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/HiranoNMSNNTHH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiranoNMSNNTHH13
Hiroko Hirano, Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
A free online accent and intonation dictionary for teachers and learners of Japanese. 1875-1876
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/AstrinakiYKDD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AstrinakiYKDD13
Maria Astrinaki, Junichi Yamagishi, Simon King, Nicolas D'Alessandro, Thierry Dutoit:
Reactive accent interpolation through an interactive map application. 1877-1878
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Berkling13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Berkling13
Kay Berkling:
A non-experts user interface for obtaining automatic diagnostic spelling evaluations for learners of the German writing system. 1879-1881

Dialog Systems

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaHT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaHT13
Tatsuya Kawahara, Soichiro Hayashi, Katsuya Takanashi:
Estimation of interest and comprehension level of audience through multi-modal behaviors in poster conversations. 1882-1885
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuQS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuQS13
Wenping Hu, Yao Qian, Frank K. Soong:
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL). 1886-1890
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PlanellsHSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PlanellsHSS13
Joaquin Planells, Lluís F. Hurtado, Encarna Segarra, Emilio Sanchis:
A multi-domain dialog system to integrate heterogeneous spoken dialog systems. 1891-1895
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TodoNYN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TodoNYN13
Yuki Todo, Ryota Nishimura, Kazumasa Yamamoto, Seiichi Nakagawa:
Development and evaluation of spoken dialog systems with one or two agents. 1896-1900
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SkantzeOH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SkantzeOH13
Gabriel Skantze, Catharine Oertel, Anna Hjalmarsson:
User feedback in human-robot interaction: prosody, gaze and timing. 1901-1905
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiPGS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiPGS13
Yongxin Taylor Xi, Matthias Paulik, Venkata Ramana Rao Gadde, Ananth Sankar:
KPCatcher - a keyphrase extraction system for enterprise videos. 1906-1910

Speech Analysis I-IV

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SlaneySH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SlaneySH13
Malcolm Slaney, Elizabeth Shriberg, Jui-Ting Huang:
Pitch-gesture modeling using subband autocorrelation change detection. 1911-1915
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GangamohanKY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GangamohanKY13
P. Gangamohan, Sudarsana Reddy Kadiri, B. Yegnanarayana:
Analysis of emotional speech at subsegmental level. 1916-1920
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriseKO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriseKO13
Masanori Morise, Hideki Kawahara, Kenji Ozawa:
Periodicity extraction for voiced sounds with multiple periodicity. 1921-1925
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaylorM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaylorM13
John H. Taylor, Ben Milner:
Modelling and estimation of the fundamental frequency of speech using a hidden Markov model. 1926-1930
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PohjalainenA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PohjalainenA13
Jouni Pohjalainen, Paavo Alku:
Extended weighted linear prediction using the autocorrelation snapshot - a robust speech analysis method and its application to recognition of vocal emotions. 1931-1935
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsgariS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsgariS13
Meysam Asgari, Izhak Shafran:
Improving the accuracy and the robustness of harmonic model for pitch estimation. 1936-1940

ASR - Pronunciation Variants and Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SongZPY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SongZPY13
Meixu Song, Qingqing Zhang, Jielin Pan, Yonghong Yan:
Discriminative pronunciation modeling based on minimum phone error training. 1941-1945
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuboSNTN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuboSNTN13
Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Grapheme-to-phoneme conversion based on adaptive regularization of weight vectors. 1946-1950
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NaghibiHP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NaghibiHP13
Tofigh Naghibi, Sarah Hoffmann, Beat Pfister:
An efficient method to estimate pronunciation from multiple utterances. 1951-1955
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BassonD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BassonD13
Willem D. Basson, Marelie H. Davel:
Category-based phoneme-to-grapheme transliteration. 1956-1960
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JyothiFL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JyothiFL13
Preethi Jyothi, Eric Fosler-Lussier, Karen Livescu:
Discriminative training of WFST factors with application to pronunciation modeling. 1961-1965
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaranasouYLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaranasouYLL13
Penny Karanasou, François Yvon, Thomas Lavergne, Lori Lamel:
Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR. 1966-1970

Speaker Recognition Evaluation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GreenbergSMYDGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GreenbergSMYDGH13
Craig S. Greenberg, Vincent M. Stanford, Alvin F. Martin, Meghana Yadagiri, George R. Doddington, John J. Godfrey, Jaime Hernandez-Cordero:
The 2012 NIST speaker recognition evaluation. 1971-1975
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrummerD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrummerD13
Niko Brümmer, George R. Doddington:
Likelihood-ratio calibration using prior-weighted proper scoring rules. 1976-1980
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FerrerMSLGM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FerrerMSLGM13
Luciana Ferrer, Mitchell McLaren, Nicolas Scheffer, Yun Lei, Martin Graciarena, Vikramjit Mitra:
A noise-robust system for NIST 2012 speaker recognition evaluation. 1981-1985
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/aizhouHBMMA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/aizhouHBMMA13
Rahim Saeidi, Kong-Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit G. B. Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilçi, Billy Braithwaite, Rosa González Hautamäki, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David A. van Leeuwen, Bin Ma, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Sébastien Marcel, John S. D. Mason, Eliathamby Ambikairajah:
I4u submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification. 1986-1990
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunM13
Hanwu Sun, Bin Ma:
Improved unsupervised NAP training dataset design for speaker recognition. 1991-1995
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ColibroVFKKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ColibroVFKKCL13
Daniele Colibro, Claudio Vair, Kevin Farrell, Nir Krause, Gennady Karvitsky, Sandro Cumani, Pietro Laface:
Nuance - Politecnico di torino's 2012 NIST speaker recognition evaluation system. 1996-2000

Physiology and Models of Speech Production

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0009GKGA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0009GKGA13
Gang Chen, Marc Garellek, Jody Kreiman, Bruce R. Gerratt, Abeer Alwan:
A perceptually and physiologically motivated voice source model. 2001-2005
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SmithPIGN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SmithPIGN13
Caitlin Smith, Michael I. Proctor, Khalil Iskarous, Louis Goldstein, Shrikanth S. Narayanan:
Stable articulatory tasks and their variable formation: tamil retroflex consonants. 2006-2009
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamanarayananLGN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamanarayananLGN13
Vikram Ramanarayanan, Adam C. Lammert, Louis Goldstein, Shrikanth S. Narayanan:
Articulatory settings facilitate mechanically advantageous motor control of vocal tract articulators. 2010-2013
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rochet-CapellanF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rochet-CapellanF13
Amélie Rochet-Capellan, Susanne Fuchs:
The interplay of linguistic structure and breathing in German spontaneous speech. 2014-2018
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Arai13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Arai13
Takayuki Arai:
Physical models of the vocal tract with a flapping tongue for flap and liquid sounds. 2019-2023
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaprieLMSH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaprieLMSH13
Yves Laprie, Matthieu Loosvelt, Shinji Maeda, Rudolph Sock, Fabrice Hirsch:
Articulatory copy synthesis from cine x-ray films. 2024-2028

Speech Science in End-User Applications

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bellegarda13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bellegarda13
Jerome R. Bellegarda:
Large-scale personal assistant technology deployment: the siri experience. 2029-2033
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeissWM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeissWM13
Benjamin Weiss, Simon Willkomm, Sebastian Möller:
Evaluating an adaptive dialog system for the public. 2034-2038
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GemmekeOThLPDHDVBKV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GemmekeOThLPDHDVBKV13
Jort F. Gemmeke, Bart Ons, Netsanet M. Tessema, Hugo Van hamme, Janneke van de Loo, Guy De Pauw, Walter Daelemans, Jonathan Huyghe, Jan Derboven, Lode Vuegen, Bert Van Den Broeck, Peter Karsmakers, Bart Vanrumste:
Self-taught assistive vocal interfaces: an overview of the ALADIN project. 2039-2043
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EybenWS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EybenWS13
Florian Eyben, Felix Weninger, Björn W. Schuller:
Affect recognition in real-life acoustic conditions - a new perspective on feature selection. 2044-2048
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrincipiSPFB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrincipiSPFB13
Emanuele Principi, Stefano Squartini, Francesco Piazza, Danilo Fuselli, Maurizio Bonifazi:
A distributed system for recognizing home automation commands and distress calls in the Italian language. 2049-2053
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZinovievaZPAP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZinovievaZPAP13
Nina Zinovieva, Xiaodan Zhuang, Pat Peterson, Joe Alwan, Rohit Prasad:
Probabilistic trainable segmenter for call center audio using multiple features. 2054-2058
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/BurkhardtN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BurkhardtN13
Felix Burkhardt, Hans Ulrich Nägeli:
Voice search in mobile applications and the use of linked open data. 2059-2061
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/VacherLIJPSC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VacherLIJPSC13
Michel Vacher, Benjamin Lecouteux, Dan Istrate, Thierry Joubert, François Portet, Mohamed A. Sehili, Pedro Chahuara:
Evaluation of a real-time voice order recognition system from multiple audio channels in a home. 2062-2064
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/AmanVRP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AmanVRP13
Frédéric Aman, Michel Vacher, Solange Rossato, François Portet:
In-home detection of distress calls: the case of aged users. 2065-2067
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/LiuCMRSW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCMRSW13
Ding Liu, Anthea Cheung, Anna Margolis, Patrick Redmond, Jun-Won Suh, Chao Wang:
Data driven methods for utterance semantic tagging. 2068-2070
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/GouveaMRCTL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GouveaMRCTL13
Evandro Gouvêa, Antonio Moreno-Daniel, A. Reddy, Rathinavelu Chengalvarayan, David L. Thomson, Andrej Ljolje:
The AT&t speech API: a study on practical challenges for customized speech to text service. 2071-2073
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/DhooreW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DhooreW13
Bart D'hoore, Alfred Wiesen:
In-vehicle destination entry by voice: practical aspects. 2074-2076

Perception of Non Native Sounds

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GautreauHM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GautreauHM13
Aurore Gautreau, Michel Hoen, Fanny Meunier:
Intelligibility at a multilingual cocktail party: effect of concurrent language knowledge. 2077-2080
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JacewiczF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JacewiczF13
Ewa Jacewicz, Robert Allen Fox:
Regional accents affect speech intelligibility in a multitalker environment. 2081-2085
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TokumaT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TokumaT13
Shinichi Tokuma, Won Tokuma:
Perception of English minimal pairs in noise by Japanese listeners: does clear speech for L2 listeners help? 2086-2090
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SisinniEG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SisinniEG13
Bianca Sisinni, Paola Escudero, Mirko Grimaldi:
Salento Italian listeners' perception of American English vowels. 2091-2094
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RauberRKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RauberRKS13
Andréia Schurt Rauber, Anabela Rato, Denise Cristina Kluge, Giane Rodrigues dos Santos:
TP 3.1 software: a tool for designing audio, visual, and audiovisual perceptual training tasks and perception tests. 2095-2098
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLWY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLWY13
Fei Chen, Junfeng Li, Lena L. N. Wong, Yonghong Yan:
Effect of linguistic masker on the intelligibility of Mandarin sentences. 2099-2102
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoonS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoonS13
Kyuwon Moon, Meghan Sumner:
The learning and generalization of contrasts consistent or inconsistent with native biases. 2103-2107
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YingSB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YingSB13
Jia Ying, Jason A. Shaw, Catherine T. Best:
L2 English learners' recognition of words spoken in familiar versus unfamiliar English accents. 2108-2112
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wong13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wong13
Janice Wing Sze Wong:
The effects of perceptual and/or productive training on the perception and production of English vowels /ɪ/ and /iː/ by Cantonese ESL learners. 2113-2117
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KartushinaF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KartushinaF13
Natalia Kartushina, Ulrich H. Frauenfelder:
On the role of L1 speech production in L2 perception: evidence from Spanish learners of French. 2118-2122
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HalleKSF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HalleKSF13
Pierre A. Hallé, Natalia Kartushina, Juan Segui, Ulrich H. Frauenfelder:
Looking for lexical feedback effects in /tl/→/kl/ repairs. 2123-2127
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BestSC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BestSC13
Catherine T. Best, Jason A. Shaw, Elizabeth Clancy:
Recognizing words across regional accents: the role of perceptual assimilation in lexical competition. 2128-2132

Speech Disorders - Data and Methodology

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MartinezGC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MartinezGC13
David Martínez González, Phil D. Green, Heidi Christensen:
Dysarthria intelligibility assessment in a factor analysis total variability space. 2133-2137
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhioGRG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhioGRG13
Alain Ghio, Médéric Gasquet-Cyrus, Juliette Roquel, Antoine Giovanni:
Perceptual interference between regional accent and voice/speech disorders. 2138-2142
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Balciuniene13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Balciuniene13
Ingrida Balciuniene:
Linguistic disfluency in narrative speech: evidence from story-telling in 6-year olds. 2143-2146
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Munson13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Munson13
Benjamin Munson:
Assessing the utility of judgments of children's speech production made by untrained listeners in uncontrolled listening environments. 2147-2151
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AntolikF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AntolikF13
Tanja Kocjancic Antolík, Cécile Fougeron:
Consonant distortions in dysarthria due to parkinson's disease, amyotrophic lateral sclerosis and cerebellar ataxia. 2152-2156
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VerdurandRGBZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VerdurandRGBZ13
Marine Verdurand, Solange Rossato, Lionel Granjon, Daria Balbo, Claudio Zmarich:
Study of coarticulation and F2 transitions in French and Italian adult stutterers. 2157-2161
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ClaphamABHS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ClaphamABHS13
Renee Peje Clapham, Corina J. van As-Brooks, Michiel W. M. van den Brekel, Frans J. M. Hilgers, R. J. J. H. van Son:
Automatic tracheoesophageal voice typing using acoustic parameters. 2162-2166
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MauclairKRG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MauclairKRG13
Julie Mauclair, Lionel Koenig, Marina Robert, Peggy Gatignol:
Burst-based features for the classification of pathological voices. 2167-2171
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HelferQWMHY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HelferQWMHY13
Brian S. Helfer, Thomas F. Quatieri, James R. Williamson, Daryush D. Mehta, Rachelle Horwitz, Bea Yu:
Classification of depression state based on articulatory precision. 2172-2176
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FraserRR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FraserRR13
Kathleen C. Fraser, Frank Rudzicz, Elizabeth Rochon:
Using text and acoustic features to diagnose progressive aphasia and its subtypes. 2177-2181

Search and Computational Issues in LVCSR

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Alumae13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Alumae13
Tanel Alumäe:
Multi-domain neural network language model. 2182-2186
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LongGLLSW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LongGLLSW13
Yanhua Long, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Matthew Stephen Seigel, Philip C. Woodland:
Improving lightly supervised training for broadcast transcription. 2187-2191
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CerisaraLK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CerisaraLK13
Christophe Cerisara, Alejandra Lorenzo, Pavel Král:
Weakly supervised parsing with rules. 2192-2196
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nussbaum-ThomBASN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nussbaum-ThomBASN13
Markus Nußbaum-Thom, Eugen Beck, Tamer Alkhouli, Ralf Schlüter, Hermann Ney:
Relative error bounds for statistical classifiers based on the f-divergence. 2197-2201
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PremkumarVS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PremkumarVS13
Melvin Jose Johnson Premkumar, Ngoc Thang Vu, Tanja Schultz:
Experiments towards a better LVCSR system for tamil. 2202-2206
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThangthaiCW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThangthaiCW13
Kwanchiva Thangthai, Ananlada Chotimongkol, Chai Wutiwiwatchai:
A hybrid language model for open-vocabulary Thai LVCSR. 2207-2211
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChienC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChienC13
Jen-Tzung Chien, Ying-Lan Chang:
Hierarchical pitman-yor and dirichlet process for language model. 2212-2216
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsamiKMYT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsamiKMYT13
Taichi Asami, Satoshi Kobashikawa, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi:
Unsupervised confidence calibration using examples of recognized words and their contexts. 2217-2221
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSN13
Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Multilingual hierarchical MRASTA features for ASR. 2222-2226
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chang13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chang13
Harry M. Chang:
Heuristic selection of training sentences from historical TV guide for semi-supervised LM adaptation. 2227-2231
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FohrM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FohrM13
Dominique Fohr, Odile Mella:
Combination of random indexing based language model and n-gram language model for speech recognition. 2232-2236
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoM13
Yajie Miao, Florian Metze:
Improving low-resource CD-DNN-HMM using dropout and multilingual DNN training. 2237-2241
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinR13
Long Qin, Alexander I. Rudnicky:
Finding recurrent out-of-vocabulary words. 2242-2246
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChiuR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChiuR13
Justin T. Chiu, Alexander I. Rudnicky:
Using conversational word bursts in spoken term detection. 2247-2251

Speech and Hearing Disorders

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AcherSLVAKBRCBP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AcherSLVAKBRCBP13
Audrey Acher, Marc Sato, Laurent Lamalle, Coriandre Vilain, Arnaud Attye, Alexandre Krainik, Georges Bettega, Christian Adrien Righini, Brice Carlot, Muriel Brix, Pascal Perrier:
Brain activations in speech recovery process after intra-oral surgery: an fMRI study. 2252-2256
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MertensSGS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MertensSGS13
Christophe Mertens, Jean Schoentgen, Francis Grenez, Sabine Skodda:
Acoustic and perceptual analysis of vocal tremor. 2257-2261
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TantibundhitOKPSSPW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TantibundhitOKPSSPW13
Charturong Tantibundhit, Chutamanee Onsuwan, Nittayapa Klangpornkun, P. Phienphanich, Tanawan Saimai, Nantaporn Saimai, P. Pitathawatchai, Chai Wutiwiwatchai:
Lexical tone perception in Thai normal-hearing adults and those using hearing aids: a case study. 2262-2266
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KagomiyaN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KagomiyaN13
Takayuki Kagomiya, Seiji Nakagawa:
Evaluation of a bone-conducted ultrasonic hearing aid in vocal emotion transmission. 2267-2271
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarrapaBGPCBV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarrapaBGPCBV13
Luigia Garrapa, Davide Bottari, Mirko Grimaldi, Francesco Pavani, Andrea Calabrese, Michele De Benedetto, Silvano Vitale:
Processing of /i/ and /u/ in Italian cochlear-implant children: a behavioral and neurophysiologic study. 2272-2276
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CosentinoFM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CosentinoFM13
Stefano Cosentino, Tiago H. Falk, David McAlpine:
Predicting the bilateral advantage in cochlear implantees using a non-intrusive speech intelligibility measure. 2277-2281

Speech and Audio Segmentation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangCLHL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangCLHL13
Zhen Huang, You-Chi Cheng, Kehuang Li, Ville Hautamäki, Chin-Hui Lee:
A blind segmentation approach to acoustic event detection based on i-vector. 2282-2286
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VuurenBN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VuurenBN13
Van Zyl van Vuuren, Louis ten Bosch, Thomas Niesler:
A dynamic programming framework for neural network-based automatic speech segmentation. 2287-2291
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrasadY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrasadY13
RaviShankar Prasad, B. Yegnanarayana:
Acoustic segmentation of speech using zero time liftering (ZTL). 2292-2296
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLLML13
Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams. 2297-2301
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kalinli13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kalinli13
Ozlem Kalinli:
Combination of auditory attention features with phone posteriors for better automatic phoneme segmentation. 2302-2305
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuanRLSMW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuanRLSMW13
Jiahong Yuan, Neville Ryant, Mark Liberman, Andreas Stolcke, Vikramjit Mitra, Wen Wang:
Automatic phonetic segmentation using boundary models. 2306-2310

Speech Synthesis - Various Topics

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyendRT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyendRT13
Thi Thu Trang Nguyen, Christophe d'Alessandro, Albert Rilliard, Do Dat Tran:
HMM-based TTS for hanoi vietnamese: issues in design and evaluation. 2311-2315
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaitioKDG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaitioKDG13
Tuomo Raitio, John Kane, Thomas Drugman, Christer Gobl:
HMM-based synthesis of creaky voice. 2316-2320
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangS13
Xiaoxuan Wang, Khe Chai Sim:
Integrating conditional random fields and joint multi-gram model with syllabic features for grapheme-to-phone conversion. 2321-2325
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LehnenALYHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LehnenALYHN13
Patrick Lehnen, Alexandre Allauzen, Thomas Lavergne, François Yvon, Stefan Hahn, Hermann Ney:
Structure learning in hidden conditional random fields for grapheme-to-phoneme conversion. 2326-2330
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StanWMGCYK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StanWMGCYK13
Adriana Stan, Oliver Watts, Yoshitaka Mamiya, Mircea Giurgiu, Robert A. J. Clark, Junichi Yamagishi, Simon King:
TUNDRA: a multilingual corpus of found data for TTS research created with light supervision. 2331-2335
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaiaGSA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaiaGSA13
Ranniery Maia, Mark J. F. Gales, Yannis Stylianou, Masami Akamine:
Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis. 2336-2340

ASR - Discriminative Training

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hifny13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hifny13
Yasser Hifny:
Augmented conditional random fields modeling based on discriminatively trained features. 2341-2344
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VeselyGBP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VeselyGBP13
Karel Veselý, Arnab Ghoshal, Lukás Burget, Daniel Povey:
Sequence-discriminative training of deep neural networks. 2345-2349
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangF13
Weibin Zhang, Pascale Fung:
Discriminatively trained sparse inverse covariance matrices for low resource acoustic modeling. 2350-2354
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TachiokaW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TachiokaW13
Yuuki Tachioka, Shinji Watanabe:
Discriminative training of acoustic models for system combination. 2355-2359
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangYGL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangYGL13
Yan Huang, Dong Yu, Yifan Gong, Chaojun Liu:
Semi-supervised GMM and DNN acoustic model training with multi-system combination and confidence re-calibration. 2360-2364
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XueLG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XueLG13
Jian Xue, Jinyu Li, Yifan Gong:
Restructuring of deep neural network acoustic models with singular value decomposition. 2365-2369

L2 Acquisition, Multilingualism

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenSHML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenSHML13
Nancy F. Chen, Vivaek Shivakumar, Mahesh Harikumar, Bin Ma, Haizhou Li:
Large-scale characterization of Mandarin pronunciation errors made by native speakers of European languages. 2370-2374
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelvauxHPH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelvauxHPH13
Véronique Delvaux, Kathy Huet, Myriam Piccaluga, Bernard Harmegnies:
Production training in second language acquisition: a comparison between objective measures and subjective judgments. 2375-2379
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NetelenbosL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NetelenbosL13
Nicole Netelenbos, Fangfang Li:
The production and perception of voice onset time in English-speaking children enrolled in a French immersion program. 2380-2384
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BurgosCHS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BurgosCHS13
Pepi Burgos, Catia Cucchiarini, Roeland van Hout, Helmer Strik:
Pronunciation errors by Spanish learners of Dutch: a data-driven study for ASR-based pronunciation training. 2385-2389
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrahamP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrahamP13
Calbert Graham, Brechtje Post:
Realisation of tonal alignment in the English of Japanese-English late bilinguals. 2390-2394
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Benoist-LucyP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Benoist-LucyP13
Agathe Benoist-Lucy, Claire Pillot-Loiseau:
The influence of language and speech task upon creaky voice use among six young American women learning French. 2395-2399

Child Computer Interaction (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoneLCBWLLN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoneLCBWLLN13
Daniel Bone, Chi-Chun Lee, Theodora Chaspari, Matthew P. Black, Marian E. Williams, Sungbok Lee, Pat Levitt, Shrikanth S. Narayanan:
Acoustic-prosodic, turn-taking, and language cues in child-psychologist interactions for varying social demand. 2400-2404
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorilZAHXGR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorilZAHXGR13
Hynek Boril, Qian Zhang, Pongtep Angkititrakul, John H. L. Hansen, Dongxin Xu, Jill Gilkerson, Jeffrey A. Richards:
A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers. 2405-2409
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ClausRPHH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ClausRPHH13
Felix Claus, Hamurabi Gamboa Rosales, Rico Petrick, Horst-Udo Hain, Rüdiger Hoffmann:
A survey about databases of children's speech. 2410-2414
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KouloumentaPP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KouloumentaPP13
Vassiliki Kouloumenta, Manolis Perakakis, Alexandros Potamianos:
Affective evaluation of multimodal dialogue games for preschoolers using physiological signals. 2415-2419
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlamADKO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlamADKO13
Md. Jahangir Alam, Yazid Attabi, Pierre Dumouchel, Patrick Kenny, Douglas D. O'Shaughnessy:
Amplitude modulation features for emotion recognition from speech. 2420-2424
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoneLRNHG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoneLRNHG13
Daniel Bone, Chi-Chun Lee, Vikram Ramanarayanan, Shrikanth S. Narayanan, Renske S. Hoedemaker, Peter C. Gordon:
Analyzing eye-voice coordination in rapid automatized naming. 2425-2429
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaspariPN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaspariPN13
Theodora Chaspari, Emily Mower Provost, Shrikanth S. Narayanan:
Analyzing the structure of parent-moderated narratives from children with ASD using an entity-based approach. 2430-2434
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EvaniniW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EvaniniW13
Keelan Evanini, Xinhao Wang:
Automated speech scoring for non-native middle school students with multiple task types. 2435-2439
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SafaviJR013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SafaviJR013
Saeid Safavi, Peter Jancovic, Martin J. Russell, Michael J. Carey:
Identification of gender from children's speech by computers and humans. 2440-2444
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Arai13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Arai13a
Takayuki Arai:
On why Japanese /r/ sounds are difficult for children to acquire. 2445-2449

Speaker Recognition I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarkarB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarkarB13
Achintya Kumar Sarkar, Claude Barras:
Anchor and UBM-based multi-class MLLR m-vector system for speaker verification. 2450-2454
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Garcia-PereraRN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Garcia-PereraRN13
Leibny Paola García-Perera, Bhiksha Raj, Juan Arturo Nolazco-Flores:
Ensemble approach in speaker verification. 2455-2459
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWWZT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWWZT13
Jun Wang, Dong Wang, Xiaojun Wu, Thomas Fang Zheng, Javier Tejedor:
Sequential model adaptation for speaker verification. 2460-2464
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanagasundaramDGSRG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanagasundaramDGSRG13
Ahilan Kanagasundaram, David Dean, Javier Gonzalez-Dominguez, Sridha Sridharan, Daniel Ramos, Joaquin Gonzalez-Rodriguez:
Improving short utterance based i-vector speaker recognition using source and utterance-duration normalization techniques. 2465-2469
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AronowitzB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AronowitzB13
Hagai Aronowitz, Oren Barkan:
On leveraging conversational data for building a text dependent speaker verification system. 2470-2473
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangLLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangLLL13
Wei-Qiang Zhang, Zhiyi Li, Weiwei Liu, Jia Liu:
THU-EE system fusion for the NIST 2012 speaker recognition evaluation. 2474-2478
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Garcia-RomeroM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Garcia-RomeroM13
Daniel Garcia-Romero, Alan McCree:
Subspace-constrained supervector PLDA for speaker verification. 2479-2483
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoBLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoBLS13
Cong-Thanh Do, Claude Barras, Viet Bac Le, Achintya Kumar Sarkar:
Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data. 2484-2488
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajanKHPA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajanKHPA13
Padmanabhan Rajan, Tomi Kinnunen, Cemal Hanilçi, Jouni Pohjalainen, Paavo Alku:
Using group delay functions from all-pole models for speaker recognition. 2489-2493
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PorteloART13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PorteloART13
José Portelo, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Secure binary embeddings of front-end factor analysis for privacy preserving speaker verification. 2494-2498
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaghiaML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaghiaML13
Jalil Taghia, Zhanyu Ma, Arne Leijon:
On von-mises fisher mixture model in text-independent speaker identification. 2499-2503
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DiezVPRB13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DiezVPRB13a
Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Using phone log-likelihood ratios as features for speaker recognition. 2504-2508
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VillalbaDVL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VillalbaDVL13
Jesús Antonio Villalba López, Mireia Díez, Amparo Varona, Eduardo Lleida:
Handling recordings acquired simultaneously over multiple channels with PLDA. 2509-2513
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FangDG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FangDG13
Xiao Fang, Najim Dehak, James R. Glass:
Bayesian distance metric learning on i-vector for speaker verification. 2514-2518
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HautamakiHRK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HautamakiHRK13
Rosa González Hautamäki, Ville Hautamäki, Padmanabhan Rajan, Tomi Kinnunen:
Merging human and automatic system decisions to improve speaker recognition performance. 2519-2523

Dialog Systems and Applications I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoZHSY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoZHSY13
Kaisheng Yao, Geoffrey Zweig, Mei-Yuh Hwang, Yangyang Shi, Dong Yu:
Recurrent neural networks for language understanding. 2524-2528
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RiedhammerDH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RiedhammerDH13
Korbinian Riedhammer, Van Hai Do, James Hieronymus:
A study on LVCSR and keyword search for tagalog. 2529-2533
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlghowinemGWEPB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlghowinemGWEPB13
Sharifa Alghowinem, Roland Goecke, Michael Wagner, Julien Epps, Gordon Parker, Michael Breakspear:
Characterising depressed speech for classification. 2534-2538
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BigotSLFD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BigotSLFD13
Benjamin Bigot, Grégory Senay, Georges Linarès, Corinne Fredouille, Richard Dufour:
Combining acoustic name spotting and continuous context models to improve spoken person name recognition in speech. 2539-2543
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenL13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenL13a
I-Fan Chen, Chin-Hui Lee:
A resource-dependent approach to word modeling for keyword spotting. 2544-2548
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WomackACPSH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WomackACPSH13
Kathryn Womack, Cecilia Ovesdotter Alm, Cara Calvelli, Jeff B. Pelz, Pengcheng Shi, Anne R. Haake:
Markers of confidence and correctness in spoken medical narratives. 2549-2553
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakamuraMSHNNTHH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakamuraMSHNNTHH13
Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Hiroko Hirano, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
Development of a web framework for teaching and learning Japanese prosody: OJAD (online Japanese accent dictionary). 2554-2558
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShribergSR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShribergSR13
Elizabeth Shriberg, Andreas Stolcke, Suman V. Ravuri:
Addressee detection for dialog systems using temporal and spectral dimensions of speaking style. 2559-2563
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HatanoKI13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HatanoKI13
Hiroaki Hatano, Miyako Kiso, Carlos Toshinori Ishi:
Analysis of factors involved in the choice of rising or non-rising intonation in question utterances appearing in conversational speech. 2564-2568
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CelikyilmazTH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CelikyilmazTH13
Asli Celikyilmaz, Gökhan Tür, Dilek Hakkani-Tür:
IsNL? a discriminative approach to detect natural language like queries for conversational understanding. 2569-2573
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengBC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengBC13
Jian Cheng, Nikhil Bojja, Xin Chen:
Automatic accent quantification of indian speakers of English. 2574-2578
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurDH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurDH13
Gökhan Tür, Anoop Deoras, Dilek Hakkani-Tür:
Semantic parsing using word confusion networks with conditional random fields. 2579-2583
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StrombergssonHEH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StrombergssonHEH13
Sofia Strömbergsson, Anna Hjalmarsson, Jens Edlund, David House:
Timing responses to questions in dialogue. 2584-2588
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarafiatGHVC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarafiatGHVC13
Martin Karafiát, Frantisek Grézl, Mirko Hannemann, Karel Veselý, Jan Cernocký:
BUT BABEL system for spontaneous Cantonese. 2589-2593
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NorouzianRJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NorouzianRJ13
Atta Norouzian, Richard C. Rose, Aren Jansen:
Semi-supervised manifold learning approaches for spoken term verification. 2594-2598
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiF13
Ying Li, Pascale Fung:
Language modeling for mixed language speech recognition using weighted phrase extraction. 2599-2603

Spoken Machine Translation and Speech Natural Language Processing I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SkowronekHR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SkowronekHR13
Janto Skowronek, Julian Herlinghaus, Alexander Raake:
Quality assessment of asymmetric multiparty telephone conferences: a systematic method from technical degradations to perceived impairments. 2604-2608
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ImotoSUO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ImotoSUO13
Keisuke Imoto, Suehiro Shimauchi, Hisashi Uematsu, Hitoshi Ohmuro:
User activity estimation method based on probabilistic generative model of acoustic event sequence with user activity and its subordinate categories. 2609-2613
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanoTSNTN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanoTSNTN13
Takatomo Kano, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Generalizing continuous-space translation of paralinguistic information. 2614-2618
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhgushiNSTN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhgushiNSTN13
Masaya Ohgushi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
An empirical comparison of joint optimization techniques for speech translation. 2619-2623
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OstendorfH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OstendorfH13
Mari Ostendorf, Sangyun Hahn:
A sequential repetition model for improved disfluency detection. 2624-2628
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MedeirosMBTN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MedeirosMBTN13
Henrique Medeiros, Helena Moniz, Fernando Batista, Isabel Trancoso, Luís Nunes:
Disfluency detection based on prosodic features for university lectures. 2629-2633
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Meyer13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Meyer13
Bernd T. Meyer:
What's the difference? comparing humans and machines on the Aurora 2 speech recognition task. 2634-2638
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GubianBV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GubianBV13
Michele Gubian, Lou Boves, Maarten Versteegh:
Calibration of distance measures for unsupervised query-by-example. 2639-2643
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CastanA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CastanA13
Diego Castán, Murat Akbacak:
Indexing multimedia documents with acoustic concept recognition lattices. 2644-2648
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KousidisPS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KousidisPS13
Spyros Kousidis, Thies Pfeiffer, David Schlangen:
MINT.tools: tools and adaptors supporting acquisition, annotation and analysis of multimodal corpora. 2649-2653

Show and Tell Sessions 1-3

- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Clark13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Clark13
Robert A. J. Clark:
Simple⁴all. 2654-2656
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/PointeauPHGD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PointeauPHGD13
Grégoire Pointeau, Maxime Petit, Xavier Hinaut, Guillaume Gibert, Peter Ford Dominey:
On-line learning of lexical items and grammatical constructions via speech, gaze and action-based human-robot interaction. 2657-2659
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Miyakoda13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Miyakoda13
Haruko Miyakoda:
Development of a pronunciation training system based on auditory-visual elements. 2660-2661
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/AzarovVLP13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AzarovVLP13a
Elias Azarov, Maxim Vashkevich, Denis Likhachov, Alexander A. Petrovsky:
Real-time and non-real-time voice conversion systems with web interfaces. 2662-2663
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/CsalaNZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CsalaNZ13
E. Csala, Géza Németh, Csaba Zainkó:
Application of the NAO humanoid robot in the treatment of bone marrow-transplanted children (demo). 2664-2666
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/WanABBCKLMSYSAGC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WanABBCKLMSYSAGC13
Vincent Wan, Robert Anderson, Art Blokland, Norbert Braunschweiler, Langzhou Chen, BalaKrishna Kolluru, Javier Latorre, Ranniery Maia, Björn Stenger, Kayoko Yanagisawa, Yannis Stylianou, Masami Akamine, Mark J. F. Gales, Roberto Cipolla:
Photo-realistic expressive text to talking head synthesis. 2667-2669
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/MaddiesonFMP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaddiesonFMP13
Ian Maddieson, Sébastien Flavier, Egidio Marsico, François Pellegrino:
Demonstration of LAPSyd: lyon-albuquerque phonological systems database. 2670-2671
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/BoyceSIM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoyceSIM13
Suzanne Boyce, Marisha Speights, Keiko Ishikawa, Joel MacAuslan:
Speechmark acoustic landmark tool: application to voice pathology. 2672-2674
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/CataneseSQCGVB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CataneseSQCGVB13
Laurence Catanese, Nathan Souviraà-Labastie, Bingqing Qu, Sébastien Campion, Guillaume Gravier, Emmanuel Vincent, Frédéric Bimbot:
MODIS: an audio motif discovery software. 2675-2677

Language Model Adaptation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaidarO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaidarO13
Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Fitting long-range information using interpolated distanced n-grams and cache models into a latent dirichlet language model for speech recognition. 2678-2682
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHCC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHCC13
Yi-Wen Chen, Bo-Han Hao, Kuan-Yu Chen, Berlin Chen:
Incorporating proximity information for relevance language modeling in speech recognition. 2683-2687
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BayerR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BayerR13
Ali Orkan Bayer, Giuseppe Riccardi:
Instance-based on-line language model adaptation. 2688-2692
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MansikkaniemiK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MansikkaniemiK13
André Mansikkaniemi, Mikko Kurimo:
Unsupervised topic adaptation for morph-based speech recognition. 2693-2697
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchlippeGVS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchlippeGVS13
Tim Schlippe, Lukasz Gren, Ngoc Thang Vu, Tanja Schultz:
Unsupervised language model adaptation for automatic speech recognition of broadcast news using web 2.0. 2698-2702
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WenHLTL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WenHLTL13
Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao, Lin-Shan Lee:
Recurrent neural network based language model personalization by social network crowdsourcing. 2703-2707

Spoken Language Summarization and Understanding

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AyadiA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AyadiA13
Moataz El Ayadi, Mohamed Afify:
Language-independent call routing using the large margin estimation principle. 2708-2712
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeorasS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeorasS13
Anoop Deoras, Ruhi Sarikaya:
Deep belief network based semantic taggers for spoken language understanding. 2713-2717
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JabaianL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JabaianL13
Bassam Jabaian, Fabrice Lefèvre:
Error-corrective discriminative joint decoding of automatic spoken language transcription and understanding. 2718-2722
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaiCR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaiCR13
Catherine Lai, Jean Carletta, Steve Renals:
Detecting summarization hot spots in meetings using group level involvement and turn-taking features. 2723-2727
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiangLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiangLL13
Sz-Rung Shiang, Hung-yi Lee, Lin-Shan Lee:
Supervised spoken document summarization based on structured support vector machine with utterance clusters as hidden variables. 2728-2732
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KlasinasPIGM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KlasinasPIGM13
Ioannis Klasinas, Alexandros Potamianos, Elias Iosif, Spiros Georgiladakis, Gianluca Mameli:
Web data harvesting for speech understanding grammar induction. 2733-2737

Speech Synthesis - Multimodal and Articulatory Synthesis

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ToutiosN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ToutiosN13
Asterios Toutios, Shrikanth S. Narayanan:
Articulatory synthesis of French connected speech from EMA data. 2738-2742
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangWLSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangWLSS13
Xinjian Zhang, Lijuan Wang, Gang Li, Frank Seide, Frank K. Soong:
A new language independent, photo-realistic talking head driven by voice only. 2743-2747
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWMHCS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWMHCS13
Chaoyang Wang, Lijuan Wang, Yasuyuki Matsushita, Bojun Huang, Magnetro Chen, Frank K. Soong:
Binocular photometric stereo acquisition and reconstruction for 3d talking head applications. 2748-2752
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HueberBBE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HueberBBE13
Thomas Hueber, Gérard Bailly, Pierre Badin, Frédéric Elisei:
Speaker adaptation of an acoustic-articulatory inversion model using cascaded Gaussian mixture regressions. 2753-2757
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoussefSB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoussefSB13
Atef Ben Youssef, Hiroshi Shimodaira, David Adam Braude:
Articulatory features for speech-driven head motion synthesis. 2758-2762
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BraudeSY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BraudeSY13
David Adam Braude, Hiroshi Shimodaira, Atef Ben Youssef:
Template-warping based speech driven head motion synthesis. 2763-2767

Speaker Diarization and Recognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarcherBFLLLMP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarcherBFLLLMP13
Anthony Larcher, Jean-François Bonastre, Benoit G. B. Fauve, Kong-Aik Lee, Christophe Lévy, Haizhou Li, John S. D. Mason, Jean-Yves Parfait:
ALIZE 3.0 - open source toolkit for state-of-the-art speaker recognition. 2768-2772
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SenoussaouiKDD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SenoussaouiKDD13
Mohammed Senoussaoui, Patrick Kenny, Pierre Dumouchel, Najim Dehak:
New cosine similarity scorings to implement gender-independent speaker verification. 2773-2777
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CharletFDS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CharletFDS13
Delphine Charlet, Corinne Fredouille, Géraldine Damnati, Grégory Senay:
Improving speaker identification in TV-shows using person name detection in overlaid text and speech. 2778-2782
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KnoxMF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KnoxMF13
Mary Tai Knox, Nikki Mirghafori, Gerald Friedland:
Exploring methods of improving speaker accuracy for speaker diarization. 2783-2787
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PriceBS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PriceBS13
Ryan Price, Sangeeta Biswas, Koichi Shinoda:
Combining deep speaker specific representations with GMM-SVM for speaker verification. 2788-2792
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchindlerD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchindlerD13
Carola Schindler, Christoph Draxler:
Using spectral moments as a speaker specific feature in nasals and fricatives. 2793-2796

Models of Speech Perception

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaurentSBD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaurentSBD13
Raphaël Laurent, Jean-Luc Schwartz, Pierre Bessière, Julien Diard:
A computational model of perceptuo-motor processing in speech perception: learning to imitate and categorize synthetic CV syllables. 2797-2801
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Theodore13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Theodore13
Rachel M. Theodore:
Talker-specific perceptual processing: influences on internal category structure. 2802-2806
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LecumberriTTC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LecumberriTTC13
María Luisa García Lecumberri, Máté Attila Tóth, Yan Tang, Martin Cooke:
Elicitation and analysis of a corpus of robust noise-induced word misperceptions in Spanish. 2807-2811
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CutlerB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CutlerB13
Anne Cutler, Laurence Bruggeman:
Vocabulary structure and spoken-word recognition: evidence from French reveals the source of embedding asymmetry. 2812-2816
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BagouF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BagouF13
Odile Bagou, Ulrich H. Frauenfelder:
How do multiple sublexical cues converge in lexical segmentation? an artificial language learning study. 2817-2821
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoschBE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoschBE13
Louis ten Bosch, Lou Boves, Mirjam Ernestus:
Towards an end-to-end computational model of speech comprehension: simulating a lexical decision task. 2822-2826

Paralinguistic Information I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShepstoneTJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShepstoneTJ13
Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Demographic recommendation by means of group profile elicitation using speaker age and gender recognition. 2827-2831
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MalandrakisSP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MalandrakisSP13
Nikos Malandrakis, Shiva Sundaram, Alexandros Potamianos:
Affective classification of generic audio clips using regression models. 2832-2836
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JeonLXL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JeonLXL13
Je Hun Jeon, Duc Le, Rui Xia, Yang Liu:
A preliminary study of cross-lingual emotion recognition from speech: automatic classification versus human perception. 2837-2840
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanLRMSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanLRMSS13
Wenjing Han, Haifeng Li, Huabin Ruan, Lin Ma, Jiayin Sun, Björn W. Schuller:
Active learning for dimensional speech emotion recognition. 2841-2845
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KellyH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KellyH13
Finnian Kelly, Naomi Harte:
Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition. 2846-2850
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlamR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlamR13
Firoj Alam, Giuseppe Riccardi:
Comparative study of speaker personality traits recognition in conversational and broadcast news speech. 2851-2855
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangDMS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangDMS13
Zixing Zhang, Jun Deng, Erik Marchi, Björn W. Schuller:
Active learning by label uncertainty for acoustic emotion recognition. 2856-2860
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoGIAN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoGIAN13
Bo Xiao, Panayiotis G. Georgiou, Zac E. Imel, David C. Atkins, Shrikanth S. Narayanan:
Modeling therapist empathy and vocal entrainment in drug addiction counseling. 2861-2865
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiyazakiHMM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiyazakiHMM13
Chiaki Miyazaki, Ryuichiro Higashinaka, Toshiro Makino, Yoshihiro Matsuo:
Estimating callers' levels of knowledge in call center dialogues. 2866-2870
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AriasBY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AriasBY13
Juan Pablo Arias, Carlos Busso, Néstor Becerra Yoma:
Energy and F0 contour modeling with functional data analysis for emotional speech detection. 2871-2875
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MishraD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MishraD13
Taniya Mishra, Dimitrios Dimitriadis:
Incremental emotion recognition. 2876-2880
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanilciKRPAE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanilciKRPAE13
Cemal Hanilçi, Tomi Kinnunen, Padmanabhan Rajan, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort. 2881-2885
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaL13
Rui Xia, Yang Liu:
Using denoising autoencoder for emotion recognition. 2886-2889

Speech and Audio Signal Processing

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AthanasopoulosV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AthanasopoulosV13
Georgios Athanasopoulos, Werner Verhelst:
A phase-modified approach for TDE-based acoustic localization. 2890-2894
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XueLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XueLL13
Wei Xue, Shan Liang, Wenju Liu:
Interference robust DOA estimation of human speech by exploiting historical information and temporal correlation. 2895-2899
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarteMKM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarteMKM13
Naomi Harte, Sadhbh Murphy, David J. Kelly, Nicola M. Marples:
Identifying new bird species from differences in birdsong. 2900-2904
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishigakiSMNIK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishigakiSMNIK13
Yuri Nishigaki, Ken-Ichi Sakakibara, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara:
Controlling "shout" expression in a Japanese POP singing performance: analysis and suppression study. 2905-2909
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MehrabaniH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MehrabaniH13
Mahnoosh Mehrabani, John H. L. Hansen:
Dimensionality analysis of singing speech based on locality preserving projections. 2910-2914
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MollaH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MollaH13
Md. Khademul Islam Molla, Keikichi Hirose:
Audio classification using dominant spatial patterns in time-frequency space. 2915-2919
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinHCCC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinHCCC13
Tse-En Lin, Chung-Chien Hsu, Yi-Cheng Chen, Jian-Hueng Chen, Tai-Shih Chi:
Spectro-temporal modulation based singing detection combined with pitch-based grouping for singing voice separation. 2920-2923
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ludena-ChoezG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ludena-ChoezG13
Jimmy Ludeña-Choez, Ascensión Gallardo-Antolín:
NMF-based temporal feature integration for acoustic event classification. 2924-2928
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RawatSBDWM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RawatSBDWM13
Shourabh Rawat, Peter F. Schulam, Susanne Burger, Duo Ding, Yipei Wang, Florian Metze:
Robust audio-codebooks for large-scale event detection in consumer videos. 2929-2933
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AltafBJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AltafBJ13
M. Umair Bin Altaf, Taras Butko, Biing-Hwang Juang:
Person identification using biometric markers from footsteps sound. 2934-2938
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Mlynarski13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Mlynarski13
Wiktor Mlynarski:
Learning binaural spectrogram features for azimuthal speaker localization. 2939-2942
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OualilFK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OualilFK13
Youssef Oualil, Friedrich Faubel, Dietrich Klakow:
An unsupervised Bayesian classifier for multiple speaker detection and localization. 2943-2947
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChakrabortyN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChakrabortyN13
Rupayan Chakraborty, Climent Nadeu:
Joint recognition and direction-of-arrival estimation of simultaneous meeting-room acoustic events. 2948-2952
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangWNPN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangWNPN13
Xiaodan Zhuang, Shuang Wu, Pradeep Natarajan, Rohit Prasad, Prem Natarajan:
Audio self organized units for high-level event detection. 2953-2957

ASR - Robustness Against Noise I-III

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaoC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaoC13
Yu-Chen Kao, Berlin Chen:
Distribution-based feature normalization for robust speech recognition leveraging context and dynamics cues. 2958-2962
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuS13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuS13a
Shilin Liu, Khe Chai Sim:
An investigation of temporally varying weight regression for noise robust speech recognition. 2963-2967
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLW13
Yang Li, Xunying Liu, Lan Wang:
Feature space generalized variable parameter HMMs for noise robust recognition. 2968-2972
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrakelSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrakelSS13
Philemon Brakel, Dirk Stroobandt, Benjamin Schrauwen:
Bidirectional truncated recurrent neural networks for efficient speech denoising. 2973-2977
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiLH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiLH13
Ehsan Variani, Feipeng Li, Hynek Hermansky:
Multi-stream recognition of noisy speech with performance monitoring. 2978-2981
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimotoN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimotoN13
Masakiyo Fujimoto, Tomohiro Nakatani:
Model-based noise suppression using unsupervised estimation of hidden Markov model for non-stationary noise. 2982-2986
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NathwaniH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NathwaniH13
Karan Nathwani, Rajesh M. Hegde:
Joint noise cancellation and dereverberation using multi-channel linearly constrained minimum variance filter. 2987-2991
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelcroixKNN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelcroixKNN13
Marc Delcroix, Yotaro Kubo, Tomohiro Nakatani, Atsushi Nakamura:
Is speech enhancement pre-processing still relevant when using deep neural networks for acoustic modeling? 2992-2996
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsiehCH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsiehCH13
Hsin-Ju Hsieh, Berlin Chen, Jeih-weih Hung:
Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition. 2997-3001
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTS13
Bo Li, Yu Tsao, Khe Chai Sim:
An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition. 3002-3006
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Remes13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Remes13
Ulpu Remes:
Bounded conditional mean imputation with an approximate posterior. 3007-3011
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiGK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiGK13
Xiaodong Cui, Vaibhava Goel, Brian Kingsbury:
Mixtures of Bayesian joint factor analyzers for noise robust automatic speech recognition. 3012-3016
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuDB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuDB13
Gang Liu, Dimitrios Dimitriadis, Enrico Bocchieri:
Robust speech enhancement techniques for ASR in non-stationary noise and dynamic environments. 3017-3021

Linguistic Systems, Phonetics-Phonology Interface

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaddiesonFMCP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaddiesonFMCP13
Ian Maddieson, Sébastien Flavier, Egidio Marsico, Christophe Coupé, François Pellegrino:
LAPSyd: lyon-albuquerque phonological systems database. 3022-3026
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Barbosa13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Barbosa13
Plínio A. Barbosa:
The duration compensation issue revisited. 3027-3031
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhPCM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhPCM13
Yoon Mi Oh, François Pellegrino, Christophe Coupé, Egidio Marsico:
Cross-language comparison of functional load for vowels, consonants, and tones. 3032-3036
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Maekawa13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Maekawa13
Kikuo Maekawa:
Notes on so-called inter-speaker difference in spontaneous speech: the case of Japanese voiced obstruent. 3037-3041
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CarignanSFLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CarignanSFLS13
Christopher Carignan, Ryan Shosted, Maojing Fu, Zhi-Pei Liang, Bradley P. Sutton:
The role of the pharynx and tongue in enhancement of vowel nasalization: a real-time MRI investigation of French nasal vowels. 3042-3046
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RenwickBTC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RenwickBTC13
Margaret E. L. Renwick, Ladan Baghai-Ravary, Rosalind Temple, John S. Coleman:
Assimilation of word-final nasals to following word-initial place of articulation in UK English. 3047-3051

Speech Synthesis - Voice Conversion

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLSD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLSD13
Ling-Hui Chen, Zhen-Hua Ling, Yan Song, Li-Rong Dai:
Joint spectral distribution modeling using restricted boltzmann machines for voice conversion. 3052-3056
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. 3057-3061
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HwangTWWC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HwangTWWC13
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training. 3062-3066
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaTNSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaTNSN13
Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion. 3067-3071
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriguchiTSSNSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriguchiTSSNSN13
Takuto Moriguchi, Tomoki Toda, Motoaki Sano, Hiroshi Sato, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion. 3072-3076
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AryalFG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AryalFG13
Sandesh Aryal, Daniel Felps, Ricardo Gutierrez-Osuna:
Foreign accent conversion through voice morphing. 3077-3081

Large Vocabulary Continuous Speech Recognition Systems

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiZGN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiZGN13
Kartik Audhkhasi, Andreas M. Zavou, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Empirical link between hypothesis diversity and fusion performance in an ensemble of automatic speech recognition systems. 3082-3086
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BellYSWMHR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BellYSWMHR13
Peter Bell, Hitoshi Yamamoto, Pawel Swietojanski, Youzheng Wu, Fergus McInnes, Chiori Hori, Steve Renals:
A lecture transcription system combining neural network acoustic and language models. 3087-3091
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauKMSB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauKMSB13
Hagen Soltau, Hong-Kwang Kuo, Lidia Mangu, George Saon, Tomás Beran:
Neural network acoustic models for the DARPA RATS program. 3092-3096
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UeffingBV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UeffingBV13
Nicola Ueffing, Maximilian Bisani, Paul Vozila:
Improved models for automatic punctuation prediction for spoken and written text. 3097-3101
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoyLFGO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoyLFGO13
Anindya Roy, Lori Lamel, Thiago Fraga-Silva, Jean-Luc Gauvain, Ilya Oparin:
Some issues affecting the transcription of Hungarian broadcast audio. 3102-3106
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GolikTSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GolikTSN13
Pavel Golik, Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Development of the RWTH transcription system for slovenian. 3107-3111

Robust Speaker Recognition I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaTO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaTO13
Naoyuki Kanda, Ryu Takeda, Yasunari Obuchi:
Noise robust speaker verification with delta cepstrum normalization. 3112-3116
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VandykeWG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VandykeWG13
David Vandyke, Michael Wagner, Roland Goecke:
R-norm: improving inter-speaker variability modelling at the score level via regression score normalisation. 3117-3121
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinnunenAMKCO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinnunenAMKCO13
Tomi Kinnunen, Md. Jahangir Alam, Pavel Matejka, Patrick Kenny, Jan Cernocký, Douglas D. O'Shaughnessy:
Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations. 3122-3126
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HasanH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HasanH13
Taufiq Hasan, John H. L. Hansen:
Acoustic factor analysis based universal background model for robust speaker verification in noise. 3127-3131
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LopezLOM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LopezLOM13
Jesús Antonio Villalba López, Eduardo Lleida, Alfonso Ortega, Antonio Miguel:
A new Bayesian network to assess the reliability of speaker verification decisions. 3132-3136
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuYP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuYP13
Weizhong Zhu, Sibel Yaman, Jason W. Pelecanos:
The IBM RATS phase II speaker recognition system: overview and analysis. 3137-3141

Acoustic and Articulatory Cues in Speech Perception

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShawTKMPHDB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShawTKMPHDB13
Jason A. Shaw, Michael D. Tyler, Benjawan Kasisopa, Yuan Ma, Michael I. Proctor, Chong Han, Donald Derrick, Denis K. Burnham:
Vowel identity conditions the time course of tone recognition. 3142-3146
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgJ13
Odette Scharenborg, Esther Janse:
Changes in the role of intensity as a cue for fricative categorisation. 3147-3151
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YasuAKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YasuAKS13
Keiichi Yasu, Takayuki Arai, Kei Kobayashi, Mitsuko Shindo:
Weighting of acoustic cues shifts to frication duration in identification of fricatives/affricates when auditory properties are degraded due to aging. 3152-3156
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoH13
Jiayin Gao, Pierre A. Hallé:
Duration as a secondary cue for perception of voicing and tone in shanghai Chinese. 3157-3161
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DekerleMNGLD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DekerleMNGLD13
Marie Dekerle, Fanny Meunier, Marie-Ange N'Guyen, Estelle Gillet-Perret, Delphine Lassus-Sangosse, Sophie Donnadieu:
Development of central auditory processes and their links with language skills in typically developing children. 3162-3166
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarnetKMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarnetKMH13
Léo Varnet, Kenneth Knoblauch, Fanny Meunier, Michel Hoen:
Show me what you listen to! auditory classification images can reveal the processing of fine acoustic cues during speech categorization. 3167-3171

Speech Production - Data and Models

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrackhaneT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrackhaneT13
Fabian Brackhane, Jürgen Trouvain:
The organ stop "vox humana" as a model for a vowel synthesiser. 3172-3176
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhoshN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhoshN13
Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Information theoretic acoustic feature selection for acoustic-to-articulatory inversion. 3177-3181
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FejlovaLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FejlovaLS13
Dita Fejlová, David Lukes, Radek Skarnitzl:
Formant contours in Czech vowels: speaker-discriminating potential. 3182-3186
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuWWLFD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuWWLFD13
Shen Liu, Jianguo Wei, Xin Wang, Wenhuan Lu, Qiang Fang, Jianwu Dang:
An anisotropic diffusion filter based on multidirectional separability. 3187-3190
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SkarnitzlSM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SkarnitzlSM13
Radek Skarnitzl, Pavel Sturm, Pavel Machac:
The phonological voicing contrast in Czech: an EPG study of phonated and whispered fricatives. 3191-3195
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaedaL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaedaL13
Shinji Maeda, Yves Laprie:
Vowel and prosodic factor dependent variations of vocal-tract length. 3196-3200
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrootswagersDBBS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrootswagersDBBS13
Tijl Grootswagers, Karen Dijkstra, Louis ten Bosch, Alex Brandmeyer, Makiko Sadakata:
Word identification using phonetic features: towards a method to support multivariate fMRI speech decoding. 3201-3205
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GowdaK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GowdaK13
Dhananjaya N. Gowda, Mikko Kurimo:
Analysis of breathy, modal and pressed phonation based on low frequency spectral density. 3206-3210
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TajimaTMM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TajimaTMM13
Keiichi Tajima, Kuniyoshi Tanaka, Andrew Martin, Reiko Mazuka:
Is the vowel length contrast in Japanese exaggerated in infant-directed speech? 3211-3215
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0009SKA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0009SKA13
Gang Chen, Robin A. Samlan, Jody Kreiman, Abeer Alwan:
Investigating the relationship between glottal area waveform shape and harmonic magnitudes through computational modeling and laryngeal high-speed videoendoscopy. 3216-3220
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimRC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimRC13
Jonathan C. Kim, Hrishikesh Rao, Mark A. Clements:
Formant frequency tracking using Gaussian mixtures with maximum a posteriori adaptation. 3221-3225
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YasudaZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YasudaZ13
Rei Yasuda, Frank Zimmerer:
Devoicing of vowels in German, a comparison of Japanese and German speakers. 3226-3229
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SmithL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SmithL13
Caitlin Smith, Adam C. Lammert:
Identifying consonantal tasks via measures of tongue shaping: a real-time MRI investigation of the production of vocalized syllabic /l/ in American English. 3230-3233

Speech Enhancement

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengBB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengBB13
Feng Deng, Changchun Bao, Feng Bao:
A speech enhancement method by coupling speech detection and spectral amplitude estimation. 3234-3238
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengC13
Chenxi Zheng, Wai-Yip Chan:
Late reverberation suppression using MMSE modulation spectral estimation. 3239-3243
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuranE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuranE13
M. A. Tugtekin Turan, Engin Erzin:
A new statistical excitation mapping for enhancement of throat microphone recordings. 3244-3248
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RomanM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RomanM13
Nicoleta Roman, Michael I. Mandel:
Classification based binaural dereverberation. 3249-3253
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimK13
Seon Man Kim, Hong Kook Kim:
Target-to-non-target directional ratio estimation based on dual-microphone phase differences for target-directional speech enhancement. 3254-3258
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuMH13
Xugang Lu, Shigeki Matsuda, Chiori Hori:
Speech spectrum restoration based on conditional restricted boltzmann machine. 3259-3263
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhanM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhanM13
Faheem Khan, Ben Milner:
Speaker separation using visual speech features and single-channel audio. 3264-3268
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChuangCHC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChuangCHC13
Wei-Lun Chuang, Kah-Meng Cheong, Chung-Chien Hsu, Tai-Shih Chi:
Spectral modulation sensitivity based perceptual acoustic echo cancellation. 3269-3273
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbrolSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbrolSS13
Vinayak Abrol, Pulkit Sharma, Anil Kumar Sao:
Speech enhancement using compressed sensing. 3274-3278
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GraisE13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GraisE13a
Emad M. Grais, Hakan Erdogan:
Spectro-temporal post-enhancement using MMSE estimation in NMF based single-channel source separation. 3279-3283
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaewtipTA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaewtipTA13
Kantapon Kaewtip, Lee Ngee Tan, Abeer Alwan:
A pitch-based spectral enhancement technique for robust speech processing. 3284-3288
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McCallumG13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McCallumG13a
Matthew C. McCallum, Bernard J. Guillemin:
Stochastic-deterministic signal modelling for the tracking of pitch in noise and speech mixtures using factorial HMMs. 3289-3293
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaymonMG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaymonMG13
Shay Maymon, Etienne Marcheret, Vaibhava Goel:
Restoration of clipped signals with application to speech recognition. 3294-3297
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UezuKSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UezuKSN13
Yasufumi Uezu, Keisuke Kinoshita, Mehrez Souden, Tomohiro Nakatani:
On the robustness of distributed EM based BSS in asynchronous distributed microphone array scenarios. 3298-3302

ASR - Acoustic Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangDG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangDG13
Jingzhou Yang, Rogier C. van Dalen, Mark J. F. Gales:
Infinite support vector machines in speech recognition. 3303-3307
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GiulianiB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GiulianiB13
Diego Giuliani, Fabio Brugnara:
An on-line incremental speaker adaptation technique for audio stream transcription. 3308-3312
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TelaarF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TelaarF13
Dominic Telaar, Mark C. Fuhs:
Accent- and speaker-specific polyphone decision trees for non-native speech recognition. 3313-3316
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WieslerLX13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WieslerLX13
Simon Wiesler, Jinyu Li, Jian Xue:
Investigations on hessian-free optimization for cross-entropy training of deep neural networks. 3317-3321
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaikoMHIH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaikoMHIH13
Masahiro Saiko, Shigeki Matsuda, Ken Hanazawa, Ryosuke Isotani, Chiori Hori:
Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP. 3322-3326
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimuraSM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimuraSM13
Hiroshi Fujimura, Yusuke Shinohara, Takashi Masuko:
N-best rescoring by phoneme classifiers using subclass adaboost algorithm. 3327-3331
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgawaLH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgawaLH13
Tetsuji Ogawa, Feipeng Li, Hynek Hermansky:
Stream selection and integration in multistream ASR using GMM-based performance monitoring. 3332-3336
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YomaGHCW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YomaGHCW13
Néstor Becerra Yoma, Claudio Garretón, Fernando Huenupán, Ignacio Catalan, Jorge Wuth:
VTLN based on the linear interpolation of contiguous mel filter-bank energies. 3337-3341
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TriefenbachJDM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TriefenbachJDM13
Fabian Triefenbach, Azarakhsh Jalalvand, Kris Demuynck, Jean-Pierre Martens:
Context-dependent modeling and speaker normalization applied to reservoir-based phone recognition. 3342-3346
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fraga-SilvaGL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fraga-SilvaGL13
Thiago Fraga-Silva, Jean-Luc Gauvain, Lori Lamel:
Interpolation of acoustic models for speech recognition. 3347-3351
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TahirHSNBCB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TahirHSNBCB13
Muhammad Ali Tahir, Heyun Huang, Ralf Schlüter, Hermann Ney, Louis ten Bosch, Bert Cranen, Lou Boves:
Training log-linear acoustic models in higher-order polynomial feature space for speech recognition. 3352-3355
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParinamVZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParinamVZ13
Venkata Neelima Parinam, Chandra Sekhar Vootkuri, Stephen A. Zahorian:
Comparison of spectral analysis methods for automatic speech recognition. 3356-3360
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanandS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanandS13
D. Rama Sanand, Torbjørn Svendsen:
Synthetic speaker models using VTLN to improve the performance of children in mismatched speaker conditions for ASR. 3361-3365
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Abdel-HamidDY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Abdel-HamidDY13
Ossama Abdel-Hamid, Li Deng, Dong Yu:
Exploring convolutional neural network structures and optimization techniques for speech recognition. 3366-3370

Special Event: ESCA/ISCA Anniversary

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarianiPFD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarianiPFD13
Joseph Mariani, Patrick Paroubek, Gil Francopoulo, Marine Delaborde:
Rediscovering 25 years of discoveries in spoken language processing: a preliminary ISCA archive analysis. 3371-3403

Language Modeling for Conversational Speech

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShaikMSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShaikMSN13
M. Ali Basha Shaik, Amr El-Desoky Mousa, Ralf Schlüter, Hermann Ney:
Feature-rich sub-lexical language models using a maximum entropy approach for German LVCSR. 3404-3408
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MousaSSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MousaSSN13
Amr El-Desoky Mousa, M. Ali Basha Shaik, Ralf Schlüter, Hermann Ney:
Morpheme level hierarchical pitman-yor class-based language models for LVCSR of morphologically rich languages. 3409-3413
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LambertRS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LambertRS13
Benjamin Lambert, Bhiksha Raj, Rita Singh:
Discriminatively trained dependency language modeling for conversational speech recognition. 3414-3418
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiZLPY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiZLPY13
Yujing Si, Qingqing Zhang, Ta Li, Jielin Pan, Yonghong Yan:
Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system. 3419-3423
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuGW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuGW13
Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Cross-domain paraphrasing for improving language modelling using out-of-domain data. 3424-3428
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraMOYT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraMOYT13
Ryo Masumura, Hirokazu Masataki, Takanobu Oba, Osamu Yoshioka, Satoshi Takahashi:
Viterbi decoding for latent words language models using gibbs sampling. 3429-3433

Speech Enhancement and Coding

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Backstrom13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Backstrom13
Tom Bäckström:
Computationally efficient objective function for algebraic codebook optimization in ACELP. 3434-3438
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MollerKKCBFSPA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MollerKKCBFSPA13
Sebastian Möller, Emilia Kelaidi, Friedemann Köster, Nicolas Côté, Patrick Bauer, Tim Fingscheidt, Thomas Schlien, Hannu Pulakka, Paavo Alku:
Speech quality prediction for artificial bandwidth extension algorithms. 3439-3443
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaB13
Bingyin Xia, Changchun Bao:
Speech enhancement with weighted denoising auto-encoder. 3444-3448
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CernakNG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CernakNG13
Milos Cernak, Xingyu Na, Philip N. Garner:
Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture. 3449-3452
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuySMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuySMH13
Nguyen Duc Duy, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Artificial bandwidth extension based on regularized piecewise linear mapping with discriminative region weighting and long-Span features. 3453-3457
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLPC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLPC13
Bong-Ki Lee, Chungsoo Lim, Jihwan Park, Joon-Hyuk Chang:
Enhanced muting method in packet loss concealment of ITU-t g.722 employing optimized sigmoid function. 3458-3462

Spoken Machine Translation and Speech Natural Language Processing I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FavreCKLLMNOPTVZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FavreCKLLMNOPTVZ13
Benoît Favre, Kyla Cheung, Siavash Kazemian, Adam Lee, Yang Liu, Cosmin Munteanu, Ani Nenkova, Dennis Ochei, Gerald Penn, Stephen Tratz, Clare R. Voss, Frauke Zeller:
Automatic human utility evaluation of ASR systems: does WER really predict performance? 3463-3467
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SridharCB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SridharCB13
Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore:
Corpus analysis of simultaneous interpretation data for improving real time speech translation. 3468-3472
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoFHKMMNRSSW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoFHKMMNRSSW13
Eunah Cho, Christian Fügen, Teresa Herrmann, Kevin Kilgour, Mohammed Mediani, Christian Mohr, Jan Niehues, Kay Rottmann, Christian Saam, Sebastian Stüker, Alex Waibel:
A real-world system for simultaneous translation of German lectures. 3473-3477
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuAS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuAS13
Dekai Wu, Karteek Addanki, Markus Saers:
Freestyle: a challenge-response system for hip hop lyrics via unsupervised induction of stochastic transduction grammars. 3478-3482
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsiartasGN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsiartasGN13
Andreas Tsiartas, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Toward transfer of acoustic cues of emphasis across languages. 3483-3486
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujitaNSTN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujitaNSTN13
Tomoki Fujita, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Simple, lexicalized choice of translation timing for simultaneous speech translation. 3487-3491

ASR - Robustness Against Noise I-III

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuGR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuGR13
Liang Lu, Arnab Ghoshal, Steve Renals:
Noise adaptive training for subspace Gaussian mixture models. 3492-3496
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonTSGK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonTSGK13
George Saon, Samuel Thomas, Hagen Soltau, Sriram Ganapathy, Brian Kingsbury:
The IBM speech activity detection system for the DARPA RATS program. 3497-3501
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SehrYDKNMK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SehrYDKNMK13
Armin Sehr, Takuya Yoshioka, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Roland Maas, Walter Kellermann:
Conditional emission densities for combining speech enhancement and recognition systems. 3502-3506
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WolfN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WolfN13
Martin Wolf, Climent Nadeu:
Channel selection using n-best hypothesis for multi-microphone ASR. 3507-3511
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshiiKSHK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshiiKSHK13
Takaaki Ishii, Hiroki Komiyama, Takahiro Shinozaki, Yasuo Horiuchi, Shingo Kuroiwa:
Reverberant speech recognition based on denoising autoencoder. 3512-3516
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaymonDCG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaymonDCG13
Shay Maymon, Pierre L. Dognin, Xiaodong Cui, Vaibhava Goel:
Adaptive stereo-based stochastic mapping. 3517-3521

Articulatory and Acoustic Cues of Speech Prosody

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenMTM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenMTM13
Thi Lan Nguyen, Alexis Michaud, Do Dat Tran, Dang-Khoa Mac:
The interplay of intonation and complex lexical tones: how speaker attitudes affect the realization of glottalization on vietnamese sentence-final particles. 3522-3526
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChasaideYKG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChasaideYKG13
Ailbhe Ní Chasaide, Irena Yanushevskaya, John Kane, Christer Gobl:
The voice prominence hypothesis: the interplay of F0 and voice source features in accentuation. 3527-3531
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeXP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeXP13
Albert Lee, Yi Xu, Santitham Prom-on:
Mora-based pre-low raising in Japanese pitch accent. 3532-3536
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoevenbruckJDSC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoevenbruckJDSC13
Hélène Loevenbruck, Mohamed Ameur Ben Jannet, Mariapaola D'Imperio, Mathilde Spini, Maud Champagne-Lavau:
Prosodic cues of sarcastic speech in French: slower, higher, wider. 3537-3541
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MenardLTPTTC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MenardLTPTTC13
Lucie Ménard, Annie Leclerc, Mark K. Tiede, Amélie Prémont, Christine Turgeon, Paméla Trudeau-Fisette, Dominique Côté:
Correlates of contrastive focus in congenitally blind adults and sighted adults. 3542-3546
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeorgetonA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeorgetonA13
Laurianne Georgeton, Nicolas Audibert:
Is protrusion of French rounded vowels affected by prosodic positions? 3547-3551

Intelligibility-Enhancing Speech Modifications (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CookeMV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CookeMV13
Martin Cooke, Catherine Mayo, Cassia Valentini-Botinhao:
Intelligibility-enhancing speech modifications: the hurricane challenge. 3552-3556
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErroZSNH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErroZSNH13
Daniel Erro, Tudor-Catalin Zorila, Yannis Stylianou, Eva Navas, Inma Hernáez:
Statistical synthesizer with embedded prosodic and spectral modifications to generate highly intelligible speech in noise. 3557-3561
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuniKRKVA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuniKRKVA13
Antti Suni, Reima Karhila, Tuomo Raitio, Mikko Kurimo, Martti Vainio, Paavo Alku:
Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013. 3562-3566
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Valentini-BotinhaoYKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Valentini-BotinhaoYKS13
Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King, Yannis Stylianou:
Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise. 3567-3571
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GodoyS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GodoyS13
Elizabeth Godoy, Yannis Stylianou:
Increasing speech intelligibility via spectral shaping with frequency warping and dynamic range compression plus transient enhancement. 3572-3576
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchepkerRD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchepkerRD13
Henning F. Schepker, Jan Rennies, Simon Doclo:
Improving speech intelligibility in noise by SII-dependent preprocessing using frequency-dependent amplification and dynamic range compression. 3577-3581
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaalJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaalJ13
Cees H. Taal, Jesper Jensen:
SII-based speech preprocessing for intelligibility improvement in noise. 3582-3586
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangPK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangPK13
Mengqiu Zhang, Petko Nikolov Petkov, W. Bastiaan Kleijn:
Rephrasing-based speech intelligibility enhancement. 3587-3591
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AubanelC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AubanelC13
Vincent Aubanel, Martin Cooke:
Information-preserving temporal reallocation of speech in the presence of fluctuating maskers. 3592-3596
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PetkovK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PetkovK13
Petko Nikolov Petkov, W. Bastiaan Kleijn:
Preservation of speech spectral dynamics enhances intelligibility. 3597-3601
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/BrouckxonV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrouckxonV13
Henk Brouckxon, Werner Verhelst:
An overview of the VUB entry for the 2013 hurricane challenge. 3602-3604
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/TakouSI13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakouSI13
Reiko Takou, Nobumasa Seiyama, Atsushi Imai:
Improvement of speech intelligibility by reallocation of spectral energy. 3605-3607

Speech Technology for Speech and Hearing Disorders I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VaerenbergBKCSG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VaerenbergBKCSG13
Bart Vaerenberg, Louis ten Bosch, Wojtek Kowalczyk, Martine Coene, Herwig De Smet, Paul J. Govaerts:
Language-universal speech audiometry with automated scoring. 3608-3612
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HammerVKBCG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HammerVKBCG13
Annemiek Hammer, Bart Vaerenberg, Wojtek Kowalczyk, Louis ten Bosch, Martine Coene, Paul J. Govaerts:
Balancing word lists in speech audiometry through large spoken language corpora. 3613-3616
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lopez-LudenaSFPF13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lopez-LudenaSFPF13
Verónica López-Ludeña, Rubén San Segundo, Javier Ferreiros, José M. Pardo, E. Ferreiro:
Developing an information system for deaf. 3617-3621
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimYK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimYK13
Myung Jong Kim, Joohong Yoo, Hoirin Kim:
Dysarthric speech recognition using dysarthria-severity-dependent and speaker-adaptive models. 3622-3626
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuhammadM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuhammadM13
Ghulam Muhammad, Moutasem Melhem:
Voice pathology detection and classification using MPEG-7 audio low-level features. 3627-3631
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KachaGS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KachaGS13
Abdellah Kacha, Francis Grenez, Jean Schoentgen:
Empirical mode decomposition-based spectral acoustic cues for disordered voices analysis. 3632-3636
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AiharaTTA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AiharaTTA13
Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments. 3637-3641
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChristensenABGHKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChristensenABGHKS13
Heidi Christensen, Magda B. Aniol, Peter Bell, Phil D. Green, Thomas Hain, Simon King, Pawel Swietojanski:
Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. 3642-3645
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaiMW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaiMW13
Guangting Mai, James W. Minett, William S.-Y. Wang:
Effects of envelope filter cutoff frequency on the intelligibility of Mandarin noise-vocoded speech in babble noise: implications for cochlear implants. 3646-3650

Robust Speaker Recognition I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLYML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLYML13
Kong-Aik Lee, Anthony Larcher, Chang Huai You, Bin Ma, Haizhou Li:
Multi-session PLDA scoring of i-vector for partially open-set speaker detection. 3651-3655
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GodinSH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GodinSH13
Keith W. Godin, Seyed Omid Sadjadi, John H. L. Hansen:
Impact of noise reduction and spectrum estimation on noise robust speaker identification. 3656-3660
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamadaWK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamadaWK13
Takanori Yamada, Longbiao Wang, Atsuhiko Kai:
Improvement of distant-talking speaker identification using bottleneck features of DNN. 3661-3664
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BruttiO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BruttiO13
Alessio Brutti, Maurizio Omologo:
Geometric contamination for GMM/UBM speaker verification in reverberant environments. 3665-3669
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McClanahanL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McClanahanL13
Richard D. McClanahan, Phillip L. De Leon:
Towards a more efficient SVM supervector speaker verification system using Gaussian reduction and a tree-structured hash. 3670-3673
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanagasundaramDGSRG13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanagasundaramDGSRG13a
Ahilan Kanagasundaram, David Dean, Javier Gonzalez-Dominguez, Sridha Sridharan, Daniel Ramos, Joaquin Gonzalez-Rodriguez:
Improving the PLDA based speaker verification in limited microphone data conditions. 3674-3678
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LopezLOM13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LopezLOM13a
Jesús Antonio Villalba López, Eduardo Lleida, Alfonso Ortega, Antonio Miguel:
The I3a speaker recognition system for NIST SRE12: post-evaluation analysis. 3679-3683
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StafylakisKOPKD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StafylakisKOPKD13
Themos Stafylakis, Patrick Kenny, Pierre Ouellet, Javier Perez, Marcel Kockmann, Pierre Dumouchel:
Text-dependent speaker recognition using PLDA with uncertainty propagation. 3684-3688
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MallidiGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MallidiGH13
Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Robust speaker recognition using spectro-temporal autoregressive models. 3689-3693
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajanKH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajanKH13
Padmanabhan Rajan, Tomi Kinnunen, Ville Hautamäki:
Effect of multicondition training on i-vector PLDA configurations for speaker recognition. 3694-3697
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McLarenAGLP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McLarenAGLP13
Mitchell McLaren, Victor Abrash, Martin Graciarena, Yun Lei, Jan Pesán:
Improving robustness to compressed speech in speaker recognition. 3698-3702
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitraMFGS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitraMFGS13
Vikramjit Mitra, Mitchell McLaren, Horacio Franco, Martin Graciarena, Nicolas Scheffer:
Modulation features for noise robust speaker identification. 3703-3707
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HautamakiCRL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HautamakiCRL13
Ville Hautamäki, You-Chi Cheng, Padmanabhan Rajan, Chin-Hui Lee:
Minimax i-vector extractor for short duration speaker verification. 3708-3712
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FowlerMBDR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FowlerMBDR13
Mike Fowler, Mark McCurry, Jonathan Bramsen, Kehinde Dunsin, Jeremiah Remus:
Standoff speaker recognition: effects of recording distance mismatch on speaker recognition system performance. 3713-3716

Dialog Systems and Applications I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StrombergssonT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StrombergssonT13
Sofia Strömbergsson, Christina Tånnander:
Correlates to intelligibility in deviant child speech - comparing clinical evaluations to audience response system-based evaluations by untrained listeners. 3717-3721
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WomackACPSH13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WomackACPSH13a
Kathryn Womack, Cecilia Ovesdotter Alm, Cara Calvelli, Jeff B. Pelz, Pengcheng Shi, Anne R. Haake:
Using linguistic analysis to characterize conceptual units of thought in spoken medical narratives. 3722-3726
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CutugnoFFLR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CutugnoFFLR13
Francesco Cutugno, Alberto Finzi, Michelangelo Fiore, Enrico Leone, Silvia Rossi:
Interacting with robots via speech and gestures, an integrated architecture. 3727-3731
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HatmiJMM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HatmiJMM13
Mohamed Hatmi, Christine Jacquin, Emmanuel Morin, Sylvain Meignier:
Incorporating named entity recognition into the speech transcription process. 3732-3736
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhnoA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhnoA13
Teppei Ohno, Tomoyosi Akiba:
DTW-distance-ordered spoken term detection. 3737-3741
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JungN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JungN13
Sangkeun Jung, Seung-Hoon Na:
Refining sentence similarity with discourse information in dialog system. 3742-3746
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakataniTA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakataniTA13
Ryohei Nakatani, Tetsuya Takiguchi, Yasuo Ariki:
Two-step correction of speech recognition errors based on n-gram and long contextual information. 3747-3750
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NegiBC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NegiBC13
Sumit Negi, Ramnath Balasubramanyan, Santanu Chaudhury:
Inferring actor communities from videos. 3751-3755
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BostEM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BostEM13
Xavier Bost, Marc El-Bèze, Renato De Mori:
Multiple topic identification in telephone conversations. 3756-3760
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenAPN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenAPN13
Wei Chen, Sankaranarayanan Ananthakrishnan, Rohit Prasad, Prem Natarajan:
Variable-Span out-of-vocabulary named entity detection. 3761-3765
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KunPMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KunPMH13
Andrew L. Kun, Oskar Palinko, Zeljko Medenica, Peter A. Heeman:
On the feasibility of using pupil diameter to estimate cognitive load changes for in-vehicle spoken dialogues. 3766-3770
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MesnilHDB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MesnilHDB13
Grégoire Mesnil, Xiaodong He, Li Deng, Yoshua Bengio:
Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding. 3771-3775
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuSBQD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuSBQD13
Xiaohu Liu, Ruhi Sarikaya, Chris Brockett, Chris Quirk, William B. Dolan:
Paraphrase features to improve natural language understanding. 3776-3779
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hakkani-TurCHT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hakkani-TurCHT13
Dilek Hakkani-Tür, Asli Celikyilmaz, Larry P. Heck, Gökhan Tür:
A weakly-supervised approach for discovering new user intents from search query logs. 3780-3784
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuS13
Puyang Xu, Ruhi Sarikaya:
Exploiting shared information for multi-intent natural language sentence classification. 3785-3789

Special Event: ESCA/ISCA Anniversary

- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Fujisaki13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fujisaki13
Hiroya Fujisaki:
An inter- and cross-disciplinary perspective of spoken language processing.
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Moore13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Moore13
Roger K. Moore:
Progress and prospects for speech technology: what ordinary people think.

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.