default search action

combined dblp search
author search
venue search
publication search

ask others

INTERSPEECH 2010: Makuhari, Japan

> Home > Conferences and Workshops > INTERSPEECH

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/2010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2010
Takao Kobayashi, Keikichi Hirose, Satoshi Nakamura:
11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, Chiba, Japan, September 26-30, 2010. ISCA 2010

Keynotes

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Young10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Young10
Steve J. Young:
Still talking to machines (cognitively speaking). 1-10
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ifukube10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ifukube10
Tohru Ifukube:
Sound-based assistive technology supporting "seeing", "hearing" and "speaking" for the disabled and the elderly. 11-19
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tseng10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tseng10
Chiu-yu Tseng:
Beyond sentence prosody. 20-29

Special Session: Models of Speech - In Search of Better Representations

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NamMTSGEH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NamMTSGEH10
Hosung Nam, Vikramjit Mitra, Mark Tiede, Elliot Saltzman, Louis Goldstein, Carol Y. Espy-Wilson, Mark Hasegawa-Johnson:
A procedure for estimating gestural scores from natural speech. 30-33
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShueCA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShueCA10
Yen-Liang Shue, Gang Chen, Abeer Alwan:
On the interdependencies between voice quality, glottal gaps, and voice-source related acoustic measures. 34-37
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaMTBNI10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaMTBNI10
Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nisimura, Toshio Irino:
Simplification and extension of non-periodic excitation source representations for high-quality speech manipulation systems. 38-41
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiroyaM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiroyaM10
Sadao Hiroya, Takemi Mochida:
Phase equalization-based autoregressive model of speech signals. 42-45
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuP10
Yi Xu, Santitham Prom-on:
Articulatory-functional modeling of speech prosody: a review. 46-49
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TorresMGP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TorresMGP10
Humberto M. Torres, Hansjörg Mixdorff, Jorge A. Gurlekian, Hartmut R. Pfitzinger:
Two new estimation methods for a superpositional intonation model. 50-53

ASR: Acoustic Models I-III

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WieslerHNSN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WieslerHNSN10
Simon Wiesler, Georg Heigold, Markus Nußbaum-Thom, Ralf Schlüter, Hermann Ney:
A discriminative splitting criterion for phonetic decision trees. 54-57
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GalesY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GalesY10
Mark J. F. Gales, Kai Yu:
Canonical state models for automatic speech recognition. 58-61
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DogninHGO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DogninHGO10
Pierre L. Dognin, John R. Hershey, Vaibhava Goel, Peder A. Olsen:
Restructuring exponential family mixture models. 62-65
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeaufaysVS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeaufaysVS10
Françoise Beaufays, Vincent Vanhoucke, Brian Strope:
Unsupervised discovery and training of maximally dissimilar cluster models. 66-69
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sim10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sim10
Khe Chai Sim:
Probabilistic state clustering using conditional random field for context-dependent acoustic modelling. 70-73
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunZ10
Xie Sun, Yunxin Zhao:
Integrate template matching and statistical modeling for speech recognition. 74-77
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonS10
George Saon, Hagen Soltau:
Boosting systems for LVCSR. 1341-1344
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoelSRONK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoelSRONK10
Vaibhava Goel, Tara N. Sainath, Bhuvana Ramabhadran, Peder A. Olsen, David Nahamoo, Dimitri Kanevsky:
Incorporating sparse representation phone identification features in automatic speech recognition using exponential families. 1345-1348
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZ10
Xin Chen, Yunxin Zhao:
Integrating MLP features and discriminative training in data sampling based ensemble acoustic modeling. 1349-1352
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangH10
Jui-Ting Huang, Mark Hasegawa-Johnson:
Semi-supervised training of Gaussian mixture models by conditional entropy minimization. 1353-1356
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiSH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiSH10
Guangchuan Shi, Yu Shi, Qiang Huo:
A study of irrelevant variability normalization based training and unsupervised online adaptation for LVCSR. 1357-1360
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsiaoMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsiaoMS10
Roger Hsiao, Florian Metze, Tanja Schultz:
Improvements to generalized discriminative feature transformation for speech recognition. 1361-1364
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VeselyBG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VeselyBG10
Karel Veselý, Lukás Burget, Frantisek Grézl:
Parallel training of neural networks for speech recognition. 2934-2937
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinghLR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinghLR10
Rita Singh, Benjamin Lambert, Bhiksha Raj:
The use of sense in unsupervised training of acoustic models for ASR systems. 2938-2941
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuHJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuHJ10
Jun Du, Yu Hu, Hui Jiang:
Boosted mixture learning of Gaussian mixture HMMs for speech recognition. 2942-2945
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeutnantH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeutnantH10
Volker Leutnant, Reinhold Haeb-Umbach:
On the exploitation of hidden Markov models and linear dynamic models in a hybrid decoder architecture for continuous speech recognition. 2946-2949
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbadPTN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbadPTN10
Alberto Abad, Thomas Pellegrini, Isabel Trancoso, João Paulo Neto:
Context dependent modelling approaches for hybrid speech recognizers. 2950-2953
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuboWNK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuboWNK10
Yotaro Kubo, Shinji Watanabe, Atsushi Nakamura, Tetsunori Kobayashi:
A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination. 2954-2957
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoABS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoABS10
Hank Liao, Christopher Alberti, Michiel Bacchiani, Olivier Siohan:
Decision tree state clustering with word and syllable features. 2958-2961
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimuraMT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimuraMT10
Hiroshi Fujimura, Takashi Masuko, Mitsuyoshi Tachimori:
A duration modeling technique with incremental speech rate normalization. 2962-2965
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WollmerSES10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WollmerSES10
Martin Wöllmer, Yang Sun, Florian Eyben, Björn W. Schuller:
Long short-term memory networks for noise robust speech recognition. 2966-2969
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NittaOKIK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NittaOKIK10
Tsuneo Nitta, Takayuki Onoda, Masashi Kimura, Yurie Iribe, Kouichi Katsurada:
One-model speech recognition and synthesis based on articulatory movement HMMs. 2970-2973
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiXDCZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiXDCZ10
Xiaodong Cui, Jian Xue, Pierre L. Dognin, Upendra V. Chaudhari, Bowen Zhou:
Acoustic modeling with bootstrap and restructuring for low-resourced languages. 2974-2977
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KosakaGIK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KosakaGIK10
Tetsuo Kosaka, Keisuke Goto, Takashi Ito, Masaharu Katoh:
Lecture speech recognition by combining word graphs of various acoustic models. 2978-2981
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL10
Khe Chai Sim, Shilin Liu:
Semi-parametric trajectory modelling using temporally varying feature mapping for speech recognition. 2982-2985
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuD10
Dong Yu, Li Deng:
Deep-structured hidden conditional random fields for phonetic recognition. 2986-2989
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MalkinB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MalkinB10
Jonathan Malkin, Jeff A. Bilmes:
Semi-supervised learning for improved expression of uncertainty in discriminative classifiers. 2990-2993
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OlsenGMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OlsenGMH10
Peder A. Olsen, Vaibhava Goel, Charles A. Micchelli, John R. Hershey:
Modeling posterior probabilities using the linear exponential family. 2994-2997

Spoken Dialogue Systems I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LefevreMY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LefevreMY10
Fabrice Lefèvre, François Mairesse, Steve J. Young:
Cross-lingual spoken language understanding from unaligned data using discriminative classification models and machine translation. 78-81
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BalchandranRRN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BalchandranRRN10
Rajesh Balchandran, Leonid Rachevsky, Bhuvana Ramabhadran, Miroslav Novak:
Techniques for topic detection based processing in spoken dialog systems. 82-85
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChandramohanGP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChandramohanGP10
Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin:
Optimizing spoken dialogue management with fitted value iteration. 86-89
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JurcicekTKMGYY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JurcicekTKMGYY10
Filip Jurcícek, Blaise Thomson, Simon Keizer, François Mairesse, Milica Gasic, Kai Yu, Steve J. Young:
Natural belief-critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems. 90-93
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchmittSMLS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchmittSMLS10
Alexander Schmitt, Michael Scholz, Wolfgang Minker, Jackson Liscombe, David Suendermann:
Is it possible to predict task completion in automated troubleshooters?. 94-97
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuendermannLP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuendermannLP10
David Suendermann, Jackson Liscombe, Roberto Pieraccini:
Minimally invasive surgery for spoken dialog systems. 98-101

Spoken Dialogue Systems II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lopez-CozarG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lopez-CozarG10
Ramón López-Cózar, David Griol:
New technique to enhance the performance of spoken dialogue systems based on dialogue states-dependent language models and grammatical rules. 2998-3001
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HurtadoPSSG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HurtadoPSSG10
Lluís F. Hurtado, Joaquin Planells, Encarna Segarra, Emilio Sanchis, David Griol:
A stochastic finite-state transducer approach to spoken dialog management. 3002-3005
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarocheBP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarocheBP10
Romain Laroche, Philippe Bretier, Ghislain Putois:
Enhanced monitoring tools and online dialogue optimisation merged into a new spoken dialogue system design experience. 3006-3009
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarochePB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarochePB10
Romain Laroche, Ghislain Putois, Philippe Bretier:
Optimising a handcrafted dialogue system design. 3010-3013
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PutzeS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PutzeS10
Felix Putze, Tanja Schultz:
Utterance selection for speech acts in a cognitive tourguide scenario. 3014-3017
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParentE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParentE10
Gabriel Parent, Maxine Eskénazi:
Lexical entrainment of real users in the let's go spoken dialog system. 3018-3021
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QuarteroniGRV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QuarteroniGRV10
Silvia Quarteroni, Meritxell González, Giuseppe Riccardi, Sebastian Varges:
Combining user intention and error modeling for statistical dialog simulators. 3022-3025
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HakulinenTCC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HakulinenTCC10
Jaakko Hakulinen, Markku Turunen, Raúl Santos de la Cámara, Nigel T. Crook:
Parallel processing of interruptions and feedback in companions affective dialogue system. 3026-3029
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RauxMRG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RauxMRG10
Antoine Raux, Neville Mehta, Deepak Ramachandran, Rakesh Gupta:
Dynamic language modeling using Bayesian networks for spoken dialog systems. 3030-3033
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaraKT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaraKT10
Sunao Hara, Norihide Kitaoka, Kazuya Takeda:
Automatic detection of task-incompleted dialog for spoken dialog system based on dialog act n-gram. 3034-3037
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiangWH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiangWH10
Wei-Bin Liang, Chung-Hsien Wu, Yu-Cheng Hsiao:
Dialogue act detection in error-prone spoken dialogue systems using partial sentence tree and latent dialogue act matrix. 3038-3041
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaSCT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaSCT10
Tatsuya Kawahara, Kouhei Sumi, Zhi-Qiang Chang, Katsuya Takanashi:
Detection of hot spots in poster conversations based on reactive tokens of audience. 3042-3045
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsuyamaFTK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsuyamaFTK10
Yoichi Matsuyama, Shinya Fujie, Hikaru Taniyama, Tetsunori Kobayashi:
Psychological evaluation of a group communication activation robot in a party game. 3046-3049
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsuyamaKTTOO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsuyamaKTTOO10
Kyoko Matsuyama, Kazunori Komatani, Ryu Takeda, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno:
Analyzing user utterances in barge-in-able spoken dialogue system for improving identification accuracy. 3050-3053
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeldnerEH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeldnerEH10
Mattias Heldner, Jens Edlund, Julia Hirschberg:
Pitch similarity in the vicinity of backchannels. 3054-3057
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TruongPH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TruongPH10
Khiet P. Truong, Ronald Poppe, Dirk Heylen:
A rule-based backchannel prediction model using pitch and pause information. 3058-3061

Speech Perception: Factors Influencing Perception

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoersmaC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoersmaC10
Paul Boersma, Katerina Chládková:
Detecting categorical perception in continuous discrimination data. 102-105
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BendersE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BendersE10
Titia Benders, Paola Escudero:
The interrelation between the stimulus range and the number of response categories in vowel categorization. 106-109
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NilsenovaGK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NilsenovaGK10
Marie Nilsenová, Martijn Goudbeek, Luuk Kempen:
The relation between pitch perception preference and emotion identification. 110-113
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OtakeMC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OtakeMC10
Takashi Otake, James M. McQueen, Anne Cutler:
Competition in the perception of spoken Japanese words. 114-117
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadakataZS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadakataZS10
Makiko Sadakata, Lotte van der Zanden, Kaoru Sekiyama:
Influence of musical training on perception of L2 speech. 118-121
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DerrickG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DerrickG10
Donald Derrick, Bryan Gick:
Full body aero-tactile integration in speech perception. 122-125

Prosody: Models

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DubedaM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DubedaM10
Tomás Dubeda, Katalin Mády:
Nucleus position within the intonation phrase: a typological study of English, Czech and Hungarian. 126-129
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeN10
Yong-cheol Lee, Satoshi Nambu:
Focus-sensitive operator or focus inducer: always and only. 130-133
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuanL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuanL10
Jiahong Yuan, Mark Liberman:
F₀ declination in English and Mandarin broadcast news speech. 134-137
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchweitzerWMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchweitzerWMS10
Katrin Schweitzer, Michael Walsh, Bernd Möbius, Hinrich Schütze:
Frequency of occurrence effects on pitch accent realisation. 138-141
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FerrerasVMC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FerrerasVMC10
César González Ferreras, Carlos Vivaracho-Pascual, David Escudero Mancebo, Valentín Cardeñoso-Payo:
On the automatic toBI accent type identification from data. 142-145
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rosenberg10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rosenberg10
Andrew Rosenberg:
AutoBI - a tool for automatic toBI annotation. 146-149

Speech Synthesis: Unit Selection and Others

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StromK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StromK10
Volker Strom, Simon King:
A classifier-based target cost for unit selection speech synthesis trained on perceptual data. 150-153
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangC10
Wei Zhang, Xiaodong Cui:
Applying scalable phonetic context similarity in unit selection of concatenative text-to-speech. 154-157
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IsogaiM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IsogaiM10
Mitsuaki Isogai, Hideyuki Mizuno:
Speech database reduction method for corpus-based TTS system. 158-161
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuLWDW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuLWDW10
Heng Lu, Zhen-Hua Ling, Si Wei, Li-Rong Dai, Ren-Hua Wang:
Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier. 162-165
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SilenHNKG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SilenHNKG10
Hanna Silén, Elina Helander, Jani Nurminen, Konsta Koppinen, Moncef Gabbouj:
Using robust viterbi algorithm and HMM-modeling in unit selection TTS to replace units of poor quality. 166-169
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimB10
Yeon-Jun Kim, Marc C. Beutnagel:
Automatic detection of abnormal stress patterns in unit selection synthesis. 170-173
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TihelkaKM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TihelkaKM10
Daniel Tihelka, Jirí Kala, Jindrich Matousek:
Enhancements of viterbi search for fast unit selection synthesis. 174-177
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EwenderP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EwenderP10
Thomas Ewender, Beat Pfister:
Accurate pitch marking for prosodic modification of speech segments. 178-181
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PanZT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PanZT10
Shifeng Pan, Meng Zhang, Jianhua Tao:
A novel hybrid approach for Mandarin speech synthesis. 182-185
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PontesF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PontesF10
Josafá de Jesus Aguiar Pontes, Sadaoki Furui:
Modeling liaison in French by using decision trees. 186-189
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuanL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuanL10
Jian Luan, Jian Li:
Improvement on plural unit selection and fusion. 190-193
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParlikarBV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParlikarBV10
Alok Parlikar, Alan W. Black, Stephan Vogel:
Improving speech synthesis of machine translation output. 194-197
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PutoisCB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PutoisCB10
Ghislain Putois, Jonathan Chevelu, Cédric Boidin:
Paraphrase generation to improve text-to-speech synthesis. 198-201

ASR: Search, Decoding and Confidence Measures I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanKLK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanKLK10
Chang Woo Han, Shin Jae Kang, Chul Min Lee, Nam Soo Kim:
Phone mismatch penalty matrices for two-stage keyword spotting via multi-pass phone recognizer. 202-205
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MotlicekVG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MotlicekVG10
Petr Motlícek, Fabio Valente, Philip N. Garner:
English spoken term detection in multilingual recordings. 206-209
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanPCK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanPCK10
Icksang Han, Chiyoun Park, Jeongmi Cho, Jeongsu Kim:
A hybrid approach to robust word lattice generation via acoustic-based word detection. 210-213
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SteinbissSN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SteinbissSN10
Volker Steinbiss, Martin Sundermeyer, Hermann Ney:
Direct observation of pruning errors (DOPE): a search analysis tool. 214-217
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RybachR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RybachR10
David Rybach, Michael Riley:
Direct construction of compact context-dependency transducers from data. 218-221
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Novak10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Novak10
Miroslav Novak:
Incremental composition of static decoding graphs with label pushing. 222-225
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangL10
Zhanlei Yang, Wenju Liu:
A novel path extension framework using steady segment detection for Mandarin speech recognition. 226-229
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchluterNN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchluterNN10
Ralf Schlüter, Markus Nußbaum-Thom, Hermann Ney:
On the relation of Bayes risk, word error, and word posteriors in ASR. 230-233
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NoldenNS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NoldenNS10
David Nolden, Hermann Ney, Ralf Schlüter:
Time conditioned search in automatic speech recognition reconsidered. 234-237
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KobashikawaAYMT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KobashikawaAYMT10
Satoshi Kobashikawa, Taichi Asami, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi:
Efficient data selection for speech recognition based on prior confidence estimation using speech and context independent models. 238-241
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgawaN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgawaN10
Atsunori Ogawa, Atsushi Nakamura:
A novel confidence measure based on marginalization of jointly estimated error cause probabilities. 242-245
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FayolleMRGG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FayolleMRGG10
Julien Fayolle, Fabienne Moreau, Christian Raymond, Guillaume Gravier, Patrick Gros:
CRF-based combination of contextual features to improve a posteriori word-level confidence measures. 1942-1945
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WollmerESR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WollmerESR10
Martin Wöllmer, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Recognition of spontaneous conversational speech using long short-term memory phoneme predictions. 1946-1949
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PellegriniT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PellegriniT10
Thomas Pellegrini, Isabel Trancoso:
Improving ASR error detection with non-decoder based features. 1950-1953
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GolipourO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GolipourO10
Ladan Golipour, Douglas D. O'Shaughnessy:
Phoneme classification and lattice rescoring based on a k-NN approach. 1954-1957
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BilmesL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BilmesL10
Jeff A. Bilmes, Hui Lin:
Online adaptive learning for speech recognition decoding. 1958-1961
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriWN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriWN10
Takaaki Hori, Shinji Watanabe, Atsushi Nakamura:
Improvements of search error risk minimization in viterbi beam search for speech recognition. 1962-1965

Special-Purpose Speech Applications

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HofeEFGGMR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HofeEFGGMR10
Robin Hofe, Stephen R. Ell, Michael J. Fagan, James M. Gilbert, Phil D. Green, Roger K. Moore, Sergey I. Rybchenko:
Evaluation of a silent speech interface based on magnetic sensing. 246-249
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SegundoLMLFCP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SegundoLMLFCP10
Rubén San Segundo, Verónica López-Ludeña, Raquel Martín, Syaheerah L. Lutfi, Javier Ferreiros, Ricardo de Córdoba, José Manuel Pardo:
Advanced speech communication system for deaf people. 250-253
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamCB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamCB10
Sethserey Sam, Eric Castelli, Laurent Besacier:
Unsupervised acoustic model adaptation for multi-origin non native ASR. 254-257
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hakkani-TurVT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hakkani-TurVT10
Dilek Hakkani-Tür, Dimitra Vergyri, Gökhan Tür:
Speech-based automated cognitive status assessment. 258-261
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ImaiHKOS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ImaiHKOS10
Toru Imai, Shinichi Homma, Akio Kobayashi, Takahiro Oku, Shoei Sato:
Speech recognition with a seamlessly updated language model for real-time closed-captioning. 262-265
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishimotoW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishimotoW10
Takuya Nishimoto, Takayuki Watanabe:
The comparison between the deletion-based methods and the mixing-based methods for audio CAPTCHA systems. 266-269
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Adda-DeckerLS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Adda-DeckerLS10
Martine Adda-Decker, Lori Lamel, Natalie D. Snoeren:
Comparing mono- & multilingual acoustic seed models for a low e-resourced language: a case-study of luxembourgish. 270-273
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SonJH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SonJH10
R. J. J. H. van Son, Irene Jacobi, Frans J. M. Hilgers:
Manipulating treacheoesophageal speech. 274-277
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ImsengBM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ImsengBM10
David Imseng, Hervé Bourlard, Mathew Magimai-Doss:
Towards mixed language speech recognition systems. 278-281
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarnardSHM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarnardSHM10
Etienne Barnard, Johan Schalkwyk, Charl Johannes van Heerden, Pedro J. Moreno:
Voice search for development. 282-285
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LevowDK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LevowDK10
Gina-Anne Levow, Susan Duncan, Edward T. King:
Cross-cultural investigation of prosody in verbal feedback in interactional rapport. 286-289
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KnoxF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KnoxF10
Mary Tai Knox, Gerald Friedland:
Multimodal speaker diarization using oriented optical flow histograms. 290-293
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiddagSM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiddagSM10
Catherine Middag, Yvan Saeys, Jean-Pierre Martens:
Towards an ASR-free objective analysis of pathological speech. 294-297

Speech Analysis

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GodinH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GodinH10
Keith W. Godin, John H. L. Hansen:
Session variability contrasts in the MARP corpus. 298-301
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KondoT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KondoT10
Kazuhiro Kondo, Yusuke Takano:
Estimation of two-to-one forced selection intelligibility scores by speech recognizers using noise-adapted models. 302-305
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchaafM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchaafM10
Thomas Schaaf, Florian Metze:
Analysis of gender normalization using MLP and VTLN features. 306-309
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AimettiMB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AimettiMB10
Guillaume Aimetti, Roger K. Moore, Louis ten Bosch:
Discovering an optimal set of minimally contrasting acoustic speech units: a point of focus for whole-word pattern matching. 310-313
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StafylakisA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StafylakisA10
Themos Stafylakis, Xavier Anguera:
Improvements to the equal-parameter BIC for speaker diarization. 314-317
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MesgaraniTH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MesgaraniTH10
Nima Mesgarani, Samuel Thomas, Hynek Hermansky:
A multistream multiresolution framework for phoneme recognition. 318-321
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SalviTZC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SalviTZC10
Giampiero Salvi, Fabio Tesser, Enrico Zovato, Piero Cosi:
Cluster analysis of differential spectral envelopes on emotional speech. 322-325
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BowmanL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BowmanL10
Samuel R. Bowman, Karen Livescu:
Modeling pronunciation variation with context-dependent articulatory feature decision trees. 326-329
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajWKH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajWKH10
Bhiksha Raj, Kevin W. Wilson, Alexander Krueger, Reinhold Haeb-Umbach:
Ungrounded independent non-negative factor analysis. 330-333
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HersheyOR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HersheyOR10
John R. Hershey, Peder A. Olsen, Steven J. Rennie:
Signal interaction and the devil function. 334-337

Systems for LVCSR

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AkitaMNK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AkitaMNK10
Yuya Akita, Masato Mimura, Graham Neubig, Tatsuya Kawahara:
Semi-automated update of automatic transcription system for the Japanese national congress. 338-341
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuGW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuGW10
Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Language model cross adaptation for LVCSR system combination. 342-345
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeHN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeHN10
Shinji Watanabe, Takaaki Hori, Atsushi Nakamura:
Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data. 346-349
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KvetonN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KvetonN10
Pavel Kveton, Miroslav Novak:
Accelerating hierarchical acoustic likelihood computation on graphics processors. 350-353
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShanWHTJM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShanWHTJM10
Jiulong Shan, Genqing Wu, Zhihong Hu, Xiliu Tang, Martin Jansche, Pedro J. Moreno:
Search by voice in Mandarin Chinese. 354-357
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainBDGHHKLW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainBDGHHKLW10
Thomas Hain, Lukás Burget, John Dines, Philip N. Garner, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiát, Mike Lincoln, Vincent Wan:
The AMIDA 2009 meeting transcription system. 358-361

Speaker Characterization and Recognition I-IV

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CampbellK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CampbellK10
William M. Campbell, Zahi N. Karam:
Simple and efficient speaker comparison using approximate KL divergence. 362-365
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunMHNL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunMHNL10
Hanwu Sun, Bin Ma, Chien-Lin Huang, Trung Hieu Nguyen, Haizhou Li:
The IIR NIST SRE 2008 and 2010 summed channel speaker recognition systems. 366-369
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangSML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangSML10
Chien-Lin Huang, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker characterization using long-term and temporal information. 370-373
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Perez-GomezRGG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Perez-GomezRGG10
Sergio Perez-Gomez, Daniel Ramos, Javier Gonzalez-Dominguez, Joaquin Gonzalez-Rodriguez:
Score-level compensation of extreme speech duration variability in speaker verification. 374-377
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbadT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbadT10
Alberto Abad, Isabel Trancoso:
Speaker recognition experiments using connectionist transformation network features. 378-381
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeiH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeiH10
Yun Lei, John H. L. Hansen:
Speaker recognition using supervised probabilistic principal component analysis. 382-385
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BigotPFA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BigotPFA10
Benjamin Bigot, Julien Pinquier, Isabelle Ferrané, Régine André-Obrecht:
Looking for relevant features for speaker role recognition. 1057-1060
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KockmannBGFC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KockmannBGFC10
Marcel Kockmann, Lukás Burget, Ondrej Glembek, Luciana Ferrer, Jan Cernocký:
Prosodic speaker verification using subspace multinomial models with intersession compensation. 1061-1064
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLMLGD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLMLGD10
Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai:
The estimation and kernel metric of spectral correlation for text-independent speaker verification. 1065-1068
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaeidiMKTCJF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaeidiMKTCJF10
Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti:
Improving monaural speaker identification by double-talk detection. 1069-1072
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AvinashGY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AvinashGY10
B. Avinash, S. Guruprasad, B. Yegnanarayana:
Exploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals. 1073-1076
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuHXCD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuHXCD10
Qingsong Liu, Wei Huang, Dongxing Xu, Hongbin Cai, Beiqian Dai:
A fast implementation of factor analysis for speaker verification. 1077-1080
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangZX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangZX10
Ce Zhang, Rong Zheng, Bo Xu:
An investigation into direct scoring methods without SVM training in speaker verification. 1437-1440
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JouraniDAA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JouraniDAA10
Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine:
Large margin Gaussian mixture models for speaker identification. 1441-1444
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengX10
Rong Zheng, Bo Xu:
On the use of Gaussian component information in the generative likelihood ratio estimation for speaker verification. 1445-1448
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakR10
Man-Wai Mak, Wei Rao:
Acoustic vector resampling for GMMSVM-based speaker verification. 1449-1452
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Biatov10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Biatov10
Konstantin Biatov:
A fast speaker indexing using vector quantization and second order statistics with adaptive threshold computation. 1453-1456
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWZ10
Gang Wang, Xiaojun Wu, Thomas Fang Zheng:
Using phoneme recognition and text-dependent speaker verification to improve speaker segmentation for Chinese speech. 1457-1460
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarretonY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarretonY10
Claudio Garretón, Néstor Becerra Yoma:
On enhancing feature sequence filtering with filter-bank energy transformation in speaker verification with telephone speech. 1461-1464
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuMLLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuMLLL10
Donglai Zhu, Bin Ma, Kong-Aik Lee, Cheung-Chi Leung, Haizhou Li:
MAP estimation of subspace transform for speaker recognition. 1465-1468
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JafariSCM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JafariSCM10
Ayeh Jafari, Ramji Srinivasan, Danny Crookes, Ji Ming:
A longest matching segment approach for text-independent speaker recognition. 1469-1472
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HautamakiKNLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HautamakiKNLML10
Ville Hautamäki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong-Aik Lee, Bin Ma, Haizhou Li:
Approaching human listener accuracy with modern speaker verification. 1473-1476
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PohjalainenSKA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PohjalainenSKA10
Jouni Pohjalainen, Rahim Saeidi, Tomi Kinnunen, Paavo Alku:
Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions. 1477-1480
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeM10
Guoli Ye, Brian Mak:
The use of subvector quantization and discrete densities for fast GMM computation for speaker verification. 1481-1484
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RichardsonC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RichardsonC10
Fred S. Richardson, Joseph P. Campbell:
Transcript-dependent speaker recognition using mixer 1 and 2. 2102-2105
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DrugmanD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DrugmanD10
Thomas Drugman, Thierry Dutoit:
On the potential of glottal signatures for speaker recognition. 2106-2109
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PadmanabhanM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PadmanabhanM10
R. Padmanabhan, Hema A. Murthy:
Acoustic feature diversity and speaker verification. 2110-2113
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehzangiMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehzangiMCL10
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
A discriminative performance metric for GMM-UBM speaker identification. 2114-2117
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AngueraB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AngueraB10
Xavier Anguera, Jean-François Bonastre:
A novel speaker binary key derived from anchor models. 2118-2121
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangDHL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangDHL10
Weiqiang Zhang, Yan Deng, Liang He, Jia Liu:
Variant time-frequency cepstral features for speaker recognition. 2122-2125
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCL10
Ning Wang, P. C. Ching, Tan Lee:
Exploitation of phase information for speaker recognition. 2126-2129
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LongDMG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LongDMG10
Yanhua Long, Li-Rong Dai, Bin Ma, Wu Guo:
Effects of the phonological relevance in speaker verification. 2130-2133
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SierraBMC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SierraBMC10
Gabriel Hernández Sierra, Jean-François Bonastre, Driss Matrouf, José R. Calvo:
Topological representation of speech for speaker recognition. 2134-2137
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadjadiH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadjadiH10
Seyed Omid Sadjadi, John H. L. Hansen:
Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions. 2138-2141
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangCYSZY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangCYSZY10
Xiang Zhang, Chuan Cao, Lin Yang, Hongbin Suo, Jianping Zhang, Yonghong Yan:
Speaker recognition using the resynthesized speech via spectrum modeling. 2142-2145

Source Separation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PeharzSPS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PeharzSPS10
Robert Peharz, Michael Stark, Franz Pernkopf, Yannis Stylianou:
A factorial sparse coder model for single channel source separation. 386-389
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BenabderrahmaneSO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BenabderrahmaneSO10
Yasmina Benabderrahmane, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Oriented PCA method for blind speech separation of convolutive mixtures. 390-393
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsiehC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsiehC10
Hsin-Lung Hsieh, Jen-Tzung Chien:
Online Gaussian process for nonstationary speech separation. 394-397
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuMXO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuMXO10
Meng Yu, Wenye Ma, Jack Xin, Stanley J. Osher:
Convexity and fast speech extraction by split bregman method. 398-401
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaYXO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaYXO10
Wenye Ma, Meng Yu, Jack Xin, Stanley J. Osher:
Reducing musical noise in blind source separation by time-domain sparse filters and split bregman method. 402-405
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WoodruffPFW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WoodruffPFW10
John Woodruff, Rohit Prabhavalkar, Eric Fosler-Lussier, DeLiang Wang:
Combining monaural and binaural evidence for reverberant speech segregation. 406-409

Speech Synthesis: HMM-Based Speech Synthesis I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zen10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zen10
Heiga Zen:
Speaker and language adaptive training for HMM-based polyglot speech synthesis. 410-413
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuZMY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuZMY10
Kai Yu, Heiga Zen, François Mairesse, Steve J. Young:
Context adaptive training with factorized decision trees for HMM-based speech synthesis. 414-417
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamagishiWKU10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamagishiWKU10
Junichi Yamagishi, Oliver Watts, Simon King, Bela Usabaev:
Roles of the average voice in speaker-adaptive HMM-based speech synthesis. 418-421
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianYWSZK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianYWSZK10
Yao Qian, Zhi-Jie Yan, Yi-Jian Wu, Frank K. Soong, Xin Zhuang, Shengyi Kong:
An HMM trajectory tiling (HTT) approach to high quality TTS. 422-425
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenYS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenYS10
Yining Chen, Zhi-Jie Yan, Frank K. Soong:
A perceptual study of acceleration parameters in HMM-based TTS. 426-429
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YokomizoNK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YokomizoNK10
Shuji Yokomizo, Takashi Nose, Takao Kobayashi:
Evaluation of prosodic contextual factors for HMM-based speech synthesis. 430-433
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShechtmanS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShechtmanS10
Slava Shechtman, Alexander Sorin:
Sinusoidal model parameterization for HMM-based TTS system. 805-808
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShigaTSK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShigaTSK10
Yoshinori Shiga, Tomoki Toda, Shinsuke Sakai, Hisashi Kawai:
Improved training of excitation for HMM-based parametric speech synthesis. 809-812
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SungHOK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SungHOK10
June Sig Sung, Doo Hwa Hong, Kyung Hwan Oh, Nam Soo Kim:
Excitation modeling based on waveform interpolation for HMM-based speech synthesis. 813-816
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangQSWZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangQSWZ10
Xin Zhuang, Yao Qian, Frank K. Soong, Yi-Jian Wu, Bo Zhang:
Formant-based frequency warping for improving speaker adaptation in HMM TTS. 817-820
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuR10
Hongwei Hu, Martin J. Russell:
Improved modelling of speech dynamics using non-linear formant trajectories for HMM-based speech synthesis. 821-824
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LingHD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LingHD10
Zhen-Hua Ling, Yu Hu, Li-Rong Dai:
Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis. 825-828
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShannonB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShannonB10
Matt Shannon, William Byrne:
Autoregressive clustering for HMM speech synthesis. 829-832
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PilkingtonZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PilkingtonZ10
Nicholas Pilkington, Heiga Zen:
An implementation of decision tree-based context clustering on graphics processing units. 833-836
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GutkinGBT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GutkinGBT10
Alexander Gutkin, Xavi Gonzalvo, Stefan Breuer, Paul Taylor:
Quantized HMMs for low footprint text-to-speech synthesis. 837-840
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WattsYK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WattsYK10
Oliver Watts, Junichi Yamagishi, Simon King:
The role of higher-level linguistic features in HMM-based speech synthesis. 841-844
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaseONT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaseONT10
Ayami Mase, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
HMM-based singing voice synthesis system using pitch-shifted pseudo training data. 845-848
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiK10
Jinfu Ni, Hisashi Kawai:
An unsupervised approach to creating web audio contents-based HMM voices. 849-852
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaNK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaNK10
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Conversational spontaneous speech synthesis using average voice model. 853-856

Multi-Modal Signal Processing

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HornsteinS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HornsteinS10
Jonas Hörnstein, José Santos-Victor:
Learning words and speech units through natural interactions. 434-437
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuWJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuWJ10
Qingju Liu, Wenwu Wang, Philip J. B. Jackson:
Bimodal coherence based scale ambiguity cancellation for target speech extraction and enhancement. 438-441
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawashimaHM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawashimaHM10
Hiroaki Kawashima, Yu Horii, Takashi Matsuyama:
Speech estimation in non-stationary noise environments using timing structures between mouth movements and sound signals. 442-445
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQHS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQHS10
Lijuan Wang, Xiaojun Qian, Wei Han, Frank K. Soong:
Synthesizing photo-real talking head via trajectory-guided sample selection. 446-449
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FlorescuCDHCPRGQ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FlorescuCDHCPRGQ10
Victoria M. Florescu, Lise Crevier-Buchman, Bruce Denby, Thomas Hueber, Antonia Colazo-Simon, Claire Pillot-Loiseau, Pierre Roussel-Ragot, Cédric Gendrot, Sophie Quattrocchi:
Silent vs vocalized articulation for a portable ultrasound-based silent speech interface. 450-453
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoferR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoferR10
Gregor Hofer, Korin Richmond:
Comparison of HMM and TMDN methods for lip synchronisation. 454-457

Paralanguage

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchielHN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchielHN10
Florian Schiel, Christian Heinrich, Veronika Neumeyer:
Rhythm and formant features for automatic alcohol detection. 458-461
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanushevskayaGKC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanushevskayaGKC10
Irena Yanushevskaya, Christer Gobl, John Kane, Ailbhe Ní Chasaide:
An exploration of voice source correlates of focus. 462-465
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarnsbergerSB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarnsbergerSB10
James D. Harnsberger, Rahul Shrivastav, W. S. Brown Jr.:
Modeling perceived vocal age in american English. 466-469
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaratyM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaratyM10
Marie-José Caraty, Claude Montacié:
Multivariate analysis of vocal fatigue in continuous reading. 470-473
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KainS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KainS10
Alexander Kain, Jan P. H. van Santen:
Frequency-domain delexicalization using surrogate vowels. 474-477
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeBEPSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeBEPSS10
Florian Metze, Anton Batliner, Florian Eyben, Tim Polzehl, Björn W. Schuller, Stefan Steidl:
Emotion recognition using imperfect speech recognition. 478-481
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLH10
Gang Liu, Yun Lei, John H. L. Hansen:
A novel feature extraction strategy for multi-stream robust emotion identification. 482-485
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ToutiosMOCWB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ToutiosMOCWB10
Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte, Brigitte Wrobel-Dautcourt, Marie-Odile Berger:
Setup for acoustic-visual speech synthesis by concatenating bimodal units. 486-489
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JochemsLOPT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JochemsLOPT10
Bart Jochems, Martha A. Larson, Roeland Ordelman, Ronald Poppe, Khiet P. Truong:
Towards affective state modeling in narrative and conversational settings. 490-493
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NomotoMYT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NomotoMYT10
Narichika Nomoto, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi:
Detection of anger emotion in dialog speech using prosody feature and temporal relation of utterances. 494-497
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoustanD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoustanD10
Benjamin Roustan, Marion Dohen:
Gesture and speech coordination: the influence of the relationship between manual gesture and speech. 498-501
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorilSKH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorilSKH10
Hynek Boril, Seyed Omid Sadjadi, Tristan Kleinschmidt, John H. L. Hansen:
Analysis and detection of cognitive load and frustration in drivers' speech. 502-505
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SasouHS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SasouHS10
Akira Sasou, Yasuharu Hashimoto, Katsuhiko Sakaue:
Acoustic-based recognition of head gestures accompanying speech. 506-509
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CastronovoMPM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CastronovoMPM10
Sandro Castronovo, Angela Mahr, Margarita Pentcheva, Christian A. Müller:
Multimodal dialog in the car: combining speech and turn-and-push dial to control comfort functions. 510-513
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KorchaginGM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KorchaginGM10
Danil Korchagin, Philip N. Garner, Petr Motlícek:
Hands free audio analysis from home entertainment. 514-517
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumRH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumRH10
Shaikh Mostafa Al Masum, Antonio Rui Ferreira Rebordão, Keikichi Hirose:
Affective story teller: a TTS system for emotional expressivity. 518-521

ASR: Speaker Adaptation, Robustness Against Reverberation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhaiS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhaiS10
Shweta Ghai, Rohit Sinha:
Enhancing children's speech recognition under mismatched condition by explicit acoustic normalization. 522-525
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS10
Bo Li, Khe Chai Sim:
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems. 526-529
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VipperlaRF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VipperlaRF10
Ravichander Vipperla, Steve Renals, Joe Frankel:
Augmentation of adaptation data. 530-533
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MachlicaZM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MachlicaZM10
Lukás Machlica, Zbynek Zajíc, Ludek Müller:
Discriminative adaptation based on fast combination of DMAP and dfMLLR. 534-537
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanandSN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanandSN10
Doddipatla Rama Sanand, Ralf Schlüter, Hermann Ney:
Revisiting VTLN using linear transformation on conventional MFCC. 538-541
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HayashiNLT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HayashiNLT10
Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Speaker adaptation based on nonlinear spectral transform for speech recognition. 542-545
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KosakaIKK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KosakaIKK10
Tetsuo Kosaka, Takashi Ito, Masaharu Katoh, Masaki Kohda:
Speaker adaptation based on system combination using speaker-class models. 546-549
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JeongSK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JeongSK10
Yongwon Jeong, Young Rok Song, Hyung Soon Kim:
Speaker adaptation in transformation space using two-dimensional PCA. 550-553
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrmalZM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrmalZM10
Jan Trmal, Jan Zelinka, Ludek Müller:
On speaker adaptive training of artificial neural networks. 554-557
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeH10
Yongjun He, Jiqing Han:
Model synthesis for band-limited speech recognition. 558-561
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FukumoriMN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FukumoriMN10
Takahiro Fukumori, Masanori Morise, Takanobu Nishiura:
Performance estimation of reverberant speech recognition based on reverberant criteria RSR-d_n with acoustic parameters. 562-565
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SehrHMK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SehrHMK10
Armin Sehr, Christian Hofmann, Roland Maas, Walter Kellermann:
A novel approach for matched reverberant training of HMMs using data pairs. 566-569
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MagantiM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MagantiM10
Hari Krishna Maganti, Marco Matassoni:
An auditory based modulation spectral feature for reverberant speech recognition. 570-573
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WolfN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WolfN10
Martin Wolf, Climent Nadeu:
On the potential of channel selection for recognition of reverberated speech with multiple microphones. 574-577
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GomezK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GomezK10
Randy Gomez, Tatsuya Kawahara:
An improved wavelet-based dereverberation for robust automatic speech recognition. 578-581
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PetrickFUH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PetrickFUH10
Rico Petrick, Thomas Fehér, Masashi Unoki, Rüdiger Hoffmann:
Methods for robust speech recognition in reverberant environments: a comparison. 582-585

Language Learning, TTS, and Other Applications

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiQMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiQMH10
Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Integration of multilayer regression analysis with structure-based pronunciation assessment. 586-589
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoremalenCS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoremalenCS10
Joost van Doremalen, Catia Cucchiarini, Helmer Strik:
Using non-native error patterns to improve pronunciation verification. 590-593
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoQMYH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoQMYH10
Dean Luo, Yu Qiao, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Regularized-MLLR speaker adaptation for computer-assisted language learning system. 594-597
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HirabayashiN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HirabayashiN10
Kuniaki Hirabayashi, Seiichi Nakagawa:
Automatic evaluation of English pronunciation by Japanese speakers using various acoustic features and pattern recognition techniques. 598-601
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoCCGL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoCCGL10
Hsien-Cheng Liao, Jiang-Chun Chen, Sen-Chia Chang, Ying-Hua Guan, Chin-Hui Lee:
Decision tree based tone modeling with corrective feedbacks for automatic Mandarin tone assessment. 602-605
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuWSGL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuWSGL10
Jingli Lu, Ruili Wang, Liyanage C. De Silva, Yang Gao, Jia Liu:
CASTLE: a computer-assisted stress teaching and learning environment for learners of English as a second language. 606-609
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLWLX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLWLX10
Shen Huang, Hongyan Li, Shijin Wang, Jiaen Liang, Bo Xu:
Automatic reference independent evaluation of prosody quality using multiple knowledge fusions. 610-613
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonHS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonHS10
Su-Youn Yoon, Mark Hasegawa-Johnson, Richard Sproat:
Landmark-based automated pronunciation error detection. 614-617
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShuangKQDC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShuangKQDC10
Zhiwei Shuang, Shiyin Kang, Yong Qin, Li-Rong Dai, Lianhong Cai:
HMM based TTS for mixed language text. 618-621
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiangD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiangD10
Hui Liang, John Dines:
An analysis of language mismatch in HMM state mapping-based cross-lingual speaker adaptation. 622-625
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaKAM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaKAM10
Tatsuya Kawahara, Norihiro Katsumaru, Yuya Akita, Shinsuke Mori:
Classroom note-taking system for hearing impaired students using automatic speech recognition adapted to lectures. 626-629
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DixonF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DixonF10
Paul R. Dixon, Sadaoki Furui:
Exploring web-browser based runtimes engines for creating ubiquitous speech interfaces. 630-632

Pitch and Glottal-Waveform Estimation and Modeling I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunG10
Xuejing Sun, Sameer Gadre:
Efficient three-stage pitch estimation for packet loss concealment. 633-636
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Funaki10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Funaki10
Keiichi Funaki:
On evaluation of the f₀ estimation based on time-varying complex speech analysis. 637-640
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangL10
Feng Huang, Tan Lee:
Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks. 641-644
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQ10
Tianyu T. Wang, Thomas F. Quatieri:
Multi-pitch estimation by a joint 2-d representation of pitch and pitch dynamics. 645-648
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsiakoulisP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsiakoulisP10
Pirros Tsiakoulis, Alexandros Potamianos:
On the effect of fundamental frequency on amplitude and frequency modulation patterns in speech resonances. 649-652
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RahmanS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RahmanS10
M. Shahidur Rahman, Tetsuya Shimamura:
Pitch determination using autocorrelation function in spectral domain. 653-656
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DrugmanD10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DrugmanD10a
Thomas Drugman, Thierry Dutoit:
Chirp complex cepstrum-based decomposition for asynchronous glottal analysis. 657-660
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CinneideDGC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CinneideDGC10
Alan Ó Cinnéide, David Dorran, Mikel Gainza, Eugene Coyle:
Exploiting glottal formant parameters for glottal inverse filtering and parameterization. 661-664
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SturmeldD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SturmeldD10
Nicolas Sturmel, Christophe d'Alessandro, Boris Doval:
Glottal parameters estimation on speech using the zeros of the z-transform. 665-668
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MallidiPGY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MallidiPGY10
Sri Harish Reddy Mallidi, Kishore Prahallad, Suryakanth V. Gangashetty, B. Yegnanarayana:
Significance of pitch synchronous analysis for speaker recognition using AANN models. 669-672
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenFSA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenFSA10
Gang Chen, Xue Feng, Yen-Liang Shue, Abeer Alwan:
On using voice source measures in automatic gender classification of children's speech. 673-676
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChuA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChuA10
Wei Chu, Abeer Alwan:
SAFE: a statistical algorithm for F0 estimation for both clean and noisy speech. 2590-2593
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HongW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HongW10
Jung Ook Hong, Patrick J. Wolfe:
Robust and efficient pitch estimation using an iterative ARMA technique. 2594-2597
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhishiKMNK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhishiKMNK10
Yasunori Ohishi, Hirokazu Kameoka, Daichi Mochihashi, Hidehisa Nagano, Kunio Kashino:
Statistical modeling of F0 dynamics in singing voices based on Gaussian processes with multiple oscillation bases. 2598-2601
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeckmannGJN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeckmannGJN10
Martin Heckmann, Claudius Gläser, Frank Joublin, Kazuhiro Nakadai:
Applying geometric source separation for improved pitch extraction in human-robot interaction. 2602-2605
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaneKG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaneKG10
John Kane, Mark Kane, Christer Gobl:
A spectral LF model based approach to voice source parameterisation. 2606-2609
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DrugmanD10b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DrugmanD10b
Thomas Drugman, Thierry Dutoit:
Glottal-based analysis of the lombard effect. 2610-2613

Open Vocabulary Spoken Document Retrieval (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItohNHNAKNMYA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItohNHNAKNMYA10
Yoshiaki Itoh, Hiromitsu Nishizaki, Xinhui Hu, Hiroaki Nanjo, Tomoyosi Akiba, Tatsuya Kawahara, Seiichi Nakagawa, Tomoko Matsui, Yoichi Yamashita, Kiyoaki Aikawa:
Constructing Japanese test collections for spoken term detection. 677-680
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NatoriNS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NatoriNS10
Satoshi Natori, Hiromitsu Nishizaki, Yoshihiro Sekiguchi:
Japanese spoken term detection using syllable transition network derived from multiple speech recognizers' outputs. 681-684
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengZL10
Sha Meng, Weiqiang Zhang, Jia Liu:
Combining Chinese spoken term detection systems via side-information conditioned linear logistic regression. 685-688
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoA10
Taisuke Kaneko, Tomoyosi Akiba:
Metric subspace indexing for fast spoken term detection. 689-692
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChanL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChanL10
Chun-an Chan, Lin-Shan Lee:
Unsupervised spoken-term detection with spoken queries using segment-based dynamic time warping. 693-696
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchneiderMLK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchneiderMLK10
Daniel Schneider, Timo Mertens, Martha A. Larson, Joachim Köhler:
Contextual verification for open vocabulary spoken term detection. 697-700
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TejedorTBKWC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TejedorTBKWC10
Javier Tejedor, Doroteo T. Toledano, Miguel Bautista, Simon King, Dong Wang, José Colás:
Augmented set of features for confidence estimation in spoken term detection. 701-704
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuIKN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuIKN10
Xinhui Hu, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Cluster-based language model for spoken document retrieval using NMF-based document clustering. 705-708

Robust ASR

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DalenG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DalenG10
Rogier C. van Dalen, Mark J. F. Gales:
Asymptotically exact noise-corrupted speech likelihoods. 709-712
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AstudilloO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AstudilloO10
Ramón Fernandez Astudillo, Reinhold Orglmeister:
A MMSE estimator in mel-cepstral domain for robust large vocabulary automatic speech recognition using uncertainty propagation. 713-716
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajVCS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajVCS10
Bhiksha Raj, Tuomas Virtanen, Sourish Chaudhuri, Rita Singh:
Non-negative matrix factorization based compensation of music for automatic speech recognition. 717-720
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DemuynckZCH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DemuynckZCH10
Kris Demuynck, Xueru Zhang, Dirk Van Compernolle, Hugo Van hamme:
Feature versus model based noise robustness. 721-724
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkKYKLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkKYKLL10
Ji Hun Park, Seon Man Kim, Jae Sam Yoon, Hong Kook Kim, Sung Joo Lee, Yunkeun Lee:
SNR-based mask compensation for computational auditory scene analysis applied to speech recognition in a car environment. 725-728
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSEL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSEL10
Chanwoo Kim, Richard M. Stern, Kiwan Eom, Jaewon Lee:
Automatic selection of thresholds for signal separation algorithms based on interaural delay. 729-732

Language and Dialect Identification

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VerdetMBH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VerdetMBH10
Florian Verdet, Driss Matrouf, Jean-François Bonastre, Jean Hennebert:
Channel detectors for system fusion in the context of NIST LRE 2009. 733-736
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLC10
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Selecting phonotactic features for language recognition. 737-740
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HananiCR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HananiCR10
Abualsoud Hanani, Michael J. Carey, Martin J. Russell:
Improved language recognition using mixture components statistics. 741-744
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PenagarikanoVRB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PenagarikanoVRB10
Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Using cross-decoder co-occurrences of phone n-grams in SVM-based phonotactic language recognition. 745-748
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KollerATV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KollerATV10
Oscar Koller, Alberto Abad, Isabel Trancoso, Céu Viana:
Exploiting variety-dependent phones in portuguese variety identification applied to broadcast news transcription. 749-752
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiadsyHC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiadsyHC10
Fadi Biadsy, Julia Hirschberg, Michael Collins:
Dialect recognition using a phone-GMM-supervector-based SVM kernel. 753-756

Technologies for Learning and Education

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianSM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianSM10
Xiaojun Qian, Frank K. Soong, Helen M. Meng:
Discriminative acoustic model for improving mispronunciation detection and diagnosis in computer-aided pronunciation training (CAPT). 757-760
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenJ10
Liang-Yu Chen, Jyh-Shing Roger Jang:
Automatic pronunciation scoring using learning to rank and DP-based score segmentation. 761-764
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoZM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoZM10
Wai Kit Lo, Shuang Zhang, Helen M. Meng:
Automatic derivation of phonological rules for mispronunciation detection in a computer-assisted pronunciation training system. 765-768
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuongM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuongM10
Minh Duong, Jack Mostow:
Adapting a duration synthesis model to rate children's oral reading prosody. 769-772
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonCZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonCZ10
Su-Youn Yoon, Lei Chen, Klaus Zechner:
Predicting word accuracy for the automatic speech recognition of non-native speech. 773-776
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuKCX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuKCX10
Taotao Zhu, Dengfeng Ke, Zhenbiao Chen, Bo Xu:
A new approach for automatic tone error detection in strong accented Mandarin based on dominant set. 777-780

Emotional Speech

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrasannaG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrasannaG10
S. R. Mahadeva Prasanna, D. Govind:
Analysis of excitation source information in emotional speech. 781-784
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuPN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuPN10
Dongrui Wu, Thomas D. Parsons, Shrikanth S. Narayanan:
Acoustic feature analysis in speech emotion primitives estimation. 785-788
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YehC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YehC10
Lan-Ying Yeh, Tai-Shih Chi:
Spectro-temporal modulations for robust speech emotion recognition. 789-792
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeBKLBCGN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeBKLBCGN10
Chi-Chun Lee, Matthew Black, Athanasios Katsamanis, Adam C. Lammert, Brian R. Baucom, Andrew Christensen, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Quantification of prosodic entrainment in affective spontaneous spoken interactions of married couples. 793-796
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MowerHLN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MowerHLN10
Emily Mower, Kyu Jeong Han, Sungbok Lee, Shrikanth S. Narayanan:
A cluster-profile representation of emotion using agglomerative hierarchical clustering. 797-800
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchullerD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchullerD10
Björn W. Schuller, Laurence Devillers:
Incremental acoustic valence recognition: an inter-corpus perspective on features, matching, and performance in a gating paradigm. 801-804

New Paradigms in ASR I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangOS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangOS10
Xiaodong Wang, Kunihiko Owa, Makoto Shozakai:
Mandarin digit recognition assisted by selective tone distinction. 857-860
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbeSIKN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbeSIKN10
Kazuhiko Abe, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Brazilian portuguese acoustic model training based on data borrowing from other language. 861-864
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VuSKS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VuSKS10
Ngoc Thang Vu, Tim Schlippe, Franziska Kraus, Tanja Schultz:
Rapid bootstrapping of five eastern european languages using the rapid language adaptation toolkit. 865-868
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaoLC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaoLC10
Houwei Cao, Tan Lee, P. C. Ching:
Cross-lingual speaker adaptation via Gaussian component mapping. 869-872
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ElmahdyGMA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ElmahdyGMA10
Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker, Slim Abdennadher:
Cross-lingual acoustic modeling for dialectal Arabic speech recognition. 873-876
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomasGH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomasGH10
Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Cross-lingual and multi-stream posterior features for low resource LVCSR systems. 877-880
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SundaramB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SundaramB10
Shiva Sundaram, Jerome R. Bellegarda:
Latent perceptual mapping: a new acoustic modeling framework for speech recognition. 881-884
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DufourBED10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DufourBED10
Richard Dufour, Fethi Bougares, Yannick Estève, Paul Deléglise:
Unsupervised model adaptation on targeted speech segments for LVCSR system combination. 885-888
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ClementeHDWG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ClementeHDWG10
Irene Ayllón Clemente, Martin Heckmann, Alexander Denecke, Britta Wrede, Christian Goerick:
Incremental word learning using large-margin discriminative training and variance floor estimation. 889-892
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VirtanenGH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VirtanenGH10
Tuomas Virtanen, Jort F. Gemmeke, Antti Hurmalainen:
State-based labelling for a sparse representation of speech and its application to robust speech recognition. 893-896
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HannemannKKB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HannemannKKB10
Mirko Hannemann, Stefan Kombrink, Martin Karafiát, Lukás Burget:
Similarity scoring for recognizing repeated out-of-vocabulary words. 897-900
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeppiC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeppiC10
Dino Seppi, Dirk Van Compernolle:
Data pruning for template-based automatic speech recognition. 901-904
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiuGCB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiuGCB10
Man-Hung Siu, Herbert Gish, Arthur Chan, William Belfield:
Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision. 2838-2841
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanevskySRN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanevskySRN10
Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo:
An analysis of sparseness and regularization in exemplar-based methods for speech classification. 2842-2845
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MohamedYD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MohamedYD10
Abdel-rahman Mohamed, Dong Yu, Li Deng:
Investigation of full-sequence training of deep belief networks for speech recognition. 2846-2849
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangL10
Yow-Bang Wang, Lin-Shan Lee:
Mandarin tone recognition using affine-invariant prosodic features and tone posteriorgram. 2850-2853
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZweigNDA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZweigNDA10
Geoffrey Zweig, Patrick Nguyen, Jasha Droppo, Alex Acero:
Continuous speech recognition with a TF-IDF acoustic model. 2854-2857
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZweigN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZweigN10
Geoffrey Zweig, Patrick Nguyen:
SCARF: a segmental conditional random field toolkit for speech recognition. 2858-2861

Speech Production: Various Approaches

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Amano-KusumotoHK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Amano-KusumotoHK10
Akiko Amano-Kusumoto, John-Paul Hosom, Alexander Kain:
Speaking style dependency of formant targets. 905-908
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kitamura10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kitamura10
Tatsuya Kitamura:
Similarity of effects of emotions on the speech organ configuration with and without speaking. 909-912
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoneKLN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoneKLN10
Daniel Bone, Samuel Kim, Sungbok Lee, Shrikanth S. Narayanan:
A study of intra-speaker and inter-speaker affective variability using electroglottograph and inverse filtered glottal waveforms. 913-916
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SakakibaraIKYT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SakakibaraIKYT10
Ken-Ichi Sakakibara, Hiroshi Imagawa, Miwako Kimura, Hisayuki Yokonishi, Niro Tayama:
Modal analysis of vocal fold vibrations using laryngotopography. 917-920
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VainioAJA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VainioAJA10
Martti Vainio, Matti Airas, Juhani Järvikivi, Paavo Alku:
Laryngeal voice quality in the expression of focus. 921-924
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimotoMF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimotoMF10
Masako Fujimoto, Kikuo Maekawa, Seiya Funatsu:
Laryngeal characteristics during the production of geminate consonants. 925-928
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CisonniNHW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CisonniNHW10
Julien Cisonni, Kazunori Nozaki, Annemie Van Hirtum, Shigeo Wada:
Numerical study of turbulent flow-induced sound production in presence of a tooth-shaped obstacle: towards sibilant [s] physical modeling. 929-932
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaniqueSE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaniqueSE10
Iris Hanique, Barbara Schuppler, Mirjam Ernestus:
Morphological and predictability effects on schwa reduction: the case of dutch word-initial syllables. 933-936
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoubayedA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoubayedA10
Samer Al Moubayed, Gopal Ananthakrishnan:
Acoustic-to-articulatory inversion based on local regression. 937-940
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Broersma10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Broersma10
Mirjam Broersma:
Korean lenis, fortis, and aspirated stops: effect of place of articulation on acoustic realization. 941-944
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakashikaTNTA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakashikaTNTA10
Toru Nakashika, Ryuki Tachibana, Masafumi Nishimura, Tetsuya Takiguchi, Yasuo Ariki:
Speech synthesis by modeling harmonics structure with multiple function. 945-948
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OtaniH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OtaniH10
Makoto Otani, Tatsuya Hirahara:
Physics of body-conducted silent speech - production, propagation and representation of non-audible murmur. 949-952

Speech Enhancement

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChakladarKJK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChakladarKJK10
Subhojit Chakladar, Nam Soo Kim, Yu Gwang Jin, Tae Gyoon Kang:
Multichannel noise reduction using low order RTF estimate. 953-956
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeYLK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeYLK10
Inho Lee, Jongsung Yoon, Yoonjae Lee, Hanseok Ko:
Reinforced blocking matrix with cross channel projection for speech enhancement. 957-960
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengLW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengLW10
Ning Cheng, Wenju Liu, Lan Wang:
Masking property based microphone array post-filter design. 961-964
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SatoHBC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SatoHBC10
Yusuke Sato, Tetsuya Hoya, Hovagim Bakardjian, Andrzej Cichocki:
Reduction of broadband noise in speech signals by multilinear subspace analysis. 965-968
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HongHJH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HongHJH10
Jungpyo Hong, Seung Ho Han, Sangbae Jeong, Minsoo Hahn:
Novel probabilistic control of noise reduction for improved microphone array beamforming. 969-972
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiFY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiFY10
Kai Li, Qiang Fu, Yonghong Yan:
Speech enhancement using improved generalized sidelobe canceller in frequency domain with multi-channel postfiltering. 973-976
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EvenISH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EvenISH10
Jani Even, Carlos Toshinori Ishi, Hiroshi Saruwatari, Norihiro Hagita:
Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface. 977-980
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasamurthyS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasamurthyS10
Ajay Srinivasamurthy, Thippur V. Sreenivas:
Multi-channel iterative dereverberation based on codebook constrained iterative multi-channel wiener filter. 981-984
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MedabalimiMY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MedabalimiMY10
Anand Joseph Xavier Medabalimi, Sri Harish Reddy Mallidi, B. Yegnanarayana:
Speaker-dependent mapping of source and system features for enhancement of throat microphone speech. 985-988
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiMMGS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiMMGS10
Jun Cai, Stefano Marini, Pierre Malarme, Francis Grenez, Jean Schoentgen:
An analytic modeling approach to enhancing throat microphone speech commands for keyword spotting. 989-992
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoWP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoWP10
Stephen So, Kamil K. Wójcicki, Kuldip K. Paliwal:
Single-channel speech enhancement using kalman filtering in the modulation domain. 993-996
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoL10
Miao Yao, Weiqian Liang:
Integrated feedback and noise reduction algorithm in digital hearing aids via oscillation detection. 997-1000
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MercierL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MercierL10
Charles Mercier, Roch Lefebvre:
A blind signal-to-noise ratio estimator for high noise speech recordings. 1001-1004

Special Session: Fact and Replica of Speech Production (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ImagawaSTOT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ImagawaSTOT10
Hiroshi Imagawa, Ken-Ichi Sakakibara, Isao T. Tokuda, Mamiko Otsuka, Niro Tayama:
Estimation of glottal area function using stereo-endoscopic high-speed digital imaging. 1005-1008
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NozakiOSWS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NozakiOSWS10
Kazunori Nozaki, Youhei Ohnishi, Takashi Suda, Shigeo Wada, Shinji Shimojo:
Toward aero-acoustical analysis of the sibilant /s/: an oral cavity modeling. 1009-1012
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Motoki10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Motoki10
Kunitoshi Motoki:
Effects of wall impedance on transmission and attenuation of higher-order modes in vocal-tract model. 1013-1016
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BirkholzKN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BirkholzKN10
Peter Birkholz, Bernd J. Kröger, Christiane Neuschaefer-Rube:
Articulatory synthesis and perception of plosive-vowel syllables with virtual consonant targets. 1017-1020
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FukuiKMSTH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FukuiKMSTH10
Kotaro Fukui, Toshihiro Kusano, Yoshikazu Mukaeda, Yuto Suzuki, Atsuo Takanishi, Masaaki Honda:
Speech robot mimicking human articulatory motion. 1021-1024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Arai10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Arai10
Takayuki Arai:
Mechanical vocal-tract models for speech dynamics. 1025-1028
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Brady10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Brady10
Michael C. Brady:
Prosodic timing analysis for articulatory re-synthesis using a bank of resonators with an adaptive oscillator. 1029-1032

ASR: Language Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EmamiCISZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EmamiCISZ10
Ahmad Emami, Stanley F. Chen, Abraham Ittycheriah, Hagen Soltau, Bing Zhao:
Decoding with shrinkage-based language models. 1033-1036
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenC10
Stanley F. Chen, Stephen M. Chu:
Enhanced word classing for model M. 1037-1040
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkLGW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkLGW10
Junho Park, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Improved neural network based language modelling and adaptation. 1041-1044
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MikolovKBCK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MikolovKBCK10
Tomás Mikolov, Martin Karafiát, Lukás Burget, Jan Cernocký, Sanjeev Khudanpur:
Recurrent neural network based language model. 1045-1048
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JyothiF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JyothiF10
Preethi Jyothi, Eric Fosler-Lussier:
Discriminative language modeling using simulated ASR errors. 1049-1052
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NeubigMMK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NeubigMMK10
Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsuya Kawahara:
Learning a language model from continuous speech. 1053-1056

Single-Channel Speech Enhancement

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoP10
Stephen So, Kuldip K. Paliwal:
Fast converging iterative kalman filtering for speech enhancement using long and overlapped tapered windows with large side lobe attenuation. 1081-1084
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunYA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunYA10
Xuejing Sun, Kuan-Chieh Yen, Rogerio Guedes Alves:
Robust noise estimation using minimum correction with harmonicity control. 1085-1088
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Triki10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Triki10
Mahdi Triki:
New insights into subspace noise tracking. 1089-1092
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrikiJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrikiJ10
Mahdi Triki, Kees Janse:
Bias considerations for minimum subspace noise tracking. 1093-1096
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MingSC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MingSC10
Ji Ming, Ramji Srinivasan, Danny Crookes:
A corpus-based approach to speech enhancement from nonstationary noise. 1097-1100
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenCYL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenCYL10
Zhe Chen, You-Chi Cheng, Fuliang Yin, Chin-Hui Lee:
Bandwidth expansion of speech based on wavelet transform modulus maxima vector mapping. 1101-1104

Speech Synthesis: Miscellaneous Topics

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgburekeCC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgburekeCC10
Kalu U. Ogbureke, Peter Cahill, Julie Carson-Berndsen:
Hidden Markov models with context-sensitive observations for grapheme-to-phoneme conversion. 1105-1108
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LangnerVB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LangnerVB10
Brian Langner, Stephan Vogel, Alan W. Black:
Evaluating a dialog language generation system: comparing the mountain system to other NLG approaches. 1109-1112
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MattheysesLV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MattheysesLV10
Wesley Mattheyses, Lukas Latacz, Werner Verhelst:
Active appearance models for photorealistic visual speech synthesis. 1113-1116
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bellegarda10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bellegarda10
Jerome R. Bellegarda:
Latent affective mapping: a novel framework for the data-driven analysis of emotion in text. 1117-1120
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JanskaC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JanskaC10
Anna C. Janska, Robert A. J. Clark:
Native and non-native speaker judgements on the quality of synthesized speech. 1121-1124
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EspinosaWFB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EspinosaWFB10
Dominic Espinosa, Michael White, Eric Fosler-Lussier, Chris Brew:
Machine learning for text selection with expressive unit-selection voices. 1125-1128

Prosody: Basics Applications

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IvanovRGTS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IvanovRGTS10
Alexei V. Ivanov, Giuseppe Riccardi, Sucheta Ghosh, Sara Tonelli, Evgeny A. Stepanov:
Acoustic correlates of meaning structure in conversational speech. 1129-1132
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObinRL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObinRL10
Nicolas Obin, Xavier Rodet, Anne Lacheret:
HMM-based prosodic structure model using rich linguistic context. 1133-1136
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WollermannSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WollermannSS10
Charlotte Wollermann, Bernhard Schröder, Ulrich Schade:
Audiovisual congruence and pragmatic focus marking. 1137-1140
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZellersGP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZellersGP10
Margaret Zellers, Michele Gubian, Brechtje Post:
Redescribing intonational categories with functional data analysis. 1141-1144
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLWLX10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLWLX10a
Shen Huang, Hongyan Li, Shijin Wang, Jiaen Liang, Bo Xu:
Exploring goodness of prosody by diverse matching templates. 1145-1148
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RouvierDLE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RouvierDLE10
Mickael Rouvier, Richard Dufour, Georges Linarès, Yannick Estève:
A language-identification inspired method for spontaneous speech detection. 1149-1152
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaillyL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaillyL10
Gérard Bailly, Amélie Lelong:
Speech dominoes and phonetic convergence. 1153-1156
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrendelZD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrendelZD10
Mátyás Brendel, Riccardo Zaccarelli, Laurence Devillers:
A quick sequential forward floating feature selection algorithm for emotion detection from speech. 1157-1160
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KissS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KissS10
Géza Kiss, Jan P. H. van Santen:
Automated vocal emotion recognition using phoneme class specific features. 1161-1164
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PassZS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PassZS10
Adrian Pass, Jianguo Zhang, Darryl Stewart:
Feature selection for pose invariant lip biometrics. 1165-1168
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HusseinH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HusseinH10
Hussein Hussein, Rüdiger Hoffmann:
Signal-based accent and phrase marking using the fujisaki model. 1169-1172
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimLN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimLN10
Jangwon Kim, Sungbok Lee, Shrikanth S. Narayanan:
A study of interplay between articulatory movement and prosodic characteristics in emotional speech production. 1173-1176

ASR: Feature Extraction I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSL10
Shang-wen Li, Liang-Che Sun, Lin-Shan Lee:
Improved phoneme recognition by integrating evidence from spectro-temporal and cepstral features. 1177-1180
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RavuriM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RavuriM10
Suman V. Ravuri, Nelson Morgan:
Using spectro-temporal features to improve AFE feature extraction for ASR. 1181-1184
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaratxagaHONLE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaratxagaHONLE10
Ibon Saratxaga, Inma Hernáez, Igor Odriozola, Eva Navas, Iker Luengo, Daniel Erro:
Using harmonic phase information to improve ASR rate. 1185-1188
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamamotoSN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamamotoSN10
Kazumasa Yamamoto, Eiichi Sueyoshi, Seiichi Nakagawa:
Speech recognition using long-term phase information. 1189-1192
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZelinkaTM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZelinkaTM10
Jan Zelinka, Jan Trmal, Ludek Müller:
Low-dimensional space transforms of posteriors in speech recognition. 1193-1196
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PlahlSN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PlahlSN10
Christian Plahl, Ralf Schlüter, Hermann Ney:
Hierarchical bottle neck features for LVCSR. 1197-1200
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrezlK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrezlK10
Frantisek Grézl, Martin Karafiát:
Hierarchical neural net architectures for feature extraction in ASR. 1201-1204
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SridharPN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SridharPN10
Vivek Kumar Rangarajan Sridhar, Rohit Prasad, Prem Natarajan:
Mutual information analysis for feature and sensor subset selection in surface electromyography based speech recognition. 1205-1208
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeyerK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeyerK10
Bernd T. Meyer, Birger Kollmeier:
Learning from human errors: prediction of phoneme confusions based on modified ASR training. 1209-1212
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS10a
Bo Li, Khe Chai Sim:
Hidden logistic linear regression for support vector machine based phone verification. 2614-2617
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgZN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgZN10
Tim Ng, Bing Zhang, Long Nguyen:
Jointly optimized discriminative features for speech recognition. 2618-2621
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MullerM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MullerM10
Florian Müller, Alfred Mertins:
Invariant integration features combined with speaker-adaptation methods. 2622-2625
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaugasSPN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaugasSPN10
Mark Raugas, Vivek Kumar Rangarajan Sridhar, Rohit Prasad, Prem Natarajan:
Multi resolution discriminative models for subvocalic speech recognition. 2626-2629
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ValenteMPRW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ValenteMPRW10
Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman V. Ravuri, Wen Wang:
A comparative large scale study of MLP features for Mandarin ASR. 2630-2633
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoPLG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoPLG10
Cong-Thanh Do, Dominique Pastor, Gaël Le Lan, André Goalic:
Recognizing cochlear implant-like spectrally reduced speech with HMM-based ASR: experiments with MFCCs and PLP coefficients. 2634-2637

Speech Perception: Cross Language and Age

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KondoKKY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KondoKKY10
Kazuhiro Kondo, Takayuki Kanda, Yosuke Kobayashi, Hiroyuki Yagyu:
Speech intelligibility of diagonally localized speech with competing noise using bone-conduction headphones. 1213-1216
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Divenyi10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Divenyi10
Pierre L. Divenyi:
Masking of vowel-analog transitions by vowel-analog distracters. 1217-1220
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PellegrinoFM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PellegrinoFM10
François Pellegrino, Emmanuel Ferragne, Fanny Meunier:
2010, a speech oddity: phonetic transcription of reversed speech. 1221-1224
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinF10
Hsin-Yi Lin, Janice Fon:
Perception on pitch reset at discourse boundaries. 1225-1228
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoleHM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoleHM10
Marjorie Dole, Michel Hoen, Fanny Meunier:
Effect of spatial separation on speech-in-noise comprehension in dyslexic adults. 1229-1232
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarklundLE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarklundLE10
Ellen Marklund, Francisco Lacerda, Anna Ericsson:
Speech categorization context effects in seven- to nine-month-old infants. 1233-1236
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kewley-PortHF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kewley-PortHF10
Diane Kewley-Port, Larry E. Humes, Daniel Fogerty:
Changes in temporal processing of speech across the adult lifespan. 1237-1240
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BernsteinCS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BernsteinCS10
Jared Bernstein, Jian Cheng, Masanori Suzuki:
Fluency and structural complexity as predictors of L2 oral proficiency. 1241-1244
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VenTE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VenTE10
Marco van de Ven, Benjamin V. Tucker, Mirjam Ernestus:
Semantic facilitation in bilingual everyday speech comprehension. 1245-1248
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsiehP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsiehP10
Bo-ren Hsieh, Ho-hsien Pan:
L2 experience and non-native vowel categorization of L1-Mandarin speakers. 1249-1252
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wester10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wester10
Mirjam Wester:
Cross-lingual talker discrimination. 1253-1256
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Otake10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Otake10
Takashi Otake:
Dajare is not the lowest form of wit. 1257-1260

SLP Systems

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TorresTKMSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TorresTKMSS10
Rafael Torres, Shota Takeuchi, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano:
Comparison of methods for topic classification in a speech-oriented guidance system. 1261-1264
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ComasTM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ComasTM10
Pere Comas, Jordi Turmo, Lluís Màrquez:
Using dependency parsing and machine learning for factoid question answering on spoken documents. 1265-1268
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParadaSDJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParadaSDJ10
Carolina Parada, Abhinav Sethy, Mark Dredze, Frederick Jelinek:
A spoken term detection framework for recovering out-of-vocabulary words using the web. 1269-1272
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeCYL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeCYL10
Hung-yi Lee, Chia-Ping Chen, Ching-feng Yeh, Lin-Shan Lee:
Improved spoken term detection by discriminative training of acoustic models based on user relevance feedback. 1273-1276
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TschopelS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TschopelS10
Sebastian Tschöpel, Daniel Schneider:
A lightweight keyword and tag-cloud retrieval algorithm for automatic speech recognition transcripts. 1277-1280
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanederaFN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanederaFN10
Noboru Kanedera, Tetsuo Funada, Seiichi Nakagawa:
Lecture subtopic retrieval by retrieval keyword expansion using subordinate concept. 1281-1284
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NanjoIY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NanjoIY10
Hiroaki Nanjo, Yusuke Iyonaga, Takehiko Yoshimi:
Spoken document retrieval for oral presentations integrating global document similarities into local document similarities. 1285-1288
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolifroniS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolifroniS10
Joseph Polifroni, Stephanie Seneff:
Combining word-based features, statistical language models, and parsing for named entity recognition. 1289-1292
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZidouniRG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZidouniRG10
Azeddine Zidouni, Sophie Rosset, Hervé Glotin:
Efficient combined approach for named entity recognition in spoken language. 1293-1296
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YellaVP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YellaVP10
Sree Harsha Yella, Vasudeva Varma, Kishore Prahallad:
Prominence based scoring of speech segments for automatic speech-to-speech summarization. 1297-1300
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuXF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuXF10
Zihan Liu, Lei Xie, Wei Feng:
Maximum lexical cohesion for fine-grained news story segmentation. 1301-1304
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangXMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangXMCL10
Xiaoxuan Wang, Lei Xie, Bin Ma, Engsiong Chng, Haizhou Li:
Phoneme lattice based texttiling towards multilingual story segmentation. 1305-1308

Quality of Experiencing Speech Services (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchlesingerB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchlesingerB10
Anton Schlesinger, Marinus M. Boone:
The characterization of the relative information content by spectral features for the objective intelligibility assessment of nonlinearly processed speech. 1309-1312
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WaltermannRM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WaltermannRM10
Marcel Wältermann, Alexander Raake, Sebastian Möller:
Analytical assessment and distance modeling of speech transmission quality. 1313-1316
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CoteKGRM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CoteKGRM10
Nicolas Côté, Vincent Koehl, Valérie Gautier-Turbin, Alexander Raake, Sebastian Möller:
An intrusive super-wideband speech quality model: DIAL. 1317-1320
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EggerSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EggerSS10
Sebastian Egger, Raimund Schatz, Stefan Scherer:
It takes two to tango - assessing the impact of delay on conversational interactivity on perceived speech quality. 1321-1324
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MollerHFP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MollerHFP10
Sebastian Möller, Florian Hinterleitner, Tiago H. Falk, Tim Polzehl:
Comparison of approaches for instrumentally predicting the quality of text-to-speech systems. 1325-1328
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KissPWCP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KissPWCP10
Imre Kiss, Joseph Polifroni, Chao Wang, Ghinwa F. Choueiter, Mike Phillips:
A hybrid architecture for mobile voice user interfaces. 1329-1332
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurunenHH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurunenHH10
Markku Turunen, Jaakko Hakulinen, Tomi Heimonen:
Assessment of spoken and multimodal applications: lessons learned from laboratory and field studies. 1333-1336
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EngelbrechtKM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EngelbrechtKM10
Klaus-Peter Engelbrecht, Hamed Ketabdar, Sebastian Möller:
Improving cross database prediction of dialogue quality using mixture of experts. 1337-1340

Language Processing

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuinaudeauGS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuinaudeauGS10
Camille Guinaudeau, Guillaume Gravier, Pascale Sébillot:
Improving ASR-based topic segmentation of TV programs with confidence measures and semantic relations. 1365-1368
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuzS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuzS10
Saturnino Luz, Jing Su:
The relevance of timing, pauses and overlaps in dialogues: detecting topic changes in scenario based meetings. 1369-1372
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DufourF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DufourF10
Richard Dufour, Benoît Favre:
Semi-supervised part-of-speech tagging in speech applications. 1373-1376
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TantiniCG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TantiniCG10
Frédéric Tantini, Christophe Cerisara, Claire Gardent:
Memory-based active learning for French broadcast news. 1377-1380
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gillick10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gillick10
Dan Gillick:
Can conversational word usage be used to predict speaker demographics?. 1381-1384
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuW10
Chao-Hong Liu, Chung-Hsien Wu:
Prosodic word-based error correction in speech recognition using prosodic word expansion and contextual information. 1385-1388

Speech and Audio Segmentation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoffmannP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoffmannP10
Sarah Hoffmann, Beat Pfister:
Fully automatic segmentation for prosodic speech corpora. 1389-1392
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhanaghaDPY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhanaghaDPY10
Vahid Khanagha, Khalid Daoudi, Oriol Pont, Hussein M. Yahia:
A novel text-independent phonetic segmentation algorithm based on the microcanonical multiscale formalism. 1393-1396
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinWL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinWL10
You-Yu Lin, Yih-Ru Wang, Yuan-Fu Liao:
Phone boundary detection using sample-based acoustic parameters. 1397-1400
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MustiTOCWB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MustiTOCWB10
Utpala Musti, Asterios Toutios, Slim Ouni, Vincent Colotte, Brigitte Wrobel-Dautcourt, Marie-Odile Berger:
HMM-based automatic visual speech segmentation using facial data. 1401-1404
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangVS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangVS10
David Wang, Robert Vogt, Sridha Sridharan:
Bayes factor based speaker segmentation for speaker diarization. 1405-1408
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangC10
Qiang Huang, Stephen J. Cox:
Using high-level information to detect key audio events in a tennis game. 1409-1412

Prosody: Analysis

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lai10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lai10
Catherine Lai:
What do you mean, you're uncertain?: the interpretation of cue words and rising intonation in dialogue. 1413-1416
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuTJC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuTJC10
Yi-Fen Liu, Shu-Chuan Tseng, Jyh-Shing Roger Jang, C.-H. Alvin Chen:
Coping imbalanced prosodic unit boundary detection with linguistically-motivated prosodic features. 1417-1420
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHJ10
Zhigang Chen, Guoping Hu, Wei Jiang:
Improving prosodic phrase prediction by unsupervised adaptation and syntactic features extraction. 1421-1424
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiL10
Yujia Li, Tan Lee:
Perception-based automatic approximation of F0 contours in Cantonese speech. 1425-1428
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FernandezR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FernandezR10
Raul Fernandez, Bhuvana Ramabhadran:
Discriminative training and unsupervised adaptation for labeling prosodic events with limited training data. 1429-1432
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CvejicKDG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CvejicKDG10
Erin Cvejic, Jeesun Kim, Chris Davis, Guillaume Gibert:
Prosody for the eyes: quantifying visual prosody using guided principal component analysis. 1433-1436

Systems for LVCSR and Rich Transcription

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PariharSRH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PariharSRH10
Naveen Parihar, Ralf Schlüter, David Rybach, Eric A. Hansen:
Parallel lexical-tree based LVCSR on multi-core processors. 1485-1488
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChongGYK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChongGYK10
Jike Chong, Ekaterina Gonina, Kisun You, Kurt Keutzer:
Exploring recognition network representations for efficient speech inference on highly parallel platforms. 1489-1492
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Caseiro10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Caseiro10
Diamantino Caseiro:
WFST compression for automatic speech recognition. 1493-1496
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bulyko10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bulyko10
Ivan Bulyko:
Speech recognizer optimization under speed constraints. 1497-1500
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeHJNS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeHJNS10
Florian Metze, Roger Hsiao, Qin Jin, Udhyakumar Nallasamy, Tanja Schultz:
The 2010 CMU GALE speech-to-text system. 1501-1504
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NweSML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NweSML10
Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker diarization in meeting audio for single distant microphone. 1505-1508
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BatistaMTMMM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BatistaMTMMM10
Fernando Batista, Helena Moniz, Isabel Trancoso, Hugo Meinedo, Ana Isabel Mata, Nuno J. Mamede:
Extending the punctuation module for european portuguese. 1509-1512
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaktiIKN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaktiIKN10
Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Utilizing a noisy-channel approach for Korean LVCSR. 1513-1516
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nussbaum-ThomWSPHSN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nussbaum-ThomWSPHSN10
Markus Nußbaum-Thom, Simon Wiesler, Martin Sundermeyer, Christian Plahl, Stefan Hahn, Ralf Schlüter, Hermann Ney:
The RWTH 2009 quaero ASR evaluation system for English and German. 1517-1520

Phonetics

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MunsonS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MunsonS10
Benjamin Munson, Renata Solum:
When is indexical information about speech activated? evidence from a cross-modal priming experiment. 1521-1524
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Munson10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Munson10
Benjamin Munson:
The influence of actual and perceived sexual orientation on diadochokinetic rate in women and men. 1525-1528
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Yu10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yu10
Kristine M. Yu:
Laryngealization and features for Chinese tonal recognition. 1529-1532
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenCC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenCC10
Viet Son Nguyen, Eric Castelli, René Carré:
Production and perception of vietnamese short vowels in V1V2 context. 1533-1536
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fenk-OczlonF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fenk-OczlonF10
Gertraud Fenk-Oczlon, August Fenk:
Measuring basic tempo across languages and some implications for speech rhythm. 1537-1540
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HirataA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HirataA10
Yukari Hirata, Shigeaki Amano:
Durational structure of Japanese single/geminate stops in three- and four-mora words spoken at varied rates. 1541-1544
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanoO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanoO10
Shin-ichiro Sano, Tomohiko Ooigawa:
Distribution and trichotomic realization of voiced velars in Japanese - an experimental study. 1545-1548
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SieczkowskaMD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SieczkowskaMD10
Jagoda Sieczkowska, Bernd Möbius, Grzegorz Dogil:
Specification in context - devoicing processes in Polish, French, american English and German sonorants. 1549-1552
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nielsen10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nielsen10
Kuniko Y. Nielsen:
Phonetic imitation of Japanese vowel devoicing. 1553-1556
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StevensH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StevensH10
Mary Stevens, John Hajek:
Post-aspiration in standard Italian: some first cross-regional acoustic evidence. 1557-1560
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrimaldiCSGS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrimaldiCSGS10
Mirko Grimaldi, Andrea Calabrese, Francesco Sigona, Luigia Garrapa, Bianca Sisinni:
Articulatory grounding of southern salentino harmony processes. 1561-1564
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanidaUSR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanidaUSR10
Yuuki Tanida, Taiji Ueno, Satoru Saito, Matthew A. Lambon Ralph:
Effects of accent typicality and phonotactic frequency on nonword immediate serial recall performance in Japanese. 1565-1567
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fujimura10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fujimura10
Osamu Fujimura:
How abstract is phonetics?. 1568-1571

Speech Production: Vocal Tract Modeling and Imaging

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LammertPN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LammertPN10
Adam C. Lammert, Michael I. Proctor, Shrikanth S. Narayanan:
Data-driven analysis of realtime vocal tract MRI using correlated image regions. 1572-1575
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ProctorBKN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ProctorBKN10
Michael I. Proctor, Daniel Bone, Athanasios Katsamanis, Shrikanth S. Narayanan:
Rapid semi-automatic segmentation of real-time magnetic resonance images for parametric vocal tract analysis. 1576-1579
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimNN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimNN10
Yoon-Chul Kim, Shrikanth S. Narayanan, Krishna S. Nayak:
Improved real-time MRI of oral-velar coordination using a golden-ratio spiral view order. 1580-1583
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BreschKGN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BreschKGN10
Erik Bresch, Athanasios Katsamanis, Louis Goldstein, Shrikanth S. Narayanan:
Statistical multi-stream modeling of real-time MRI articulatory speech data. 1584-1587
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnanthakrishnanBVE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnanthakrishnanBVE10
Gopal Ananthakrishnan, Pierre Badin, Julián Andrés Valdés Vargas, Olov Engwall:
Predicting unseen articulations from multi-speaker articulatory models. 1588-1591
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinC10
Chao Qin, Miguel Á. Carreira-Perpiñán:
Estimating missing data sequences in x-ray microbeam recordings. 1592-1595
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinCF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinCF10
Chao Qin, Miguel Á. Carreira-Perpiñán, Mohsen Farhadloo:
Adaptation of a tongue shape model by local feature transformations. 1596-1599
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeN10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeN10a
Sungbok Lee, Shrikanth S. Narayanan:
Vocal tract contour analysis of emotional speech by the functional data curve representation. 1600-1603
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LammertGI10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LammertGI10
Adam C. Lammert, Louis Goldstein, Khalil Iskarous:
Locally-weighted regression for estimating the forward kinematics of a geometric vocal tract model. 1604-1607
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ReimerR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ReimerR10
Michael Reimer, Frank Rudzicz:
Identifying articulatory goals from kinematic data using principal differential analysis. 1608-1611
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MingBFS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MingBFS10
Zuheng Ming, Denis Beautemps, Gang Feng, Sébastien Schmerber:
Estimation of speech lip features from discrete cosinus transform. 1612-1615
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AhmadiMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AhmadiMS10
Farzaneh Ahmadi, Ian Vince McLoughlin, Hamid R. Sharifzadeh:
Autoregressive modelling for linear prediction of ultrasonic speech. 1616-1619

Speech Intelligibility Enhancement for All Ages, Health Conditions and Environments (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AraiH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AraiH10
Takayuki Arai, Nao Hodoshima:
Enhanced speech yielding higher intelligibility for all listeners and environments. 1620-1623
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadjadiPH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadjadiPH10
Seyed Omid Sadjadi, Sanjay A. Patil, John H. L. Hansen:
Quality conversion of non-acoustic signals for facilitating human-to-human speech communication under harsh acoustic conditions. 1624-1627
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakamuraTSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakamuraTSS10
Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion. 1628-1631
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimL10
Gibak Kim, Philipos C. Loizou:
A new binary mask based on noise constraints for improved speech intelligibility. 1632-1635
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangC10
Yan Tang, Martin Cooke:
Energy reallocation strategies for speech enhancement in known noise conditions. 1636-1639
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenBM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenBM10
Jing Chen, Thomas Baer, Brian C. J. Moore:
Effects of enhancement of spectral changes on speech quality and subjective speech intelligibility. 1640-1643

ASR: Acoustic Model Adaptation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BreslinCGKX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BreslinCGKX10
Catherine Breslin, K. K. Chin, Mark J. F. Gales, Kate M. Knill, Haitian Xu:
Prior information for rapid speaker adaptation. 1644-1647
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoofSN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoofSN10
Jonas Lööf, Ralf Schlüter, Hermann Ney:
Discriminative adaptation for log-linear acoustic models. 1648-1651
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VergyriLG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VergyriLG10
Dimitra Vergyri, Lori Lamel, Jean-Luc Gauvain:
Automatic speech recognition of multiple accented English data. 1652-1655
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTL10
Jinyu Li, Yu Tsao, Chin-Hui Lee:
Shrinkage model adaptation in automatic speech recognition. 1656-1659
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiYGD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiYGD10
Jinyu Li, Dong Yu, Yifan Gong, Li Deng:
Unscented transform with online distortion estimation for HMM adaptation. 1660-1663
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeltzerA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeltzerA10
Michael L. Seltzer, Alex Acero:
HMM adaptation using linear spline interpolation with integrated spline parameter training for robust speech recognition. 1664-1667

SLP Systems for Information Extraction/Retrieval

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangKET10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangKET10
Dong Wang, Simon King, Nicholas W. D. Evans, Raphaël Troncy:
CRF-based stochastic pronunciation modeling for out-of-vocabulary spoken term detection. 1668-1671
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLYL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLYL10
Chia-Ping Chen, Hung-yi Lee, Ching-feng Yeh, Lin-Shan Lee:
Improved spoken term detection by feature space pseudo-relevance feedback. 1672-1675
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JansenCH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JansenCH10
Aren Jansen, Kenneth Church, Hynek Hermansky:
Towards spoken term discovery at scale with zero resources. 1676-1679
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GouveaE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GouveaE10
Evandro B. Gouvêa, Tony Ezzat:
Vocabulary independent spoken query: a case for subword units. 1680-1683
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinYC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinYC10
Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen:
Extractive speech summarization - from the view of decision theory. 1684-1687
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MurrayCN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MurrayCN10
Gabriel Murray, Giuseppe Carenini, Raymond T. Ng:
The impact of ASR on abstractive vs. extractive meeting summaries. 1688-1691

Speech Representation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengSYAMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengSYAMH10
Li Deng, Michael L. Seltzer, Dong Yu, Alex Acero, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Binary coding of speech spectrograms using a deep auto-encoder. 1692-1695
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NamMGLA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NamMGLA10
Juhan Nam, Gautham J. Mysore, Joachim Ganseman, Kyogu Lee, Jonathan S. Abel:
A super-resolution spectrogram using coupled PLCA. 1696-1699
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TzedakisPRS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TzedakisPRS10
Georgios Tzedakis, Yannis Pantazis, Olivier Rosec, Yannis Stylianou:
Fast least-squares solution for sinusoidal, harmonic and quasi-harmonic models. 1700-1703
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsaeiBG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsaeiBG10
Afsaneh Asaei, Hervé Bourlard, Philip N. Garner:
Sparse component analysis for speech recognition in multi-speaker environment. 1704-1707
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SkogstadS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SkogstadS10
Trond Skogstad, Torbjørn Svendsen:
Intra-frame variability as a predictor of frame classifiability. 1708-1711
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShimamuraN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShimamuraN10
Tetsuya Shimamura, Ngoc Dinh Nguyen:
Autocorrelation and double autocorrelation based spectral representations for a noisy word recognition system. 1712-1715

Voice Conversion

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HelanderSMG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HelanderSMG10
Elina Helander, Hanna Silén, Joaquín Míguez, Moncef Gabbouj:
Maximum a posteriori voice conversion using sequential monte carlo methods. 1716-1719
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LanchantinR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LanchantinR10
Pierre Lanchantin, Xavier Rodet:
Dynamic model selection for spectral voice conversion. 1720-1723
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NoseK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NoseK10
Takashi Nose, Takao Kobayashi:
Speaker-independent HMM-based voice conversion using quantized fundamental frequency. 1724-1727
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaitoWNM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaitoWNM10
Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu:
Probabilistic integration of joint density model and speaker model for voice conversion. 1728-1731
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKCL10
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Text-independent F0 transformation with non-parallel data for voice conversion. 1732-1735
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangWSH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangWSH10
Xiaodan Zhuang, Lijuan Wang, Frank K. Soong, Mark Hasegawa-Johnson:
A minimum converted trajectory error (MCTE) approach to high quality speech-to-lips conversion. 1736-1739

Prosody: Language-Specific Models

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarlssonHST10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarlssonHST10
Anastasia Karlsson, David House, Jan-Olof Svantesson, Damrong Tayanin:
Influence of lexical tones on intonation in kammu. 1740-1743
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NambuL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NambuL10
Satoshi Nambu, Yong-cheol Lee:
Phonetic realization of second occurrence focus in Japanese. 1744-1747
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kuang10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kuang10
Jianjing Kuang:
Prosodic grouping and relative clause disambiguation in Mandarin. 1748-1751
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTZPX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTZPX10
Ya Li, Jianhua Tao, Meng Zhang, Shifeng Pan, Xiaoying Xu:
Text-based unstressed syllable prediction in Mandarin. 1752-1755
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Dubeda10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Dubeda10
Tomás Dubeda:
"flat pitch accents" in Czech. 1756-1759
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Dubeda10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Dubeda10a
Tomás Dubeda:
Positional variability of pitch accents in Czech. 1760-1763
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MandalSBHF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MandalSBHF10
Shyamal Kr. Das Mandal, Arup Saha, Tulika Basu, Keikichi Hirose, Hiroya Fujisaki:
Modeling of sentence-medial pauses in bangla readout speech: occurrence and duration. 1764-1767
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeemannZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeemannZ10
Adrian Leemann, Lucy Zuberbühler:
Declarative sentence intonation patterns in 8 swiss German dialects. 1768-1771
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JeonL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JeonL10
Je Hun Jeon, Yang Liu:
Syllable-level prominence detection with acoustic evidence. 1772-1775
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrasadB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrasadB10
Sankalan Prasad, Kalika Bali:
Prosody cues for classification of the discourse particle "hã" in hindi. 1776-1779
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaL10
Yuan Jia, Aijun Li:
Interaction of syntax-marked focus and wh-question induced focus in standard Chinese. 1780-1783
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoubayedB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoubayedB10
Samer Al Moubayed, Jonas Beskow:
Prominence detection in Swedish using syllable correlates. 1784-1787
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhiHB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhiHB10
Na Zhi, Daniel Hirst, Pier Marco Bertinetto:
Automatic analysis of the intonation of a tone language. applying the momel algorithm to spontaneous standard Chinese (beijing). 1788-1791
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgLHLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgLHLML10
Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma, Haizhou Li:
Towards long-range prosodic attribute modeling for language recognition. 1792-1795
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchubertJH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchubertJH10
Robert Schubert, Oliver Jokisch, Diane Hirschfeld:
A modified parameterization of the Fujisaki model. 1796-1799

ASR: Language Modeling and Speech Understanding I

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MomtaziFK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MomtaziFK10
Saeedeh Momtazi, Friedrich Faubel, Dietrich Klakow:
Within and across sentence boundary language model. 1800-1803
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarikayaCSR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarikayaCSR10
Ruhi Sarikaya, Stanley F. Chen, Abhinav Sethy, Bhuvana Ramabhadran:
Impact of word classing on shrinkage-based language models. 1804-1807
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgerPL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgerPL10
Stanislas Oger, Vladimir Popescu, Georges Linarès:
Combination of probabilistic and possibilistic language models. 1808-1811
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BallingerAGS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BallingerAGS10
Brandon Ballinger, Cyril Allauzen, Alexander Gruenstein, Johan Schalkwyk:
On-demand language model interpolation for mobile speech input. 1812-1815
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchlippeZGS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchlippeZGS10
Tim Schlippe, Chenfei Zhu, Jan Gebhardt, Tanja Schultz:
Text normalization based on statistical machine translation and internet user support. 1816-1819
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlumaeK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlumaeK10
Tanel Alumäe, Mikko Kurimo:
Efficient estimation of maximum entropy language models with n-gram features: an SRILM extension. 1820-1823
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GillotCLH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GillotCLH10
Christian Gillot, Christophe Cerisara, David Langlois, Jean Paul Haton:
Similar n-gram language model. 1824-1827
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JongtaveesatapornF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JongtaveesatapornF10
Markpong Jongtaveesataporn, Sadaoki Furui:
Topic and style-adapted language modeling for Thai broadcast news ASR. 1828-1831
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EmamiKZM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EmamiKZM10
Ahmad Emami, Hong-Kwang Jeff Kuo, Imed Zitouni, Lidia Mangu:
Augmented context features for Arabic speech recognition. 1832-1835
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OrtegaGHSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OrtegaGHSS10
Lucía Ortega, Isabel Galiano, Lluís F. Hurtado, Emilio Sanchis, Encarna Segarra:
A statistical segment-based approach for spoken language understanding. 1836-1839
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LecouteuxRL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LecouteuxRL10
Benjamin Lecouteux, Raphaël Rubino, Georges Linarès:
Improving back-off models with bag of words and hollow-grams. 2418-2421
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChelbaBNX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChelbaBNX10
Ciprian Chelba, Thorsten Brants, Will Neveitt, Peng Xu:
Study on interaction between entropy pruning and kneser-ney smoothing. 2422-2425
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamamotoHMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamamotoHMS10
Hitoshi Yamamoto, Ken Hanazawa, Kiyokazu Miki, Koichi Shinoda:
Dynamic language model adaptation using keyword category classification. 2426-2429
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NaptaliTN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NaptaliTN10
Welly Naptali, Masatoshi Tsuchiya, Seiichi Nakagawa:
Integration of cache-based model and topic dependent class model with soft clustering and soft voting. 2430-2433
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuvertM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuvertM10
Frédéric Duvert, Renato de Mori:
Conditional models for detecting lambda-functions in a spoken language understanding system. 2434-2437
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaidarO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaidarO10
Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Novel weighting scheme for unsupervised language model adaptation using latent dirichlet allocation. 2438-2441
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanAGEN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanAGEN10
Qun Feng Tan, Kartik Audhkhasi, Panayiotis G. Georgiou, Emil Ettelaie, Shrikanth S. Narayanan:
Automatic speech recognition system channel modeling. 2442-2445
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObaHN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObaHN10
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Round-robin discrimination model for reranking ASR hypotheses. 2446-2449
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SakSG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SakSG10
Hasim Sak, Murat Saraclar, Tunga Güngör:
On-the-fly lattice rescoring for real-time automatic speech recognition. 2450-2453

First and Second Language Acquisition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CooperW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CooperW10
Angela Cooper, Yue Wang:
Cantonese tone word learning by tone and non-tone language speakers. 1840-1843
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CutlerS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CutlerS10
Anne Cutler, Janise Shanley:
Validation of a training method for L2 continuous-speech segmentation. 1844-1847
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/Yuan10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yuan10
Jiahong Yuan:
Linguistic rhythm in foreign accent. 1848-1849
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SonuTKS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SonuTKS10
Mee Sonu, Keiichi Tajima, Hiroaki Kato, Yoshinori Sagisaka:
The effect of a word embedded in a sentence and speaking rate variation on the perceptual training of geminate and singleton consonant distinction. 1850-1853
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tsurutani10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tsurutani10
Chiharu Tsurutani:
Foreign accent matters most when timing is wrong. 1854-1857
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HongKC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HongKC10
Hyejin Hong, Jina Kim, Minhwa Chung:
Effects of Korean learners' consonant cluster reduction strategies on English speech recognition performance. 1858-1861
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LevittK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LevittK10
June S. Levitt, William F. Katz:
The effects of EMA-based augmented visual feedback on the English speakers' acquisition of the Japanese flap: a perceptual study. 1862-1865
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasudaA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasudaA10
Hinako Masuda, Takayuki Arai:
Perception of voiceless fricatives by Japanese listeners of advanced and intermediate level English proficiency. 1866-1869
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeisterM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeisterM10
Lya Meister, Einar Meister:
Perception of estonian vowel categories by native and non-native speakers. 1870-1873
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiLZCXO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiLZCXO10
Qin Shi, Kun Li, Shilei Zhang, Stephen M. Chu, Ji Xiao, Zhijian Ou:
Spoken English assessment system for non-native speakers using acoustic and prosodic features. 1874-1877
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyaksoFKG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyaksoFKG10
Elena E. Lyakso, Olga V. Frolova, Anna V. Kurazhova, Julia S. Gaikova:
Russian infants and children's sounds and speech corpuses for language acquisition studies. 1878-1881
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MonninL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MonninL10
Julia Monnin, Hélène Loevenbruck:
Language-specific influence on phoneme development: French and drehu data. 1882-1885
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HollidayBM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HollidayBM10
Jeffrey J. Holliday, Mary E. Beckman, Chanelle Mays:
Did you say susi or shushi? measuring the emergence of robust fricative contrasts in English- and Japanese-acquiring children. 1886-1889

Spoken Language Resources, Systems and Evaluation I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NovakDF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NovakDF10
Josef R. Novak, Paul R. Dixon, Sadaoki Furui:
An empirical comparison of the t³, juicer, HDecode and sphinx3 decoders. 1890-1893
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarnerD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarnerD10
Philip N. Garner, John Dines:
Tracter: a lightweight dataflow framework. 1894-1897
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DavelW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DavelW10
Marelie H. Davel, Febe de Wet:
Verifying pronunciation dictionaries using conflict analysis. 1898-1901
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoyVR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoyVR10
Brandon Roy, Soroush Vosoughi, Deb Roy:
Automatic estimation of transcription accuracy and difficulty. 1902-1905
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LambertSR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LambertSR10
Benjamin Lambert, Rita Singh, Bhiksha Raj:
Creating a linguistic plausibility dataset with non-expert annotators. 1906-1909
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuIKN10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuIKN10a
Xinhui Hu, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Construction and evaluations of an annotated Chinese conversational corpus in travel domain for the language model of speech recognition. 1910-1913
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HughesNHVML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HughesNHVML10
Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu, Pedro J. Moreno, Mike LeBeau:
Building transcribed speech corpora quickly and cheaply for many languages. 1914-1917
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChristensenBMG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChristensenBMG10
Heidi Christensen, Jon Barker, Ning Ma, Phil D. Green:
The CHiME corpus: a resource and a challenge for computational hearing in multisource environments. 1918-1921
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaoWZX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaoWZX10
Wen Cao, Dongning Wang, Jinsong Zhang, Ziyu Xiong:
Developing a Chinese L2 speech database of Japanese learners with narrow-phonetic labels for computer assisted pronunciation training. 1922-1925
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshikawaKTK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshikawaKTK10
Shogo Ishikawa, Shinya Kiriyama, Yoichi Takebayashi, Shigeyoshi Kitazawa:
How children acquire situation understanding skills?: a developmental analysis utilizing multimodal speech behavior corpus. 1926-1929
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WechsungSSNM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WechsungSSNM10
Ina Wechsung, Stefan Schaffer, Robert Schleicher, Anja Naumann, Sebastian Möller:
The influence of expertise and efficiency on modality selection strategies and perceived mental effort. 1930-1933
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuhnelWM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuhnelWM10
Christine Kühnel, Benjamin Weiss, Sebastian Möller:
Parameters describing multimodal interaction - definitions and three usage scenarios. 1934-1937
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZgorzelskiSHM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZgorzelskiSHM10
Alexander Zgorzelski, Alexander Schmitt, Tobias Heinroth, Wolfgang Minker:
Repair strategies on trial: which error recovery do users like best?. 1938-1941
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KamvarB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KamvarB10
Maryam Kamvar, Doug Beeferman:
Say what? why users choose to speak their web queries. 1966-1969
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TeutenbergW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TeutenbergW10
Jonathan Teutenberg, Catherine Inez Watson:
The effect of audience familiarity on the perception of modified accent. 1970-1973
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RichmondCF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RichmondCF10
Korin Richmond, Robert A. J. Clark, Susan Fitt:
On generating combilex pronunciations via morphological analysis. 1974-1977
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoddeM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoddeM10
Florian Gödde, Sebastian Möller:
Say it as you mean it - analyzing free user comments in the VOICE awards corpus. 1978-1981
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RozgicXKBGN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RozgicXKBGN10
Viktor Rozgic, Bo Xiao, Athanasios Katsamanis, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
A new multichannel multi modal dyadic interaction database. 1982-1985
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyuTCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyuTCL10
Dau-Cheng Lyu, Tien Ping Tan, Engsiong Chng, Haizhou Li:
SEAME: a Mandarin-English code-switching speech corpus in south-east asia. 1986-1989

Speech Production: Analysis

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FelpsGBRG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FelpsGBRG10
Daniel Felps, Christian Geng, Michael Berger, Korin Richmond, Ricardo Gutierrez-Osuna:
Relying on critical articulators to estimate vocal tract spectra in an articulatory-acoustic database. 1990-1993
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamanarayananBGN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamanarayananBGN10
Vikram Ramanarayanan, Dani Byrd, Louis Goldstein, Shrikanth S. Narayanan:
Investigating articulatory setting - pauses, ready position, and rest - using real-time MRI. 1994-1997
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinC10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinC10a
Chao Qin, Miguel Á. Carreira-Perpiñán:
Articulatory inversion of american English /turnr/ by conditional density modes. 1998-2001
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoussefBB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoussefBB10
Atef Ben Youssef, Pierre Badin, Gérard Bailly:
Can tongue be recovered from face? the answer of data-driven statistical models. 2002-2005
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TorreiraE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TorreiraE10
Francisco Torreira, Mirjam Ernestus:
Phrase-medial vowel devoicing in spontaneous French. 2006-2009
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengXG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengXG10
Chierh Cheng, Yi Xu, Michele Gubian:
Exploring the mechanism of tonal contraction in taiwan Mandarin. 2010-2013

Paralanguage Cognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeissB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeissB10
Benjamin Weiss, Felix Burkhardt:
Voice attributes affecting likability perception. 2014-2017
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JokinenHNY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JokinenHNY10
Kristiina Jokinen, Kazuaki Harada, Masafumi Nishida, Seiichi Yamamoto:
Turn-alignment using eye-gaze and speech in conversational interaction. 2018-2021
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YapEAC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YapEAC10
Tet Fei Yap, Julien Epps, Eliathamby Ambikairajah, Eric H. C. Choi:
An investigation of formant frequencies for cognitive load classification. 2022-2025
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoudbeekB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoudbeekB10
Martijn Goudbeek, Mirjam Broersma:
Language specific effects of emotion on phoneme duration. 2026-2029
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BlackKLLBCGN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BlackKLLBCGN10
Matthew Black, Athanasios Katsamanis, Chi-Chun Lee, Adam C. Lammert, Brian R. Baucom, Andrew Christensen, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Automatic classification of married couples' behavior using audio features. 2030-2033
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KowadloYZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KowadloYZ10
Gideon Kowadlo, Patrick Ye, Ingrid Zukerman:
Influence of gestural salience on the interpretation of spoken requests. 2034-2037

Robust ASR Against Noise

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitraNESG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitraNESG10
Vikramjit Mitra, Hosung Nam, Carol Y. Espy-Wilson, Elliot Saltzman, Louis Goldstein:
Robust word recognition using articulatory trajectories and gestures. 2038-2041
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamadaNKM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamadaNKM10
Takeshi Yamada, Tomohiro Nakajima, Nobuhiko Kitawaki, Shoji Makino:
Performance estimation of noisy speech recognition considering recognition task complexity. 2042-2045
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FaubelK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FaubelK10
Friedrich Faubel, Dietrich Klakow:
Estimating noise from noisy speech features with a monte carlo variant of the expectation maximization algorithm. 2046-2049
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamuraHTH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamuraHTH10
Satoshi Tamura, Eriko Hishikawa, Wataru Taguchi, Satoru Hayamizu:
Template-based spectral estimation using microphone array for speech recognition. 2050-2053
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MushtaqTH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MushtaqTH10
Aleem Mushtaq, Yu Tsao, Chin-Hui Lee:
A particle filter feature compensation approach to robust speech recognition. 2054-2057
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimS10
Chanwoo Kim, Richard M. Stern:
Nonlinear enhancement of onset for robust speech recognition. 2058-2061
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BadiezadeganR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BadiezadeganR10
Shirin Badiezadegan, Richard C. Rose:
Mask estimation in non-stationary noise environments for missing feature based robust speech recognition. 2062-2065
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKH10
Lae-Hoon Kim, Kyung-Tae Kim, Mark Hasegawa-Johnson:
Robust automatic speech recognition with decoder oriented ideal binary mask estimation. 2066-2069
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InceNRTI10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InceNRTI10
Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura:
A robust speech recognition system against the ego noise of a robot. 2070-2073
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuC10
Kuo-Hao Wu, Chia-Ping Chen:
Empirical mode decomposition for noise-robust automatic speech recognition. 2074-2077
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSH10
Wooil Kim, Jun-Won Suh, John H. L. Hansen:
An effective feature compensation scheme tightly matched with speech recognizer employing SVM-based GMM generation. 2078-2081
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GemmekeV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GemmekeV10
Jort F. Gemmeke, Tuomas Virtanen:
Artificial and online acquired noise dictionaries for noise robust ASR. 2082-2085
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaitoNLT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaitoNLT10
Akira Saito, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Voice activity detection based on conditional random fields using multiple features. 2086-2089
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoJ10
Yong Zhao, Biing-Hwang Juang:
A comparative study of noise estimation algorithms for VTS-based robust speech recognition. 2090-2093
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeideZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeideZ10
Frank Seide, Pei Zhao:
On using missing-feature theory with cepstral features - approximations to the multivariate integral. 2094-2097
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunGCBB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunGCBB10
Yang Sun, Jort F. Gemmeke, Bert Cranen, Louis ten Bosch, Lou Boves:
Using a DBN to integrate sparse classification and GMM-based ASR. 2098-2101

Voice Conversion and Speech Synthesis

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Robel10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Robel10
Axel Röbel:
Shape-invariant speech transformation with the phase vocoder. 2146-2149
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanagisawaH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanagisawaH10
Kayoko Yanagisawa, Mark A. Huckvale:
A phonetic alternative to cross-language voice conversion in a text-dependent context: evaluation of speaker identity. 2150-2153
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KlabbersKS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KlabbersKS10
Esther Klabbers, Alexander Kain, Jan P. H. van Santen:
Evaluation of speaker mimic technology for personalizing SGD voices. 2154-2157
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhtaTOSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhtaTOSS10
Kumi Ohta, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano:
Adaptive voice-quality control based on one-to-many eigenvoice conversion. 2158-2161
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VillavicencioB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VillavicencioB10
Fernando Villavicencio, Jordi Bonada:
Applying voice conversion to concatenative singing-voice synthesis. 2162-2165
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWHM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWHM10
Miaomiao Wang, Miaomiao Wen, Keikichi Hirose, Nobuaki Minematsu:
Improved generation of fundamental frequency in HMM-based speech synthesis using generation process model. 2166-2169
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeiWSLD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeiWSLD10
Ming Lei, Yi-Jian Wu, Frank K. Soong, Zhen-Hua Ling, Li-Rong Dai:
A hierarchical F0 modeling method for HMM-based speech synthesis. 2170-2173
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LatorreGZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LatorreGZ10
Javier Latorre, Mark J. F. Gales, Heiga Zen:
Training a parametric-based logF0 model with the minimum generation error criterion. 2174-2177
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WenWHM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WenWHM10
Miaomiao Wen, Miaomiao Wang, Keikichi Hirose, Nobuaki Minematsu:
Improving Mandarin segmental duration prediction with automatically extracted syntax features. 2178-2181
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiekerkB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiekerkB10
Daniel R. van Niekerk, Etienne Barnard:
An intonation model for TTS in sepedi. 2182-2185
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PucherSY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PucherSY10
Michael Pucher, Dietmar Schabus, Junichi Yamagishi:
Synthesis of fast speech with interpolation of adapted HSMMs and its evaluation by blind and sighted listeners. 2186-2189
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WebsterKK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WebsterKK10
Gabriel Webster, Sacha Krstulovic, Kate M. Knill:
A comparison of pronunciation modeling approaches for HMM-TTS. 2190-2193
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LingRY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LingRY10
Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi:
HMM-based text-to-articulatory-movement prediction and analysis of critical articulators. 2194-2197

Detection, Classification, and Segmentation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeKH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeKH10
Jiaxing Ye, Takumi Kobayashi, Tetsuya Higuchi:
Audio-based sports highlight detection by fourier local auto-correlations. 2198-2201
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorilSHH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorilSHH10
Hynek Boril, Abhijeet Sangwan, Taufiq Hasan, John H. L. Hansen:
Automatic excitement-level detection for sports highlights generation. 2202-2205
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BachA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BachA10
Jörg-Hendrik Bach, Jörn Anemüller:
Detecting novel objects in acoustic scenes through classifier incongruence. 2206-2209
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NtalampirasPF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NtalampirasPF10
Stavros Ntalampiras, Ilyas Potamitis, Nikos Fakotakis:
A multidomain approach for automatic home environmental sound classification. 2210-2213
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CardinalGB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CardinalGB10
Patrick Cardinal, Vishwa Gupta, Gilles Boulianne:
Content-based advertisement detection. 2214-2217
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NtalampirasPF10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NtalampirasPF10a
Stavros Ntalampiras, Ilyas Potamitis, Nikos Fakotakis:
Identification of abnormal audio events based on probabilistic novelty detection. 2218-2221
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BraunschweilerGB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BraunschweilerGB10
Norbert Braunschweiler, Mark J. F. Gales, Sabine Buchholz:
Lightly supervised recognition for automatic alignment of large coherent speech recordings. 2222-2225
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ben-HarushLG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ben-HarushLG10
Oshry Ben-Harush, Itshak Lapidot, Hugo Guterman:
Incremental diarization of telephone conversations. 2226-2229
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CherlaR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CherlaR10
Srikanth Cherla, V. Ramasubramanian:
Audio analytics by template modeling and 1-pass DP based decoding. 2230-2233
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZiolkoGZD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZiolkoGZD10
Mariusz Ziólko, Jakub Galka, Bartosz Ziólko, Tomasz Drwiega:
Perceptual wavelet decomposition for speech segmentation. 2234-2237
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KeriP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KeriP10
Venkatesh Keri, Kishore Prahallad:
A comparative study of constrained and unconstrained approaches for segmentation of speech signal. 2238-2241
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SondereggerK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SondereggerK10
Morgan Sonderegger, Joseph Keshet:
Automatic discriminative measurement of voice onset time. 2242-2245
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LengDKL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LengDKL10
Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Selective gammatone filterbank feature for robust sound event recognition. 2246-2249

Compressive Sensing for Speech and Language Processing (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangZMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangZMS10
Allen Y. Yang, Zihan Zhou, Yi Ma, Shankar Sastry:
Towards a robust face recognition system using compressive sensing. 2250-2253
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathRNKS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathRNKS10
Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky, Abhinav Sethy:
Sparse representation features for speech recognition. 2254-2257
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SethySRK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SethySRK10
Abhinav Sethy, Tara N. Sainath, Bhuvana Ramabhadran, Dimitri Kanevsky:
Data selection for language modeling using sparse representations. 2258-2261
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GemmekeRP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GemmekeRP10
Jort F. Gemmeke, Ulpu Remes, Kalle J. Palomäki:
Observation uncertainty measures for sparse imputation. 2262-2265
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathMKRNH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathMKRNH10
Tara N. Sainath, Sameer Maskey, Dimitri Kanevsky, Bhuvana Ramabhadran, David Nahamoo, Julia Hirschberg:
Sparse representations for text categorization. 2266-2269
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SivaramGH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SivaramGH10
Garimella S. V. S. Sivaram, Sriram Ganapathy, Hynek Hermansky:
Sparse auto-associative neural networks: theory and application to speech recognition. 2270-2273

ASR: Lexical and Pronunciation Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuZH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuZH10
Chi Hu, Xiaodan Zhuang, Mark Hasegawa-Johnson:
FSM-based pronunciation modeling using articulatory phonological code. 2274-2277
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JouvetFI10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JouvetFI10
Denis Jouvet, Dominique Fohr, Irina Illina:
Detailed pronunciation variant modeling for speech transcription. 2278-2281
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AddeRMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AddeRMS10
Line Adde, Bert Réveil, Jean-Pierre Martens, Torbjørn Svendsen:
A minimum classification error approach to pronunciation variation modeling of non-native proper names. 2282-2285
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaurentMMD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaurentMMD10
Antoine Laurent, Sylvain Meignier, Téva Merlin, Paul Deléglise:
Acoustics-based phonetic transcription method for proper nouns. 2286-2289
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchlippeOS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchlippeOS10
Tim Schlippe, Sebastian Ochs, Tanja Schultz:
Wiktionary as a source for automatic pronunciation extraction. 2290-2293
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BadrMG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BadrMG10
Ibrahim Badr, Ian McGraw, James R. Glass:
Learning new word pronunciations from spoken examples. 2294-2297

Speaker Recognition and Diarization

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenCW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenCW10
I-Fan Chen, Shih-Sian Cheng, Hsin-Min Wang:
Phonetic subspace mixture model for speaker diarization. 2298-2301
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZelenakSH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZelenakSH10
Martin Zelenák, Carlos Segura, Javier Hernando:
Overlap detection for speaker diarization by fusing spectral and spatial features. 2302-2305
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DielmannGB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DielmannGB10
Alfred Dielmann, Giulia Garau, Hervé Bourlard:
Floor holder detection and end of speaker turn prediction in meetings. 2306-2309
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VaqueroOVML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VaqueroOVML10
Carlos Vaquero, Alfonso Ortega, Jesús Antonio Villalba López, Antonio Miguel, Eduardo Lleida:
Confidence measures for speaker segmentation and their relation to speaker verification. 2310-2313
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarcherLMB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarcherLMB10
Anthony Larcher, Christophe Lévy, Driss Matrouf, Jean-François Bonastre:
Decoupling session variability modelling and speaker characterisation. 2314-2317
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeungZLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeungZLML10
Cheung-Chi Leung, Donglai Zhu, Kong-Aik Lee, Bin Ma, Haizhou Li:
Incorporating MAP estimation and covariance transform for SVM based speaker recognition. 2318-2321

Speech and Audio Classification

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RossignolP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RossignolP10
Stéphane Rossignol, Olivier Pietquin:
Single-speaker/multi-speaker co-channel speech classification. 2322-2325
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VinyalsFM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VinyalsFM10
Oriol Vinyals, Gerald Friedland, Nelson Morgan:
Discriminative training for hierarchical clustering in speaker diarization. 2326-2329
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeigerWR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeigerWR10
Jürgen T. Geiger, Frank Wallhoff, Gerhard Rigoll:
GMM-UBM based open-set online speaker diarization. 2330-2333
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GolipourO10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GolipourO10a
Ladan Golipour, Douglas D. O'Shaughnessy:
A segment-based non-parametric approach for monophone recognition. 2334-2337
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ButkoN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ButkoN10
Taras Butko, Climent Nadeu:
A fast one-pass-training feature selection technique for GMM-based acoustic event detection with audio-visual data. 2338-2341
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamakawaKTKOO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamakawaKTKOO10
Nobuhide Yamakawa, Tetsuro Kitahara, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Effects of modelling within- and between-frame temporal variations in power spectra on non-verbal sound recognition. 2342-2345

Emotion Recognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeLA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeLA10
Ling He, Margaret Lech, Nicholas B. Allen:
On the importance of glottal flow spectral energy for the recognition of emotions in speech. 2346-2349
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DevillersVC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DevillersVC10
Laurence Devillers, Christophe Vaudable, Clément Chastagnol:
Real-life emotion-related states detection in call centers: a cross-corpora study. 2350-2353
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HassanD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HassanD10
Ali Hassan, Robert I. Damper:
Multi-class and hierarchical SVMs for emotion recognition. 2354-2357
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HubnerVGW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HubnerVGW10
David Philippou-Hübner, Bogdan Vlasenko, Tobias Grosser, Andreas Wendemuth:
Determining optimal features for emotion recognition from speech by applying an evolutionary algorithm. 2358-2361
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WollmerMESN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WollmerMESN10
Martin Wöllmer, Angeliki Metallinou, Florian Eyben, Björn W. Schuller, Shrikanth S. Narayanan:
Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling. 2362-2365
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiN10
Kartik Audhkhasi, Shrikanth S. Narayanan:
Data-dependent evaluator modeling and its application to emotional valence classification from speech. 2366-2369

Speech Coding, Modeling, and Transmission

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaL10
Zhanyu Ma, Arne Leijon:
Modelling speech line spectral frequencies with dirichlet mixture models. 2370-2373
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaL10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaL10a
Zhanyu Ma, Arne Leijon:
PDF-optimized LSF vector quantization based on beta mixture models. 2374-2377
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarciaOML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarciaOML10
José Enrique García Laínez, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
Non-linear predictive vector quantization of feature vectors for distributed speech recognition. 2378-2381
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaaksonenTMVLYOLKMZGF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaaksonenTMVLYOLKMZGF10
Lasse Laaksonen, Mikko Tammi, Vladimir Malenovsky, Tommy Vaillancourt, Mi Suk Lee, Tomofumi Yamanashi, Masahiro Oshikiri, Claude Lamblin, Balázs Kövesi, Lei Miao, Deming Zhang, Jon Gibbs, Holly Francois:
Superwideband extension of g.718 and g.729.1 speech codecs. 2382-2385
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CarmonaGPPG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CarmonaGPPG10
José L. Carmona, Angel M. Gomez, Antonio M. Peinado, José L. Pérez-Córdoba, José A. González:
A multipulse FEC scheme based on amplitude estimation for CELP codecs over packet networks. 2386-2389
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamoT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamoT10
Anssi Rämö, Henri Toukomaa:
Voice quality evaluation of recent open source codecs. 2390-2393
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorgstromBA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorgstromBA10
Bengt J. Borgström, Per Henrik Borgström, Abeer Alwan:
Efficient HMM-based estimation of missing features, with applications to packet loss concealment. 2394-2397
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoN10
Xiaoqiang Xiao, Robert M. Nickel:
Speech inventory based discriminative training for joint speech enhancement and low-rate speech coding. 2398-2401
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongK10
Qipeng Gong, Peter Kabal:
Quality-based playout buffering with FEC for conversational voIP. 2402-2405
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamuraKA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamuraKA10
Masatsune Tamura, Takehiko Kagoshima, Masami Akamine:
Sub-band basis spectrum model for pitch-synchronous log-spectrum and phase based on approximation of sparse coding. 2406-2409
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarshavardhanSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarshavardhanSS10
Sundar Harshavardhan, Chandra Sekhar Seelamantula, Thippur V. Sreenivas:
A multimodal density function estimation approach to formant tracking. 2410-2413
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RasiloLR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RasiloLR10
Heikki Rasilo, Unto K. Laine, Okko Johannes Räsänen:
Estimation studies of vocal tract shape trajectory using a variable length and lossy kelly-lochbaum model. 2414-2417

Speech Perception: Processing and Intelligibility

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaqueT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaqueT10
Serajul Haque, Roberto Togneri:
A feature extraction method for automatic speech recognition based on the cochlear nucleus. 2454-2457
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomasPGMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomasPGMH10
Samuel Thomas, Kailash Patil, Sriram Ganapathy, Nima Mesgarani, Hynek Hermansky:
A phoneme recognition framework based on auditory spectro-temporal receptive fields. 2458-2461
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeestonB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeestonB10
Amy V. Beeston, Guy J. Brown:
Perceptual compensation for effects of reverberation in speech identification: a computer model based on auditory efferent processing. 2462-2465
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchupplerEDK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchupplerEDK10
Barbara Schuppler, Mirjam Ernestus, Wim A. van Dommelen, Jacques C. Koreman:
Predicting human perception and ASR classification of word-final [t] by its acoustic sub-segmental properties. 2466-2469
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RobertsonBLPT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RobertsonBLPT10
Matthew Robertson, Guy J. Brown, Wendy Lecluyse, Manasa Panda, Christine M. Tan:
A speech-in-noise test based on spoken digits: comparison of normal and impaired listeners using a computer model. 2470-2473
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KagomiyaN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KagomiyaN10
Takayuki Kagomiya, Seiji Nakagawa:
Evaluation of bone-conducted ultrasonic hearing-aid regarding transmission of paralinguistic information: a comparison with cochlear implant simulator. 2474-2477
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JurgensFMKB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JurgensFMKB10
Tim Jürgens, Stefan Fredelake, Ralf M. Meyer, Birger Kollmeier, Thomas Brand:
Challenging the speech intelligibility index: macroscopic vs. microscopic prediction of sentence recognition in normal and hearing-impaired listeners. 2478-2481
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UslarBHCRHK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UslarBHCRHK10
Verena N. Uslar, Thomas Brand, Mirko Hanke, Rebecca Carroll, Esther Ruigendijk, Cornelia Hamann, Birger Kollmeier:
Does sentence complexity interfere with intelligibility in noise? evaluation of the oldenburg linguistically and audiologically controlled sentence test (OLACS). 2482-2485
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamirezKR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamirezKR10
Juan-Pablo Ramirez, Hamed Ketabdar, Alexander Raake:
Intelligibility predictions for speech against fluctuating masker. 2486-2489
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItoOIY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItoOIY10
Masashi Ito, Keiji Ohara, Akinori Ito, Masafumi Yano:
An effect of formant amplitude in vowel perception. 2490-2493
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PetkovW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PetkovW10
Christopher I. Petkov, Benjamin Wilson:
Functional imaging of brain regions sensitive to communication sounds in primates. 2494-2497

Spoken Language Understanding and Spoken Language Translation I, II

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wang10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wang10
Ye-Yi Wang:
Strategies for statistical spoken language understanding with small amount of data - an empirical study. 2498-2501
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JabaianBL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JabaianBL10
Bassam Jabaian, Laurent Besacier, Fabrice Lefèvre:
Investigating multiple approaches for SLU portability to a new language. 2502-2505
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AustermannYFN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AustermannYFN10
Anja Austermann, Seiji Yamada, Kotaro Funakoshi, Mikio Nakano:
Learning naturally spoken commands for a robot. 2506-2509
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlbalateSSM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlbalateSSM10
Amparo Albalate, Aparna Suchindranath, David Suendermann, Wolfgang Minker:
A semi-supervised cluster-and-label approach for utterance classification. 2510-2513
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QuarteroniR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QuarteroniR10
Silvia Quarteroni, Giuseppe Riccardi:
Classifying dialog acts in human-human and human-machine spoken conversations. 2514-2517
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuL10
Fei Liu, Yang Liu:
Exploring speaker characteristics for meeting summarization. 2518-2521
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieLL10
Shasha Xie, Hui Lin, Yang Liu:
Semi-supervised extractive speech summarization via co-training algorithm. 2522-2525
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CelikyilmazH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CelikyilmazH10
Asli Celikyilmaz, Dilek Hakkani-Tür:
Extractive summarization using a latent variable model. 2526-2529
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EttelaieGN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EttelaieGN10
Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Hierarchical classification for speech-to-speech translation. 2530-2533
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PaulikW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PaulikW10
Matthias Paulik, Alex Waibel:
Rapid development of speech translation using consecutive interpretation. 2534-2537
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaskeyRZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaskeyRZ10
Sameer Maskey, Steven J. Rennie, Bowen Zhou:
Combining many alignments for speech to speech translation. 2538-2541
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GotabDBD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GotabDBD10
Pierre Gotab, Géraldine Damnati, Frédéric Béchet, Lionel Delphin-Poulat:
Online SLU model adaptation with a partial oracle. 2862-2865
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeshmukhDVV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeshmukhDVV10
Om Deshmukh, Harish Doddala, Ashish Verma, Karthik Visweswariah:
Role of language models in spoken fluency evaluation. 2866-2869
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamanHT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamanHT10
Sibel Yaman, Dilek Hakkani-Tür, Gökhan Tür:
Social role discovery from spoken language using dynamic Bayesian networks. 2870-2873
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanchezTFH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanchezTFH10
Michelle Hewlett Sanchez, Gökhan Tür, Luciana Ferrer, Dilek Hakkani-Tür:
Domain adaptation and compensation for emotion detection. 2874-2877
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnanthakrishnanPN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnanthakrishnanPN10
Sankaranarayanan Ananthakrishnan, Rohit Prasad, Prem Natarajan:
Phrase alignment confidence for statistical machine translation. 2878-2881
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaneW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaneW10
Ian R. Lane, Alex Waibel:
Named-entity projection and data-driven morphological decomposition for field maintainable speech-to-speech translation systems. 2882-2885

Social Signals in Speech (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrunetCCSDD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrunetCCSDD10
Paul M. Brunet, Marcela Charfuelan, Roderick Cowie, Marc Schröder, Hastings Donnan, Ellen Douglas-Cowie:
Detecting Politeness and efficiency in a cooperative social interaction. 2542-2545
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CampbellS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CampbellS10
Nick Campbell, Stefan Scherer:
Comparing measures of synchrony and alignment in dialogue speech timing with respect to turn-taking activity. 2546-2549
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KurticBW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KurticBW10
Emina Kurtic, Guy J. Brown, Bill Wells:
Resources for turn competition in overlap in multi-party conversations: speech rate, pausing and duration. 2550-2553
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TruongH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TruongH10
Khiet P. Truong, Dirk Heylen:
Disambiguating the functions of conversational sounds with prosody: the case of 'yeah'. 2554-2557
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CharfuelanSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CharfuelanSS10
Marcela Charfuelan, Marc Schröder, Ingmar Steiner:
Prosody and voice quality of vocal social signals: the case of dominance in scenario meetings. 2558-2561
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NeibergG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NeibergG10
Daniel Neiberg, Joakim Gustafson:
The prosody of Swedish conversational grunts. 2562-2565

Physiology and Pathology of Spoken Language

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MertensGCS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MertensGCS10
Christophe Mertens, Francis Grenez, Lise Crevier-Buchman, Jean Schoentgen:
Reliable tracking based on speech sample salience of vocal cycle length perturbations. 2566-2569
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KasuyaYEM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KasuyaYEM10
Hideki Kasuya, Hajime Yoshida, Satoshi Ebihara, Hiroki Mori:
Longitudinal changes of selected voice source parameters. 2570-2573
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlpanSMG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlpanSMG10
Ali Alpan, Jean Schoentgen, Youri Maryn, Francis Grenez:
Automatic perceptual categorization of disordered connected speech. 2574-2577
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimRLH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimRLH10
Heejin Kim, Panying Rong, Torrey M. Loucks, Mark Hasegawa-Johnson:
Kinematic analysis of tongue movement control in spastic dysarthria. 2578-2581
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JacobiMRH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JacobiMRH10
Irene Jacobi, Lisette van der Molen, Maya van Rossum, Frans J. M. Hilgers:
Pre- and short-term posttreatment vocal functioning in patients with advanced head and neck cancer treated with concomitant chemoradiotherapy. 2582-2585
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaH10
Joan K. Y. Ma, Rüdiger Hoffmann:
Acoustic analysis of intonation in parkinson's disease. 2586-2589

Speaker Diarization

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VaqueroVF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VaqueroVF10
Carlos Vaquero, Oriol Vinyals, Gerald Friedland:
A hybrid approach to online speaker diarization. 2638-2641
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BozonnetEAVFF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BozonnetEAVFF10
Simon Bozonnet, Nicholas W. D. Evans, Xavier Anguera, Oriol Vinyals, Gerald Friedland, Corinne Fredouille:
System output combination for improved speaker diarization. 2642-2645
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BozonnetEFWT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BozonnetEFWT10
Simon Bozonnet, Nicholas W. D. Evans, Corinne Fredouille, Dong Wang, Raphaël Troncy:
An integrated top-down/bottom-up approach to speaker diarization. 2646-2649
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VijayasenanVB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VijayasenanVB10
Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Advances in fast multistream diarization based on the information bottleneck framework. 2650-2653
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarauDB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarauDB10
Giulia Garau, Alfred Dielmann, Hervé Bourlard:
Audio-visual synchronisation for speaker diarisation. 2654-2657
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanN10
Kyu Jeong Han, Shrikanth S. Narayanan:
An improved cluster model selection method for agglomerative hierarchical speaker clustering using incremental Gaussian mixture models. 2658-2661
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WardFV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WardFV10
Nigel G. Ward, Olac Fuentes, Alejandro Vega:
Dialog prediction for a general model of turn-taking. 2662-2665
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HerbigGM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HerbigGM10
Tobias Herbig, Franz Gerl, Wolfgang Minker:
Speaker tracking in an unsupervised speech controlled system. 2666-2669
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lopez-OteroFG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lopez-OteroFG10
Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo:
MultiBIC: an improved speaker segmentation technique for TV shows. 2670-2673

Multi-Modal ASR, Including Audio-Visual ASR

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HosomJBF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HosomJBF10
John-Paul Hosom, Tom Jakobs, Allen Baker, Susan Fager:
Automatic speech recognition for assistive writing in speech supplemented word prediction. 2674-2677
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarpovRMZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarpovRMZ10
Alexey Karpov, Andrey Ronzhin, Konstantin Markov, Milos Zelezný:
Viseme-dependent weight optimization for CHMM-based audio-visual speech recognition. 2678-2681
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TerryLPK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TerryLPK10
Louis H. Terry, Karen Livescu, Janet B. Pierrehumbert, Aggelos K. Katsaggelos:
Audio-visual anticipatory coarticulation modeling by human and machine. 2682-2685
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JankeWS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JankeWS10
Matthias Janke, Michael Wand, Tanja Schultz:
Impact of lack of acoustic feedback in EMG-based silent speech recognition. 2686-2689
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiLX10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiLX10
Chong-Jia Ni, Wenju Liu, Bo Xu:
Using prosody to improve Mandarin automatic speech recognition. 2690-2693
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamuraIHTH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamuraIHTH10
Satoshi Tamura, Masato Ishikawa, Takashi Hashiba, Shin'ichi Takeuchi, Satoru Hayamizu:
A robust audio-visual speech recognition using audio-visual voice activity detection. 2694-2697
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KolossaCZK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KolossaCZK10
Dorothea Kolossa, Jike Chong, Steffen Zeiler, Kurt Keutzer:
Efficient manycore CHMM speech recognition for audiovisual and multistream data. 2698-2701
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshidaN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshidaN10
Takami Yoshida, Kazuhiro Nakadai:
Two-layered audio-visual integration in voice activity detection and automatic speech recognition for robots. 2702-2705
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeracleousH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeracleousH10
Panikos Heracleous, Norihiro Hagita:
Non-audible murmur recognition based on fusion of audio and visual streams. 2706-2709

Speaker and Language Recognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BenZeghibaGL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BenZeghibaGL10
Mohamed Faouzi BenZeghiba, Jean-Luc Gauvain, Lori Lamel:
Improved n-gram phonotactic models for language recognition. 2710-2713
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoonsukZMSPTW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoonsukZMSPTW10
Sirinoot Boonsuk, Donglai Zhu, Bin Ma, Atiwong Suchato, Proadpran Punyabukkana, Nattanun Thatphithakkul, Chai Wutiwiwatchai:
A study of term weighting in phonotactic approach to spoken language recognition. 2714-2717
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiniscalchiRSL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiniscalchiRSL10
Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition. 2718-2721
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ImsengMB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ImsengMB10
David Imseng, Mathew Magimai-Doss, Hervé Bourlard:
Hierarchical multilayer perceptron based language identification. 2722-2725
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MartinG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MartinG10
Alvin F. Martin, Craig S. Greenberg:
The NIST 2010 speaker recognition evaluation. 2726-2729
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengCW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengCW10
Shih-Sian Cheng, I-Fan Chen, Hsin-Min Wang:
Bayesian speaker recognition using Gaussian mixture model and laplace approximation. 2730-2733
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinnunenSSH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinnunenSSH10
Tomi Kinnunen, Rahim Saeidi, Johan Sandberg, Maria Hansson-Sandsten:
What else is new than the hamming window? robust MFCCs for speaker recognition via multitapering. 2734-2737
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarkarU10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarkarU10
Achintya Kumar Sarkar, Srinivasan Umesh:
Fast computation of speaker characterization vector using MLLR and sufficient statistics in anchor model framework. 2738-2741
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaramC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaramC10
Zahi N. Karam, William M. Campbell:
Graph-embedding for speaker recognition. 2742-2745
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouLL10
Chang Huai You, Haizhou Li, Kong-Aik Lee:
A hybrid modeling strategy for GMM-SVM speaker recognition with adaptive relevance factor. 2746-2749
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarshavardhanS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarshavardhanS10
Sundar Harshavardhan, Thippur V. Sreenivas:
Robust mixture modeling using t-distribution: application to speaker ID. 2750-2753
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JungHSNK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JungHSNK10
Chi-Sang Jung, Kyu Jeong Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang:
A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification. 2754-2757

Source Localization and Separation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HayashidaMN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HayashidaMN10
Kohei Hayashida, Masanori Morise, Takanobu Nishiura:
Near field sound source localization based on cross-power spectrum phase analysis with multiple microphones. 2758-2761
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiY10
Jinho Choi, Chang D. Yoo:
A maximum a posteriori sound source localization in reverberant and noisy conditions. 2762-2765
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakataniAYF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakataniAYF10
Tomohiro Nakatani, Shoko Araki, Takuya Yoshioka, Masakiyo Fujimoto:
Multichannel source separation based on source location cue with log-spectral shaping by hidden Markov source model. 2766-2769
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChauLA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChauLA10
Duc Thanh Chau, Junfeng Li, Masato Akagi:
A DOA estimation algorithm based on equalization-cancellation theory. 2770-2773
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HabibR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HabibR10
Tania Habib, Harald Romsdorfer:
Concurrent speaker localization using multi-band position-pitch (m-popi) algorithm with spectro-temporal pre-processing. 2774-2777
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SongLPKC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SongLPKC10
Ji-Hyun Song, Kyu-Ho Lee, Yun-Sik Park, Sang-Ick Kang, Joon-Hyuk Chang:
On using Gaussian mixture model for double-talk detection in acoustic echo suppression. 2778-2781
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DemirCS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DemirCS10
Cemil Demir, A. Taylan Cemgil, Murat Saraclar:
Catalog-based single-channel speech-music separation. 2782-2785
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuW10
Ke Hu, DeLiang Wang:
Unvoiced speech segregation based on CASA and spectral subtraction. 2786-2789
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuW10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuW10a
Ke Hu, DeLiang Wang:
Unsupervised sequential organization for cochannel speech separation. 2790-2793

INTERSPEECH 2010 Paralinguistic Challenge (Special Session)

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchullerSBBDMN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchullerSBBDMN10
Björn W. Schuller, Stefan Steidl, Anton Batliner, Felix Burkhardt, Laurence Devillers, Christian A. Müller, Shrikanth S. Narayanan:
The INTERSPEECH 2010 paralinguistic challenge. 2794-2797
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LingenfelserWVKA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LingenfelserWVKA10
Florian Lingenfelser, Johannes Wagner, Thurid Vogt, Jonghwa Kim, Elisabeth André:
Age and gender classification from speech using decision level fusion and ensemble based techniques. 2798-2801
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JeonXL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JeonXL10
Je Hun Jeon, Rui Xia, Yang Liu:
Level of interest sensing in spoken dialog using multi-level fusion of acoustic and lexical evidence. 2802-2805
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenLTHS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenLTHS10
Phuoc Nguyen, Trung Le, Dat Tran, Xu Huang, Dharmendra Sharma:
Fuzzy support vector machines for age and gender classification. 2806-2809
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GajsekZJSVM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GajsekZJSVM10
Rok Gajsek, Janez Zibert, Tadej Justin, Vitomir Struc, Bostjan Vesnicer, France Mihelic:
Gender and affect recognition based on GMM and GMM-UBM modeling with relevance MAP estimation. 2810-2813
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PoratLZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PoratLZ10
Royi Porat, Dan Lange, Yaniv Zigel:
Age recognition based on speech signals using weights supervector. 2814-2817
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeinedoT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeinedoT10
Hugo Meinedo, Isabel Trancoso:
Age and gender classification using fusion of acoustic and prosodic features. 2818-2821
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KockmannBC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KockmannBC10
Marcel Kockmann, Lukás Burget, Jan Cernocký:
Brno university of technology system for interspeech 2010 paralinguistic challenge. 2822-2825
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiJH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiJH10
Ming Li, Chi-Sang Jung, Kyu Jeong Han:
Combining five acoustic level modeling methods for automatic speaker age and gender recognition. 2826-2829
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BockletSZN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BockletSZN10
Tobias Bocklet, Georg Stemmer, Viktor Zeißler, Elmar Nöth:
Age and gender recognition based on multiple systems - early vs. late fusion. 2830-2833
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FeldBM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FeldBM10
Michael Feld, Felix Burkhardt, Christian A. Müller:
Automatic speaker age and gender recognition in the car for tailoring dialog and mobile services. 2834-2837

Signal Processing for Music and Song

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AikawaUA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AikawaUA10
Kiyoaki Aikawa, Junko Uenuma, Tomoko Akitake:
Acoustic correlates of voice quality improvement by voice training. 2886-2889
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DongCCLTK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DongCCLTK10
Minghui Dong, Paul Y. Chan, Ling Cen, Haizhou Li, Jason Teo, Ping Jen Kua:
Phonetic segmentation of singing voice using MIDI and parallel speech. 2890-2893
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainoTK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainoTK10
Keijiro Saino, Makoto Tachibana, Hideki Kenmochi:
A singing style modeling system for singing voice synthesizers. 2894-2897
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangLZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangLZ10
Jingzhou Yang, Jia Liu, Weiqiang Zhang:
A fast query by humming system based on notes. 2898-2901
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoJY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoJY10
Seokhwan Jo, Sihyun Joo, Chang D. Yoo:
Melody pitch estimation based on range estimation and candidate extraction using harmonic structure model. 2902-2905
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkKSH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkKSH10
Jihoon Park, Kwang-Ki Kim, Jeongil Seo, Minsoo Hahn:
Modified spatial audio object coding scheme with harmonic extraction and elimination structure for interactive audio service. 2906-2909

Modeling First Language Acquisition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BergmannGB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BergmannGB10
Christina Bergmann, Michele Gubian, Lou Boves:
Modelling the effect of speaker familiarity and noise on infant word recognition. 2910-2913
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiyazawaKM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiyazawaKM10
Kouki Miyazawa, Hideaki Kikuchi, Reiko Mazuka:
Unsupervised learning of vowels from continuous speech based on self-organized phoneme acquisition model. 2914-2917
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PlummerBBFM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PlummerBBFM10
Andrew R. Plummer, Mary E. Beckman, Mikhail Belkin, Eric Fosler-Lussier, Benjamin Munson:
Learning speaker normalization using semisupervised manifold alignment. 2918-2921
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rasanen10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rasanen10
Okko Johannes Räsänen:
Fully unsupervised word learning from continuous speech using transitional probabilities of atomic acoustic events. 2922-2925
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoschB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoschB10
Louis ten Bosch, Lou Boves:
Language acquisition and cross-modal associations: computational simulation of the result of infant studies. 2926-2929
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VersteeghBB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VersteeghBB10
Maarten Versteegh, Louis ten Bosch, Lou Boves:
Active word learning under uncertain input conditions. 2930-2933

Discourse and Dialogue

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LavalleyCBE10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LavalleyCBE10
Rémi Lavalley, Chloé Clavel, Patrice Bellot, Marc El-Bèze:
Combining text categorization and dialog modeling for speaker role identification on call center conversations. 3062-3065
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakamuraH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakamuraH10
Akira Nakamura, Satoru Hayamizu:
Topic-dependent n-gram models based on optimization of context lengths in LDA. 3066-3069
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObinDLR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObinDLR10
Nicolas Obin, Volker Dellwo, Anne Lacheret, Xavier Rodet:
Expectations for discourse genre identification: a prosodic study. 3070-3073
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GranellPMB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GranellPMB10
Ramón Granell, Stephen G. Pulman, Carlos D. Martínez-Hinarejos, José-Miguel Benedí:
Dialogue act tagging and segmentation with a single perceptron. 3074-3077
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujiiYN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujiiYN10
Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa:
Improving the readability of class lecture ASR results using a confusion network. 3078-3081

Voice Activity and Turn Detection

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimCKSC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimCKSC10
Sang-Kyun Kim, Jae-Hun Choi, Sang-Ick Kang, Ji-Hyun Song, Joon-Hyuk Chang:
Toward detecting voice activity employing soft decision in second-order conditional MAP. 3082-3085
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuUIKN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuUIKN10
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Voice activity detection in a reguarized reproducing kernel hilbert space. 3086-3089
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuZL10
Ji Wu, Xiao-Lei Zhang, Wei Li:
A new VAD framework using statistical model and human knowledge based empirical rule. 3090-3093
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HugginsSL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HugginsSL10
Mark C. Huggins, Brett Y. Smolenski, Aaron D. Lawson:
Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments. 3094-3097
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhoshTGN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhoshTGN10
Prasanta Kumar Ghosh, Andreas Tsiartas, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Robust voice activity detection in stereo recording with crosstalk. 3098-3101
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimotoWN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimotoWN10
Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani:
Voice activity detection using frame-wise model re-estimation method based on Gaussian pruning with weight normalization. 3102-3105
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeM10
Bowon Lee, Debargha Muhkerjee:
Spectral entropy-based voice activity detector for videoconferencing systems. 3106-3109
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeanSVM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeanSVM10
David Dean, Sridha Sridharan, Robert Vogt, Michael Mason:
The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms. 3110-3113
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuH10
Tao Yu, John H. L. Hansen:
A Bayesian approach to voice activity detection using multiple statistical models and discriminative training. 3114-3117
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhaemmaghamiBVS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhaemmaghamiBVS10
Houman Ghaemmaghami, Brendan Baker, Robert Vogt, Sridha Sridharan:
Noise robust voice activity detection using features extracted from the time-domain autocorrelation function. 3118-3121
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OonishiIF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OonishiIF10
Tasuku Oonishi, Koji Iwano, Sadaoki Furui:
VAD-measure-embedded decoder with online model adaptation. 3122-3125
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengH10
Shiwen Deng, Jiqing Han:
Robust statistical voice activity detection using a likelihood ratio sign test. 3126-3129
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IvanovR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IvanovR10
Alexei V. Ivanov, Giuseppe Riccardi:
Automatic turn segmentation in spoken conversations. 3130-3133
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaguchiTO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaguchiTO10
Yohei Kawaguchi, Masahito Togami, Yasunari Obuchi:
Turn taking-based conversation detection by using DOA estimation. 3134-3137

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.