default search action
EUROSPEECH 1995: Madrid, Spain
- Fourth European Conference on Speech Communication and Technology, EUROSPEECH 1995, Madrid, Spain, September 18-21, 1995. ISCA 1995
Keynotes
- Kenneth N. Stevens:
Applying phonetic knowledge to lexical access. 3-14 - William A. Ainsworth:
Auditory mechanisms for speech perception. 171-178 - Hervé Bourlard:
Towards increasing speech recognition error rates. 883-894 - Sadaoki Furui:
Flexible speech recognition. 1595-1604
Prosody Modelling in ASR
- Takatoshi Jitsuhiro, Tomokazu Yamada, Shigeki Sagayama:
Syllabic duration control for vocabulary-free speech recognition. 15-18 - Kazuyuki Takagi, Shuichi Itahashi:
Effectiveness of pause information in the content word detection of spoken dialogues. 19-22 - Kazuhiro Kondo:
Connected Japanese digit recognition with pitch accent-dependent models. 23-26 - Claude Barras, Marie-José Caraty, Claude Montacié:
Temporal control and training selection for HMM-based system. 27-30 - Keikichi Hirose, Xinhui Hu:
HMM-based tone recognition of Chinese trisyllables using double codebooks on fundamental frequency and waveform power. 31-34
Wideband Coding
- C. Murgia, Gang Feng, Catherine Quinquis, Alain Le Guyader:
Very low delay and high quality coding of 20 hz -15 khz speech at 64 kbit/S. 37-40 - Shigeaki Sasaki, Akitoshi Kataoka, Takehiro Moriya:
Wideband CELP coder at 16-kbit/s with 10-ms frame. 41-44 - A. W. Black, Ian A. Atkinson, Ahmet M. Kondoz, Barry G. Evans:
High quality 14.1kb/s wideband speech coder. 45-48 - Victoria Abreu-Sernández, Domingo Docampo-Amoedo:
A multipulse-deconvolution codec for wideband speech. 49-52
Hardware and Systems for Speech Processing
- E. Rohwer:
An advanced multi-DSP platform for speech technology integration in computer telephony applications. 58-62 - Rafael Ciria, Rafael Sarmiento de Sotomayor, Cristina Aguila, José Parera, Juan Santos:
Voice processing architecture for computer-telephony integration. 63-66 - Laura Ortiz-Balbuena, Héctor M. Pérez Meana, Alejandro Martínez-González, Luís Nino de Rivera, Mariko Nakano-Miyatake:
Fast convergent analog adaptive filter. 67-70 - Manuel A. Leandro, Álvaro Villegas, José Manuel Pardo:
Efficient isolated word recognition in Spanish based on static modeling. 71-74 - Jean-Luc Cochard, Olivier Oppizzi:
Reliability in a multi-agent spoken language recognition system. 75-78
Discriminative Training I, II
- Tomoko Matsui, Sadaoki Furui:
A study of speaker adaptation based on minimum classification error training. 81-84 - Albino Nogueiras Rodríguez, José B. Mariño:
Maximum likelihood based discriminative training of acoustic models. 85-88 - Hugues Leprieur, Patrick Haffner:
Discriminant learning with minimum memory loss for improved non-vocabulary rejection. 89-92 - Cesar Martín del Alamo, F. Javier Caminero-Gil, Celinda de la Torre-Munilla, Luis A. Hernández Gómez:
Codebook weights adaptation for discriminative training of SCHMM-based speech recognition systems. 93-96 - Kyungmin Na, Bumki Jeon, Dong-Il Chang, Soo-Ik Chae, Souguil Ann:
Discriminative training of hidden Markov models using overall risk criterion and reduced gradient method. 97-100 - Qiang Huo, Chorkin Chan:
Discriminative training of HMM based speech recognizer with gradient projection method. 101-104 - Javier Hernando Pericas, J. Ayarte, Enric Monte:
Optimization of speech parameter weighting for CDHMM word recognition. 105-108 - Mazin G. Rahim, Chin-Hui Lee, Biing-Hwang Juang:
Discriminative utterance verification for connected digits recognition. 529-532 - Antonio M. Peinado, Antonio J. Rubio, José C. Segura, Victoria E. Sánchez, Jesús Esteban Díaz Verdejo:
MCE estimation of VQ parameters for MVQHMM speech recognition. 533-536 - Wolfgang Reichl, Günther Ruske:
Discriminative training for continuous speech recognition. 537-540 - Kuldip K. Paliwal, Michiel Bacchiani, Yoshinori Sagisaka:
Minimum classification error training algorithm for feature extractor and pattern classifier in speech recognition. 541-544
Auditory Modeling in Speech Recognition
- Stephan Euler:
Integrated optimization of feature transformation for speech recognition. 109-112 - Andrew C. Morris, José M. Pardo:
Phoneme transition detection and broad classification using a simple model based on the function of onset detector cells found in the cochlear nucleus. 115-118 - Eric Fragnière, André van Schaik, Eric A. Vittoz:
Linear predictive coding of speech using an analogue cochlear model. 119-122 - Edward Jones, Eliathamby Ambikairajah:
Pitch extraction of telephone bandwidth speech using a place-temporal approach. 123-126 - Markus Bodden, Timothy R. Anderson:
A binaural selectivity model for speech recognition. 127-130 - Cristina Dobrin, Petri Haavisto, Kari Laurila, Jaakko Astola:
Speech recognition experiments in a noisy environment using auditory system modelling. 131-134
Applications and Systems
- Frédéric Berthommier, Georg F. Meyer:
Source separation by a functional model of amplitude demodulation. 135-138 - Vratislav Davidek, Pavel Sovka, Jiri Sika:
Real-time implementation of spectral subtraction algorithm for suppression of acoustic noise in speech. 141-144 - Bert Van Coile, Hans-Wilhelm Rühl, L. Vogten, M. Thoone, S. Goß, D. Delaey, E. Moons, Jacques M. B. Terken, Jan-Roelof de Pijper, Marianne Kugler, P. Kaufholz, Regina Krüger, Steven Leys, S. Willems:
Speech synthesis for the new pan-european traffic message control system RDS-TMC. 145-148 - Roberto Pacifici, G. Manca:
Echo cancelling in speech recognition systems. 149-152 - Teodoro Calonge Cano, Luis Alonso, Rui Ralha, A. L. Sánchez:
Parallel implementation of an hybrid neural network used for speech recognition task. 153-156 - M. Li, J. T. Proudfoot:
Hardware design of LPC coding for speech feature extraction. 157-160 - Henning Bergmann, Hans-Hermann Hamer, Andreas Noll, Annedore Paeseler, Horst Tomaschewski:
Modularization in task-specific language modelling. 161-164 - Carlos Avendaño, Hynek Hermansky, Eric A. Wan:
Beyond NYQUIST: towards the recovery of broad-bandwidth speech from narrow-bandwidth speech. 165-168
Large Vocabulary
- David Pye, Philip C. Woodland, Steve J. Young:
Large vocabulary multilingual speech recognition using HTK. 181-184 - Lori Lamel, Martine Adda-Decker, Jean-Luc Gauvain:
Issues in Large Vocabulary, Multilingual Speech Recognition. 185-188 - James Barnett, Paul G. Bamberg, Martin Held, Juan Huerta, Linda Manganaro, Adam Weiss:
Comparative performance in large-vocabulary isolated-word recognition in five european languages. 189-192 - Julie Brousseau, Caroline Drouin, George F. Foster, Pierre Isabelle, Roland Kuhn, Yves Normandin, Pierre Plamondon:
French speech recognition in an automatic dictation system for translators: the transtalk project. 193-196 - Christian Dugast, Xavier L. Aubert, Reinhard Kneser:
The Philips large-vocabulary recognition system for american English, French, and German. 197-200 - Sung-Chien Lin, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee:
A syllable-based very-large-vocabulary voice retrieval system for Chinese databases with textual attributes. 203-206 - Michael Riley, Andrej Ljolje, Donald Hindle, Fernando Pereira:
The AT&t 60,000 word speech-to-text system. 207-210 - Tai-Hsuan Ho, Hsin-Min Wang, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee:
Fast and accurate continuous speech recognition for Chinese language with very large vocabulary. 211-214 - Zuoying Wang, Jun Wu, Xi Xiao, Jin Quo:
Methods towards the very large vocabulary Chinese speech recognition. 215-218 - Gary D. Cook, Anthony J. Robinson:
Utterance clustering for large vocabulary continuous speech recognition. 219-222
Speech Coding I
- Thomas Eriksson, Jan Linden, Jan Skoglund:
Vector quantization of glottal pulses. 225-228 - Michele Festa, Daniele Sereno:
A speech coding algorithm based on prototypes interpolation with critical bands and phase coding. 229-232 - Dionysis E. Tsoukalas, Jiannis Mouropoulos, George Kokkinakis:
Very low-bitrate speech coding using perceptually-derived spectral data. 233-236 - Lorenzo Piazzo:
A new very low bit rate speech coder: the step decomposition vocoder. 237-240 - Ian A. Atkinson, Ahmet M. Kondoz, Barry G. Evans:
Time envelope LP vocoder: a new coding technique at very low bit rates. 241-244 - Dan Stefanoiu, Radwan Kastantin, Gang Feng:
Speech coding based on the discrete-time wavelet transform and human auditory system properties. 661-664 - F. J. Ancin, M. L. Larreategui, B. L. Burrows, Rolando A. Carrasco:
Wavelets for low bit rate speech coding applications. 665-669 - Elimberaza Mandridake, Rachid Atay, Mohamed Najim:
Adaptive speech vector coding with a multiresolution hierarchical codebook. 669-672 - Andrei Popescu, Nicolas Moreau:
Subband analysis-by-synthesis coding. 673-676 - Clifford I. Parris, Danny Wong, Francois Chambon:
A robust 2.4kb/s LP-MBE with iterative LP modelling. 677-680 - M. S. Torres-Guijarro, Francisco Javier Casajús-Quirós:
Improved transient representation and quantization for sinusoidal speech coders. 681-684 - Eric W. M. Yu, Cheung-Fat Chan:
Efficient multiband excitation linear predictive coding of speech at 1.6 kbps. 685-688 - Bruno Wery, Stephane Deketelaere:
Voice coding in the MSBN satellite communication system. 689-692 - Barry M. G. Cheetham, Xiaoqin Sun, W. T. K. Wong:
Spectral envelope estimation for low bit-rate sinusoidal speech coders. 693-696
Speech Signal Processing / Wavelets
- Israel Cohen, Shalom Raz, David Malah:
Shift-invariant adaptive local trigonometric decomposition. 247-250 - Paul Micallef, Edward H. S. Chilton:
Spectral envelope of speech using wavelets. 251-254 - Andrzej Drygajlo, Nicolas Thevoz:
Multiresolution speech analysis using fast time-varying orthogonal wavelet packet transform algorithms. 255-258 - Maria Rangoussi, Flemming Pedersen:
Second- and third-order wigner distributions in hierarchical recognition of speech phonemes. 259-262 - Gaafar M. K. Saleh, Mahesan Niranjan, William J. Fitzgerald:
The use of maximum a posteriori parameters in linear prediction of speech. 263-268
Applications of Speech Technology
- William C. G. Ortel:
Observed long-term changes in customer calling patterns in a telephone application using automatic speech recognition. 269-272 - Ayman Asadi, David M. Lubensky, L. Madhavrao, Jayant M. Naik, Vijay Raman, George Vysotsky:
Combining speech algorithms into a "natural" application of speech technology for telephone network services. 273-276 - Yevgeny Ludovik, Valeriy Sibirtsev:
Intelligent answering machine-secretary. 277-280 - Kyung-ho Loken-Kim, Young-Duk Park, Suguru Mizunashi, Laurel Fais, Tsuyoshi Morimoto:
Verbal-gestural behaviors in multimodal spoken language interpreting telecommunications. 281-284 - Jung-Kuei Chen, Lin-Shan Lee, Frank K. Soong:
Large vocabulary, word-based Mandarin dictation system. 285-288
Visual Speech
- Bertrand Le Goff, Thierry Guiard-Marigny, Christian Benoît:
Read my lips... and my jaw! how intelligible are the components of a speaker's face? 291-294 - Angela Fuster Duran:
Mcgurk effect in Spanish and German listeners: influences of visual cues in the perception of Spanish and German conflicting audio-visual stimuli. 295-298 - Jonas Beskow:
Rule-based visual speech synthesis. 299-302 - Fabio Lavagetto, Paolo Lavagetto:
A new algorithm for visual synthesis of speech. 303-306 - Harouna Kabré:
Audiovisual speech recognition using the fuzzy shape filters model. 307-310
Speaker Recognition I-III
- Jialong He, Li Liu, Günther Palm:
On the use of features from prediction residual signals in speaker identification. 313-316 - Kai Tat Ng, Haizhou Li, Jean Paul Haton:
Some nonparametric distance measures in speaker verification. 317-320 - Michael J. Carey, Graham Tattersall, Eluned S. Parris:
Adaptive transforms for speaker recognition. 321-324 - Kai Tat Ng, Jian Su, Bingzheng Xu:
Speaker recognition with discriminative speaker VQ models. 325-328 - A. Federico, Andrea Paoloni:
Parametric speaker recognition over large population of telephonic voices. 329-332 - Toomas Altosaar, Einar Meister:
Speaker recognition experiments in Estonian using multi-layer feed-forward neural nets. 333-336 - Ivan Magrin-Chagnolleau, Jean-François Bonastre, Frédéric Bimbot:
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods. 337-340 - Pavel V. Labulin, Sergey L. Koval, Andrey N. Raev:
Automatic speaker recognition using formants-based nearest-neighbour distance measure. 341-344 - Mohammad Mehdi Homayounpour, Gérard Chollet:
Discrimination of voices of twins and siblings for speaker verification. 345-348 - Kay M. Berkling, Etienne Barnard:
Theoretical error prediction for a language identification system using optimal phoneme clustering. 351-354 - Jesper Ø. Olsen:
Separation of speakers in audio data. 355-358 - J.-L. Bonifas, Inma Hernáez Rioja, Borja Etxebarria Gonzalez, S. Saoudi:
Text-dependent speaker verification using dynamic time warping and vector quantization of LSF. 359-362 - Haizhou Li, Jean Paul Haton, Yifan Gong:
On MMI learning of Gaussian mixture for speaker models. 363-366 - Yifan Gong:
Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification. 367-370 - Daniele Falavigna:
Comparison of different HMM based methods for speaker verification. 371-374 - J. Sheikhzadegan, M. Tebiani, M. Lotfizad, Mahmood R. Roohani:
Speaker classification by neural network for short utteranses using phoneme groups in Farsi. 375-378 - Jean-Luc Le Floch, Claude Montacié, Marie-José Caraty:
Speaker recognition experiments on the NTIMIT database. 379-382 - Michael Wagner, John S. Mason, J. Bruce Millar:
Speaker identification using vector quantisation with codeword-specific derivative coding. 383-386 - Haizhou Li, Jean Paul Haton, Jian Su, Yifan Gong:
Speaker recognition with temporal transition models. 617-620 - Tomoko Matsui, Tomohito Kanno, Sadaoki Furui:
Speaker recognition using HMM composition in noisy environments. 621-624 - ChiWei Che, Qiguang Lin:
Speaker recognition using HMM with experiments on the yoho database. 625-628 - Kin Yu, John S. Mason, John Oglesby:
Speaker recognition models. 629-632 - Thierry Artières, Patrick Gallinari:
Multi-state predictive neural networks for text-independent speaker recognition. 633-636
Voice Source Analysis and Modelling
- Francesco Beritelli, Salvatore Casale, Marco Russo:
A voiced/unvoiced speech discrimination technique based on fuzzy logic. 389-392 - Vassilios Darsinos, Christophe d'Alessandro, B. Yegnanarayana:
Evaluation of a periodic/aperiodic speech decomposition algorithm. 393-396 - Jean Rouat, Yong Chun Liu, Daniel Morissette:
A pitch determination and voiced/unvoiced decision algorithm for noisy speech. 397-400 - Léonard Janer:
Modulated Gaussian wavelet transform based speech analyser (MGWTSA) pitch detection algorithm (PDA). 401-404 - M. L. Larreategui, F. J. Ancin, Rolando A. Carrasco:
An improved epoch detection algorithm based on sinusoidal modelling of speech. 409-412 - Vassilios Darsinos, Dimitrios Galanis, George Kokkinakis:
A method for fully automatic analysis and modelling of voice source characteristics. 413-416 - Hartmut R. Pfitzinger:
Dynamic vowel quality: a new determination formalism based on perceptual experiments. 417-420 - Sumio Ohno, Hiroya Fujisaki:
A method for quantitative analysis of the local speech rate. 421-424
Voice Personality Characteristics in TTS
- Ki-Seung Lee, Dae Hee Youn, Il-Whan Cha:
Voice personality transformation using an orthogonal vector space conversion. 427-430 - Makoto Hashimoto, Norio Higuchi:
Spectral mapping for voice conversion using speaker selection and vector field smoothing. 431-435 - Norio Higuchi, Makoto Hashimoto:
Analysis of acoustic features affecting speaker identification. 435-438 - Masato Akagi, Taw Ienaga:
Speaker individualities in fundamental frequency contours and its control. 439-442 - King-fai Lam, Cheung-Fat Chan:
Interpolating MBE v/UV mixture function for high quality synthesis of speech. 443-447 - Yannis Stylianou, Olivier Cappé, Eric Moulines:
Statistical methods for voice quality transformation. 447-450 - Yannis Stylianou, Jean Laroche, Eric Moulines:
High-quality speech modification based on a harmonic + noise model. 451-454 - Sahar E. Bou-Ghazah, John H. L. Hansen:
Source generator based stressed speech perturbation. 455-458
Robust Speech Recognition in Noise
- Néstor Becerra Yoma, Fergus R. McInnes, Mervyn A. Jack:
Improved algorithms for speech recognition in noise using lateral inhibition and SNR weighting. 461-464 - Olivier Siohan, Yifan Gong, Jean Paul Haton:
Noise adaptation using linear regression for continuous noisy speech recognition. 465-468 - Ruikang Yang, Markku Majaniemi, Petri Haavisto:
Dynamic parameter compensation for speech recognition in noise. 469-472 - Andrzej Drygajlo, Nathalie Virag, Gregoire Cosendai:
Robust speech recognition in noise using speech enhancement based on masking properties of the auditory system and adaptive HMM. 473-476 - Dong Yu, Taiyi Huang:
Canonical correlation based compensation approach for robust speech recognition in noisy environment. 477-480 - Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A unified approach for robust speech recognition. 481-484
Modelling and Training for Robust Recognition
- Harald Singer, Kuldip K. Paliwal, Tomohiko Beppu, Yoshinori Sagisaka:
Effect of rasta-type processing for speech recognition with speaking-rate mismatches. 487-490 - Nikki Mirghafori, Eric Fosler, Nelson Morgan:
Fast speakers in large vocabulary continuous speech recognition: analysis & antidotes. 491-494 - Wu Chou, Mazin G. Rahim, Eric R. Buhrke:
Signal conditioned minimum error rate training. 495-498 - Takeshi Matsumura, Shoichi Matsunaga:
Non-uniform unit HMMS for speech recognition. 499-502 - Ananth Sankar, Françoise Beaufays, Vassilios Digalakis:
Training data clustering for improved speech recognition. 503-506 - Qiang Huo, Chorkin Chan:
On the use of bi-directional contextual dependence in acoustic modeling for speech recognition. 507-510 - Tomohiko Beppu, Kiyoaki Aikawa:
Spontaneous speech recognition using dynamic CEPSTRA incorporating forward and backward masking effect. 511-514 - Mohamed Afify, Yifan Gong, Jean Paul Haton:
Stochastic trajectory models for speech recognition: an extension to modelling time correlation. 515-518 - Ben P. Milner, Saeed Vaseghi:
An analysis of cepstral-time matrices for noise and channel robust speech recognition. 519-522 - Zeév Rivlin:
A confidence measure for acoustic likelihood scores. 523-526
Semantic Interpretation, ELSNET
- Alain Biem, Erik McDermott, Shigeru Katagiri:
A discriminative filter bank model for speech recognition. 545-548 - Holger Stahl, Johannes Müller:
A stochastic grammar for isolated representation of syntactic and semantic knowledge. 551-554 - Esther Levin, Roberto Pieraccini:
Concept-based spontaneous speech understanding system. 555-558 - Antonio Bonafonte, José B. Mariño, Eduardo Lleida:
Semantic decoding of speech in constrained domains. 559-562 - Marcello Federico, Fabrizio Vernesoni:
A speech understanding architecture for an information query system. 563-566 - Josef G. Bauer, Holger Stahl, Johannes Müller:
A one-pass search algorithm for understanding natural spoken time utterances by stochastic models. 567-570
Data based Text-to-Speech
- Robert E. Donovan, Philip C. Woodland:
Improvements in an HMM-based speech synthesiser. 573-576 - Yoshiharu Itoh, Makoto Hashimoto, Norio Higuchi:
Sub-phonemic optimal path search for concatenative speech synthesis. 577-580 - Alan W. Black, Nick Campbell:
Optimising selection of units from speech databases for concatenative synthesis. 581-584 - Eduardo López Gonzalo, Luis A. Hernández Gómez:
Automatic data-driven prosodic modeling for text-to-speech. 585-588 - F. Mana, Silvia Quazza:
Text-to-speech oriented automatic learning of Italian prosody. 589-592
Prosody in Text-to-Speech
- Andrew P. Breen:
A simple method of predicting the duration of syllables. 595-598 - Marcel Riedi:
A neural-network-based model of segmental duration for speech synthesis. 599-602 - Frédéric Beaugendre:
Generating French intonation at different speaking rates. 603-606 - Evelyne Tzoukermann, Olivier Soumoy:
Segmental duration in French text-to-speech synthesis. 607-610 - Merle Horne, Marcus Filipsson:
Developing the prosodic component for Swedish speech synthesis. 611-614
Speaker Verification
- Anand R. Setlur, Thomas Jacobs:
Results of a speaker verification service trial using HMM models. 639-642 - Han-Sheng Liou, Richard J. Mammone:
Application of phonetic weighting to the neural tree network based speaker recognition system. 643-646 - Douglas A. Reynolds, Beth A. Carlson:
Text-dependent speaker verification using decoupled and integrated speaker and speech recognizers. - Jean-Luc Gauvain, Lori Lamel, B. Prouts:
Experiments with speaker verification over the telephone. 651-654
CELP Coding
- Sofia Moreno Perez, Ramón García Gómez:
Improved CELP algorithm suited for various speech coding applications. 699-702 - S. A. Atungsiri, Ahmet M. Kondoz, Barry G. Evans:
Comparative study of two codecs for an enhanced GSM system. 703-707 - Siu-pun Chui, Cheung-Fat Chan:
Fast low-delay CELP coding of speech at 8kbps. 707-710 - Hong-Goo Kang, Jeong Tae Seo, Il-Whan Cha, Dae Hee Youn:
A low bit-rate speech coder using the perceptual properties of the human ear. 711-714 - Silvio Cucchi, Marco Fratti:
Very fast CELP coding using stochastic innovations. 715-718 - Mustapha Bouraoui, W. Glass, Gang Feng:
Fast codebook search algorithm based on hamming ECC for algebraic CELP speech coding. 719-722 - Tim Fingscheidt, Thomas Wiechers, Eckhard Delfs:
Implementation aspects of the GSM half-rate speech codec. 723-726 - Scott D. Watson, Barry M. G. Cheetham, W. T. K. Wong, A. V. Lewis:
Low and variable bit-rate speech coding for ATM networks. 727-730 - Andrei Popescu, Nicolas Moreau, Claude Lamblin:
A differential encoding method for the LTP delay in CELP coders. 731-733
Formant Analysis
- Philipp Schmid, Etienne Barnard:
Robust, n-best formant tracking. 737-740 - Fabrice Plante, William A. Ainsworth:
Formant tracking using reassigned spectrum. 741-744 - Jean Schoentgen, Sorin Ciocea:
Direct calculation of the vocal tract area function from measured formant frequencies. 745-748 - Don X. Sun:
Robust estimation of spectral center-of-gravity trajectories using mixture spline models. 749-752 - Jeng-Shyang Pan, Fergus R. McInnes, Mervyn A. Jack:
Bound for minkowski metric based on LP distortion measure. 753-756 - Keiichi Tokuda, Takashi Masuko, Tetsuya Yamada, Takao Kobayashi, Satoshi Imai:
An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features. 757-760 - Hywel B. Richards, John S. Mason, Melvyn J. Hunt, John S. Bridle:
Deriving articulatory representations of speech. 761-764
Acoustic-Phonetic Modelling
- Xavier L. Aubert, Christian Dugast:
Improved acoustic-phonetic modeling in philips' dictation system by handling liaisons and multiple pronunciations. 767-770 - Nelson Morgan, Su-Lin Wu, Hervé Bourlard:
Digit recognition with stochastic perceptual speech models. 771-774 - R. N. V. Sitaram, Thippur V. Sreenivas:
On incorporating phonemic constraints in hidden Markov models for speech recognition. 775-778 - Saga Chang, Sin-Horng Chen:
An improvement on syllable-based continuous Mandarin speech recognition via using inter-syllable boundary models. 779-782 - Torbjørn Svendsen, Frank K. Soong, Heiko Purnhagen:
Optimizing baseforms for HMM-based speech recognition. 783-787 - Jorge Alvarez-Cercadillo, Chin-Hui Lee, Luis A. Hernández Gómez:
Acoustic modeling of context dependent units, for large vocabulary speech recognition in Spanish. 787-790 - Shigeki Okawa, Katsuhiko Shirai:
Estimation of statistical phoneme center and its application to accurate phoneme modelling. 791-794 - Zhishun Li, Patrick Kenny, Douglas D. O'Shaughnessy:
Hybrid hidden Markov models in speech recognition. 795-798 - Luciano Fissore, Franco Ravera, Pietro Laface:
Acoustic-phonetic modeling for flexible vocabulary speech recognition. 799-802 - Pierre Dumouchel, Douglas D. O'Shaughnessy:
Segmental duration and HMM modeling. 803-806 - A. Ghio, Mario Rossi:
A knowledge-based model for speaker-independent, acoustic-phonetic decoding. 807-810
Spoken Language Resources I
- Ea-Ee Jan, Piergiorgio Svaizer, James L. Flanagan:
A database for microphone array experimentation. 813-816 - Terri Lander, Ronald A. Cole, Beatrice T. Oshika, Mike Noel:
The OGI 22 language telephone speech corpus. 817-820 - Ronald A. Cole, Mike Noel, Terri Lander, T. Durham:
New telephone speech corpora at CSLU. 821-824 - Els den Os, T. I. Boogaart, Lou Boves, Esther Klabbers:
The Dutch polyphone corpus. 825-828 - The Onomastica interlanguage pronunciation lexicon.
- J. Bertenstam, Mats Blomberg, Rolf Carlson, Kjell Elenius, Björn Granström, Joakim Gustafson, Sheri Hunnicutt, Jesper Högberg, Roger Lindell, Lennart Neovius, Lennart Nord, Antonio de Serpa-Leitao, Nikko Ström:
The waxholm application database. 833-836 - Fabrice Plante, Georg F. Meyer, William A. Ainsworth:
A pitch extraction reference database. 837-840 - Richard Winski, Roger K. Moore, Dafydd Gibbon:
Eagles spoken language working group: overview and results. 841-845 - Celinda de la Torre-Munilla, Luis A. Hernández Gómez, Daniel Tapias Merino:
CEUDEX: a data base oriented to context-dependent units training in Spanish for continuous speech recognition. 845-848 - Karmele López de Ipiña, Inés Torres, Lourdes Oñederra:
Design of a phonetic corpus for a speech database in basque language. 851-854 - Vito Pirrelli, Stefano Federici:
"you'd better say nothing than say something wrong": analogy, accuracy and text-to-speech applications. 855-858 - A. Misheva, S. Dimitrova, V. Filipov, E. Grigoreva, M. Nikov, Peter Roach, Simon Arnfield:
Bulgarian speech database: a pilot study. 859-863 - Wolfgang J. Hess, Klaus J. Kohler, Hans-Günther Tillmann:
The Phondat-verbmobil speech corpus. 863-866 - Dominic S. F. Chan, Adrian Fourcin, Dafydd Gibbon, Björn Granström, Mark A. Huckvale, George Kokkinakis, Knut Kvale, Lori Lamel, Børge Lindberg, Asunción Moreno, Jiannis Mouropoulos, Francesco Senia, Isabel Trancoso, Corin 't Veld, Jerome Zeiliger:
EUROM - a spoken language resource for the EU - the SAM projects. 867-870 - Gernot A. Fink, Michaela Johanntokrax, Brigitte Schaffranietz:
A flexible formal language for the orthographic transcription of spontaneous spoken dialogues. 871-874 - Hsiao-Chuan Wang:
Design and implementation of Mandarin speech database in taiwan. 875-877
Search Methods I-II
- Philippe Morin, Ted H. Applebaum:
Word hypothesizer based on reliably detected phoneme similarity regions. 897-900 - Stefan Ortmanns, Hermann Ney:
Experimental analysis of the search space for 20 000-word speech recognition. 901-904 - Ming-Sheng Wang, Satoshi Imai:
Speech parsing by downward request search based on the divide and conquer method. 905-908 - Claire Waast, Lalit R. Bahl, Marc El-Bèze:
Fast match based on decision tree. 909-913 - Yoshiaki Noda, Shigeki Sagayama:
Fast and accurate beam search using forward heuristic functions in HMM-LR speech recognition. 913-916 - Don Colton, Mark A. Fanty, Ronald A. Cole:
Utterance verification improves closed-set recognition and out-of-vocabulary rejection. 1067-1071 - Víctor M. Jiménez, Andrés Marzal, Jean Monné:
A comparison of two exact algorithms for finding the n-best sentence hypotheses in continuous speech recognition. 1071-1074 - Kazuya Takeda, Shingo Kuroiwa, Masaki Naito, Seiichi Yamamoto:
Top-down speech detection and n-best meaning search in a voice activated telephone extension system. 1075-1078 - Frank Seide:
Fast likelihood computation for continuous-mixture densities using a tree-based nearest neighbor search. 1079-1083 - Peter Beyerlein, Meinhard Ullrich:
Hamming distance approximation for a fast log-likelihood computation for mixture densities. 1083-1086 - Yasuhiro Komori, Masayuki Yamada, Hiroki Yamamoto, Yasunori Ohora:
An efficient output probability computation for continuous HMM using rough and detail models. 1087-1090 - Jürgen Fritsch, Ivica Rogina, Tilo Sloboda, Alex Waibel:
Speeding up the score computation of HMM speech regognizers with the bucket voronoi intersection algorithm. 1091-1094
Analysis for Speech Recognition I-III
- Jan Nouza:
On the speech feature selection problem: are dynamic features more important than the static ones? 919-922 - Climent Nadeu, Pau Pachès-Leal, Biing-Hwang Juang:
Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition. 923-926 - Dimitris Tambakas, Nikos Fakotakis, George Kokkinakis:
Robust phoneme prototype extraction for speech recognition. 927-930 - Melvyn J. Hunt:
The even transform: a variance-equalizing orthogonal transformation and its application to speech recognition. 931-934 - Kai Hübener:
Using segmental coefficients in HMM speech recognition. 935-938 - Felix Freitag, Enric Monte, Javier Hernando Pericas:
On the use of the derivative of the pole trajectories of the LPC analysis parameter sequence as an alternative to delta parameters. 1373-1376 - P. V. S. Rao, R. Raveendran:
On the dual role of sequence directionality and coherence in a spectral predictive discrimination model. 1377-1380 - Climent Nadeu, Javier Hernando, Mónica Gorricho:
On the decorrelation of filter-bank energies in speech recognition. 1381-1384 - Jean-Claude Junqua, Dominique Fohr, Jean-François Mari, Ted H. Applebaum, Brian A. Hanson:
Time derivatives, cepstrai normaiization, and spectral parameter filtering for continuously spelled names over the telephone. 1385-1388 - Linda Djezzar:
Some new considerations about the spectral form of French stop bursts. 1389-1392 - P. V. S. Rao, R. Raveendran:
Characterization of spectral transition region by various prediction approaches for discriminating stop consonants. 1393-1396 - A. Vorstermanst, Jean-Pierre Martens, Bert Van Coile:
Fast automatic segmentation and labeling: results on TIMIT and EUROMO. 1397-1400 - Gordon Ramsay, Li Deng:
Maximum-likelihood estimation for articulatory speech recognition using a stochastic target model. 1401-1404 - John Sirigos, Nikos Fakotakis, George Kokkinakis:
A comparison of several speech parameters for speaker independent speech recognition and speaker recognition. 1407-1410 - Nabil N. Bitar, Carol Y. Espy-Wilson:
Speech parameterization based on phonetic features: application to speech recognition. 1411-1414 - Klaus Beulen, Lutz Welling, Hermann Ney:
Experiments with linear feature extraction in speech recognition. 1415-1418 - Christian-Michael Westendorf:
Nonlinear feature transformation based on statistical phoneme modeling. 1419-1422 - Ramesh R. Sarukkai, Dana H. Ballard:
The distance set representation of speech segments. 1427-1430 - Simon Dobrisek, R. Mihelic, Nikola Pavesic:
Multi-variate mixture probability density modelling of VQ codebook using gradient descent algorithm. 1431-1434
Perception/Pitch
- Agaath M. C. Sluijter, Vincent J. van Heuven:
Intensity and vocal effort as cues in the perception of stress. 941-944 - Mariusz Owsianny:
The effect of voice pitch on perception of synthetic Polish vowels. 945-948 - David House:
Perception of prepausal tonal contours: implications for automatic stylization of intonation. 949-952 - Tomio Takara:
Experimental study on perception of the glottal explosive of the Japanese ryukyu dialect. 953-956 - Christophe d'Alessandro, Sophie Rosset, O. Piot:
Measurement of pitch perception for F0 glides. 957-960
Perception/Context
- S. Wauquier-Gravelines:
Interferences between phonemes: evidence for "perceptual domains" in continuous speech perception. 963-966 - R. J. J. H. van Son, Louis C. W. Pols:
The influence of local context on the identification of vowels and consonants. 967-970 - William A. Ainsworth:
Effect of preceding noise duration on the perception of voiced plosives and vowels. 971-974 - Anne Bonneau, Linda Djezzar, Yves Laprie:
Influence of a prior knowledge of the vocalic context on stop burst perception. 975-978 - Wilma van Donselaar:
Listeners' use of the 'information-accentuation' interdependence in processing implicit and explicit references. 979-982
Prosody: Models/Modelling Prosody
- Hiroya Fujisaki, Sumio Ohno:
Analysis and modeling of fundamental frequency contours of English utterances. 985-988 - Pascale Nicolas, Daniel Hirst:
Symbolic coding of higher-level characteristics of fundamental frequency curves. 989-992 - Kenneth N. Ross, Mari Ostendorf:
A dynamical system model for recognizing intonation patterns. 993-996 - Andrew J. Hunt:
Syntactic influence on prosodic phrasing in the framework of the link grammar. 997-1000 - Johanneke Caspers, Vincent J. van Heuven:
Effects of time pressure on the choice of accent-lending and boundary-marking pitch configurations in dutch. 1001-1004
Prosody I-III
- Mayumi Sakata, Keikichi Hirose:
Analysis and synthesis of prosodic features in spoken dialogue of Japanese. 1007-1010 - Nick Campbell:
Prosodic influence on segmental quality. 1011-1014 - Bojan Petek:
Towards voice-interactive telephone services in slovenia: on prosody of digits using the sociolinguistic framework. 1015-1018 - Gregor Möhler, Grzegorz Dogil:
Test environment for the two level model of Germanic prominence. 1019-1022 - Nancy A. Daly-Kelly:
Linguistic and acoustic characteristics of pause intervals in spontaneous speech. 1023-1026 - Yoichi Yamashita, Riichiro Mizoguchi:
Modeling the contextual effects on prosody in dialog. 1329-1332 - Ralf Kompe, Andreas Kießling, Heinrich Niemann, Elmar Nöth, Ernst Günter Schukat-Talamazzini, A. Zottmann, Anton Batliner:
Prosodic scoring of word hypotheses graphs. 1333-1336 - Stefan Harbeck, Andreas Kießling, Ralf Kompe, Heinrich Niemann, Elmar Nöth:
Robust pitch period detection using dynamic programming with an ANN cost function. 1337-1340 - Toshio Hirai, Norio Higuchi, Yoshinori Sagisaka:
Automatic detection of major phrase boundaries using statistical properties of superpositional F0 control model parameters. 1341-1344 - Paul Taylor:
Using neural networks to locate pitch accents. 1345-1348 - Helmer Strik:
The relation between physiological signals and F0: a quantitative analysis method. 2027-2030 - Masanobu Abe:
Analysis of prosodic characteristics in speech advisories and their application to speech output. 2031-2034 - Michel Guitton, F. Javier Caminero-Gil, Joel Crestel, Laure Charonnat:
Pitch and elocution rate of diverts speech. 2035-2038 - Volker Strom:
Detection of accents, phrase boundaries and sentence modality in German with prosodic features. 2039-2042 - Yann Morlec, Gérard Bailly, Véronique Aubergé:
Synthesis and evaluation of intonation with a superposition model. 2043-2046 - Marcus L. Fach, Wolfgang Wokurek:
Pitch accent classification of fundamental frequency contours by hidden Markov models. 2047-2050 - Dik J. Hermes:
Measuring the perceptual similarity of pitch contours. 2051-2054 - Philippe Langlais:
Microprosodic study of isolated French word corpora. 2055-2058
Quantization of Spectral Parameters
- Kuldip K. Paliwal:
Interpolation properties of linear prediction parametric representations. 1029-1032 - H. B. Choi, W. T. K. Wong, Barry M. G. Cheetham, Colin C. Goodyear:
Interpolation of spectral information for low bit rate speech coding. 1033-1036 - Ladan Baghai-Ravary, Steve W. Beet, Osman Tokhit:
Adaptive flux interpolation, flow-based prediction, delta or delta-delta coefficients: which is best? 1037-1040 - Balázs Kövesi, Samir Saoudi, Jean-Marc Boucher, Z. Reguly:
LSP Markov model for reducing the complexity of vector quantization. 1041-1044 - Hamid Reza Sadegh Mohammadi, W. Harvey Holmes:
Predictive delta adaptive scalar quantization: an efficient method for coding the short-term speech spectrum. 1045-1048 - Dong-Il Chang, Kyungmin Na, Souguil Ann:
Conditional split vector quantization of LSP parameterswith multiple search. 1049-1052 - Stefan Bruhn:
Matrix product quantization for very-low-rate mobile speech communication. 1053-1056 - Ulrich Balss, Herbert Reininger, Holger Schalk, Dietrich Wolf:
Robust vector quantization for low bit rate speech coding. 1057-1060 - H. C. Ng, S. H. Leung:
Scalar quantization of LSF parameters at 28 bits/frame. 1061-1064
Systems and Evaluation
- Juana M. Gutiérrez-Arriola, Francisco M. Gimenez de los Galanes, Mohammad Hasan Savoji:
Improvement of the quality of speech synthesis by analysis using segmentation and modeling of the excitation signal. 1097-1100 - Przemyslaw Dymarski, Slawomir Kuklinski, Siawomir Kula:
A text-to-speech synthesizer for the Polish language. 1101-1104 - Carsten Jürgens, Mirko Wunderlich:
A comparison of different speech units for the German TTS-system tubsy. 1105-1108 - Cristina Delogu, Andrea Paoloni, Paola Ridolfi:
Confusions among Italian consonants in good and in telephone conditions: differences between text-to-speech systems and natural speech with noise. 1109-1112 - Briony Williams:
Text-to-speech synthesis for welsh and welsh English. 1113-1117 - Ove Andersen, Paul Dalsgaard:
Multi-lingual testing of a self-learning approach to phonemic transcription of orthography. 1117-1120
Adaption in Speech Recognition
- V. Beattie, S. Edmondson, D. Miller, Y. Patel, G. Talvola:
An integrated multi-dialect speech recognition system with optional speaker adaptation. 1123-1126 - Leonardo Neumeyer, Ananth Sankar, Vassilios Digalakis:
A comparative study of speaker adaptation techniques. 1127-1130 - George Zavaliagkos, Richard M. Schwartz, John W. McDonough, John Makhoul:
Adaptation algorithms for large scale HMM recognizers. 1131-1135 - Shoichi Matsunaga, Tetsuo Kosaka, Tohru Shimizu:
Speaking-style and speaker adaptation for the recognition of spontaneous dialogue speech. 1135-1138 - Stefan Dobler, Hans-Wilhelm Rühl:
Speaker adaptation for telephone based speech dialogue systems. 1139-1143 - Koichi Shinoda, Takao Watanabe:
Speaker adaptation with autonomous control using tree structure. 1143-1146 - Masahiro Tonomura, Tetsuo Kosaka, Shoichi Matsunaga, Akito Monden:
Speaker adaptation fitting training data size and contents. 1147-1150 - H. C. Choi, Robin W. King:
Direct and joint-space approaches to the use of spectral transformation for speaker adaptation in continuous speech recognition. 1151-1154 - C. J. Leggetter, Philip C. Woodland:
Flexible speaker adaptation for large vocabulary speech recognition. 1155-1158
Spoken Dialogue
- Bernhard Kaspar, Georg Fries, Karlheinz Schuhmacher, Antje Wirth:
Faust - a directory assistance demonstrator. 1161-1164 - Keiko Watanuki, Fumio Togawa:
Some signals of emotional arousal: analysis of conversations using a multimodal interaction database. 1165-1168 - Gösta Bruce, Björn Granström, Marcus Filipsson, Kjell Gustafson, Merle Horne, David House, Birgitta Lastow, Paul Touati:
Speech synthesis in spoken dialogue research. 1169-1172 - Mary Zajicek, Ken Brownsey, Simon Lippmann, Patrice Palau, Phyl Greenhead:
Dynamically created dialogues for automated telephone answering using uncertain reasoning and linguistic theory. 1173-1176 - Yasuhisa Niimi, Yutaka Kobayashi:
Modeling dialogue control strategies to relieve speech recognition errors. 1177-1180 - Anders Baekgaard:
Constraining of input media in a spoken dialogue system. 1181-1184 - Jana Klecková, Václav Matousek, Jana Netrvalova:
An Automatic Creation of the Language Model for the Spontaneous Czech Speech Recognizer. 1185-1188 - Mary O'Kane, P. E. Kenne, Hamish G. Pearcy:
A grammar of conversational English. 1189-1192 - Eric Brison, Nadine Vigouroux:
Robust comprehension in a spoken dialog system. 1193-1196 - Youngjik Lee, Young-Sum Kim, Jung-Chul Lee, Joon-Hyung Ryoo, Jae-Woo Yang:
Korean-Japanese speech translation system for hotel reservation - Korean front desk side. 1197-1200
Miscellaneous on Language Processing
- Sung-Chien Lin, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee:
Unconstrained speech retrieval for Chinese document databases with very large vocabulary and unlimited domains. 1203-1206 - Cristina Delogu, Andrea Di Carlo, Rino Falcone:
Integrating partial syntactical analysis and plan recognition for understanding DB natural language queries. 1207-1211 - Yuri A. Kosarev:
Communicative language model: structure and functioning. 1211-1214 - Adelaide Stevenur, Patrick Gallinari:
Integrating heuristic preferences into a neural understanding system. 1215-1218 - Björn Gambäck, Martin Eineborg, Mikael Eriksson, Barbro Ekholm, Bertil Lyberg, Tomas Svensson:
A language interface to a polyphone-based speech synthesizer. 1219-1223 - Georg Niklfeld, Hannes Pirker, Harald Trost:
Using two-level morphology as a generator-synthesizer interface for concept-to-speech. 1223-1226 - Myoung-Wan Koo, Il-Hyun Sohn, Woosung Kim, Du-Seong Chang:
KT-STS: a speech translation system for hotel reservation and a continuous speech recognition system for speech translation. 1227-1230 - Juan Miguel Vilar, Andrés Marzal, Enrique Vidal:
Learning language translation in limited domains using finite-state models: some extensions and improvements. 1231-1234 - David G. Novick, Karen Ward, Benjamin Corliss:
The effect of context on the intelligibility of dialogue. 1235-1238
Language Modelling I, II
- Ute Kilian, Fritz Class, Alfred Kaltenmeier, Peter Regel-Brietzmann:
Representation of a finite state grammar as bigram language model for continuous speech recognition. 1241-1244 - M. Generet, Hermann Ney, Frank Wessel:
Extensions of absolute discounting for language modeling. 1245-1248 - Loreia Moisa, Egidio P. Giachin:
Automatic clustering of words for probabilistic language models. 1249-1253 - Sven C. Martin, Jörg Liermann, Hermann Ney:
Algorithms for bigram and trigram word clustering. 1253-1256 - Joerg P. Ueberla:
More efficient clustering of n-grams for statistical language modeling. 1257-1260 - Uta Naeve, Gudrun Socher, Gernot A. Fink, Franz Kummert, Gerhard Sagerer:
Generation of language models using the results of image analysis. 1739-1742 - Gudrun Flach:
Modelling pronunciation variability for special domains. 1743-1746 - Nick Cremelie, Jean-Pierre Martens:
On the use of pronunciation rules for improved word recognition. 1747-1750 - Jin'ichi Murakami:
Reducing memory requirements and computational costs for the baum-welch algorithm and application to automatic stochastic network grammar acquisition. 1751-1754 - Stefan Besling, Hans-Günter Meier:
Language model speaker adaptation. 1755-1758 - Jerneja Gros, R. Mihelic, Nikola Pavesic:
Sentence hypothesisation using NG-gram models. 1759-1762 - Ronald Rosenfeld:
Optimizing lexical and N-gram coverage via judicious use of linguistic data. 1763-1766 - Marcus Spies:
A language model for compound words in speech recognition. 1767-1770
Benchmarking and Assessment I-II
- Johannes Müller, Holger Stahl:
Collecting and analyzing spoken utterances for a speech controlled application. 1437-1440 - Hiromi Nagabuchi, Akira Takahashi, Mineyoshi Ogawa:
Speech intelligibility and loudness assessment in a wireless personal communication. 1441-1444 - Kate S. Hone, Robert W. Series, Chris Baber:
An experimental investigation of the input and error correction strategies used by subjects entering digits with the AURIX speech recogniser. 1445-1448 - Karim Belhoula:
Goal-directed generation of intelligibility test vocabularies in the framework of names synthesis. 1449-1452 - Reinhold Haeb-Umbach, Stephan Gamm:
Human factors of a voice-controlled car stereo. 1453-1456 - Niels Ole Bernsen, Hans Dybkjær, Laila Dybkjær:
Exploring the limits of system-directed dialogue, dialogue evaluation of the danish dialogue system. 1457-1460 - David A. van Leeuwen, Leo-Geert van den Berg, Herman J. M. Steeneken:
Human benchmarks for speaker independent large vocabulary recognition performance. 1461-1464 - Alison Simons:
Predictive assesment for speaker independent isolated word recognisers. 1465-1468 - Satoshi Kobayashi, Shigeyoshi Kitazawa:
Consistency of inter-transcribers' transcription. 1263-1266 - H. Klaus, A. Niebank:
Comparison of reference system approaches for the quality assessment of synthesized speech. 1267-1270 - Herman J. M. Steeneken, David A. van Leeuwen:
Multi-lingual assessment of speaker independent large vocabulary speech-recognition systems: THE SQALE-PROJECT. 1271-1274 - Katarina Bartkova, Dominique Dubois, Denis Jouvet, Jean Monné:
Error analysis on field data and improved garbage HMM modelling. 1275-1278 - E. J. A. Verheijen, Floris L. van Nes, L. M. de Bruyn, Arie Hasman, Jan Willem Arends:
Interference of speech recognition feedback during diagnostic tasks. 1279-1282
Production/Biomechanics
- Béatrice Vaxelaire:
Geometric and temporal constraints in the production of French consonant sequences x-ray and acoustic data for French. 1285-1288 - Rafael Laboissière, Vittorio Sanguineti, Yohan Payan:
On the biomechanical control variables of the tongue during speech movements. 1289-1292 - Soumya Bouabana, Shinji Maeda:
Multipulse LPC modeling of articulatory movements: analysis and interpretation. 1293-1296 - Gaël Richard, M. Liu, D. Snider, H. Duncan, Qiguang Lin, James L. Flanagan, Stephen E. Levinson, Donald Davis, Scott Slimon:
Numerical simulations of fluid flow in the vocal tract. 1297-1300 - Hisayoshi Suzuki, Takayoshi Nakai, Hiroshi Sakakibara:
3-d fem analysis of sound propagation in the nasal tract. 1301-1304
Phonetics and Phonology I
- Pilar Prieto, Chilin Shih:
Effects of tonal clash on downstepped h* accents in Spanish. 1307-1310 - Mariko Kondo:
The effect of two factors related to speaking tempo on vowel devoicing in Japanese. 1311-1314 - Rob Goedemans, Vincent J. van Heuven:
Duration perception in subsyllabic constituents. 1315-1318 - Dick R. van Bergem:
Experimental evidence for a comprehensive theory of vowel reduction. 1319-1322 - John J. Ohala:
Clear speech does not exaggerate phonemic contrast. 1323-1326 - Susan Fitt:
The pronunciation of unfamiliar native and non-native town names. 2227-2230 - Joakim Gustafson:
Using two-level morphology to transcribe Swedish names. 2231-2234 - Didier Demolin, Jean-Marie Hombert, Véronique Lecuit, Christoph Segebarth, Alain Soquet:
An MRI study of French vowels. 2235-2238 - Marie-Josep Solé, E. Estebas:
Connected speech processes: a cross-linguistic study. 2239-2242 - Sabine Deligne, François Yvon, Frédéric Bimbot:
Variable-length sequence matching for phonetic transcription using joint multigrams. 2243-2246 - Gary N. Tajchman, Eric Fosler, Daniel Jurafsky:
Building multiple pronunciation models for novel words using exploratory computational phonology. 2247-2250 - Lourdes Aguilar, Maria Machuca:
Pragmatic factors affecting the phonetic properties of diphthongs. 2251-2254 - Edda Farnetani:
The spatial and the temporal dimensions of consonant reduction in conversational Italian. 2255-2258 - Steven Gillis, Georges De Schutter, Jo Verhoeven:
Neutralization of consonant length: the case of dutch intervocalic stops. 2259-2262 - Manabu Kotani, Haruya Matsumoto:
Sound perception between two languages based on analyses of onomatopoeic expression. 2263-2266
Language Identification
- Yonghong Yan, Etienne Barnard:
An approach to language identification with enhanced language model. 1351-1354 - Peter Nowell, Roger K. Moore:
The application of dynamic programming techniques to non-word based topic spotting. 1355-1358 - Itahashi Shuichi, Du Liang:
Language identification based on speech fundamental frequency. 1359-1362 - Michael A. Lund, Herbert Gish:
Two novel language model estimation techniques for statistical language identification. 1363-1366 - HingKeung Kwan, Keikichi Hirose:
Recognized phoneme-based N-gram modeling in automatic language identification. 1367-1370
Pattern Recognition and HMM's
- Amparo Varona, Inés Torres, Francisco Casacuberta:
Discriminative-transitional/steady units for Spanish continuous speech recognition. 1471-1474 - Ji Ming, Peter O'Boyle, Francis Jack Smith:
An HMM with optimized segment-dependent observations for speech recognition. 1475-1478 - Thierry Moudenc, Denis Jouvet, Jean Monné:
Improving recognition performances on field data with an a-priori segmentation of the speech signal. 1479-1482 - Lutz Welling, Hermann Ney, Andreas Eiden, C. Forbrig:
Connected digit recognition using statistical template matching. 1483-1486 - Markus Falkhausen, Herbert Reininger, Dietrich Wolf:
Calculation of distance measures between hidden Markov models. 1487-1490 - Jia-Lin Shen, Lin-Shan Lee:
A chernoff distance based segmental probability model (CD-SPM) approach for Mandarin syllable recognition. 1491-1494 - María José Castro, Federico Prat, Pablo Aibar, Francisco Casacuberta:
Geometric pattern recognition techniques for acoustic-phonetic decoding of Spanish continuous speech. 1495-1498 - Enrico Bocchieri, Giuseppe Riccardi:
State tying of triphone HMM's for the 1994 AT&t ARPA ATIS recognizer. 1499-1502 - Azarshid Farhat, Douglas D. O'Shaughnessy:
A shared-distribution approach in a hidden Markov model-based continuous speech recognition system. 1503-1506 - Javier Ferreiros, José M. Pardo:
Preliminary experimentation of different methods for continuous speech recognition in Spanish. 1507-1510
Compensations for Unknown Channels
- Gerhard Doblinger:
Computationally efficient speech enhancement by spectral minima tracking in subbands. 1513-1516 - Josep M. Salavedra, Javier Hernando, Enrique Masgrau, Asunción Moreno:
Robust hos-based techniques applied to speech recognition and enhancement. 1517-1520 - Kuldip K. Paliwal:
A maximum likelihood equalization technique for robust speech recognition in adverse environments. 1521-1524 - Gérard Faucon, Régine Le Bouquin-Jeannès:
Joint system for acoustic echo cancellation and noise reduction. 1525-1528 - Jane Chang, Victor Zue:
A study of speech recognition system robustness to microphone variations. 1529-1532 - Alexandros Potamianos, Li Lee, Richard C. Rose:
A feature-space transformation for telephone based speech recognition. 1533-1536 - Ricardo de Córdoba, Xavier Menéndez-Pidal, Javier Macías Guarasa, Ascensión Gallardo-Antolín, José Manuel Pardo:
Development and improvement of a real-time ASR system for isolated digits in Spanish over the telephone line. 1537-1540 - Jen-Tzung Chien, Lee-Min Lee, Hsiao-Chuan Wang:
Channel estimation for reference model adaptation in telephone speech recognition. 1541-1544 - Hiroshi Yasukawa:
Enhancement of telephone speech quality by simple spectrum extrapolation method. 1545-1548 - Peter Handel:
Low-distortion spectral subtraction for speech enhancement. 1549-1552
Overcoming the Effects of Noise and Variability
- Zhihong Hu, Etienne Barnard, Ronald A. Cole:
Transition-based feature extraction within frame-based recognition. 1555-1558 - Laurent Girin, Gang Feng, Jean-Luc Schwartz:
Noisy speech enhancement with filters estimated from the speaker's lips. 1559-1562 - Ali Adjoudani, Christian Benoît:
Audio-visual speech recognition compared across two architectures. 1563-1566 - Sharlene A. Liu:
Noise effects on landmark detection in a speech recognition system. 1567-1570 - Laure Charonnat, Joel Crestel, Michel Glutton, Herve Chuberre:
AR identification of the vocal filter from noisy hyperbaric speech signals. 1571-1574 - Pavel Sovka, Petr Pollák:
The study of speech/pause detectors for speech enhancement methods. 1575-1578 - Robert Sokol, Guy Mercier:
Neural-fuzzy network for phonetic features recognition. 1579-1582 - A. Shamsoddini, P. N. Denbigh:
A system for speech separation. 1583-1586 - Yunxin Zhao:
Hierarchical mixture models and phonological rules in open-vocabulary speech recognition. 1587-1590
Modelling Segments and Units
- Antonio Bonafonte, Rafael Estany, Eugenio Vives:
Study of subword units for Spanish speech recognition. 1607-1610 - Wendy J. Holmes, Martin J. Russell:
Speech recognition using a linear dynamic segmental HMM. 1611-1614 - Kazumasa Yamamoto, Seiichi Nakagawa:
Comparative evaluation of segmental unit input HMM and conditional density HMM. 1615-1618 - Shoichi Matsunaga, Takeshi Matsumura, Harald Singer:
Continuous speech recognition using non-uniform unit based acoustic and language models. 1619-1622 - C. Simon Blackburn, Steve J. Young:
Towards improved speech recognition using a speech production model. 1623-1626
Out of Vocabulary Recognition
- Rafid A. Sukkar, Chin-Hui Lee, Biing-Hwang Juang:
A vocabulary independent discriminatively trained method for rejection of non-keywords in sub word based speech recognition. 1629-1632 - Juan Carlos Torrecilla, Daniel Tapias Merino, F. Javier Caminero-Gil, Luis Villarrubia:
Rejection techniques based on context independent subword units. 1633-1636 - Pablo Fetter, Fritz Class, Udo Haiber, Alfred Kaltenmeier, Ute Kilian, Peter Regel-Brietzmann:
Detection of unknown words in spontaneous speech. 1637-1640 - Atsushi Nakamura:
A minimum error training of garbage model for keyword spotter with artificially generated training data. 1641-1644 - I. Lee Hetherington:
New words: effect on recognition performance and incorporation issues. 1645-1648
Neural Networks I-IV
- David O. Baldwin, Georg F. Meyer:
Improving speech recognition using speaker classification. 1651-1654 - Axel Glaeser:
Modular neural networks with task-specific input parameters for speakerindependent speech recognition. 1655-1658 - Gerhard Rigoll, Christoph Neukirchen, Jörg Rottland:
Large vocabulary speaker-independent continuous speech recognition with a new hybrid system based on MMI-neural networks. 1659-1662 - Hervé Bourlard, Yochai Konig, Nelson Morgan:
REMAP: recursive estimation and maximization of a posteriori probabilities in connectionist speech recognition. 1663-1666 - Tan Lee, P. C. Ching, Lai-Wan Chan:
An RNN based speech recognition system with discriminative training. 1667-1670 - M. Asunción Castaño, Enrique Vidal, Francisco Casacuberta:
Preliminary experiments for automatic speech understanding through simple recurrent networks. 1673-1676 - Ha-Jin Yu, Yung-Hwan Oh:
A neural network using non-uniform units for continuous speech recognition. 1677-1680 - Horacio Franco, Vassilios Digalakis:
Temporal correlation modeling in a hybrid neural network/hidden Markov model speech recognizer. 1681-1684 - Laurent Buniet, Dominique Fohr:
Continuous speech segmentation with the gamma memory model. 1685-1688 - Xavier Menéndez-Pidal, Ricardo de Córdoba, Javier Ferreiros, José Manuel Pardo:
Incorporating fuzzy modelling in a hybrid HMM-ANNs system for CSR tasks. 1689-1692 - Wolfgang Reichl, S. Harengel, Franz Wolfertstetter, Günther Ruske:
Neural networks for nonlinear discriminant analysis in continuous speech recognition. 2163-2166 - Gerhard Rigoll:
Speech recognition experiments with a new multilayer LVQ network (MLVQ). 2167-2170 - João Paulo Neto, Luís B. Almeida, Mike Hochberg, Ciro Martins, Luís Nunes, Steve Renals, Tony Robinson:
Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system. 2171-2174 - Premysl Puzrla, Frédéric Bimbot, Christoph Windheuser:
Distributed binary representations for word recognition by TDNN-DTW hybrid systems. 2175-2178 - Yolande Anglade:
A robust discrimination method based on selectively trained neural networks for confusable words in noisy conditions. 2179-2182 - Victor Abrash, Horacio Franco, Ananth Sankar, Michael Cohen:
Connectionist speaker normalization and adaptation. 2183-2186 - Pedro L. Galindo:
A competitive algorithm for training HMM for speech recognition. 2187-2190 - Martin Paping, Hans Marti, Mark Renfer:
Predictive connectionist speech recognition with a new discriminant learning algorithm. 2193-2196 - Antonio J. Rubio, Ronan G. Reilly:
Preliminary results on speech signal segmentation with recurrent neural networks. 2197-2200 - Klára Vicsi, Attila Vig:
Text independent neural network/rule based hybrid, continuous speech recognition. 2201-2204 - Alfred Ying Pang Ng, P. C. Ching, Lai-Wan Chan:
Automatic recognition of Cantonese lexical tones in connected speech by multi-layer perceptron. 2205-2208 - Dave Abberley, Phil D. Green:
Combining HMM processing and formant measurements in automatic speech recognition. 2209-2212 - Kyungmin Na, Jekwan Ryu, Dong-Il Chang, Soo-Ik Chae, Souguil Ann:
Recurrent neural prediction models for speech recognition. 2213-2216 - Linda Djezzar, Jean Paul Haton:
Exploiting acoustic-phonetic knowledge and neural networks for stop recognition. 2217-2220 - Pedro Gómez, Victoria Rodellar, Agustín Álvarez, Jesús Bobadilla, Jesus Bernal, Victor Nieto Lluis, Mercedes Pérez:
Estimation of speech formant-dynamics using neural networks. 2221-2224
Human Word Recognition
- Willy Jongenburger, Vincent J. van Heuven:
The role of linguistic stress in the time course of word recognition in stress-accent languages. 1695-1698 - Béatrice de Gelder, Paul Bertelson, Jean Vroomen, Hsuan Chin Chen:
Inter-language differences in the mcgurk effect for dutch and Cantonese listeners. 1699-1702 - Takashi Otake, Sally M. Davis, Anne Cutler:
Listeners representations of within-word structure: a cross-linguistic and cross-dialectal investigation. 1703-1706 - James M. McQueen, Ethan Cox:
The use of phonotactic constraints in the segmentation of dutch. 1707-1710 - Jean Vroomen, Béatrice de Gelder:
Lexical inhibition in spoken word recognition. 1711-1714
Pathology and Communications I-II
- Maurílio Nunes Vieira, Fergus R. McInnes, Mervyn A. Jack, Arnold G. D. Maran, Colin Watson, Moira Little:
Methodological aspects in a multimedia database of vocal fold pathologies. 1867-1870 - V. Udayashankara, A. P. Shivaprasad:
The application of volterra LMS adaptive filtering to speech enhancement for the hearing impaired. 1871-1874 - Paolo Rosso, J. H. Wright, M. Smith:
CAPDA: managing intelligibility in children and young adults with down's syndrome or speech disorders. 1875-1878 - Alan Wrench, Mary S. Jackson, David S. Soutar, A. Gerry Robertson, Janet MacKenzie Beck:
Evaluation of a system for segmental speech quality assessment: voiceless fricatives. 1879-1882 - Bernard Teston, Benoit Galindo:
A diagnostic and rehabilitation aid workstation for speech and voice pathologies. 1883-1886 - Géza Németh, Gábor Olaszy, Laszlo Pataki, Luis A. Hernández Gómez, Diamantino Freitas:
Improvement, evaluation and testing of a low cost multilingual portable speaking aid for the speech impaired. 1887-1890 - Dirk Michaelis, Hans Werner Strube:
Empirical study to test the independence of different acoustic voice parameters on a large voice database. 1891-1894 - Sergio Daniel Cano-Ortiz, Daniel I. Escobedo Beceiro, Manuel Socarras Reyes:
The spectral analysis of infant cry: an initial approximation. 1895-1898 - Nobumasa Seiyama, Akira Nakamura, Atsushi Imai, Tohru Takagi, Eiichi Miyasaka:
Portable speech rate conversion system. 1717-1720 - Jialu Zhang:
A field test of sivo aid in China. 1721-1724 - Takayuki Arai, Keiko Okazaki, Setsuko Imatomi, Yuichi Yoshida:
Analysis for palatalized articulation of [s] sounds using synthetic speech. 1725-1728 - Krzysztof Marasek:
An attempt to classify LX signals. 1729-1732 - Jean Schoentgen, Raoul De Guchteneere:
Time series analysis of glottal cycle lengths of healthy and dysphonic speakers. 1733-1736
Language Modelling III and Syntactic Analysis
- Ernst Günter Schukat-Talamazzini, R. Hendrych, Ralf Kompe, Heinrich Niemann:
Permugram language models. 1773-1776 - Steffen Staab:
GLR-parsing of word lattices using a beam search method. 1777-1780 - Stephanie Seneff, Michael K. McCandless, Victor Zue:
Integrating natural language into the word graph search for simultaneous speech recognition and understanding. 1781-1784 - Joshua Koppelman, Stephen Della Pietra, Mark Epstein, Salim Roukos, Todd Ward:
A statistical approach to language modelling for the ATIS task. 1785-1788 - Gareth J. F. Jones, Harvey Lloyd-Thomas, Jeremy H. Wright:
Lattice parsing and application of integrated language models for speech recognition. 1789-1792 - Manny Rayner, Peter J. Wyard:
Robust parsing of n-best speech hypothesis lists using a general grammar-based language model. 1793-1796 - Fabio Brugnara, Mauro Cettolo:
Improvements in tree-based language model representation. 1797-1800 - P. E. Kenne, Mary O'Kane, Hamish G. Pearcy:
Language modeling of spontaneous speech in a court context. 1801-1804
Linguistics Aspects of TTS
- Chilin Shih:
Study of vowel variations for a Mandarin speech synthesizer. 1807-1810 - Eric Sanders, Paul Taylor:
Using statistical models to predict phrase boundaries for speech synthesis. 1811-1814 - Mark Tatham, Eric Lewis:
Naturalness in a high-level synthetic speech system. 1815-1818 - Katherine Morton, Mark Tatham:
Pragmatic effects in speech synthesis. 1819-1822 - Hansjörg Mixdorff, Hiroya Fujisaki:
A scheme for a model-based synthesis by rule of F0 contours of German utterances. 1823-1826 - Yasushi Ishikawa, Kunio Nakajima:
Speech synthesis by rule based on synthesis units considering prosodic features. 1827-1830 - Joop Kerkhoff, Toni C. M. Rietveld:
The generation of prosody in the nijmegen rule oriented speech synthesis system. 1831-1834 - Vincent Pean, Anne Lacheret-Dujour:
Phonological rules modelling style variations of 'e' caduc in French parisian spontaneous speech for text-to-speech synthesis. 1835-1838
Articulatory Modelling in Synthesis
- Dongbing Wei, J. W. Devaney, Colin C. Goodyear:
Voiced diphone synthesis using a parametric model and formant based mapping. 1841-1844 - Debra Yarrington, H. Timothy Bunnell, Gene Ball:
Robust automatic extraction of diphones with variable boundaries. 1845-1848 - Andrew Richard Greenwood:
Generation of articulatory synthesiser parameters from formant frequencies using a cubic mapping function. 1849-1852 - Raymond N. J. Veldhuis, I. J. M. Bogaert, N. J. C. Lous:
Two-mass models for speech synthesis. 1853-1856 - Mats Båvegård:
Introducing a parametric consonantal model to the articulatory speech synthesiser. 1857-1860 - Nobuyuki Katae, Tatsuro Matsumoto, Shinta Kimura, Mitsuko Kaseda, Takayuki Ohyama:
High-quality Japanese text-to-speech system: NARSYS. 1861-1864
Production
- Mohamed Yeou:
An investigation of locus equations as a source of information for consonantal place. 1901-1904 - Nathalie Parlangeau, Régine André-Obrecht, Alain Marchal:
Automatic labelling of multi-sensor speech database: issues and perspectives. 1905-1908 - R. J. J. H. van Son, Louis C. W. Pols:
What does consonant reduction look like, if it exists? 1909-1912 - Gérard Bailly, Louis-Jean Boë, Nathalie Vallée, Pierre Badin:
Articulatori-acoustic vowel prototypes for speech production. 1913-1916 - Michel Pitermann, Jean Schoentgen:
Experimental study of the target theory of vowel production. 1917-1920 - Rudolph Sock, Pascal Perrier, Anders Löfqvist:
Comparing tongue kinematic and acoustic phasing patterns for vowel quantity contrasts in WOLOF. 1921-1924 - Pascal Perrier, Lian Apostol, Yohan Payan:
Evaluation of a vowel normalisation procedure based on speech production knowledge. 1925-1928 - Santha Sampath:
Dialect specific features of australian English diphthongs in spontaneous speech. 1929-1932 - Christophe Vescovi, Eric Castelli, Xavier Pelorson:
Adaptation of a two-mass model of the vocal cords to a particular speaker. 1933-1936
Spoken Dialogue Systems
- Stephen Springer, Sara Basson, Ashok Kalyanswamy, Edward Man, Dina Yashchin:
The money talks interactive speech technology assessment: a report from the field. 1939-1942 - Jacques Siroux, Marc Guyomard, Y. Jolly, Franck Multon, Christophe Remondeau:
Speech and tactile-based georal system. 1943-1946 - Norman M. Fraser, J. H. Simon Thornton:
Vocalist: a robust, portable spoken language dialogue system for telephone applications. 1947-1950 - Masami Suzuki, Naomi Inoue, Fumihiro Yato, Kazuya Takeda, Seiichi Yamamoto:
A prototype of a Japanese-Korean realtime speech translation system. 1951-1954 - B. L. Zeigler, Baruch Mazor:
Query-response relationships in the oasis speech-recognition system. 1955-1958
Spoken Dialogue Systems II (Development and Tools)
- Lori Lamel, Sophie Rosset, Samir Bennacef, Hélène Bonneau-Maynard, Laurence Devillers, Jean-Luc Gauvain:
Development of spoken language corpora for travel information. 1961-1964 - Giovanni Flammia, Victor Zue:
Empirical evaluation of human performance and agreement in parsing discourse constituents in spoken dialogue. 1965-1968 - Tsuneo Nitta, Mika Amamiya, Hiroyuki Kamio, Hiroshi Matsu'ura, Arisa Uchiyama, Masafumi Tamura:
Multimodal spoken dialogue systems and rapid-prototyping. 1969-1972 - Lars Bo Larsen:
Development and evaluation of a spoken dialogue for a telephone based transaction system. 1973-1976 - Hermann Hild, Alex Waibel:
Integrating spelling into spoken dialogue recognition. 1977-1980
Overview of Robust Speech Recognition Techniques
- Mark J. F. Gales, Steve J. Young:
The application of parallel model combination to a large vocabulary dictation task. 1983-1986 - Chafic Mokbel, Denis Jouvet, Jean Monné:
Blind equalization using adaptive filtering for improving speech recognition over telephone. 1987-1990 - Yuchang Cao, Sridha Sridharan, Miles Moody:
Speech-seeking microphone array with multi-stage processing. 1991-1994 - Halewijn Vereecken, Jean-Pierre Martens:
Recognition of noisy speech using an auditory model. 1995-1998 - Brian D. Womack, John H. L. Hansen:
Stress independent robust HMM speech recognition using neural network stress classification. 1999-2002
Speech Processing with Multiple Microphones
- Klaus Linhard:
Speech enhancement using two versions of the noisy speech signal. 2005-2008 - Rainer Martin:
Design and optimization of a two microphone speech enhancement system. 2009-2012 - Martin Drews:
Time delay estimation for microphone array speech enhancement systems. 2013-2016 - Yuchang Cao, Sridha Sridharan, Miles Moody:
Speech enhancement by eigen decomposition with two-channel observations. 2017-2020 - Diego Giuliani, Marco Matassoni, Maurizio Omologo, Piergiorgio Svaizer:
Robust continuous speech recognition using a microphone array. 2021-2024
Prosody: Models and Data
- Joaquim Llisterri, Rafael Mann, Carme de la Mota, Antonio Rios:
Factors affecting F0 peak displacement in Spanish. 2061-2064 - Véronique Aubergé, Gérard Bailly:
Generation of intonation: a global approach. 2065-2068 - Peter E. Czigler, Dawn M. Behne:
Prevocalic consonant duration in Swedish: effects of vowel quality and postvocalic place of articulation. 2069-2072 - P. Vitez, Véronique Aubergé:
Intonation gesture of slovene: first indications. 2073-2076 - Barbara Heuft, Thomas Portele:
Intrinsic prosodic values and segmental context. 2077-2080 - Andy Tams, Mark Tatham, Julian H. Page:
Describing speech styles using prosody: a pilot study. 2081-2084 - C. Franchon Cabrera:
Stress and intonation in Spanish for affirmative and interrogative sentences. 2085-2088 - Michael Lehning:
Statistical methods for the automatic labelling of German prosody. 2089-2092
Out of Vocabulary and Connected Speech
- Atsuhiko Kai, Seiichi Nakagawa:
Investigation on unknown word processing and strategies for spontaneous speech understanding. 2095-2098 - F. Javier Caminero-Gil, Celinda de la Torre-Munilla, Luis A. Hernández Gómez, Cesar Martín del Alamo:
New n-best based rejection techniques for improving a real-time telephonic connected word recognition system. 2099-2102 - Hiroyuki Sakamoto, Shoichi Matsunaga:
Detection of unknown words using garbage cluster models for continuous speech recognition. 2103-2106 - A. Jusek, Gernot A. Fink, Franz Kummert, Heike Rautenstrauch, Gerhard Sagerer:
Detection of unknown words and its evaluation. 2107-2110 - Seung-Bae Lee, Lag-Yong Kim, Min-Seong Kim, Jong-Seok Lee, Shin-Wook Kang:
Minimum duration constrained non-keyword modeling and rejection for word spotting. 2111-2114 - Sari Accaino, Bart D'hoore, Johan Vantieghem, Dirk Van Compernolle:
Rejection capabilities for HMM-based speech recognizers. 2115-2118 - Joan Salavedra, Claus Jacobsen, Mazin G. Rahim, Ilija Zeljkovic, Jay G. Wilpon:
Multi-lingual connected digits recognition. 2119-2122 - Celinda de la Torre-Munilla, Luis A. Hernández Gómez, F. Javier Caminero-Gil, Cesar Martín del Alamo:
Recognition of spontaneously spoken connected numbers in Spanish over the telephone line. 2123-2126
Word Spotting
- Rachida El Méliani, Douglas D. O'Shaughnessy:
Lexical fillers for task-independent-training based keyword spotting and detection of new words. 2129-2132 - Michael J. Carey, Eluned S. Parris:
Topic spotting with task independent models. 2133-2136 - Toshiyuki Hanazawa, Yoshiharu Abe, Kunio Nakajima:
Phrase spotting using pitch pattern information. 2137-2140 - Yoshiaki Itoh, Jiro Kiyama, Ryuichi Oka:
Speech understanding and speech retrieval for TV news by using connected word spotting. 2141-2144 - J. T. Foote, Gareth J. F. Jones, Karen Sparck Jones, Steve J. Young:
Talker-independent keyword spotting for information retrieval. 2145-2148 - Philippe Jeanrenaud, Man-Hung Siu, Herbert Gish:
Large vocabulary word scoring as a basis for transcription generation. 2149-2152 - Jin'ichi Murakami:
New word spotting algorithm based on forward decoding. 2153-2156 - H. Klemm, Fritz Class, Ute Kilian:
Word- and phrase spotting with syllable-based garbage modelling. 2157-2160
Perception
- Li Liu, Jialong He, Günther Palm:
Influence of short-time phase on the perception of stop consonants. 2269-2272 - Andrew Simpson, Valérie Hazan:
Enhancing the perceptual salience of information-rich regions of natural intervocalic consonants. 2273-2276 - R. J. J. H. van Son:
A method to quantify the error distribution in confusion matrices. 2277-2280 - Luis E. López-Bascuas:
Speech and nonspeech signal densities for the perception of temporal order. 2281-2284 - Eduardo Sá Marta, Fernando Perdigão, Luís Vieira de Sá:
Researching the processing structures of human phoneme recognition by analysis of natural stop-consonant-vowel utterances that elicit correct recognition through unusual acoustic patterns. 2285-2288 - Kirk A. Widdison:
The perception of voicing in Spanish sibilants. 2289-2292 - Elzbieta B. Slawinski:
Development of the perception of initial prevocalic [r] and [l] by English children. 2293-2296 - Francesco Cutugno, Renata Savy:
On phonetic boundaries across categories for synthetic and natural vocalic speech sounds. 2297-2300 - Athanassios Protopapas, Steven Finney, Peter D. Eimas:
Effects of syllabic position in the perception of spoken English. 2301-2304
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.