default search action
IberSPEECH 2018: Barcelona, Spain
- Jordi Luque, Antonio Bonafonte, Francesc Alías Pujol, António J. S. Teixeira:
Fourth International Conference, IberSPEECH 2018, Barcelona, Spain, 21-23 November 2018, Proceedings. ISCA 2018
Speaker Recognition
- Victoria Mingote, Antonio Miguel, Alfonso Ortega, Eduardo Lleida:
Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker Verification. 1-5 - Ignacio Viñals, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
Phonetic Variability Influence on Short Utterances in Speaker Verification. 6-9 - Umair Khan, Pooyan Safari, Javier Hernando:
Restricted Boltzmann Machine Vectors for Speaker Clustering. 10-14 - Esther Rituerto-González, Ascensión Gallardo-Antolín, Carmen Peláez-Moreno:
Speaker Recognition under Stress Conditions. 15-19
Keynote 1
- Tanja Schultz:
Bio signal-based Spoken Communication.
Topics on Speech Technologies
- Alp Öktem, Mireia Farrús, Antonio Bonafonte:
Bilingual Prosodic Dataset Compilation for Spoken Language Translation. 20-24 - Baybars Külebi, Alp Öktem:
Building an Open Source Automatic Speech Recognition System for Catalan. 25-29 - Oriol Barbany, Antonio Bonafonte, Santiago Pascual:
Multi-Speaker Neural Vocoder. 30-34 - Andrés Piñeiro Martín, Carmen García-Mateo, Laura Docío Fernández:
Improving the Automatic Speech Recognition through the improvement of Laguage Models. 35-39 - Mónica Domínguez, Alicia Burga, Mireia Farrús, Leo Wanner:
Towards expressive prosody generation in TTS for reading aloud applications. 40-44 - Alejandro Gómez Alanís, Antonio M. Peinado, José Andrés González López, Angel M. Gomez:
Performance evaluation of front- and back-end techniques for ASV spoofing detection systems based on deep features. 45-49 - Igor Odriozola, Inma Hernáez, Eva Navas, Luis Serrano, Jon Sánchez:
The observation likelihood of silence: analysis and prospects for VAD applications. 50-54 - Christian Salamea, Ricardo de Córdoba, Luis Fernando D'Haro, Rubén San Segundo, Javier Ferreiros:
On the use of Phone-based Embeddings for Language Recognition. 55-59 - Laura Cross Vila, Carlos Escolano, José A. R. Fonollosa, Marta R. Costa-jussà:
End-to-End Speech Translation with the Transformer. 60-63 - Javier Darna-Sequeiros, Doroteo T. Toledano:
Audio event detection on Google's Audio Set database: Preliminary results using different types of DNNs. 64-67 - Mikel de Velasco, Raquel Justo, Josu Antón, Mikel Carrilero, M. Inés Torres:
Emotion Detection from Speech and Text. 68-71 - Darío Tilves Santiago, Ian Benderitter, Carmen García-Mateo:
Experimental Framework Design for Sign Language Automatic Recognition. 72-76 - Cassio T. Batista, Ana Larissa Dias, Nelson C. Sampaio Neto:
Baseline Acoustic Models for Brazilian Portuguese Using Kaldi Tools. 77-81
ASR & Speech Applications
- Paula Lopez-Otero, Laura Docío Fernández:
Converted Mel-Cepstral Coefficients for Gender Variability Reduction in Query-by-Example Spoken Document Retrieval. 82-86 - Pablo Gimeno, Ignacio Viñals, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data. 87-91 - Emilio Granell, Carlos David Martínez-Hinarejos, Verónica Romero:
Improving Transcription of Manuscripts with Multimodality and Interaction. 92-96 - Cristian Tejedor García, Valentín Cardeñoso-Payo, María J. Machuca, David Escudero Mancebo, Antonio Ríos, Takuya Kimura:
Improving Pronunciation of Spanish as a Foreign Language for L1 Japanese Speakers with Japañol CAPT Tool. - Conrad Bernath, Aitor Álvarez, Haritz Arzelus, Carlos David Martínez:
Exploring E2E speech recognition systems for new languages. 102-106
Speech & Language Technologies Applied to Health
- Sneha Raman, Inma Hernáez, Eva Navas, Luis Serrano:
Listening to Laryngectomees: A study of Intelligibility and Self-reported Listening Effort of Spanish Oesophageal Speech. 107-111 - Mario Corrales-Astorgano, Pastora Martínez-Castilla, David Escudero Mancebo, Lourdes Aguilar, César González Ferreras, Valentín Cardeñoso-Payo:
Towards an automatic evaluation of the prosody of people with Down syndrome. 112-116 - Santiago Pascual, Antonio Bonafonte, Joan Serrà, José Andrés González López:
Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks. 117-121 - Luis Serrano, David Tavarez, Xabier Sarasola, Sneha Raman, Ibon Saratxaga, Eva Navas, Inma Hernáez:
LSTM based voice conversion for laryngectomees. 122-126 - Zuzanna Parcheta, Carlos David Martínez-Hinarejos:
Sign Language Gesture Classification using Neural Networks. 127-131
Synthesis, Production & Analysis
- Marc Freixes, Marc Arnela, Joan Claudi Socoró, Francesc Alías Pujol, Oriol Guasch:
Influence of tense, modal and lax phonation on the three-dimensional finite element synthesis of vowel [A]. 132-136 - Conceição Cunha, Samuel S. Silva, António J. S. Teixeira, Catarina Oliveira, Paula Martins, Arun A. Joseph, Jens Frahm:
Exploring Advances in Real-time MRI for Speech Production Studies of European Portuguese. 137-141 - Juan M. Martín-Doñas, Iván López-Espejo, Angel M. Gomez, Antonio M. Peinado:
A postfiltering approach for dual-microphone smartphones. 142-146 - Xabier Sarasola, Eva Navas, David Tavarez, Luis Serrano, Ibon Saratxaga:
Speech and monophonic singing segmentation using pitch parameters. 147-151 - Santiago Pascual, Antonio Bonafonte, Joan Serrà:
Self-Attention Linguistic-Acoustic Decoder. 152-156
Keynote 2
- Rob Clark:
Synthesizing variation in prosody for Text-to-Speech.
Special Session: Show & Tell
- Cristian Tejedor García, Valentín Cardeñoso-Payo, David Escudero Mancebo:
Japañol: a mobile application to help improving Spanish pronunciation by Japanese native speakers. 157-158
Special Session: Ongoing Research Projects
- Juan Manuel Espín, Roberto Font, Juan Francisco Inglés-Romero, Cristina Vicente-Chicote:
Towards the Application of Global Quality-of-Service Metrics in Biometric Systems. 159-160 - David Escudero Mancebo, Valentín Cardeñoso-Payo:
Incorporation of a Module for Automatic Prediction of Oral Productions Quality in a Learning Video Game. 161-162 - José Andrés González López, Phil D. Green, Damian T. Murphy, Amelia Jane Gully, James M. Gilbert:
Silent Speech: Restoring the Power of Speech to People whose Larynx has been Removed. 163-165 - Inma Hernáez, Eva Navas, Jose Antonio Municio Martín, Javier Gomez Suárez:
RESTORE Project: REpair, STOrage and REhabilitation of speech. 166-169 - Asunción Moreno, Antonio Bonafonte, Igor Jauk, Laia Tarrés, Victor Pereira:
Corpus for Cyberbullying Prevention. 170-171 - M. Inés Torres, Gérard Chollet, César Montenegro, Jofre Tenorio-Laranga, Olga Gordeeva, Anna Esposito, Cornelius Glackin, Stephan Schlögl, Olivier Deroo, Begoñya Fernández-Ruanova, Roberto Santana, Maria Stylianou Korsnes, Fred Lindner, Daria Kyslitska, Miriam Reiner, Gennaro Cordasco, Mari Aksnes, Raquel Justo:
EMPATHIC, Expressive, Advanced Virtual Coach to Improve Independent Healthy-Life-Years of the Elderdy. 172-173
Special Session: PhD Thesis
- Emilio Granell, Carlos David Martínez-Hinarejos, Verónica Romero:
Advances on the Transcription of Historical Manuscripts based on Multimodality, Interactivity and Crowdsourcing. 174-178 - Alicia Lozano-Diez, Joaquin Gonzalez-Rodriguez, Javier Gonzalez-Dominguez:
Bottleneck and Embedding Representation of Speech for DNN-based Language and Speaker Recognition. 179-183 - Omid Ghahabi:
Deep Learning for i-Vector Speaker and Language Recognition: A Ph.D. Thesis Overview. 184-188 - Igor Jauk:
Unsupervised Learning for Expressive Speech Synthesis. 189-193
Albayzin Challenges: Multimodal Diarization
- Benjamin Maurice, Hervé Bredin, Ruiqing Yin, Jose Patino, Héctor Delgado, Claude Barras, Nicholas W. D. Evans, Camille Guinaudeau:
ODESSA/PLUMCOT at Albayzin Multimodal Diarization Challenge 2018. 194-198 - Miquel Angel India Massana, Itziar Sagastiberri, Ponç Palau, Elisa Sayrol, Josep Ramon Morros, Javier Hernando:
UPC Multimodal Speaker Diarization System for the 2018 Albayzin Challenge. 199-203 - Eduardo Ramos-Muguerza, Laura Docío Fernández, José Luis Alba-Castro:
The GTM-UVIGO System for Audiovisual Diarization. 204-207
Albayzin Challenges: Speaker Diarization
- Diego Castán, Mitchell McLaren, Mahesh Kumar Nandwana:
The SRI International STAR-LAB System Description for IberSPEECH-RTVE 2018 Speaker Diarization Challenge. 208-210 - Jose Patino, Héctor Delgado, Ruiqing Yin, Hervé Bredin, Claude Barras, Nicholas W. D. Evans:
ODESSA at Albayzin Speaker Diarization Challenge 2018. 211-215 - Omid Ghahabi, Volker Fischer:
EML Submission to Albayzin 2018 Speaker Diarization Challenge. 216-219 - Ignacio Viñals, Pablo Gimeno, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge. 220-223 - Alicia Lozano-Diez, Beltran Labrador, Diego de Benito, Pablo Ramirez, Doroteo T. Toledano:
DNN-based Embeddings for Speaker Diarization in the AuDIaS-UAM System for the Albayzin 2018 IberSPEECH-RTVE Evaluation. 224-226 - Edward L. Campbell, Gabriel Hernández, José Ramón Calvo de Lara:
CENATAV Voice-Group Systems for Albayzin 2018 Speaker Diarization Evaluation Campaign. 227-230 - Abbas Khosravani, Cornelius Glackin, Nazim Dugan, Gérard Chollet, Nigel Cannings:
The Intelligent Voice System for the IberSPEECH-RTVE 2018 Speaker Diarization Challenge. 231-235 - Zili Huang, L. Paola García-Perera, Jesús Villalba, Daniel Povey, Najim Dehak:
JHU Diarization System Description. 236-239
Albayzin Challenges: Search on Speech
- Paula Lopez-Otero, Laura Docío Fernández:
GTM-IRLab Systems for Albayzin 2018 Search on Speech Evaluation. 240-244 - Maria Cabello, Doroteo T. Toledano, Javier Tejedor:
AUDIAS-CEU: A Language-independent approach for the Query-by-Example Spoken Term Detection task of the Search on Speech ALBAYZIN 2018 evaluation. 245-248 - Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Germán Bordel:
GTTS-EHU Systems for the Albayzin 2018 Search on Speech Evaluation. 249-253 - Ana R. Montalvo, Jose M. Ramirez, Alejandro Roble, José R. Calvo:
Cenatav Voice Group System for Albayzin 2018 Search on Speech Evaluation. 254-256
Albayzin Challenges: Speech to Text
- Javier Jorge, Adria A. Martinez-Villaronga, Pavel Golik, Adrià Giménez, Joan Albert Silvestre-Cerdà, Patrick Doetsch, Vicent Andreu Císcar, Hermann Ney, Alfons Juan, Albert Sanchís:
MLLP-UPV and RWTH Aachen Spanish ASR Systems for the IberSpeech-RTVE 2018 Speech-to-Text Transcription Challenge. 257-261 - Juan M. Perero-Codosero, Javier Antón-Martín, Daniel Tapias Merino, Eduardo López Gonzalo, Luis A. Hernández Gómez:
Exploring Open-Source Deep Learning ASR for Speech-to-Text TV program transcription. 262-266 - Haritz Arzelus, Aitor Álvarez, Conrad Bernath, Eneritz García, Emilio Granell, Carlos David Martínez-Hinarejos:
The Vicomtech-PRHLT Speech Transcription Systems for the IberSPEECH-RTVE 2018 Speech to Text Transcription Challenge. 267-271 - Nazim Dugan, Cornelius Glackin, Gérard Chollet, Nigel Cannings:
Intelligent Voice ASR system for Iberspeech 2018 Speech to Text Transcription Challenge. 272-276 - Laura Docío Fernández, Carmen García-Mateo:
The GTM-UVIGO System for Albayzin 2018 Speech-to-Text Evaluation. 277-280
Text & NLP Applications
- Anna Pompili, Alberto Abad, David Martins de Matos, Isabel Pavão Martins:
Topic coherence analysis for the classification of Alzheimer's disease. 281-285 - Eszter Iklódi, Gábor Recski, Gábor Borbély, María José Castro Bleda:
Building a global dictionary for semantic technologies. 286-290 - Juan María Garrido, Marta Codina, Kimber Fodge:
TransDic, a public domain tool for the generation of phonetic dictionaries in standard and dialectal Spanish and Catalan. 291-295 - Jorge Llombart, Antonio Miguel, Alfonso Ortega, Eduardo Lleida:
Wide Residual Networks 1D for Automatic Text Punctuation. 296-300 - Eugénio Ribeiro, Ricardo Ribeiro, David Martins de Matos:
End-to-End Multi-Level Dialog Act Recognition. 301-305
Keynote 3
- Lluís Màrquez:
Automatic Question Answering: Problem Solved?
Round Table
- Marta R. Costa-jussà:
Panel discussion on Speech technologies: Industry and Academy.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.