default search action
23rd SPECOM 2021: St. Petersburg, Russia
- Alexey Karpov, Rodmonga Potapova:
Speech and Computer - 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021, Proceedings. Lecture Notes in Computer Science 12997, Springer 2021, ISBN 978-3-030-87801-6 - Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:
Text-Independent Speaker Verification Employing CNN-LSTM-TDNN Hybrid Networks. 1-13 - Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:
End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics. 14-25 - Nuno Almeida, Conceição Cunha, Samuel S. Silva, António Teixeira:
Assessing Velar Gestures Timing in European Portuguese Nasal Vowels with RT-MRI Data. 26-35 - Nuno Almeida, Diogo Cunha, Samuel S. Silva, António Teixeira:
Designing and Deploying an Interaction Modality for Articulatory-Based Audiovisual Speech Synthesis. 36-49 - Arash Amani, Mohammad MohammadAmini, Hadi Veisi:
Kurdish Spoken Dialect Recognition Using X-Vector Speaker Embedding. 50-57 - Yu Bai, Cristian Tejedor García, Ferdy Hubers, Catia Cucchiarini, Helmer Strik:
An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders. 58-69 - Peter Birkholz, Christian Kleiner:
Velocity Differences Between Velum Raising and Lowering Movements. 70-80 - Natalia Bogdanova-Beglarian, Olga Blinova, Tatiana Y. Sherstinova, Tatiana Sulimova:
Pragmatic Markers of Russian Everyday Speech: Invariants in Dialogue and Monologue. 81-90 - Vincent Brignatz, Jarod Duret, Driss Matrouf, Mickael Rouvier:
Language Adaptation for Speaker Recognition Systems Using Contrastive Learning. 91-99 - Pierre Champion, Denis Jouvet, Anthony Larcher:
Evaluating X-Vector-Based Speaker Anonymization Under White-Box Assessment. 100-111 - Myrsini Christidou, Alexandra Vioni, Nikolaos Ellinas, Georgios Vamvoukakis, Konstantinos Markopoulos, Panos Kakoulidis, June Sig Sung, Hyoungmin Park, Aimilios Chalamandaris, Pirros Tsiakoulis:
Improved Prosodic Clustering for Multispeaker and Speaker-Independent Phoneme-Level Prosody Control. 112-123 - Adam Chýlek, Jan Svec, Lubos Smídl:
Initial Experiments on Question Answering from the Intrinsic Structure of Oral History Archives. 124-133 - Debadatta Dash, Paul Ferrari, Karinne Berstis, Jun Wang:
Imagined, Intended, and Spoken Speech Envelope Synthesis from Neuromagnetic Signals. 134-145 - Maria Dayter, Elena I. Riekhakaynen:
What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms. 146-156 - Mikhail Dolgushin, Dayana Ismakova, Yuliya Bidulya, Igor Krupkin, Galina Barskaya, Anastasiya Lesiv:
Toxic Comment Classification Service in Social Network. 157-165 - Denis Dresvyanskiy, Wolfgang Minker, Alexey Karpov:
Deep Learning Based Engagement Recognition in Highly Imbalanced Data. 166-178 - Anna Dunashova:
Intraspeaker Variability of a Professional Lecturer: Ageing, Genre, Pragmatics vs. Voice Acting (Case Study). 179-189 - Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
An Ensemble Approach for the Diagnosis of COVID-19 from Speech and Cough Sounds. 190-201 - Sahar Ghannay, Antoine Caubrière, Salima Mdhaffar, Gaëlle Laperrière, Bassam Jabaian, Yannick Estève:
Where Are We in Semantic Concept Extraction for Spoken Language Understanding? 202-213 - Parismita Gogoi, Sishir Kalita, Wendy Lalhminghlui, Priyankoo Sarmah, S. R. M. Prasanna:
Learning Mizo Tones from F0 Contours Using 1D-CNN. 214-225 - Ivan Gruber, Marek Hrúz, Pavel Ircing, Petr Neduchal, Tomás Zítka, Miroslav Hlavác, Zbynek Zajíc, Jan Svec, Martin Bulín:
OCR Improvements for Images of Multi-page Historical Documents. 226-237 - Ivan Gruber, Marek Hrúz, Milos Zelezný, Alexey Karpov:
X-Bridge: Image-to-Image Translation with Reconstruction Capabilities. 238-249 - Hien Thi Ha, Ales Horák:
Who is Selling to Whom - Feature Evaluation for Multi-block Classification in Invoice Information Extraction. 250-261 - Abner Hernandez, Seung Hee Yang:
Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning. 262-270 - Juan Hussain, Christian Huber, Sebastian Stüker, Alexander Waibel:
Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition. 271-278 - Anosha Ignatius, Uthayasanker Thayasivam:
Speaker-Invariant Speech-to-Intent Classification for Low-Resource Languages. 279-290 - Denis Ivanko, Dmitry Ryumin, Alexandr Axyonov, Alexey M. Kashevnik:
Speaker-Dependent Visual Command Recognition in Vehicle Cabin: Methodology and Evaluation. 291-302 - Joshua Jansen van Vueren, Thomas Niesler:
Optimised Code-Switched Language Model Data Augmentation in Four Under-Resourced South African Languages. 303-316 - Virender Kadyan, Hemant Kumar Kathania, Prajjval Govil, Mikko Kurimo:
Synthesis Speech Based Data Augmentation for Low Resource Children ASR. 317-326 - Irina S. Kipyatkova:
End-to-End Russian Speech Recognition Models with Multi-head Attention. 327-335 - Konstantinos Klapsas, Nikolaos Ellinas, June Sig Sung, Hyoungmin Park, Spyros Raptis:
Word-Level Style Control for Expressive, Non-attentive Speech Synthesis. 336-347 - Liliya Komalova, Diana Kulagina:
Perceiving Speech Aggression with and without Textual Context on Twitter Social Network Site. 348-359 - Roman Korostik, Javier Latorre, Sivanand Achanta, Yannis Stylianou:
Assessing Speaker Interpolation in Neural Text-to-Speech. 360-371 - Denis Likhachov, Maxim Vashkevich, Elias Azarov, Katsiaryna Malhina, Yuliya Rushkevich:
A Mobile Application for Detection of Amyotrophic Lateral Sclerosis via Voice Analysis. 372-383 - Elena E. Lyakso, Olga V. Frolova, Nersisson Ruban, A. Mary Mekala:
Child's Emotional Speech Classification by Human Across Two Languages: Russian & Tamil. 384-396 - Olesia Makhnytkina, Aleksey Grigorev, Aleksander Nikolaev:
Analysis of Dialogues of Typically Developing Children, Children with Down Syndrome and ASD Using Machine Learning Methods. 397-406 - Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Speaker Adaptation with Continuous Vocoder-Based DNN-TTS. 407-416 - Yuri Matveev, Anton Matveev, Olga V. Frolova, Elena E. Lyakso:
Automatic Recognition of the Psychoneurological State of Children: Autism Spectrum Disorders, Down Syndrome, Typical Development. 417-425 - Salima Mdhaffar, Marc Tommasi, Yannick Estève:
Study on Acoustic Model Personalization in a Context of Collaborative Learning Constrained by Privacy Preservation. 426-436 - Muhammadjon Musaev, Saida Mussakhojayeva, Ilyos Khujayorov, Yerbolat Khassanov, Mannon Ochilov, Huseyin Atakan Varol:
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments. 437-447 - Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English. 448-459 - Sergis Nicolaou, Lambros Mavrides, Georgina Tryfou, Kyriakos Tolias, Konstantinos P. Panousis, Sotirios Chatzis, Sergios Theodoridis:
Dialog Speech Sentiment Classification for Imbalanced Datasets. 460-471 - Tijana V. Nosek, Sinisa Suzic, Mia Vujovic, Darko Pekar, Milan Secujski, Vlado Delic:
Explicit Control of the Level of Expressiveness in DNN-Based Speech Synthesis by Embedding Interpolation. 472-482 - Dariya Novokhrestova, Evgeny Kostuchenko, Ilya A. Hodashinsky, Lidiya N. Balatskaya:
Experimental Analysis of Expert and Quantitative Estimates of Syllable Recordings in the Process of Speech Rehabilitation. 483-491 - Edvin Pakoci, Branislav M. Popovic:
Methods for Using Class Based N-gram Language Models in the Kaldi Toolkit. 492-503 - Ankur T. Patil, Harsh Kotta, Rajul Acharya, Hemant A. Patil:
Spectral Root Features for Replay Spoof Detection in Voice Assistants. 504-515 - Rodmonga Potapova, Tatyana Agibalova, Vsevolod Potapov, Olga Tuchina:
Influence of the Aggressive Internet Environment on Cognitive Personality Disorders (in Relation to the Russian Young Generation of Users). 516-527 - Rodmonga Potapova, Vsevolod Potapov, Nataliya Lebedeva, Ekaterina Karimova, Nikolay Bobrov:
Media Content vs Nature Stimuli Influence on Human Brain Activity. 528-539 - Valeriya Prokaeva, Elena I. Riekhakaynen, Vladislav I. Zubov:
Can Your Eyes Tell Us Why You Hesitate? Comparing Reading Aloud in Russian as L1 and Japanese as L2. 540-552 - Josef V. Psutka, Ales Prazák, Jan Vanek:
Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors Using Various DNN Architectures. 553-564 - Mathias Quillot, Richard Dufour, Jean-François Bonastre:
Assessing Speaker-Independent Character Information for Acted Voices. 565-576 - Mathias Quillot, Jarod Duret, Richard Dufour, Mickael Rouvier, Jean-François Bonastre:
Influence of Speaker Pre-training on Character Voice Representation. 577-588 - Ilyos Rabbimov, Sami Kobilov, Iosif Mporas:
Opinion Classification via Word and Emoji Embedding Models with LSTM. 589-601 - Aku Rouhe, Astrid Van Camp, Mittul Singh, Hugo Van hamme, Mikko Kurimo:
An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR. 602-613 - Lyudmila V. Savchenko, Andrey V. Savchenko:
Speaker-Aware Training of Speech Emotion Classifier with Speaker Recognition. 614-625 - Andrey V. Savinkov, Vladimir V. Bochkarev, Anna V. Shevlyakova, Stanislav Khristoforov:
Neural Network Recognition of Russian Noun and Adjective Cases in the Google Books Ngram Corpus. 626-637 - Vered Silber-Varod, Mária Gósy, Anat Lerner:
Is It a Filler or a Pause? A Quantitative Analysis of Filled Pauses in Hebrew. 638-648 - Shrishti Singh, Kuldeep Khoria, Hemant A. Patil:
Modified Group Delay Function Using Different Spectral Smoothing Techniques for Voice Liveness Detection. 649-659 - Tatiana Sokoreva, Tatiana Shevchenko, Mariya Chyrvonaya:
Complex Rhythm Adjustments in Multilingual Code-Switching Across Mandarin, English and Russian. 660-669 - Mohammad Soleymanpour, Michael T. Johnson, Jeffrey Berry:
Increasing the Precision of Dysarthric Speech Intelligibility and Severity Level Estimate. 670-679 - Lauri Tavi, Tomi Kinnunen, Einar Meister, Rosa González Hautamäki, Anton Malmi:
Articulation During Voice Disguise: A Pilot Study. 680-691 - Elena Timofeeva, Elena Evseeva, Valeriia Zaluskaia, Vlada Kapranova, Sergei Astapov, Vladimir Kabarov:
Improvement of Speaker Number Estimation by Applying an Overlapped Speech Detector. 692-703 - Paras Tiwari, Sawan Rai:
Mind Your Tweet: Abusive Tweet Detection. 704-715 - Marián Trnka, Sakhia Darjaa, Milan Rusko, Meilin Schaper, Tim H. Stelkens-Kobsch:
Speaker Authorization for Air Traffic Control Security. 716-725 - Ana Rita Valente, Catarina Oliveira, Luciana Albuquerque, António Teixeira, Plínio A. Barbosa:
Prosodic Changes with Age: A Longitudinal Study on a Famous European Portuguese Native Speaker. 726-736 - Loes van Bemmel, Wieke Harmsen, Catia Cucchiarini, Helmer Strik:
Automatic Selection of the Most Characterizing Features for Detecting COPD in Speech. 737-748 - Ewald van der Westhuizen, Trideba Padhi, Thomas Niesler:
Multilingual Training Set Selection for ASR in Under-Resourced Malian Languages. 749-760 - Jan Volín, Markéta Rezácková, Jindrich Matousek:
Human and Transformer-Based Prosodic Phrasing in Two Speech Genres. 761-772 - Roman Vygon, Nikolay Mikhaylovskiy:
Learning Efficient Representations for Keyword Spotting with Triplet Loss. 773-785 - Tobias Watzel, Ludwig Kürzinger, Lujun Li, Gerhard Rigoll:
Regularized Forward-Backward Decoder for Attention Models. 786-794 - Tobias Watzel, Ludwig Kürzinger, Lujun Li, Gerhard Rigoll:
Induced Local Attention for Transformer Models in Speech Recognition. 795-806 - Zbynek Zajíc, Marie Kunesová, Ludek Müller:
Applying EEND Diarization to Telephone Recordings from a Call Center. 807-817 - Svetlana Zimina, Vera Evdokimova:
Acoustic Characteristics of Speech Entrainment in Dialogues in Similar Phonetic Sequences. 818-825 - Ismail Rasim Ülgen, Mustafa Erden, Levent M. Arslan:
Predicting Biometric Error Behaviour from Speaker Embeddings and a Fast Score Normalization Scheme. 826-836
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.