default search action
Alexis Moinet
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i19]Mateusz Lajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman:
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data. CoRR abs/2402.08093 (2024) - 2023
- [c33]Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman:
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. INTERSPEECH 2023: 3387-3391 - [c32]Marcel Granero Moya, Penny Karanasou, Sri Karlapati, Bastian Schnell, Nicole Peinelt, Alexis Moinet, Thomas Drugman:
A Comparative Analysis of Pretrained Language Models for Text-to-Speech. SSW 2023: 14-20 - [c31]Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova:
Controllable Emphasis with zero data for text-to-speech. SSW 2023: 113-119 - [i18]Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman:
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. CoRR abs/2306.11327 (2023) - [i17]Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova:
Controllable Emphasis with zero data for text-to-speech. CoRR abs/2307.07062 (2023) - [i16]Marcel Granero Moya, Penny Karanasou, Sri Karlapati, Bastian Schnell, Nicole Peinelt, Alexis Moinet, Thomas Drugman:
A Comparative Analysis of Pretrained Language Models for Text-to-Speech. CoRR abs/2309.01576 (2023) - 2022
- [c30]Mateusz Lajszczak, Animesh Prasad, Arent van Korlaar, Bajibabu Bollepalli, Antonio Bonafonte, Arnaud Joly, Marco Nicolis, Alexis Moinet, Thomas Drugman, Trevor Wood, Elena Sokolova:
Distribution Augmentation for Low-Resource Expressive Text-To-Speech. ICASSP 2022: 8307-8311 - [c29]Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Syed Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman:
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. INTERSPEECH 2022: 3363-3367 - [c28]Peter Makarov, Syed Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou:
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. INTERSPEECH 2022: 3368-3372 - [c27]Syed Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman:
Expressive, Variable, and Controllable Duration Modelling in TTS. INTERSPEECH 2022: 4546-4550 - [c26]Dino Rattcliffe, You Wang, Alex Mansbridge, Penny Karanasou, Alexis Moinet, Marius Cotescu:
Cross-lingual Style Transfer with Conditional Prior VAE and Style Loss. INTERSPEECH 2022: 4586-4590 - [i15]Mateusz Lajszczak, Animesh Prasad, Arent van Korlaar, Bajibabu Bollepalli, Antonio Bonafonte, Arnaud Joly, Marco Nicolis, Alexis Moinet, Thomas Drugman, Trevor Wood, Elena Sokolova:
Distribution augmentation for low-resource expressive text-to-speech. CoRR abs/2202.06409 (2022) - [i14]Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman:
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. CoRR abs/2206.13443 (2022) - [i13]Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman:
Expressive, Variable, and Controllable Duration Modelling in TTS. CoRR abs/2206.14165 (2022) - [i12]Peter Makarov, Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou:
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. CoRR abs/2206.14643 (2022) - 2021
- [c25]Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman:
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech. ICASSP 2021: 6573-6577 - [c24]Zack Hodari, Alexis Moinet, Sri Karlapati, Jaime Lorenzo-Trueba, Thomas Merritt, Arnaud Joly, Ammar Abbas, Penny Karanasou, Thomas Drugman:
Camp: A Two-Stage Approach to Modelling Prosody in Context. ICASSP 2021: 6578-6582 - [c23]Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman:
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System. Interspeech 2021: 3620-3624 - [c22]Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangen, Sri Karlapati, Thomas Drugman:
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech. SSW 2021: 177-182 - [i11]Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman:
A learned conditional prior for the VAE acoustic space of a TTS system. CoRR abs/2106.10229 (2021) - [i10]Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangen, Sri Karlapati, Thomas Drugman:
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech. CoRR abs/2106.15649 (2021) - 2020
- [j5]Marius Cotescu, Thomas Drugman, Goeric Huybrechts, Jaime Lorenzo-Trueba, Alexis Moinet:
Voice Conversion for Whispered Speech Synthesis. IEEE Signal Process. Lett. 27: 186-190 (2020) - [c21]Orazio Angelini, Alexis Moinet, Kayoko Yanagisawa, Thomas Drugman:
Singing Synthesis: With a Little Help from my Attention. INTERSPEECH 2020: 1221-1225 - [c20]Sri Karlapati, Alexis Moinet, Arnaud Joly, Viacheslav Klimkov, Daniel Sáez-Trigueros, Thomas Drugman:
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech. INTERSPEECH 2020: 4387-4391 - [i9]Sri Karlapati, Alexis Moinet, Arnaud Joly, Viacheslav Klimkov, Daniel Saez-Trigueros, Thomas Drugman:
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech. CoRR abs/2004.14617 (2020) - [i8]Thomas Drugman, Thomas Dubuisson, Alexis Moinet, Nicolas D'Alessandro, Thierry Dutoit:
Glottal source estimation robustness: A comparison of sensitivity of voice source estimation techniques. CoRR abs/2005.11682 (2020) - [i7]Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman:
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech. CoRR abs/2011.02252 (2020) - [i6]Jonas Rohnke, Thomas Merritt, Jaime Lorenzo-Trueba, Adam Gabrys, Vatsal Aggarwal, Alexis Moinet, Roberto Barra-Chicote:
Parallel WaveNet conditioned on VAE latent vectors. CoRR abs/2012.09703 (2020)
2010 – 2019
- 2019
- [c19]Jaime Lorenzo-Trueba, Thomas Drugman, Javier Latorre, Thomas Merritt, Bartosz Putrycz, Roberto Barra-Chicote, Alexis Moinet, Vatsal Aggarwal:
Towards Achieving Robust Universal Neural Vocoding. INTERSPEECH 2019: 181-185 - [i5]Thomas Drugman, Goeric Huybrechts, Viacheslav Klimkov, Alexis Moinet:
Traditional Machine Learning for Pitch Detection. CoRR abs/1903.01290 (2019) - [i4]Marius Cotescu, Thomas Drugman, Goeric Huybrechts, Jaime Lorenzo-Trueba, Alexis Moinet:
Voice Conversion for Whispered Speech Synthesis. CoRR abs/1912.05289 (2019) - [i3]Orazio Angelini, Alexis Moinet, Kayoko Yanagisawa, Thomas Drugman:
Singing Synthesis: with a little help from my attention. CoRR abs/1912.05881 (2019) - [i2]Thomas Drugman, Alexis Moinet, Thierry Dutoit, Geoffrey Wilfart:
Using a Pitch-Synchronous Residual Codebook for Hybrid HMM/Frame Selection Speech Synthesis. CoRR abs/1912.12887 (2019) - 2018
- [j4]Thomas Drugman, Goeric Huybrechts, Viacheslav Klimkov, Alexis Moinet:
Traditional Machine Learning for Pitch Detection. IEEE Signal Process. Lett. 25(11): 1745-1749 (2018) - [c18]Thomas Merritt, Bartosz Putrycz, Adam Nadolski, Tianjun Ye, Daniel Korzekwa, Wiktor Dolecki, Thomas Drugman, Viacheslav Klimkov, Alexis Moinet, Andrew Breen, Rafal Kuklinski, Nikko Strom, Roberto Barra-Chicote:
Comprehensive Evaluation of Statistical Speech Waveform Synthesis. SLT 2018: 325-331 - [c17]Viacheslav Klimkov, Alexis Moinet, Adam Nadolski, Thomas Drugman:
Parameter Generation Algorithms for Text-To-Speech Synthesis with Recurrent Neural Networks. SLT 2018: 626-631 - [i1]Thomas Merritt, Bartosz Putrycz, Adam Nadolski, Tianjun Ye, Daniel Korzekwa, Wiktor Dolecki, Thomas Drugman, Viacheslav Klimkov, Alexis Moinet, Andrew Breen, Rafal Kuklinski, Nikko Strom, Roberto Barra-Chicote:
Comprehensive evaluation of statistical speech waveform synthesis. CoRR abs/1811.06296 (2018) - 2017
- [c16]Viacheslav Klimkov, Adam Nadolski, Alexis Moinet, Bartosz Putrycz, Roberto Barra-Chicote, Thomas Merritt, Thomas Drugman:
Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information. INTERSPEECH 2017: 1064-1068 - 2016
- [c15]Gabriel Urbain, Christian Frisson, Alexis Moinet, Thierry Dutoit:
A Semantic and Content-Based Search User Interface for Browsing Large Collections of Foley Sounds. Audio Mostly Conference 2016: 272-277 - 2015
- [c14]Kevin El Haddad, Hüseyin Çakmak, Alexis Moinet, Stéphane Dupont, Thierry Dutoit:
An HMM approach for synthesizing amused speech with a controllable intensity of smile. ISSPIT 2015: 7-11 - 2013
- [c13]Nicolas D'Alessandro, Joëlle Tilmanne, Maria Astrinaki, Thomas Hueber, Rasmus Dall, Thierry Ravet, Alexis Moinet, Hüseyin Çakmak, Onur Babacan, Adela Barbulescu, Valentin Parfait, Victor Huguenin, Emine Sümeyye Kalayci, Qiong Hu:
Reactive Statistical Mapping: Towards the Sketching of Performative Control with Data. eNTERFACE 2013: 20-49 - [c12]Christian Frisson, Stéphane Dupont, Alexis Moinet, Cécile Picard-Limpens, Thierry Ravet, Xavier Siebert, Thierry Dutoit:
VideoCycle: User-Friendly Navigation by Similarity in Video Databases. MMM (2) 2013: 550-553 - [c11]Maria Astrinaki, Nicolas D'Alessandro, Loïc Reboursière, Alexis Moinet, Thierry Dutoit:
MAGE 2.0: New Features and its Application in the Development of a Talking Guitar. NIME 2013: 547-550 - [c10]Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit:
Mage - reactive articulatory feature control of HMM-based parametric speech synthesis. SSW 2013: 207-211 - [c9]Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit:
Mage - HMM-based speech synthesis reactively controlled by the articulators. SSW 2013: 243 - 2012
- [j3]Joëlle Tilmanne, Alexis Moinet, Thierry Dutoit:
Stylistic gait synthesis based on hidden Markov models. EURASIP J. Adv. Signal Process. 2012: 72 (2012) - [c8]Christian Frisson, Stéphane Dupont, Julien Leroy, Alexis Moinet, Thierry Ravet, Xavier Siebert, Thierry Dutoit:
LoopJam: turning the dance floor into a collaborative instrumental map. NIME 2012 - 2010
- [j2]Jérôme Urbain, Radoslaw Niewiadomski, Elisabetta Bevacqua, Thierry Dutoit, Alexis Moinet, Catherine Pelachaud, Benjamin Picart, Joëlle Tilmanne, Johannes Wagner:
AVLaughterCycle. J. Multimodal User Interfaces 4(1): 47-58 (2010) - [c7]Jérôme Urbain, Elisabetta Bevacqua, Thierry Dutoit, Alexis Moinet, Radoslaw Niewiadomski, Catherine Pelachaud, Benjamin Picart, Joëlle Tilmanne, Johannes Wagner:
The AVLaughterCycle Database. LREC 2010
2000 – 2009
- 2009
- [c6]Thomas Drugman, Alexis Moinet, Thierry Dutoit, Geoffrey Wilfart:
Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis. ICASSP 2009: 3793-3796 - [c5]Malorie Charlier, Yamato Ohtani, Tomoki Toda, Alexis Moinet, Thierry Dutoit:
Cross-language voice conversion based on eigenvoices. INTERSPEECH 2009: 1635-1638 - 2008
- [j1]Nicolas D'Alessandro, Onur Babacan, Baris Bozkurt, Thomas Dubuisson, Andre Holzapfel, Loïc Kessous, Alexis Moinet, Maxime Vlieghe:
RAMCESS 2.X framework - expressive voice analysis for realtime and accurate synthesis of singing. J. Multimodal User Interfaces 2(2): 133-144 (2008) - [c4]Thomas Drugman, Thomas Dubuisson, Nicolas D'Alessandro, Alexis Moinet, Thierry Dutoit:
Voice source parameters estimation by fitting the glottal formant and the inverse filtering open phase. EUSIPCO 2008: 1-5 - [c3]Thomas Drugman, Thomas Dubuisson, Alexis Moinet, Nicolas D'Alessandro, Thierry Dutoit:
Glottal Source Estimation Robustness - A Comparison of Sensitivity of Voice Source Estimation Techniques. SIGMAP 2008: 202-207 - 2007
- [c2]Thierry Dutoit, Andre Holzapfel, Matthieu Jottrand, Alexis Moinet, Javier Pérez, Yannis Stylianou:
Towards a Voice Conversion System Based on Frame Selection. ICASSP (4) 2007: 513-516 - [c1]Nicolas D'Alessandro, Alexis Moinet, Thomas Dubuisson, Thierry Dutoit:
Causal/anticausal Decomposition for mixed-phase Description of brass and Bowed String sounds. ICMC 2007
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-03 20:10 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint