SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model (2021)
- Authors:
- Casanova, Edresson
- Shulby, Christopher Dane
- Gölge, Eren
- Müller, Nicolas Michael
- Oliveira, Frederico Santos de - Universidade Federal de Goiás (UFG)
- Candido Junior, Arnaldo - Universidade Tecnológica Federal do Paraná (UTFPR)
- Soares, Anderson da Silva - Universidade Federal de Goiás (UFG)
- Aluísio, Sandra Maria
- Ponti, Moacir Antonelli
- USP affiliated authors: ALUISIO, SANDRA MARIA - ICMC ; PONTI, MOACIR ANTONELLI - ICMC ; CASANOVA, EDRESSON - ICMC
- Unidade: ICMC
- DOI: 10.21437/Interspeech.2021-1774
- Subjects: PROCESSAMENTO DE LINGUAGEM NATURAL; REDES NEURAIS; RECONHECIMENTO DE TEXTO; RECONHECIMENTO DE VOZ
- Keywords: zero-shot multi-speaker TTS; text-to-speech; multi-speaker modeling; zero-shot voice conversion
- Agências de fomento:
- Language: Inglês
- Imprenta:
- Source:
- Título do periódico: Proceedings
- Conference titles: Annual Conference of the International Speech Communication Association - INTERSPEECH
- Este periódico é de assinatura
- Este artigo é de acesso aberto
- URL de acesso aberto
- Cor do Acesso Aberto: green
-
ABNT
CASANOVA, Edresson et al. SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model. 2021, Anais.. Baixas: ISCA, 2021. Disponível em: https://doi.org/10.21437/Interspeech.2021-1774. Acesso em: 19 set. 2024. -
APA
Casanova, E., Shulby, C. D., Gölge, E., Müller, N. M., Oliveira, F. S. de, Candido Junior, A., et al. (2021). SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model. In Proceedings. Baixas: ISCA. doi:10.21437/Interspeech.2021-1774 -
NLM
Casanova E, Shulby CD, Gölge E, Müller NM, Oliveira FS de, Candido Junior A, Soares A da S, Aluísio SM, Ponti MA. SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model [Internet]. Proceedings. 2021 ;[citado 2024 set. 19 ] Available from: https://doi.org/10.21437/Interspeech.2021-1774 -
Vancouver
Casanova E, Shulby CD, Gölge E, Müller NM, Oliveira FS de, Candido Junior A, Soares A da S, Aluísio SM, Ponti MA. SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model [Internet]. Proceedings. 2021 ;[citado 2024 set. 19 ] Available from: https://doi.org/10.21437/Interspeech.2021-1774 - ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
- TTS applied to the generation of datasets for automatic speech recognition
- Speech2Phone: a novel and efficient method for training speaker recognition models
- TTS-portuguese corpus: a corpus for speech synthesis in brazilian portuguese
- YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone
- Deep learning approaches for speech synthesis and speaker verification
- Evaluating sentence segmentation in different datasets of neuropsychological language tests in brazilian portuguese
- Transfer learning and data augmentation techniques to the COVID-19 identification tasks in ComParE 2021
- Evaluating semantic similarity methods to build semantic predictability norms of reading data
- Brazilian portuguese speech recognition using Wav2vec 2.0
Informações sobre o DOI: 10.21437/Interspeech.2021-1774 (Fonte: oaDOI API)
Download do texto completo
Tipo | Nome | Link | |
---|---|---|---|
3057702.pdf |
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas