YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone (2022)
- Authors:
- USP affiliated authors: PONTI, MOACIR ANTONELLI - ICMC ; CASANOVA, EDRESSON - ICMC
- Unidade: ICMC
- Subjects: APRENDIZADO COMPUTACIONAL; PROCESSAMENTO DE LINGUAGEM NATURAL; RECONHECIMENTO DA FALA; SÍNTESE DE FALA
- Keywords: cross-lingual zero-shot multi-speaker TTS; text-to-speech; cross-lingual zero-shot voice conversion; speaker adaptation
- Agências de fomento:
- Language: Inglês
- Imprenta:
- Publisher: Microtome Publishing
- Publisher place: Brookline
- Date published: 2022
- Source:
- Título: Proceedings of Machine Learning Research : PMLR
- ISSN: 1938-7228
- Volume/Número/Paginação/Ano: v. 162, p. 2709-2720, 2022
- Conference titles: International Conference on Machine Learning - ICML
-
ABNT
CASANOVA, Edresson et al. YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone. Proceedings of Machine Learning Research : PMLR. Brookline: Microtome Publishing. Disponível em: https://proceedings.mlr.press/v162/casanova22a.html. Acesso em: 29 abr. 2025. , 2022 -
APA
Casanova, E., Weber, J., Shulby, C. D., Candido Junior, A., Gölge, E., & Ponti, M. A. (2022). YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone. Proceedings of Machine Learning Research : PMLR. Brookline: Microtome Publishing. Recuperado de https://proceedings.mlr.press/v162/casanova22a.html -
NLM
Casanova E, Weber J, Shulby CD, Candido Junior A, Gölge E, Ponti MA. YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone [Internet]. Proceedings of Machine Learning Research : PMLR. 2022 ; 162 2709-2720.[citado 2025 abr. 29 ] Available from: https://proceedings.mlr.press/v162/casanova22a.html -
Vancouver
Casanova E, Weber J, Shulby CD, Candido Junior A, Gölge E, Ponti MA. YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone [Internet]. Proceedings of Machine Learning Research : PMLR. 2022 ; 162 2709-2720.[citado 2025 abr. 29 ] Available from: https://proceedings.mlr.press/v162/casanova22a.html - SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model
- ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
- Transfer learning and data augmentation techniques to the COVID-19 identification tasks in ComParE 2021
- TTS applied to the generation of datasets for automatic speech recognition
- TTS-portuguese corpus: a corpus for speech synthesis in brazilian portuguese
- Speech2Phone: a novel and efficient method for training speaker recognition models
- Brazilian portuguese speech recognition using Wav2vec 2.0
- Síntese de fala aplicada à geração de conjunto de dados para reconhecimento automático de fala
- Desenvolvimento de um modelo de reconhecimento de voz para o português brasileiro com poucos dados utilizando o Wav2vec 2.0
- BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Download do texto completo
Tipo | Nome | Link | |
---|---|---|---|
3114020.pdf | Direct link |
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas