ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion (2023)
- Authors:
- USP affiliated authors: ALUISIO, SANDRA MARIA - ICMC ; PONTI, MOACIR ANTONELLI - ICMC ; CASANOVA, EDRESSON - ICMC
- Unidade: ICMC
- DOI: 10.21437/Interspeech.2023-496
- Subjects: SÍNTESE DE FALA; PROCESSAMENTO DE LINGUAGEM NATURAL
- Agências de fomento:
- Language: Inglês
- Imprenta:
- Source:
- Título: Proceedings
- Conference titles: Annual Conference of the International Speech Communication Association - INTERSPEECH
- Este periódico é de assinatura
- Este artigo é de acesso aberto
- URL de acesso aberto
- Cor do Acesso Aberto: green
-
ABNT
CASANOVA, Edresson et al. ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion. 2023, Anais.. Baixas: ISCA, 2023. Disponível em: https://doi.org/10.21437/Interspeech.2023-496. Acesso em: 28 dez. 2025. -
APA
Casanova, E., Shulby, C. D., Korolev, A., Candido Junior, A., Soares, A. da S., Aluísio, S. M., & Ponti, M. A. (2023). ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion. In Proceedings. Baixas: ISCA. doi:10.21437/Interspeech.2023-496 -
NLM
Casanova E, Shulby CD, Korolev A, Candido Junior A, Soares A da S, Aluísio SM, Ponti MA. ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion [Internet]. Proceedings. 2023 ;[citado 2025 dez. 28 ] Available from: https://doi.org/10.21437/Interspeech.2023-496 -
Vancouver
Casanova E, Shulby CD, Korolev A, Candido Junior A, Soares A da S, Aluísio SM, Ponti MA. ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion [Internet]. Proceedings. 2023 ;[citado 2025 dez. 28 ] Available from: https://doi.org/10.21437/Interspeech.2023-496 - SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model
- Speech2Phone: a novel and efficient method for training speaker recognition models
- TTS-portuguese corpus: a corpus for speech synthesis in brazilian portuguese
- TTS applied to the generation of datasets for automatic speech recognition
- YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone
- Deep learning approaches for speech synthesis and speaker verification
- Evaluating sentence segmentation in different datasets of neuropsychological language tests in brazilian portuguese
- Transfer learning and data augmentation techniques to the COVID-19 identification tasks in ComParE 2021
- Evaluating semantic similarity methods to build semantic predictability norms of reading data
- Brazilian portuguese speech recognition using Wav2vec 2.0
Informações sobre o DOI: 10.21437/Interspeech.2023-496 (Fonte: oaDOI API)
Download do texto completo
| Tipo | Nome | Link | |
|---|---|---|---|
| 3156016.pdf |
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas
