Deep learning approaches for speech synthesis and speaker verification (2021)
- Authors:
- USP affiliated authors: ALUISIO, SANDRA MARIA - ICMC ; CASANOVA, EDRESSON - ICMC
- Unidade: ICMC
- DOI: 10.11606/9786587596198
- Subjects: APRENDIZADO COMPUTACIONAL; SÍNTESE DE FALA
- Keywords: Speech Technologies; Speech Synthesis; Speaker Verification; Deep Learning approaches.
- Language: Inglês
- Imprenta:
- Source:
- Título: Acoustic communication : an interdisciplinary approach
- Volume/Número/Paginação/Ano: 210 p
- Este periódico é de acesso aberto
- Este artigo é de acesso aberto
- URL de acesso aberto
- Cor do Acesso Aberto: gold
- Licença: cc-by-nc-sa
-
ABNT
CASANOVA, Edresson e SHULBY, Christopher Dane e ALUÍSIO, Sandra Maria. Deep learning approaches for speech synthesis and speaker verification. Acoustic communication : an interdisciplinary approach. Tradução . São Paulo: IP-USP, 2021. . Disponível em: https://doi.org/10.11606/9786587596198. Acesso em: 14 nov. 2024. -
APA
Casanova, E., Shulby, C. D., & Aluísio, S. M. (2021). Deep learning approaches for speech synthesis and speaker verification. In Acoustic communication : an interdisciplinary approach. São Paulo: IP-USP. doi:10.11606/9786587596198 -
NLM
Casanova E, Shulby CD, Aluísio SM. Deep learning approaches for speech synthesis and speaker verification [Internet]. In: Acoustic communication : an interdisciplinary approach. São Paulo: IP-USP; 2021. [citado 2024 nov. 14 ] Available from: https://doi.org/10.11606/9786587596198 -
Vancouver
Casanova E, Shulby CD, Aluísio SM. Deep learning approaches for speech synthesis and speaker verification [Internet]. In: Acoustic communication : an interdisciplinary approach. São Paulo: IP-USP; 2021. [citado 2024 nov. 14 ] Available from: https://doi.org/10.11606/9786587596198 - Evaluating sentence segmentation in different datasets of neuropsychological language tests in brazilian portuguese
- SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model
- ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
- Evaluating semantic similarity methods to build semantic predictability norms of reading data
- TTS applied to the generation of datasets for automatic speech recognition
- Speech2Phone: a novel and efficient method for training speaker recognition models
- TTS-portuguese corpus: a corpus for speech synthesis in brazilian portuguese
- Desenvolvimento de um modelo de reconhecimento de voz para o português brasileiro com poucos dados utilizando o Wav2vec 2.0
- BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
- Brazilian portuguese speech recognition using Wav2vec 2.0
Informações sobre o DOI: 10.11606/9786587596198 (Fonte: oaDOI API)
Download do texto completo
Tipo | Nome | Link | |
---|---|---|---|
3077149.pdf | Direct link |
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas