Speech2Phone: a novel and efficient method for training speaker recognition models (2021)
- Authors:
- USP affiliated authors: ALUISIO, SANDRA MARIA - ICMC ; PONTI, MOACIR ANTONELLI - ICMC ; CASANOVA, EDRESSON - ICMC
- Unidade: ICMC
- DOI: 10.1007/978-3-030-91699-2_39
- Subjects: PROCESSAMENTO DE LINGUAGEM NATURAL; RECONHECIMENTO DE VOZ; APRENDIZADO COMPUTACIONAL
- Keywords: Speaker verification; Speaker recognition; Speaker identification
- Agências de fomento:
- Language: Inglês
- Imprenta:
- Source:
- Título: Lecture Notes in Artificial Intelligence
- ISSN: 0302-9743
- Volume/Número/Paginação/Ano: v. 13074, p. 572-585, 2021
- Conference titles: Brazilian Conference on Intelligent Systems - BRACIS
- Este periódico é de assinatura
- Este artigo NÃO é de acesso aberto
- Cor do Acesso Aberto: closed
-
ABNT
CASANOVA, Edresson et al. Speech2Phone: a novel and efficient method for training speaker recognition models. Lecture Notes in Artificial Intelligence. Cham: Springer. Disponível em: https://doi.org/10.1007/978-3-030-91699-2_39. Acesso em: 28 dez. 2025. , 2021 -
APA
Casanova, E., Candido Junior, A., Shulby, C. D., Oliveira, F. S. de, Gris, L. R. S., Silva, H. P. da, et al. (2021). Speech2Phone: a novel and efficient method for training speaker recognition models. Lecture Notes in Artificial Intelligence. Cham: Springer. doi:10.1007/978-3-030-91699-2_39 -
NLM
Casanova E, Candido Junior A, Shulby CD, Oliveira FS de, Gris LRS, Silva HP da, Aluísio SM, Ponti MA. Speech2Phone: a novel and efficient method for training speaker recognition models [Internet]. Lecture Notes in Artificial Intelligence. 2021 ; 13074 572-585.[citado 2025 dez. 28 ] Available from: https://doi.org/10.1007/978-3-030-91699-2_39 -
Vancouver
Casanova E, Candido Junior A, Shulby CD, Oliveira FS de, Gris LRS, Silva HP da, Aluísio SM, Ponti MA. Speech2Phone: a novel and efficient method for training speaker recognition models [Internet]. Lecture Notes in Artificial Intelligence. 2021 ; 13074 572-585.[citado 2025 dez. 28 ] Available from: https://doi.org/10.1007/978-3-030-91699-2_39 - SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model
- ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
- TTS-portuguese corpus: a corpus for speech synthesis in brazilian portuguese
- TTS applied to the generation of datasets for automatic speech recognition
- YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone
- Deep learning approaches for speech synthesis and speaker verification
- Evaluating sentence segmentation in different datasets of neuropsychological language tests in brazilian portuguese
- Transfer learning and data augmentation techniques to the COVID-19 identification tasks in ComParE 2021
- Evaluating semantic similarity methods to build semantic predictability norms of reading data
- Brazilian portuguese speech recognition using Wav2vec 2.0
Informações sobre o DOI: 10.1007/978-3-030-91699-2_39 (Fonte: oaDOI API)
Download do texto completo
| Tipo | Nome | Link | |
|---|---|---|---|
| 3057253.pdf |
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas
