An end-to-end deep learning approach for video captioning through mobile devices (2024)
- Authors:
- USP affiliated authors: CESAR JUNIOR, ROBERTO MARCONDES - IME ; DAMACENO, RAFAEL JEFERSON PEZZUTO - IME
- Unidade: IME
- DOI: 10.1007/978-3-031-49018-7_51
- Subjects: VISÃO COMPUTACIONAL; APRENDIZAGEM PROFUNDA
- Keywords: Video captioning; Mobile device
- Agências de fomento:
- Language: Inglês
- Imprenta:
- Source:
- Título: Lecture Notes in Computer Science
- ISSN: 0302-9743
- Volume/Número/Paginação/Ano: v. 14469, p. 715-729, 2025
- Conference titles: Iberoamerican Congress on Pattern Recognition - CIARP
- Este periódico é de assinatura
- Este artigo NÃO é de acesso aberto
- Cor do Acesso Aberto: closed
-
ABNT
DAMACENO, Rafael Jeferson Pezzuto e CÉSAR JÚNIOR, Roberto Marcondes. An end-to-end deep learning approach for video captioning through mobile devices. Lecture Notes in Computer Science. Cham: Springer. Disponível em: https://doi.org/10.1007/978-3-031-49018-7_51. Acesso em: 27 dez. 2025. , 2024 -
APA
Damaceno, R. J. P., & César Júnior, R. M. (2024). An end-to-end deep learning approach for video captioning through mobile devices. Lecture Notes in Computer Science. Cham: Springer. doi:10.1007/978-3-031-49018-7_51 -
NLM
Damaceno RJP, César Júnior RM. An end-to-end deep learning approach for video captioning through mobile devices [Internet]. Lecture Notes in Computer Science. 2024 ; 14469 715-729.[citado 2025 dez. 27 ] Available from: https://doi.org/10.1007/978-3-031-49018-7_51 -
Vancouver
Damaceno RJP, César Júnior RM. An end-to-end deep learning approach for video captioning through mobile devices [Internet]. Lecture Notes in Computer Science. 2024 ; 14469 715-729.[citado 2025 dez. 27 ] Available from: https://doi.org/10.1007/978-3-031-49018-7_51 - SideSeeing: a multimodal dataset and tools for sidewalk assessment
- A mobile device framework for video captioning using multimodal neural networks
- Tactile path guidance via weakly supervised visual attention
- Video cropping using salience maps: a case study on a sidewalk dataset
- Towards a method for evaluating bus stop infrastructure with street level images and large language models
- Computação e inovação: ampliando fronteiras para solução de desafios no Brasil
- A Fourier-wavelet representation of 2-D shapes: sexual dimorphism in the Japanese cranial base
- Segmentation of similar images using graph matching and community detection
- ISMM 2007 special issue
- On the ternary spatial relation "between"
Informações sobre o DOI: 10.1007/978-3-031-49018-7_51 (Fonte: oaDOI API)
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas
