Source: Proceedings. Conference titles: Brazilian Conference on Intelligent Systems - BRACIS. Unidade: IME
Subjects: MODELOS PARA PROCESSOS ESTOCÁSTICOS, APRENDIZADO COMPUTACIONAL
ABNT
LOVATTO, Ângelo Gregório e BUENO, Thiago Pereira e BARROS, Leliane Nunes de. Gradient estimation in model-based reinforcement learning: a study on linear quadratic environments. 2021, Anais.. Cham: Springer, 2021. Disponível em: https://doi.org/10.1007/978-3-030-91702-9_3. Acesso em: 06 nov. 2024.APA
Lovatto, Â. G., Bueno, T. P., & Barros, L. N. de. (2021). Gradient estimation in model-based reinforcement learning: a study on linear quadratic environments. In Proceedings. Cham: Springer. doi:10.1007/978-3-030-91702-9_3NLM
Lovatto ÂG, Bueno TP, Barros LN de. Gradient estimation in model-based reinforcement learning: a study on linear quadratic environments [Internet]. Proceedings. 2021 ;[citado 2024 nov. 06 ] Available from: https://doi.org/10.1007/978-3-030-91702-9_3Vancouver
Lovatto ÂG, Bueno TP, Barros LN de. Gradient estimation in model-based reinforcement learning: a study on linear quadratic environments [Internet]. Proceedings. 2021 ;[citado 2024 nov. 06 ] Available from: https://doi.org/10.1007/978-3-030-91702-9_3