Q Learning - epsilon greedy + temporal difference Off policy (Wall Following)
This package implements Q Learning Algorithm using epsilon greedy and temporal difference update for a robot in a simulated environment. It includes scripts for training and testing the Q-learning algorithm, as well as launch files for running the simulation in training and testing modes.
Видео Q Learning - epsilon greedy + temporal difference Off policy (Wall Following) канала Shashank Walke
Видео Q Learning - epsilon greedy + temporal difference Off policy (Wall Following) канала Shashank Walke
Комментарии отсутствуют
Информация о видео
25 февраля 2025 г. 7:58:19
00:03:50
Другие видео канала