Загрузка...

Q-Learning Agent Analysis - Reinforcement Learning p.3

Welcome to part 3 of the Reinforcement Learning series as well as part 3 of the Q learning parts. Up to this point, we've successfully made a Q-learning algorithm that navigates the OpenAI MountainCar environment. The issue now is, we have a lot of parameters here that we might want to tune. Being able to beat the game is one thing, but we might want to beat it quicker, and maybe even try to explore ways to learn faster. In order to do this, we need to start shedding some light onto what exactly we're doing.

Text-based tutorial and sample code: https://pythonprogramming.net/q-learning-analysis-reinforcement-learning-python-tutorial/

Channel membership: https://www.youtube.com/channel/UCfzlCWGWYyIQ0aLC5w48gBQ/join
Discord: https://discord.gg/sentdex
Support the content: https://pythonprogramming.net/support-donate/
Twitter: https://twitter.com/sentdex
Instagram: https://instagram.com/sentdex
Facebook: https://www.facebook.com/pythonprogramming.net/
Twitch: https://www.twitch.tv/sentdex

#reinforcementlearning #machinelearning #python

Видео Q-Learning Agent Analysis - Reinforcement Learning p.3 канала sentdex

Q-learning q learning reinforcement learning rl tutorial machine learning python

Комментарии отсутствуют

Информация о видео

5 июня 2019 г. 19:56:52

00:24:59

Правообладателям

Жалоба на материал Недопустимый материал Нарушение авторских прав

Комментарии

Поделиться

Другие видео канала

Kivy with Python tutorial Part 12 - Adding better Navigation

Visualizing Neural Network Internals

AI-Based Game Engines with GameGAN research paper

Workers and Pylons - Python AI in StarCraft II tutorial p.2

Python Plays GTA V: Reboot - Announcement

Neural Networks from Scratch (NNFS) in Print!

Running our Reinforcement Learning Agent - Self-driving cars with Carla and Python p.5

Can we simulate a real robot?

Building Neural Network Model - Python AI in StarCraft II tutorial p.10

Pickling and Scaling - Practical Machine Learning Tutorial with Python p.6

Basic Stock data Manipulation - Python Programming for Finance p.3

Determining other vehicle distances & collision warning (object detection) - Self Driving Cars p.18

[See Description] Programming for Finance with Python and Quantopian and Zipline Part 1

[See Description] Understanding Hedgefund and other financial Objectives - Python for Finance 13

Scikit Learn Machine Learning Tutorial for investing with Python p. 14

Generative Model Basics - Unconventional Neural Networks p.1

Optimizing Neural Network Structures with Keras-Tuner

Normalizing and creating sequences Crypto RNN - Deep Learning w/ Python, TensorFlow and Keras p.9

Adding a tutorial option - Tkinter tutorial Python 3.4 part 19

Testing VLMs and LLMs for robotics w/ the Jetson Thor devkit

Research - Algorithmic Trading with Python and Quantopian p. 4

Все заметки Новая заметка Страницу в заметки

Страницу в закладки Мои закладки

На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.

О CookiesНапомнить позжеПринять