List: Reinforcement Learning | Curated by Mauricio Arancibia | Medium

Mauricio Arancibia

Jul 6, 2022

9 stories

Reinforcement Learning

This story is no longer available

Arthur Juliani

Simple Reinforcement Learning with Tensorflow Part 0: Q-Learning with Tables and Neural Networks

For this tutorial in my Reinforcement Learning series, we are going to be exploring a family of RL algorithms called Q-Learning algorithms…

Aug 25, 2016

Simple Reinforcement Learning with Tensorflow Part 0: Q-Learning with Tables and Neural Networks

Aug 25, 2016

In

HackerNoon.com

by

Rudy Gilman

Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

Reinforcement learning (RL) practitioners have produced a number of excellent tutorials. Most, however, describe RL in terms of…

Jan 9, 2018

Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

Jan 9, 2018

Markel Sanz Ausin

Introducción al aprendizaje por refuerzo. Parte 5: políticas de gradiente

Algoritmos de políticas de gradiente (Policy Gradient algorithms), con código ejecutable y derivación matemática. Inteligencia Artificial.

Nov 25, 2020

Introducción al aprendizaje por refuerzo. Parte 5: políticas de gradiente

Nov 25, 2020

Markel Sanz Ausin

Introducción al aprendizaje por refuerzo. Parte 3: Q-Learning con redes neuronales, algoritmo DQN.

En la parte 2 vimos que el algoritmo Q-Learning funciona muy bien cuando el entorno es simple y la función Q(s,a) se puede representar…

Apr 3, 2020

Introducción al aprendizaje por refuerzo. Parte 3: Q-Learning con redes neuronales, algoritmo DQN.

Apr 3, 2020

Markel Sanz Ausin

Introducción al aprendizaje por refuerzo. Parte 4: Double DQN y Dueling DQN.

En la parte 3 hemos visto cómo funciona el algoritmo DQN, y cómo éste puede aprender a solucionar problemas complejos. En esta parte…

Apr 14, 2020

Introducción al aprendizaje por refuerzo. Parte 4: Double DQN y Dueling DQN.

Apr 14, 2020

Markel Sanz Ausin

Introducción al aprendizaje por refuerzo. Parte 2: Q-Learning.

En la parte 1, hemos descrito el problema del bandido multibrazo, y hemos introducido varios conceptos, como el estado, la acción, la…

Mar 29, 2020

Introducción al aprendizaje por refuerzo. Parte 2: Q-Learning.

Mar 29, 2020

Markel Sanz Ausin

Introducción al aprendizaje por refuerzo. Parte 1: el problema del bandido multibrazo.

Inteligencia artificial con aprendizaje por refuerzo para el problema del bandido multibrazo

Mar 22, 2020

Introducción al aprendizaje por refuerzo. Parte 1: el problema del bandido multibrazo.

Mar 22, 2020

In

Sigmoid

by

Rishabh Anand

A Brief Introduction to Markov Chains

A general guide on what makes the Markov Decision Process tick

Feb 27, 2019

A Brief Introduction to Markov Chains

Feb 27, 2019

Mauricio Arancibia

Mauricio Arancibia

AI Engineer, Drummer, Lover of Science Fiction Reading. 🧠+🤖 Visit me at http://www.neuraldojo.org

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams