Example: air traffic controller

Asynchronous Methods For Deep Reinforcement Learning

Found 2 free book(s)
A arXiv:1611.01578v2 [cs.LG] 15 Feb 2017

A arXiv:1611.01578v2 [cs.LG] 15 Feb 2017

arxiv.org

asynchronous parameter updates in order to speed up the learning process of the controller (Dean et al., 2012). We use a parameter-server scheme where we have a parameter server of Sshards, that store the shared parameters for Kcontroller replicas. Each controller replica samples mdifferent child architectures that are trained in parallel.

  Learning, Asynchronous

Python code for Artificial Intelligence: Foundations of ...

Python code for Artificial Intelligence: Foundations of ...

artint.info

1 Python code for Artificial Intelligence: Foundations of Computational Agents David L. Poole and Alan K. Mackworth Version 0.9.0 of July 2, 2021.

Similar queries