Asynchronous Methods for Deep Reinforcement …

Asynchronous Methods for deep Reinforcement LearningVolodymyr Puigdom nech P. DeepMind2 Montreal Institute for learning Algorithms (MILA), University of MontrealAbstractWeproposeaconceptuallysi mpleandlightweight framework for deep reinforce-ment learning that uses Asynchronous gradientdescent for optimization of deep neural networkcontrollers. We present Asynchronous variants offour standard Reinforcement learning algorithmsand show that parallel actor-learners have astabilizing effect on training allowing all fourmethods to successfully train neural best performing method, anasynchronous variant of actor-critic, surpassesthe current state-of-the-art on the Atari domainwhile training for half the time on a singlemulti-core CPU instead of a GPU.

Asynchronous Methods for Deep Reinforcement Learning One way of propagating rewards faster is by using n-step returns (Watkins,1989;Peng & Williams,1996).

Fullscreen Download

Tags:

Methods, Learning, Deep, Propagating, Reinforcement, Asynchronous, Asynchronous methods for deep reinforcement, Asynchronous methods for deep reinforcement learning

Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Spam in document Broken preview Other abuse

Transcription of Asynchronous Methods for Deep Reinforcement …

Related search queries

Plants from Specialized, Propagating, Basic Techniques for Propagating Plants, ToolingTooling, DuPont, Polyvinyl fluoride, International treaty on plant genetic resources, AN-1811 Bluetooth Antenna Design Rev

PDF4PRO ^⚡AMP

Modern search engine that looking for books and documents around the web

Asynchronous Methods for Deep Reinforcement …

Tags:

Information

Transcription of Asynchronous Methods for Deep Reinforcement …

Related search queries

Asynchronous Methods for Deep Reinforcement …

Tags:

Information

Documents from same domain

Related documents

Related search queries