Example: dental hygienist
Asynchronous Methods for Deep Reinforcement Learning
ment learning methods with linear function approximation. Parallelism was used to speed up large matrix operations but not to parallelize the collection of experience or sta-bilize learning. (Grounds & Kudenko,2008) proposed a parallel version of the Sarsa algorithm that uses multiple separate actor-learners to accelerate training. Each actor-
Download Asynchronous Methods for Deep Reinforcement Learning
Information
Domain:
Source:
Link to this page: