Example: biology
Deterministic Policy Gradient Algorithms

Deterministic Policy Gradient Algorithms

Back to document page

(ajs) = P[ajs; ] that stochastically selects action ain state saccording to parameter vector . Policy gradient algorithms typically proceed by sampling this stochastic policy and adjusting the policy parameters in the direction of greater cumulative reward.

  Deterministic

Download Deterministic Policy Gradient Algorithms


Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Other abuse

Advertisement

Related search queries