Example: tourism industry

Search results with tag "Gradient methods for reinforcement learning"

Policy Gradient Methods for Reinforcement Learning with ...

Policy Gradient Methods for Reinforcement Learning with ...

homes.cs.washington.edu

Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour AT&T Labs { Research, 180 Park Avenue, Florham Park, NJ 07932 Abstract Function approximation is essential to reinforcement learning, but the standard approach of approximating a value function and deter-mining a policy from it has so far proven theoretically intractable.

  Methods, Learning, Reinforcement, Derating, Sutton, Gradient methods for reinforcement learning

Similar queries