Example: tourism industry
Search results with tag "Gradient methods for reinforcement learning"
Policy Gradient Methods for Reinforcement Learning with ...
homes.cs.washington.eduRichard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour AT&T Labs { Research, 180 Park Avenue, Florham Park, NJ 07932 Abstract Function approximation is essential to reinforcement learning, but the standard approach of approximating a value function and deter-mining a policy from it has so far proven theoretically intractable.