Example: tourism industry
Search results with tag "Maximum entropy inverse reinforcement learning"
Maximum Entropy Inverse Reinforcement Learning
www.aaai.orgRecovering the agent’s exact reward weights is an ill-posed problem; many reward weights, including degenera-cies (e.g., all zeroes), make demonstrated trajectories opti-mal. Ratliff, Bagnell, & Zinkevich (2006) cast this problem as one of structured maximum margin prediction (MMP). They consider a class of loss functions that directly measure