Example: bachelor of science

CS 188: Artificial Intelligence Example: Grid World

If there is a wall in the direction the agent would have been taken, the agent stays put The agent receives rewards each time step Small “living” reward each step (can be negative) Big rewards come at the end (good or bad) Goal: maximize sum of (discounted) rewards Recap: MDPs Markov decision processes: States S

Tags:

  Intelligence, Walls, Artificial, Rewards, Artificial intelligence, Cs 188

Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Other abuse

Advertisement

Transcription of CS 188: Artificial Intelligence Example: Grid World

Related search queries