Example: air traffic controller
Probability Amp Stochastic Processes
Found 1 free book(s)Reinforcement Learning: Theory and Algorithms
rltheorybook.github.ioA transition function P: SA! ( S), where ( S) is the space of probability distributions over S(i.e., the probability simplex). P(s0js;a) is the probability of transitioning into state s0upon taking action ain state s. We use P s;ato denote the vector P( s;a). A reward function r: SA!