Search results with tag "Dif ference learning"

Deep Reinforcement Learning with Double Q-learning - arXiv

arxiv.org

using Q-learning (Watkins, 1989), a form of temporal dif-ference learning (Sutton, 1988). Most interesting problems are too large to learn all action values in all states sepa-rately. Instead, we can learn a parameterized value function Q(s;a; t). The standard Q-learning update for the param-eters after taking action At in state St and ...

Learning, Double, Double q learning, Ference, Dif ference learning

PDF4PRO ^⚡AMP

Modern search engine that looking for books and documents around the web

Search results with tag "Dif ference learning"

Deep Reinforcement Learning with Double Q-learning - arXiv

Similar queries