Example: stock market

Search results with tag "Soft actor critic"

Soft Actor-Critic: Off-Policy Maximum Entropy Deep ...

Soft Actor-Critic: Off-Policy Maximum Entropy Deep ...

proceedings.mlr.press

sensitivity (Duan et al.,2016;Henderson et al.,2017). We explore how to design an efficient and stable model-free deep RL algorithm for continuous state and action spaces. To that end, we draw on the maximum entropy framework, which augments the standard maximum reward

  States, 2017, Soft, Citric, Henderson, Actors, Soft actor critic

Similar queries