Example: stock market
Search results with tag "Soft actor critic"
Soft Actor-Critic: Off-Policy Maximum Entropy Deep ...
proceedings.mlr.presssensitivity (Duan et al.,2016;Henderson et al.,2017). We explore how to design an efficient and stable model-free deep RL algorithm for continuous state and action spaces. To that end, we draw on the maximum entropy framework, which augments the standard maximum reward