Example: air traffic controller
17 Mirror Descent
stance, for minimizing linear functions over the probability simplex Dn, we saw in §16.4.1 that the generic gradient descent algorithm does significantly worse than the specialized Hedge algorithm.Show that not only the analysis but the algorithm is bad.This suggests ask-ing: can we somehow change gradient descent to adapt to the “geometry ...
Tags:
Information
Domain:
Source:
Link to this page: