Second-Order Methods for Policy Search in Reinforcement Learning


  • Wrote thesis work aimed at improving the convergence rate and theoretical guarantees of policy search methods
  • Evaluated second-order methods like Newton method and approximations motivated by natural gradient information
  • Empirically compared algorithms using exact Hessian and its approximations motivated by Fisher information