Hamid Reza Maei

Home * People * Hamid Reza Maei

[[FILE:HamidRezaMaei.jpg|border|right|thumb|link=https://lifeboat.com/ex/bios.hamid.reza.maei
 * Hamid Reza Maei ]]

Hamid Reza Maei, an Iranian born physicist, computer scientist and software engineer. He holds a Ph.D. in 2011 from University of Alberta , where he was member of the reinforcement learning and artificial intelligence (RLAI) group at the Department of Computing Science to create reinforcement learning algorithms for large-scale problems and was involved in the development of a new family of temporal-difference learning algorithms suitable for value function approximation. The goal of these algorithms was to bring us closer to the development of a universal prediction learning algorithm suitable for learning experientially grounded knowledge of the world.

=Selected Publications=

2008 ...

 * Richard Sutton, Csaba Szepesvári, Hamid Reza Maei (2008). A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation. NIPS 2008, pdf
 * Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard Sutton (2009). Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. NIPS 2009, pdf
 * Richard Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora. (2009). Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation. ICML 2009

2010 ...

 * Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard Sutton (2010). Toward Off-Policy Learning Control with Function Approximation. ICML 2010, pdf
 * Hamid Reza Maei, Richard Sutton (2010). GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces. AGI 2010
 * Hamid Reza Maei (2011). Gradient Temporal-Difference Learning Algorithms. Ph.D. thesis, University of Alberta, advisor Richard Sutton
 * Zheng Wen, Daniel O'Neill, Hamid Reza Maei (2014). Optimal Demand Response Using Device Based Reinforcement Learning. arXiv:1401.1549
 * Hamid Reza Maei (2018). Convergent Actor-Critic Algorithms Under Off-Policy Training and Function Approximation. arXiv:1802.07842

=External Links=
 * Hamid Maei‬ - ‪Google Scholar‬
 * Lifeboat Foundation Bios: Hamid Reza Maei

=References= Up one level