Hamid Reza Maei
Revision as of 14:03, 12 April 2021 by GerdIsenberg (talk | contribs) (Created page with "'''Home * People * Hamid Reza Maei''' FILE:HamidRezaMaei.jpg|border|right|thumb|link=https://lifeboat.com/ex/bios.hamid.reza.maei | Hamid Reza Maei <ref>...")
Home * People * Hamid Reza Maei
Hamid Reza Maei,
an Iranian born physicist, computer scientist and software engineer. He holds a Ph.D. in 2011 from University of Alberta
[2],
where he was member of the reinforcement learning and artificial intelligence (RLAI) group
[3] at the Department of Computing Science to create reinforcement learning algorithms for large-scale problems and was involved in the development of a new family of temporal-difference learning algorithms suitable for value function approximation.
The goal of these algorithms was to bring us closer to the development of a universal prediction learning algorithm suitable for learning experientially grounded knowledge of the world [4].
Selected Publications
2008 ...
- Richard Sutton, Csaba Szepesvári, Hamid Reza Maei (2008). A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation. NIPS 2008, pdf
- Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard Sutton (2009). Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. NIPS 2009, pdf
- Richard Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora. (2009). Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation. ICML 2009
2010 ...
- Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard Sutton (2010). Toward Off-Policy Learning Control with Function Approximation. ICML 2010, pdf
- Hamid Reza Maei, Richard Sutton (2010). GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces. AGI 2010
- Hamid Reza Maei (2011). Gradient Temporal-Difference Learning Algorithms. Ph.D. thesis, University of Alberta, advisor Richard Sutton
- Zheng Wen, Daniel O'Neill, Hamid Reza Maei (2014). Optimal Demand Response Using Device Based Reinforcement Learning. arXiv:1401.1549
- Hamid Reza Maei (2018). Convergent Actor-Critic Algorithms Under Off-Policy Training and Function Approximation. arXiv:1802.07842
External Links
References
- ↑ Lifeboat Foundation Bios: Hamid Reza Maei
- ↑ Hamid Reza Maei (2011). Gradient Temporal-Difference Learning Algorithms. Ph.D. thesis, University of Alberta, advisor Richard Sutton
- ↑ Reinforcement Learning and Artificial Intelligence (RLAI)
- ↑ Lifeboat Foundation Bios: Hamid Reza Maei
- ↑ dblp: Hamid Reza Maei