Home * People * Hamid Reza Maei

Hamid Reza Maei ^[1]

Hamid Reza Maei,
an Iranian born physicist, computer scientist and software engineer. He holds a Ph.D. in 2011 from University of Alberta ^[2], where he was member of the reinforcement learning and artificial intelligence (RLAI) group ^[3] at the Department of Computing Science to create reinforcement learning algorithms for large-scale problems and was involved in the development of a new family of temporal-difference learning algorithms suitable for value function approximation. The goal of these algorithms was to bring us closer to the development of a universal prediction learning algorithm suitable for learning experientially grounded knowledge of the world ^[4].

Selected Publications

^[5]

2008 ...

Richard Sutton, Csaba Szepesvári, Hamid Reza Maei (2008). A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation. NIPS 2008, pdf
Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard Sutton (2009). Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. NIPS 2009, pdf
Richard Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora. (2009). Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation. ICML 2009

2010 ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard Sutton (2010). Toward Off-Policy Learning Control with Function Approximation. ICML 2010, pdf
Hamid Reza Maei, Richard Sutton (2010). GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces. AGI 2010
Hamid Reza Maei (2011). Gradient Temporal-Difference Learning Algorithms. Ph.D. thesis, University of Alberta, advisor Richard Sutton
Zheng Wen, Daniel O'Neill, Hamid Reza Maei (2014). Optimal Demand Response Using Device Based Reinforcement Learning. arXiv:1401.1549
Hamid Reza Maei (2018). Convergent Actor-Critic Algorithms Under Off-Policy Training and Function Approximation. arXiv:1802.07842

External Links

References

↑ Lifeboat Foundation Bios: Hamid Reza Maei
↑ Hamid Reza Maei (2011). Gradient Temporal-Difference Learning Algorithms. Ph.D. thesis, University of Alberta, advisor Richard Sutton
↑ Reinforcement Learning and Artificial Intelligence (RLAI)
↑ Lifeboat Foundation Bios: Hamid Reza Maei
↑ dblp: Hamid Reza Maei

Up one level

[1] Lifeboat Foundation Bios: Hamid Reza Maei

[2] Hamid Reza Maei (2011). Gradient Temporal-Difference Learning Algorithms. Ph.D. thesis, University of Alberta, advisor Richard Sutton

[3] Reinforcement Learning and Artificial Intelligence (RLAI)

[4] Lifeboat Foundation Bios: Hamid Reza Maei

[5] : Hamid Reza Maei

[1]

[2]

[3]

[4]

[5]

Hamid Reza Maei

Selected Publications

2008 ...

2010 ...

External Links

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools