Changes

Jump to: navigation, search

Hamid Reza Maei

4,448 bytes added, 14:03, 12 April 2021
Created page with "'''Home * People * Hamid Reza Maei''' FILE:HamidRezaMaei.jpg|border|right|thumb|link=https://lifeboat.com/ex/bios.hamid.reza.maei | Hamid Reza Maei <ref>..."
'''[[Main Page|Home]] * [[People]] * Hamid Reza Maei'''

[[FILE:HamidRezaMaei.jpg|border|right|thumb|link=https://lifeboat.com/ex/bios.hamid.reza.maei
| Hamid Reza Maei <ref>[https://lifeboat.com/ex/bios.hamid.reza.maei Lifeboat Foundation Bios: Hamid Reza Maei]</ref> ]]

'''Hamid Reza Maei''',<br/>
an Iranian born physicist, computer scientist and software engineer. He holds a Ph.D. in 2011 from [[University of Alberta]]
<ref>[[Hamid Reza Maei]] ('''2011'''). ''[https://era.library.ualberta.ca/items/fd55edcb-ce47-4f84-84e2-be281d27b16a Gradient Temporal-Difference Learning Algorithms]''. Ph.D. thesis, [[University of Alberta]], advisor [[Richard Sutton]]</ref>,
where he was member of the [[Reinforcement Learning|reinforcement learning]] and [[Artificial Intelligence|artificial intelligence]] (RLAI) group
<ref>[http://rlai.ualberta.ca/ Reinforcement Learning and Artificial Intelligence (RLAI)]</ref> at the ''Department of Computing Science'' to create reinforcement learning algorithms for large-scale problems and was involved in the development of a new family of [[Temporal Difference Learning|temporal-difference learning]] algorithms suitable for value function approximation.
The goal of these algorithms was to bring us closer to the development of a universal prediction learning algorithm suitable for learning experientially grounded knowledge of the world <ref>[https://lifeboat.com/ex/bios.hamid.reza.maei Lifeboat Foundation Bios: Hamid Reza Maei]</ref>.

=Selected Publications=
<ref>[http://dblp.uni-trier.de/pers/hd/m/Maei:Hamid_Reza dblp: Hamid Reza Maei]</ref>
==2008 ...==
* [[Richard Sutton]], [[Csaba Szepesvári]], [[Hamid Reza Maei]] ('''2008'''). ''A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation''. [https://dblp.uni-trier.de/db/conf/nips/nips2008.html#SuttonSM08 NIPS 2008], [https://proceedings.neurips.cc/paper/2008/file/e0c641195b27425bb056ac56f8953d24-Paper.pdf pdf]
* [[Hamid Reza Maei]], [[Csaba Szepesvári]], [[Shalabh Bhatnagar]], [[Doina Precup]], [[David Silver]], [[Richard Sutton]] ('''2009'''). ''Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation''. [https://dblp.uni-trier.de/db/conf/nips/nips2009.html#MaeiSBPSS09 NIPS 2009], [https://papers.nips.cc/paper/2009/file/3a15c7d0bbe60300a39f76f8a5ba6896-Paper.pdf pdf]
* [[Richard Sutton]], [[Hamid Reza Maei]], [[Doina Precup]], [[Shalabh Bhatnagar]], [[David Silver]], [[Csaba Szepesvári]], [[Eric Wiewiora]]. ('''2009'''). ''[https://dl.acm.org/doi/10.1145/1553374.1553501 Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation]''. [https://dblp.uni-trier.de/db/conf/icml/icml2009.html#SuttonMPBSSW09 ICML 2009]
==2010 ...==
* [[Hamid Reza Maei]], [[Csaba Szepesvári]], [[Shalabh Bhatnagar]], [[Richard Sutton]] ('''2010'''). ''Toward Off-Policy Learning Control with Function Approximation''. [https://dblp.uni-trier.de/db/conf/icml/icml2010.html#MaeiSBS10 ICML 2010], [https://icml.cc/Conferences/2010/papers/627.pdf pdf]
* [[Hamid Reza Maei]], [[Richard Sutton]] ('''2010'''). ''[https://www.researchgate.net/publication/215990384_GQlambda_A_general_gradient_algorithm_for_temporal-difference_prediction_learning_with_eligibility_traces GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces]''. [https://agi-conf.org/2010/ AGI 2010]
* [[Hamid Reza Maei]] ('''2011'''). ''[https://era.library.ualberta.ca/items/fd55edcb-ce47-4f84-84e2-be281d27b16a Gradient Temporal-Difference Learning Algorithms]''. Ph.D. thesis, [[University of Alberta]], advisor [[Richard Sutton]]
* [https://dblp.uni-trier.de/pid/96/780.html Zheng Wen], [https://dblp.uni-trier.de/pid/26/209.html Daniel O'Neill], [[Hamid Reza Maei]] ('''2014'''). ''Optimal Demand Response Using Device Based Reinforcement Learning''. [https://arxiv.org/abs/1401.1549 arXiv:1401.1549]
* [[Hamid Reza Maei]] ('''2018'''). ''Convergent Actor-Critic Algorithms Under Off-Policy Training and Function Approximation''. [https://arxiv.org/abs/1802.07842 arXiv:1802.07842]

=External Links=
* [https://scholar.google.com/citations?user=BXUTo4AAAAAJ Hamid Maei‬ - ‪Google Scholar‬]
* [https://lifeboat.com/ex/bios.hamid.reza.maei Lifeboat Foundation Bios: Hamid Reza Maei]

=References=
<references />
'''[[People|Up one level]]'''
[[Category:Physicist|Maei]]
[[Category:Researcher|Maei]]

Navigation menu