Changes

Newer edit →

István Szita

5,148 bytes added, 14:27, 23 October 2018

Created page with "'''Home * People * István Szita''' '''István Szita''',<br/> a Hungarian computer scientist and software engineer at Google Zurich. He holds a Ph.D. fr..."

'''[[Main Page|Home]] * [[People]] * István Szita'''

'''István Szita''',<br/>
a Hungarian computer scientist and software engineer at [[Google]] Zurich. He holds a Ph.D. from [https://en.wikipedia.org/wiki/E%C3%B6tv%C3%B6s_Lor%C3%A1nd_University Eötvös Loránd University] on [[Reinforcement Learning|reinforcement learning]] (RL) <ref>[[István Szita]] ('''2007'''). ''Rewarding Excursions: Extending Reinforcement Learning to Complex Domains''. Ph.D. thesis, supervisor [http://people.inf.elte.hu/lorincz/ András Lőrincz], [https://en.wikipedia.org/wiki/E%C3%B6tv%C3%B6s_Lor%C3%A1nd_University Eötvös Loránd University], [http://web.eotvos.elte.hu/szityu/papers/SzitaThesis.pdf pdf]</ref>,
and was postdoctoral researcher at [[Maastricht University]], the RL3 group <ref>[http://www.cs.rutgers.edu/rl3/index.html RL3, Rutgers Laboratory for Real-Life Reinforcement Learning]</ref> of [https://en.wikipedia.org/wiki/Rutgers_University Rutgers University], and the [[ University of Alberta]].
Beside RL, his research interests include [[Games|games]], and [https://en.wikipedia.org/wiki/Recurrent_neural_network recurrent neural nets].

=Selected Publications=
<ref>[http://web.eotvos.elte.hu/szityu/pubs.html István Szita - Publications]</ref> <ref>[https://dblp.uni-trier.de/pers/hd/s/Szita:Istv=aacute=n dblp: István Szita]</ref>
==2006 ...==
* [[István Szita]], [http://people.inf.elte.hu/lorincz/ András Lőrincz] ('''2006'''). ''Learning Tetris Using the Noisy Cross-Entropy Method''. Neural Computation 18, [http://web.eotvos.elte.hu/szityu/papers/SzitaLorincz05Learning.pdf pdf] <ref>[https://en.wikipedia.org/wiki/Tetris Tetris from Wikipedia]</ref>
* [[István Szita]] ('''2007'''). ''Rewarding Excursions: Extending Reinforcement Learning to Complex Domains''. Ph.D. thesis, supervisor [http://people.inf.elte.hu/lorincz/ András Lőrincz], [https://en.wikipedia.org/wiki/E%C3%B6tv%C3%B6s_Lor%C3%A1nd_University Eötvös Loránd University], [http://web.eotvos.elte.hu/szityu/papers/SzitaThesis.pdf pdf]
* [[Guillaume Chaslot]], [[Sander Bakkes]], [[István Szita]], [[Pieter Spronck]] ('''2008'''). ''Monte-Carlo Tree Search: A New Framework for Game AI''. [http://sander.landofsand.com/publications/AIIDE08_Chaslot.pdf pdf]
* [[Guillaume Chaslot]], [[Mark Winands]], [[István Szita]], [[Jaap van den Herik]]. ('''2008'''). ''Cross-entropy for Monte-Carlo Tree Search''. [[ICGA Journal#31_3|ICGA Journal, Vol. 31, No. 3]], [http://www.personeel.unimaas.nl/m-winands/documents/crossmc.pdf pdf]
* [[István Szita]], [[Marc Ponsen]], [[Pieter Spronck]] ('''2008'''). ''Keeping Adaptive Game AI interesting''. [http://www.cgamesusa.com/08/ CGames 2008], [http://web.eotvos.elte.hu/szityu/papers/SzitaPonsenSpronck08Interesting.pdf pdf draft], [http://ticc.uvt.nl/~pspronck/pubs/CGAMES08Szita.pdf pdf]
* [[István Szita]], [http://people.inf.elte.hu/lorincz/ András Lőrincz] ('''2008'''). ''The Many Faces of Optimism: a Unifying Approach''. [http://www.informatik.uni-trier.de/~ley/db/conf/icml/icml2008.html#SzitaL08 ICML 2008], [http://web.eotvos.elte.hu/szityu/papers/SzitaLorincz08Many.pdf pdf] <ref>[http://videolectures.net/icml08_szita_mfo/ The Many Faces of Optimism: a Unifying Approach - videolectures.net]</ref>
* [[István Szita]], [[Guillaume Chaslot]], [[Pieter Spronck]] ('''2009'''). ''Monte-Carlo Tree Search in Settlers of Catan''. [[Advances in Computer Games 12]], [http://ticc.uvt.nl/~pspronck/pubs/ACG12Szita.pdf pdf] <ref>[https://en.wikipedia.org/wiki/The_Settlers_of_Catan The Settlers of Catan from Wikipedia]</ref>
==2010 ...==
* [[István Szita]], [[Csaba Szepesvári]] ('''2010'''). ''Model-based reinforcement learning with nearly tight exploration complexity bounds''. [http://www.informatik.uni-trier.de/~ley/db/conf/icml/icml2010.html#SzitaS10 ICML 2010]
* [[István Szita]], [[Csaba Szepesvári]] ('''2011'''). ''Agnostic KWIK learning and efficient approximate reinforcement learning''. [http://www.informatik.uni-trier.de/~ley/db/journals/jmlr/jmlrp19.html#SzitaS11 Journal of Machine Learning Research - Proceedings Track 19]
* [[István Szita]] ('''2012'''). ''[http://link.springer.com/chapter/10.1007%2F978-3-642-27645-3_17 Reinforcement Learning in Games]''. in [[Marco Wiering]], [http://martijnvanotterlo.nl/ Martijn Van Otterlo] ('''2012'''). ''[https://scholar.google.com/citations?view_op=view_citation&hl=en&user=xVas0I8AAAAJ&citation_for_view=xVas0I8AAAAJ:abG-DnoFyZgC Reinforcement learning: State-of-the-art]''. [http://link.springer.com/book/10.1007/978-3-642-27645-3 Adaptation, Learning, and Optimization, Vol. 12], [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
* [[Thomas J. Walsh]], [[István Szita]], [[Carlos Diuk]], [[Michael L. Littman]] ('''2012'''). ''Exploring compact reinforcement-learning representations with linear regression''. [https://arxiv.org/abs/1205.2606 arXiv:1205.2606]

=External Links=
* [https://www.facebook.com/szityu Istvan Szita | Facebook]
* [http://videolectures.net/istvan_szita/ Istvan Szita - Eötvös Loránd University - videolectures.net]

=References=
<references />
'''[[People|Up one level]]'''

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

István Szita

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools