Changes

Jump to: navigation, search

SPSA

430 bytes added, 17:36, 5 September 2020
no edit summary
* [[Levente Kocsis]], [[Csaba Szepesvári]], [[Mark Winands]] ('''2005'''). ''[http://link.springer.com/chapter/10.1007/11922155_4 RSPSA: Enhanced Parameter Optimization in Games]''. [[Advances in Computer Games 11]], [http://www.sztaki.hu/~szcsaba/papers/rspsa_acg.pdf pdf], [https://dke.maastrichtuniversity.nl/m.winands/documents/RSPSA.pdf pdf]
* [[Levente Kocsis]], [[Csaba Szepesvári]] ('''2006'''). ''[http://link.springer.com/article/10.1007/s10994-006-6888-8 Universal Parameter Optimisation in Games Based on SPSA]''. [https://en.wikipedia.org/wiki/Machine_Learning_%28journal%29 Machine Learning], Special Issue on Machine Learning and Games, Vol. 63, No. 3, [http://www.sztaki.hu/~szcsaba/papers/kocsis_acg05.pdf pdf]
* [https://dblp.org/pid/70/382.html Mohammed Shahid Abdulla], [[Shalabh Bhatnagar]] ('''2007'''). ''[https://link.springer.com/article/10.1007/s10626-006-0003-y Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes]''. [https://www.springer.com/journal/10626 Discrete Event Dynamic Systems], Vol.17, No.1
* [http://dblp.uni-trier.de/pers/hd/m/Maryak:John_L= John L. Maryak], [http://dblp.uni-trier.de/pers/hd/c/Chin:Daniel_C= Daniel C. Chin] ('''2008'''). ''Global Random Optimization by Simultaneous Perturbation Stochastic Approximation''. [[IEEE#TAC|IEEE Transactions on Automatic Control]], Vol. 53, No. 3, [http://www.jhuapl.edu/spsa/PDF-SPSA/Maryak_Chin_IEEETAC08.pdf pdf]
* [http://dblp.uni-trier.de/pers/hd/s/Song_0001:Qing Qing Song], [[James C. Spall]], [http://dblp.uni-trier.de/pers/hd/s/Soh:Yeng_Chai Yeng Chai Soh], [http://dblp.uni-trier.de/pers/hd/n/Ni:Jie Jie Ni] ('''2008'''). ''Robust Neural Network Tracking Controller Using Simultaneous Perturbation Stochastic Approximation''. [[IEEE#NN|IEEE Transactions on Neural Networks]], Vol. 19, No. 5, [https://pdfs.semanticscholar.org/3f2a/4d69ca8adbbc072d82af58b3c750621d36ab.pdf 2003 pdf] » [[Neural Networks]]
==2010 ...==
* [[Shalabh Bhatnagar]], [[https://dblp.org/pid/31/10493.html H. L. Prasad]], [[https://scholar.google.co.in/citations?user=Q1YXWpoAAAAJ&hl=en L.A. Prashanth]] ('''2013'''). ''[httphttps://stochastic.csa.iisclink.ernetspringer.incom/~shalabhbook/book10.html 1007/978-1-4471-4285-0 Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods]''. [httphttps://www.springer.com/series/642 Lecture Notes in Control and Information Sciences], Vol. 434, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
* [[Qi Wang]] ('''2013'''). ''[https://jscholarship.library.jhu.edu/handle/1774.2/36955 Optimization with Discrete Simultaneous Perturbation Stochastic Approximation Using Noisy Loss Function Measurements]''. Ph.D. thesis, [https://en.wikipedia.org/wiki/Johns_Hopkins_University Johns Hopkins University], advisor [[James C. Spall]]
* [https://scholar.google.com/citations?user=nqDASHMAAAAJ Pushpendre Rastogi], [http://dblp.uni-trier.de/pers/hd/z/Zhu:Jingyi Jingyi Zhu], [[James C. Spall]] ('''2016'''). ''Efficient implementation of Enhanced Adaptive Simultaneous Perturbation Algorithms''. [http://dblp.uni-trier.de/db/conf/ciss/ciss2016.html#RastogiZS16 CISS 2016], [http://www.cs.jhu.edu/~prastog3/res/rastogi2016efficient.pdf pdf]

Navigation menu