Search results
- .../www.talkchess.com/forum/viewtopic.php?t=44165 Help with Best-First Select-Formula] by [[Srdja Matovic]], [[CCC]], June 23, 2012 ...nce], [https://www.allmusic.com/artist/chris-fletcher-mn0000384453/credits Chris Fletcher]11 KB (1,526 words) - 14:42, 18 November 2021
- ...pled in all states and the action-values are represented discretely <ref>[[Chris Watkins]], [[Peter Dayan]] ('''1992'''). ''[http://www.gatsby.ucl.ac.uk/~da * [[Chris Watkins]] ('''1989'''). ''Learning from Delayed Rewards''. Ph.D. thesis, [h54 KB (7,025 words) - 12:47, 14 March 2022
- Each prediction is a single number, derived from a formula using adjustable weights of features, for instance a [[Neural Networks|neur ...orum3/viewtopic.php?f=7&t=77053 TD learning by self play (TD-Gammon)] by [[Chris Whittington]], [[CCC]], April 10, 202146 KB (6,248 words) - 13:59, 23 May 2021
- ...djust the amount of exploration and incorporates the sqrt(2) from the UCB1 formula The first component of the UCB1 formula above corresponds to exploitation, as it is high for moves with high averag25 KB (3,413 words) - 12:36, 23 October 2020
- Toga II '''1.4.1SE''' by [[Chris Formula]] <span id="1.4beta5c"></span>based on Toga II '''1.4beta5c''' by Thomas Ga12 KB (1,677 words) - 09:45, 7 October 2021