Changes

Jump to: navigation, search

Temporal Difference Learning

23 bytes added, 10:00, 19 May 2018
m
no edit summary
Each prediction is a single number, derived from a formula using adjustable weights of features, for instance a [[Neural Networks|neural network]] most simply a single neuron [[Neural Networks#Perceptron|perceptron]], that is a linear [[Evaluation|evaluation]] function ...
[[File:TDLForula1.jpg|none|text-bottom]]
[[FILE:sigDeri.jpg|right|thumb|Sigmoid and Derivative]]
... with the [[Pawn Advantage, Win Percentage, and Elo|pawn advantage]] converted to a [[Pawn Advantage, Win Percentage, and Elo|winning probability]] by the standard [https://en.wikipedia.org/wiki/Sigmoid_function sigmoid squashing function], also topic in [[Automated Tuning#LogisticRegression|logistic regression]] in the domain of [[Supervised Learning|supervised learning]] and [[Automated Tuning|automated tuning]], ...
[[File:TDLForula2.jpg|none|text-bottom]]

Navigation menu