Changes

Temporal Difference Learning

23 bytes added, 10:00, 19 May 2018

m

no edit summary

Each prediction is a single number, derived from a formula using adjustable weights of features, for instance a [[Neural Networks|neural network]] most simply a single neuron [[Neural Networks#Perceptron|perceptron]], that is a linear [[Evaluation|evaluation]] function ...

[[File:TDLForula1.jpg|none|text-bottom]]

[[FILE:sigDeri.jpg|right|thumb|Sigmoid and Derivative]]

... with the [[Pawn Advantage, Win Percentage, and Elo|pawn advantage]] converted to a [[Pawn Advantage, Win Percentage, and Elo|winning probability]] by the standard [https://en.wikipedia.org/wiki/Sigmoid_function sigmoid squashing function], also topic in [[Automated Tuning#LogisticRegression|logistic regression]] in the domain of [[Supervised Learning|supervised learning]] and [[Automated Tuning|automated tuning]], ...

[[File:TDLForula2.jpg|none|text-bottom]]

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Temporal Difference Learning

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools