Home * Knowledge * Endgame Tablebases

Samuel Bak - Persistence ^[1]

Endgame Tablebases, (EGTBs or EGTs)
precalculated endgame tables generated by an exhaustive retrograde analysis. During its game play and/or search, recognizing a specific material composition, a chess program can probe, or in principle compute these tables to determine the outcome of positions definitively and act as an oracle providing the optimal moves. The tables are usually divided into separate files for each material configuration and side to move. What all different formats of tablebases have in common is that every possible piece placement has its own, unique index number which represents the position where the information about the position is located in the file. Positions with non-null castling rights could be represented in an EGTB but typically are not.

The term Tablebase opposed to endgame database was coined by Steven Edwards by means of a data vector and a single file access for both sides to move, when he introduced his Edwards' Tablebases, addressing several shortcomings of Ken Thompson's earlier database of positions with the weaker and hence potentially losing side to move only ^[2] .

Purposes

When a chess game goes to the endgame phase, it may complete soon with the result as a draw or a win/loss. Both human/chess engines can finish the endgame by continuing searching, applying some endgame rules. Even the game can finish in this way but it is not in a perfect way: it takes time to compute moves, may use more moves than necessary. Sometimes the winning side may lose the change since the rule 50 moves and/or making blunders. Both human and chess engines may miss badly the chances if they don’t know or not good enough about the endgame they are playing.

An EGTB can help to solve this period. It is a kind of a dictionary of all endgame positions which can answer immediately for a given position:

If it is a draw or a win/loss
How far/how many move from matting/being mated
What is the best move

The main advantages of using EGTBs:

Engines don’t have to search but retrieves data thus they can move instantly
Moves are the perfect ones, leading the shortest line to win
Engines don’t need to have the knowledge and code about those endgames

The main disadvantages of using EGTBs:

Their data typically are too large for downloading and storing
They may slow down the search
Require complicated codes for probing

Data

An EGTB is a set of endgames. Each endgame is a set of records about positions’ information. All involving positions in an endgame must have the same material. Typically each position's record associates with an integer number which informs how far that position is from mating/being mated or converting (depends on the type of its metrics).

Based on values of those integer numbers it can answer directly two questions of the EGTB: 1) for a given position it is a draw or a win/loss position and 2) how far it is from mating/being mated or converting. To answer the 3rd question, the best move of a position can be indirectly calculated: generate all legal moves, make them, probe all new positions, compare probing scores for the highest one and the associated move is the best one.

Theoretically, we can add more information to each position’s record. However, due to large numbers of involving positions, any additional information may make the whole EGTB becomes significantly larger. Thus so far none of popular EGTBs store any extra information and their records actually contain those integers only.

All records of an endgame are simply organized as two arrays (one array for one side/color). In other words, each item in those arrays is an integer and its index on an array is mapped to a unique position. For a given chess position, we will calculate its index then access its data record/integer via that index without searching.

Endgame files

Those arrays of an endgame can be stored in files. Typically each array is stored in one file. However, sometimes they can be combined into one larger file or splitted into more files to easier copy or manage. Those files may be in compressed or uncompressed forms. If they are compressed, they are typically divided and compressed by small blocks thus they don't need to load and decompress all data to extract just a position.

In memory

Some chess engines create themselves EGTBs on-flight and keep all data in memory without saving/loading to files. Due to limit by initial time they can generate and use very small EGTBs only.

Sizes

The size of an endgame depends on:

The number of involving pieces: more pieces, the bigger size
Type of pieces: for standard chess, more Pawns is the smaller size
Type of metric
The largest value: the integer number should be large enough to cover the largest value. Different endgames will have different largest values. Thus some endgames need only one byte per item but some need 2 bytes
Indexing: The algorithms to map between an array item and a position
Compress ratio

So far the size of an EGTB is the most important factor of success.

Metrics

DTM

Depth to Mate 'Mate' is the ultimate goal and ends the game. For each position that is represented, 'DTM' indicates the theoretical value, and the number of winner's moves to 'Mate' if won/lost - assuming that the winner is minimizing and the loser is maximizing DTM.

DTC

Depth to Conversion 'Conversion' is the changing of the force on the board, by capture and/or Pawn-conversion. For each position that is represented, 'DTC' indicates the theoretical value, and the number of winner's moves to the goal of 'Won Conversion' if won/lost - assuming that the winner is minimizing and the loser is maximizing DTC. DTC £ DTM.

DTZ

Depth to Zeroing (of the ply count) The ply-count is zeroed if and only if a Pawn is pushed or a capture is made. For each position that is represented, 'DTZ' indicates the theoretical value, and the number of winner's moves to the goal of 'Win and ply-count zeroed' if won/lost - assuming that the winner is minimizing and the loser is maximizing DTZ. DTZ £ DTC £ DTM.

DTZ50

Depth to Zeroing (of the ply count) in the context of the 50-Move Rule The DTC, DTM and DTZ metrics do not consider the current FIDE 50-move rule whereby a game in which no irreversible moves have been played for 100 plies can be claimed as a draw by the player 'on move'. In this metric, in addition to the DTC/DTM/DTZ draws, positions requiring more than 50 winner's moves to zero the ply-count are defined as draws. DTZ £ DTZ50 if DTZ50 ¹ draw. 'DTZ50' is one of a set of 'DTZk' rules: a k-move rule effectively applies if there are only k moves left before a draw-claim might be made. k1 £ k2 implies DTZk2 £ DTZk1 if DTZk1 ¹ draw.

Note that, because depth is measured in winner's moves rather than plies, this metric does not define as draw a position in which 50 winner's moves and 51 loser's moves are required to zero the ply-count, despite the fact that the loser would claim a draw after 100 plies had been played in that endgame.

DTR

Depth by the (k-move) Rule 'The Rule' referred to here is a notional set of k-move rules. For each position that is represented, 'DTR' indicates the theoretical value and, for won/lost positions, the least k for which DTZk is not draw - assuming that the winner is minimising and the loser is maximising DTR. DTZ £ DTR. No DTR EGTs have been created to date ^[3] .

DTZR

Depth to Zeroing move, give that DTR is being minimaxed If DTR = draw, then DTZR = draw. Otherwise, Otherwise DTRZR indicates the theoretical value, and the number of winner's moves to the zeroing of the ply count, given that the two sides are minimaxing first DTR and then DTZR. DTZR would be used in conjunction with DTR to progress a deep win in the context of a k-move rule because DTR can remain unchanged over a sequence of moves. DTZR £ DTR but it is not always the case that DTZ £ DTZR as may seem intuitively obvious: the loser may zero the move-count to maximise DTR.

Summary

To summarize the different metrics of tablebases, each metric has its strengths and weaknesses. Nalimov's DTM EGTs are widely and commercially available, and used by many chess engines or chess GUIs. DTC EGTs are more easily computed and stored as they involve smaller depth-ranges and are more compressible. For endgames with less or equal than 5 pieces on the board, KNNKP comes closest to requiring more than one Byte per position to store DTM: maxDTM = 115 (1-0) and 73 (0-1) necessitating 190 values in the EGT. For KPPKP, maxDTM = 127 (1-0) but only 42 (0-1). But for certain 6 piece endgames, one Byte is not enough: maxDTM = 262 for a KRNKNN position. The DTC and DTZ metrics increasingly postpone the need for two Bytes per position in an uncompressed EGT, but KRBNKQN has maxDTC = maxDTZ = 517 (0-1), both wtm and btm.

In extremis, he unmoderated use of DTM, DTC or DTZ EGTs will let slip a winnable position in the context of the 50-move rule ... hence the marginal need for the DTR and DTZR metrics, coupled with the ability to recompute DTR/DTZR EGTs in the context of a limited move-budget.

Bitbases

see main article Endgame Bitbases

Inside the search much denser tables with value {won, draw, loss, invalid} or boolean {win, not_win} information are sufficient with respect to bounds. Denser tables are more efficiently used, given that memory and cache are limited resources. Boolean Win/Not_Win Bitbases require only one bit per position (and if necessary, further enquiry if draw is to be distinguished from loss with respect to bounds). Value Bitbases require two bits per position.

Indexing

Every position is assigned to a unique index to specify the location of the information stored about it. The main purpose of indexing is to locate and read information (from databases) such as distance to mate for a given position based on its index.

There are two main aims when generating the index number. The easier the algorithm to generate the index from any given position, the faster will be the probing of the tablebase as well as the generation of it. But the second aim is to get a relatively small range of index-numbers, to keep the size of the file as small as possible.

The most straight forward way would be to for every piece on the board use 6 bit (1-64 squares)

index = position_white_King + position_black_King * 64

for each piece {
    index = index * 64 + position
}

Almost all work on this field is to reduce index spaces and reduce data sizes (e.g., by compressing). The above indexing can be optimised in several ways.

Remove redundant symmetrical indexes:

A chess board can be rotated and mirrored (if pawns are on the board only a vertical reflection is possible).
For pieces of the same type, different arrangements can be left away.

Remove illegal indexes:

Pieces may not stand on the same square.
The side to move may not give check.
The distance between the two Kings must be greater than one.

Retrieving

Probe

When a chess program (e.g., a chess engine or a chess GUI) works with an EGTB, it needs to retrieve data (integer numbers) for some given chess positions. The process named probe and has below steps:

convert a given chess position into an index via indexing algorithms
access data (typically organised as arrays of integers) using the index to retrieve the integer of that chess position

If data of that endgame is not ready in memory, usually the probe process has to do some extra work:

From data index calculate the block index
Read data of that block from storage into memory
Uncompress the block data (if it is compressed)
Convert the endgame index into block index and retrieve the needed data

Search

After probing the chess engine may have the result as an integer number for the queried position. Depending on the metric, the number may be a distance to mate or distance to conversion, the position is a draw or a win or loss for the querying side. Sometimes that value (state of draw/win/loss) may be enough for searching. However, sometimes that state is not enough and it needs to know which is the best move to make. The solution is to generate all legal moves from that position, make one by one, probe that EGTB for values of each new position. After all, compare the values of all children's positions to find which one is the best to make.

Speed

Typically the probe process requires some computing and it may access storage and decompress which is usually so slow. An engine that probes EGTB when searching may make the whole search be slower. Sometimes the benefit from probing EGTBs maybe not enough to cover the loss of slower search. That’s why the probing should be planned and implemented carefully. EGTB files may have to store in fast storage and/or using some huge systems’ caches. The good point is that the engine can stop searching on the branch probed the EGTB.

Brief Info

of some popular Endgame tablebases:

Format	Metric	1st published	5 men	6 men	7 men
Thompson	DTC	1991	2.5 GiB (not completed)	-	-
Edwards	DTM	1994	56 GiB (estimated)	-	-
Nalimov	DTM	1998	7.1 GiB	1.2 TiB	-
Scorpio	WDL	2005	214 MiB	48.1 GiB	-
Gaviota	DTM	2008	6.5 GiB	-	-
Lomonosov	DTM	2012	-	-	140 TiB
Syzygy	WDL + DTZ50	2013 / 2018	939 MiB	150.2 GB	17 TB

Formats

Generators

7-man EGTBs

2005

In 2005, Yakov Konoval started to collaborate with Marc Bourzutschky on constructing 7-man EGTBs. Yakov wrote the generator and Marc a verification program and some utilities for extracting the data. In October 2005, Yakov Konoval and Marc Bourzutschky announced that a position in the ending of a KRRNkrr requires 290 moves to convert to a simpler winning endgame ^[4] . The old record was 243 moves from a position in a rook and knight versus two knights endgame, discovered by Lewis Stiller in 1991 ^[5] .

2006

In March 2006, the wizards of 7-men endgames, Marc Bourzutschky and Yakov Konoval found a 330 moves win in KQBNkqb ^[6] and in May 2006 a 517 moves win in KQNkrbn ^[7] .

2012

In July 2012, Vladimir Makhnychev and Victor Zakharov from Moscow State University completed 4+3 DTM-tablebases (525 endings including KPPPKPP). The next set of 5+2 DTM-tablebases was completed during August 2012 ^[8] . The tablebases are named Lomonosov Tablebases, using the Lomonosov Supercomputer.

2013

Convekta Ltd. announced the release of Lomonosov Tablebases. Until December 31, 2013, the tablebases will be available online through ChessOK products ^[9].

2018

Starting in March 2018, seven piece Syzygy Bases, originally created for up to six pieces by Ronald de Man in 2013, were generated by Bojun Guo, finished in August 2018 ^[10].

Selected Publications

1965 ...

Richard E. Bellman (1965). On the application of dynamic programming to the determination of optimal play in chess and checkers. Proceedings of the National Academy of Sciences of the United States of America. pdf

1970 ...

Thomas Ströhlein (1970). Untersuchungen über kombinatorische Spiele. Ph.D. Thesis, Technical University of Munich (German)
Edward Komissarchik, Aaron L. Futer (1974). Ob Analize Ferzevogo Endshpilya pri Pomoshchi EVM. (Analysis of a queen endgame using an IBM computer) Problemy Kybernetiki, Vol. 29, pp. 211-220. English translation by C. Posthoff, revised in ICCA Journal, Vol. 9, No. 4 (1986)

1975 ...

Mike Clarke (1977). A quantitative study of KPK. Advances in Computer Chess 1, pp. 108-118
A. G. Alexandrov, A. M. Baraev, Ya. Yu. Gol'fand, Edward Komissarchik, Aaron L. Futer (1977). Computer analysis of rook end game. Avtomatika i Telemekhanika, No. 8, 113–117
A. G. Alexandrov, Vladimir Arlazarov, A. M. Baraev, Ya. Yu. Gol'fand, V. N. Deza, T. P. Il’ina, Edward Komissarchik, Anatoly Uskov, I. A. Faradzhev, Aaron L. Futer (1977). Processing of large files of information on the example of the analysis of the rook’s end game. Programming and Computer Software, No. 3
Thomas Ströhlein, Ludwig Zagler (1977). Analyzing games by Boolean matrix iteration. Discrete Mathematics, Vol. 19, No. 2
Thomas Ströhlein, Ludwig Zagler (1978). Ergebnisse einer vollstandigen Analyse von Schachendspielen: König und Turm gegen König, König und Turm gegen König und Laufer. Report, Institut für Informatik, Technical University of Munich (German)
Max Bramer (1978). Computer-Generated Databases for the Endgame in Chess. Technical Report. Milton Keynes, Open University Faculty of Mathematics.
Aaron L. Futer (1978). Programming endgames with few pieces. Soviet Physics. Doklady, No. 23
Vladimir Arlazarov, Aaron L. Futer (1979). Computer Analysis of a Rook End-Game. Machine Intelligence 9 (eds. Jean Hayes Michie, Donald Michie and L.I. Mikulich), pp. 361-371. Ellis Horwood, Chichester. Reprinted in Computer Chess Compendium
Max Bramer, Mike Clarke (1979). A Model for the Representation of Pattern-Knowledge for the Endgame in Chess. International Journal of Man-Machine Studies, Vol. 11, No.5

1985 ...

Jaap van den Herik, Bob Herschberg (1985). The Construction of an Omniscient Endgame Data Base. ICCA Journal, Vol. 8, No. 2
John Roycroft (1985). Chess-Endgame Data-Base`Oracles': Necessary and Desirable Features. ICCA Journal, Vol. 8, No. 2
Harry Nefkens (1985). Constructing Data Bases to Fit a Microcomputer. ICCA Journal, Vol. 8, No. 4

1986

Jaap van den Herik, Bob Herschberg (1986). A Data Base on Data Bases. ICCA Journal, Vol. 9, No. 1
John Roycroft, Ken Thompson (1986). Queen and Pawn on a2 against Queen. Chess Endgame Consultants and Publishers, London
John Roycroft, Ken Thompson (1986). Queen and Pawn on a6 against Queen. Chess Endgame Consultants and Publishers, London
John Roycroft, Ken Thompson (1986). Queen and Pawn on b7 against Queen. Chess Endgame Consultants and Publishers, London
Jaap van den Herik (1986). Roycroft's 5-Man Chess Endgame Series. ICCA Journal, Vol. 9, No. 3
Ken Thompson (1986). Retrograde Analysis of Certain Endgames. ICCA Journal, Vol. 9, No. 3, pdf
Edward Komissarchik, Aaron L. Futer (1986). Computer Analysis of a Queen Endgame. ICCA Journal, Vol. 9, No. 4 ^[11]
Ken Thompson (1986). An Example of QPvQ. ICCA Journal, Vol. 9, No. 4
Hans Zellner (1986). Compressing Databases Down to Micro Size. ICCA Journal, Vol. 9, No. 4

1987

Hans Zellner (1987). KBBK Squeezed into a Micro. ICCA Journal, Vol. 10, No. 1
Jaap van den Herik, Bob Herschberg (1987). The KBBKN Statistics: New Data from Ken Thompson. ICCA Journal, Vol. 10, No. 1
Bob Herschberg, Jaap van den Herik (1987). More Truth on KBBK Database Results. ICCA Journal, Vol. 10, No. 2
Hans Zellner, Jaap van den Herik, Bob Herschberg (1987). Corrections and Substantiations to KBNK. ICCA Journal, Vol. 10, No. 3
Sito Dekker, Jaap van den Herik, Bob Herschberg (1987). Complexity Starts at Five. ICCA Journal, Vol. 10, No. 3
Lars Rasmussen (1987). Correcting grandmasters' Analyses in Elementary Endgames. ICCA Journal, Vol. 10, No. 4
Jaap van den Herik, Bob Herschberg, Najib Nakad (1987). A Six-Men-Endgame Database: KRP(a2)KbBP(a3). ICCA Journal, Vol. 10, No. 4, pdf
Sito Dekker, Jaap van den Herik, Bob Herschberg (1987). Perfect Knowledge and Beyond. Report 37-87, Delft University of Technology, Delft.
John Roycroft (1987). Expert Against Oracle. Machine Intelligence 11

1988

Jaap van den Herik, Bob Herschberg (1988). Computer Checks on Human Analyses of the KRKB Endgame. ICCA Journal, Vol. 11, No. 1
Lars Rasmussen (1988). Ultimates in KQKR and KRKN. ICCA Journal, Vol. 11, No. 1
Lewis Stiller (1988). Massively Parallel Retrograde Endgame Analysis. BUCS Tech. Report #88-014, Boston University, Computer Science Department.
Jaap van den Herik, Bob Herschberg, Najib Nakad (1988). A Reply to R. Sattler's Remarks on the KRP(a2)KbBP(a3) Database. ICCA Journal, Vol. 11, No. 2/3, pdf
Lars Rasmussen (1988). A Database for KRKP. ICCA Journal, Vol. 11, No. 4

1989

Christopher A. Marris (1989). Compressing a Chess-Endgame Database. ICCA Journal, Vol. 12, No. 1
David Forthoffer, Lars Rasmussen, Sito Dekker (1989). A Correction to some KRKB-Database Results. ICCA Journal, Vol. 12, No. 1
Edmar Mednis (1989). Rook and Bishop versus Rook: The Controversial Endgame. ICCA Journal, Vol. 12, No. 1
Lewis Stiller (1989). Parallel Analysis of Certain Endgames. ICCA Journal, Vol. 12, No. 2
Hans Zellner (1989). The KPK Database Revisited. ICCA Journal, Vol. 12, No. 2
Sito Dekker, Jaap van den Herik, Bob Herschberg (1989). Perfect Knowledge and Beyond. Advances in Computer Chess 5
Bob Herschberg, Jaap van den Herik, Patrick Schoo (1989). Verifying and Codifying Strategies in the KNNKP(h) Endgame. ICCA Journal, Vol. 12, No. 3, pdf
Hans Zellner (1989). KBBKN on a Micro. ICCA Journal, Vol. 12, No. 3
John Roycroft (1989). The Computer is at it again - Interfering with the Endgame! Britisch Chess Magazine, Vol. 109, No. 10

1990 ...

Bob Herschberg, Jaap van den Herik, Patrick Schoo (1990). Verifying and Codifying Strategies in a Chess Endgame. Computers, Chess, and Cognition, pp. 183-196
John Roycroft (1990). How to Treat Endgame Databases. ICCA Journal, Vol. 13, No. 2
Ken Thompson (1990). KQPKQ and KRPKR Endings. ICCA Journal, Vol. 13, No. 4
John Roycroft (1990). Identifying Three Types of Zugzwang. ICCA Journal, Vol. 13, No. 4 » Zugzwang

1991

Ken Thompson (1991). Chess Endgames Vol. 1. ICCA Journal, Vol. 14, No. 1
Ken Thompson (1991). New Results for KNPKB and KNPKN Endgames. ICCA Journal, Vol. 14, No. 1
Edmar Mednis (1991). Database Results for KQKRN and KQKRB annotated. ICCA Journal, Vol. 14, No. 2
John Roycroft (1991). A Postscript to the Computer's Involvement. Britisch Chess Magazine, Vol. 111, No. 2
Lewis Stiller (1991). Some Results from a Massively Parallel Retrograde Analysis. ICCA Journal, Vol. 14, No. 3
John Roycroft (1991). A Side-Effect of new Database Knowledge. ICCA Journal, Vol. 14, No. 3
John Roycroft (1991). A Use for Endgame Databases? ICCA Journal, Vol. 14, No. 4
John Roycroft, Don Beal (1991). To Make Dumb Endgame Databases Speak. Advances in Computer Chess 6

1992

Lewis Stiller (1992). KQNKRR. ICCA Journal, Vol. 15, No. 1
John Nunn (1992). Perfect Prose. ICCA Journal, Vol. 15, No. 2
Lars Rasmussen (1992). Queen versus Rook and Pawn. ICCA Journal, Vol. 15, No. 2

1993

Burton Wendroff, Tony Warnock, Lewis Stiller, Dean Mayer, Ralph Brickner (1993). Bits and pieces: constructing chess endgame databases on parallel and vector architectures. Appl. Num. Math. 12, pp. 285-295, zipped ps
Christian Posthoff, Michael Schlosser, Jens Zeidler (1993). Search vs. Knowledge? - Search and Knowledge! In: Proc. 3rd KADS Meeting, Munich, March 8-9 1993, Siemens AG, Corporate Research and Development, 305-326, 1993.
John Nunn (1993). Extracting Information from Endgame Databases. ICCA Journal, Vol. 16, No. 4

1994

Ingo Althöfer, Bernhard Walter (1994). Weak Zugzwang: Statistics on some Chess Endgames. ICCA Journal, Vol. 17, No. 2
John Nunn (1994). More, and More Perfect Prose. ICCA Journal, Vol. 17, No. 2
John Nunn (1994). Extracting Information from Endgame Databases. Advances in Computer Chess 7
Christian Posthoff, Michael Schlosser, Rainer Staudte, Jens Zeidler (1994). Transformations of Knowledge. Advances in Computer Chess 7
Grit Lachmann (1994). Verarbeitung von Wissen aus Endspieldatenbanken. Master's thesis, Chemnitz University of Technology (German)

1995 ...

Steven Edwards and the Editorial Board (1995). An Examination of the Endgame KBNKN. ICCA Journal, Vol. 18, No. 3
Henri Bal, Victor Allis (1995). Parallel Retrograde Analysis on a Distributed System. Supercomputing ’95, San Diego, CA.

1996

Dietmar Lippold (1996). Legality of Positions of Simple Chess Endgames. zipped pdf ^[12]
Ken Thompson (1996). 6-Piece Endgames. ICCA Journal, Vol. 19, No. 4
Lewis Stiller (1996). Multilinear Algebra and Chess Endgames. Games of No Chance edited by Richard J. Nowakowski, pdf

1997

Dietmar Lippold (1997). The Legitimacy of Positions in Endgame Databases. ICCA Journal, Vol. 20, No. 1
Ken Thompson (1997). 6-Piece Endgames. Advances in Computer Chess 8
Johannes Fürnkranz (1997). Knowledge Discovery in Chess Databases: A Research Proposal. Technical Report OEFAI-TR-97-33, Austrian Research Institute for Artificial Intelligence, zipped ps, pdf ^[13]

1999

Ernst A. Heinz (1999). Endgame Databases and Efficient Index Schemes for Chess. ICCA Journal, Vol. 22, No. 1, ps
Ernst A. Heinz (1999). Knowledgeable Encoding and Querying of Endgame Databases. ICCA Journal, Vol. 22, No. 2, ps
Christoph Wirth, Jürg Nievergelt (1999). Exhaustive and Heuristic Retrograde Analysis of the KPPKP Endgame. ICCA Journal, Vol. 22, No. 2 ^[14]
Eugene Nalimov, Christoph Wirth, Guy Haworth (1999). KQQKQQ and the Kasparov-World Game. ICCA Journal, Vol. 22, No. 4
Ulrich Thiemonds (1999). Ein regelbasiertes Spielprogramm für Schachendspiele. University of Bonn, Diplom thesis, pdf (German)
Chrilly Donninger (1999). Der Fortschritt ist nicht aufzuhalten - Chrilly Donninger über Endspieldatenbanken. CSS 6/99, pdf (German)