Algebraic Chess Notation

Home * Chess * Game * Notation * Algebraic Chess Notation

Algebraic Chess Notation is a chess notation to record and represent moves of a human readable game notation. It is based on a system developed by Philipp Stamma to notate the target square by algebraic coordinates, and various forms to represent the origin square of the move, either by language dependent piece initials and/or file, rank- or square notation of the from- or origin square. Promotions must be disambiguated by including the promoted piece type, and castling usually has their own symbol strings ('O-O', 'O-O-O'). Some format variations also specify a captured piece, if any, for the purpose to make the notation reversible, and to go backward as easy as forward while re-playing a game.

=Sequence of Moves= Chess programs need to convert the list of encoded moves into a sequence of printable strings, or to render them inside a notation window. The move number of the game, starting by '1.' from the initial position, is prefix of the white halfmove, followed by a the black reply, often in a second column of a grid view. If a move notation starts with Black, also after embedded comments or annotations, a trailing ellipsis is used instead of a single dot. ::=  ::= [ ...]{'.' | '...'}

Modern chess programs, and their graphical user interface often support various modes to represent all kinds of game and move notations. Regarding input of moves from a user interface, or reading moves from text files, programs needs to parse strings accordantly. =Pure coordinate notation= Considering the common From-To move encoding inside a chess program, pure coordinate notation is a straight-forward chess notation to use only algebraic From- and To-coordinates. This notation omits any machine redundant piece letters for the moving and/or capturing pieces, and only has to specify the promoted piece as trailing letter in case of promotions.

::= 'q'|'r'|'b'|'n'

Pure algebraic coordinate notation was often used in early microcomputer chess programs, sometimes even with capital file letters. Some programs used a hyphen or 'x' delimiter between the coordinate substrings to distinguish quiet moves from captures in move outputs. While not common in human chess notation, pure coordinate notation has the advantage to avoid any ambiguity and is further easy convertible from a move list without the need of retrieving information from internal board representations, (i.e. which pieces occupy which squares). Even castling was often written as E1G1 or E1C1 instead of 'O-O' or 'O-O-O'.

XBoard
Per default, the Chess Engine Communication Protocol requires pure algebraic coordinate notation without from-to delimiters as input and output move format.

UCI
The UCI communication protocol requires pure algebraic coordinate notation without from-to delimiters as well. Inside its specification the UCI move format is called "long algebraic notation", which might be considered slightly incorrect concerning leading piece character and dash or hyphen between from- and to coordinates, see LAN definition below.

ICCF numeric notation
The language independent ICCF numeric notation as used in Correspondence chess is a numerical pure coordinate notation, giving coordinates by two digits (File '1'-'8) rather than letters ('a'-'h') for the file. In case of promotions a fifth trailing digit is used: '1' for queen, '2' for rook, '3' for bishop and '4' for knight.

Smith notation
The Smith notation designed by Warren D. Smith, as used in the Internet Chess Club chess server, encodes moves with from-square, to-square, and to make it reversible, so it is as easy to go backwards in a game as forwards, what piece was captured, if any:

 ::= [ ][ ] ::= 'P' | 'N' | 'B' | 'R' | 'Q' =Long Algebraic Notation (LAN)= Beside the already sufficient and unambiguous pure origin- and target-coordinates, LAN uses a leading redundant, national dependent uppercase piece letter or figurine piece symbol of the moving piece usually other than a pawn, to represent the move, which makes it more human readable and compatible with SAN and desciptive notation. However, for chess programs using pure from-to move encoding, converting the move list to LAN already requires a board representation in sync with the leading moves already played, to lookup the piece on the board.

 ::=  ['-'|'x']  ::= ['-'|'x'] [ ]  ::= 'N' | 'B' | 'R' | 'Q' | 'K' English symbols Knight, Bishop, Rook, Queen and King.

=Standard Algebraic Notation (SAN)= Standard algebraic notation (SAN) is the official notation of the FIDE which must be used in all recognized international competition involving human players. Concerning computer chess, SAN is a representation standard for chess moves inside the Portable Game Notation standard using the ASCII Latin alphabet, and should be supported as default notation by all modern chess programs and their user interfaces.

While otherwise similar to LAN, SAN suppresses redundant information concerning the from-square, while keeping the descriptive letter or symbol of pieces other than a pawn. SAN further suppresses the from-to hyphen, and in some variations also the capture indicator 'x' (or ':').   ::= [ | | ]['x']  ::= [ ] 'x' [ ]     ::= [ ]

Ambiguities
If the piece is sufficient to unambiguously determine the origin square, the whole from square is omitted. Otherwise, if two (or more) pieces of the same kind can move to the same square, the piece's initial is followed by (in descending order of preference)
 * 1) file of departure if different
 * 2) rank of departure if the files are the same but the ranks differ
 * 3) the complete origin square coordinate otherwise

Captures
Captures are denoted by the lower case letter "x" immediately prior to the destination square. Pawn captures with the omitted piece symbol, include the file letter of the originating square of the capturing pawn prior to the "x" character, even if not required for unambiguousness. Some SAN variations in printed media even omit the target rank if unambiguous, like dxe, which might not be accepted as input format.

En passant
The PGN-Standard does not require En passant captures have any special notation, and is written as if the captured pawn were on the capturing pawn's destination square.FIDE states the redundant move suffix "e.p." optional (after 1 July 2014).

In the case of an ‘en passant’ capture, ‘e.p.’ may be appended to the notation.

Pawn promotion
A pawn promotion requires the information about the chosen piece, appended as trailing Piece letter behind the target square. The SAN PGN-Standard requires an equal sign ('=') immediately following the destination square.

Castling
Castling is indicated by the special notations, "O-O" for kingside castling and "O-O-O" for queenside castling. While the FIDE handbook uses the digit zero, the SAN PGN-Standard requires the capital letter 'O' for its export format.

Converting Moves
Due to the most compact representation, considering ambiguities concerning the origin square, converting moves with pure from- and to-squares to SAN requires not only an underlying board representation to determine piece initials, but also legal move generation for a subset of moves to the destination square. Pseudo legal, but illegal moves for instance with a Pinned piece must not be considered in ambiguous issues in an export format.

XBoard 2
With the Chess Engine Communication Protocol version 2, one can use the feature command to select SAN as move format for both input and output. =Figurine Algebraic Notation (FAN)= Figurine notation uses miniature piece icons instead of single letter piece abbreviations. This enables the moves to be read independent of the language. The Unicode Miscellaneous Symbols set includes all of the symbols necessary for FAN if appropriate fonts were installed on the computer. Chess GUIs may alternatively use Scalable Vector Graphics or other scalable 2D Vector Graphics to render the symbols inside their views.

LaTeX
=Reading Chess= In an attempt of Computer vision, Henry S. Baird and Ken Thompson used optical character recognition along with various heuristics and applying the rules of chess, to "read" the figurine notation from Informant's Encyclopaedia of Chess Openings with high accuracy and a success rate of 99.995% on approximately one million characters (2850 games, 945 pages).

=See also=
 * CPW-Engine_algebraic
 * Encoding Moves
 * Entering Moves
 * Portable Game Notation (PGN)

=Publications=
 * Henry S. Baird, Ken Thompson (1990). Reading Chess. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 12, No. 6, pdf
 * Michel Goossens, Frank Mittelbach, Sebastian Rahtz, Denis Roegel, Herbert Voß (2007). The LATEXGraphics Companion. Second Edition, Addison-Wesley, ISBN-13: 978-0-321-50892-8, sample pdf, 10.1 Chess, 10.2 Xiangqi—Chinese Chess

=Forum Posts=
 * correct SAN notation ? by Andreas Herrmann, CCC, December 19, 2002
 * Crafty (under winboard) question by Andrei Fortuna, CCC, December 21, 2003 » WinBoard
 * SAN Move Disambiguation -- looking for test positition by humble programmer, CCC, April 12, 2010 » PGN
 * Unicode values for chessmen by Steven Edwards, CCC, March 07, 2011
 * Determining From squares by David Whitten, CCC, December 04, 2011
 * What world have I been in - castling notation? by Lonnie Cook, CCC, September 13, 2014
 * SAN test position by Matthew Lai, CCC, September 11, 2016 » SAN
 * WAC using coordinate notation? by Erin Dame, CCC, September 17, 2017 » Pure coordinate notation, Win at Chess

=External Links=
 * Algebraic chess notation from Wikipedia
 * Desciptive notation from Wikipedia
 * Punctuation (chess) from Wikipedia
 * Chess symbols in Unicode from Wikipedia
 * Standard: Portable Game Notation Specification and Implementation Guide by Steven Edwards - 8.2.3: Movetext SAN (Standard Algebraic Notation)
 * SAN Kit: Implemented Standards for Chess Move Notation by Steven Edwards
 * FIDE handbook, Laws of Chess - Appendix C. Algebraic notation
 * Smith notation (Wayback Machine)

=References= Up one Level