From Chessprogramming wiki
Jump to: navigation, search

Home * Hardware * x86-64 * BMI1

an x86-64 expansion of bit-manipulation instructions by Intel, introduced in conjunction with the Advanced Vector Extensions SIMD instruction set. With the Bulldozer microarchitecture, BMI1 as well as AVX are also available on AMD processors under the initial name BMI, along with their Trailing Bit Manipulation Instructions (TBM) [1]. Most BMI1 instructions (except LZCNT and TZCNT) employ the VEX prefix encoding to support up to three-operand syntax with non-destructive source operands on 32- or 64-bit general-purpose registers. BMI1 (ANDN, BEXTR, BLSI, BLSMK, BLSR, TZCNT) requires bit 3 set in EBX of CPUID with EAX=07H, ECX=0H. LZCNT, not exactly member of BMI1, requires bit 5 set in ECX of CPUID EAX=80000001H. With the advent of AVX2, some more bit-twiddling on general-purpose registers is proposed with BMI2.


BMI1 instructions may speedup various bitboard operations, such as relative complement, and isolation, reset and separation of the least significant one bit, they combine two instructions and reduce register pressure. Leading and trailing zero count are useful for scanning bits with possibly empty sets.


Logical And Not, the relative complement, no intrinsic due to compiler support.

dest ::= ~src1 & src2;


Bit Field Extract. Nice to extract some consecutive bits from a (rotated) occupancy bitboard, or, as they name suggests, from bit-field structures.

dest ::= (src >> start) & ((1 << len)-1);

unsigned __int32 _bextr_u32(unsigned __int32 src, unsigned __int32 start, unsigned __int32 len);
unsigned __int64 _bextr_u64(unsigned __int64 src, unsigned __int32 start, unsigned __int32 len);

A shiftless sign extension might be applied by [2]:

dest_signextended ::= (dest ^ signbit) - signbit


Extract Lowest Set Isolated Bit, isolates least significant one bit.

dest ::= src & -src;

unsigned __int64 _blsi_u64(unsigned __int64 src);


Get Mask Up to Lowest Set Bit, sets all bits below the least significant one bit, and clears all upper bits.

dest ::= (src-1) ^ src;

unsigned __int64 _blsmsk_u64(unsigned __int64 src);


Reset Lowest Set Bit, resets least significant one bit.

dest ::= (src-1) & src;

unsigned __int64 _blsr_u64(unsigned __int64 src);


Count the Number of Leading Zero Bits, Leading Zero Count, initially from AMD's SSE4a aka Advanced Bit Manipulations (ABM).

unsigned __int64 _lzcnt_u64(unsigned __int64 src);


Count the Number of Trailing Zero Bits, Trailing Zero Count [3].

unsigned __int64 _tzcnt_u64(unsigned __int64 src);

See also


External Links


Up one Level