Changes

AVX-512

438 bytes added, 10:27, 25 August 2020

no edit summary

Following VPTERNLOGQ intrinsics are declared, where the maskz version sets unmasked destination quad word elements to zero, while the mask version copies unmasked elements from s:

<pre>

__m256i _mm256_ternarylogic_epi64(__m256i a, __m256i b, __m256i c, int imm8);

__m256i _mm256_maskz_ternarylogic_epi64(__mmask8 k, __m256i a, __m256i b, __m256i c, int imm8);

__m256i _mm256_mask_ternarylogic_epi64(__m256i src, __mmask8 k, __m256i a, __m256i b, int imm8);

__m512i _mm512_ternarylogic_epi64(__m512i a, __m512i b, __m512i c, int imm8);

__m512i _mm512_maskz_ternarylogic_epi64( __mmask8 m, __m512i a, __m512i b, __m512i c, int imm8);

AVX-512CD has Vector [[BitScan#LeadingZeroCount|Leading Zero Count]] - VPLZCNTQ counts leading zeroes on a vector of eight bitboards in parallel <ref>[https://www.google.com/patents/US9372692 Patent US9372692 - Methods, apparatus, instructions, and logic to provide permute controls with leading zero count functionality - Google Patent Search]</ref> - using following intrinsics <ref>[https://hjlebbink.github.io/x86doc/html/VPLZCNTD_Q.html VPLZCNTD/Q—Count the Number of Leading Zero Bits for Packed Dword, Packed Qword Values]</ref>, where the maskz version sets unmasked destination elements to zero, while the mask version copies unmasked elements from s:

<pre>

__m256i _mm256_lzcnt_epi64(__m256i a);

__m256i _mm256_maskz_lzcnt_epi64(__mmask8 k, __m256i a);

__m256i _mm256_mask_lzcnt_epi64(__m256i src, __mmask8 k, __m256i a);

__m512i _mm512_lzcnt_epi64(__m512i a);

__m512i _mm512_maskz_lzcnt_epi64(__mmask8 mk, __m512i a);__m512i _mm512_mask_lzcnt_epi64(__m512i ssrc, __mmask8 mk, __m512i a);

</pre>

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

AVX-512

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools