Changes

Jump to: navigation, search

GPU

No change in size, 12:05, 23 October 2021
m
Integer Instruction Throughput
==Integer Instruction Throughput==
* INT32
: The 32 -bit integer performance can be architecture and operation depended less than 32 -bit FLOP or 24 -bit integer performance.
* INT64
: In general GPU [https://en.wikipedia.org/wiki/Processor_register registers] and Vector-[https://en.wikipedia.org/wiki/Arithmetic_logic_unit ALUs] are 32 -bit wide and have to emulate 64 -bit integer operations.
* INT8
: Some architectures offer higher throughput with lower precision. They quadruple the INT8 or octuple the INT4 throughput.
422
edits

Navigation menu