422
edits
Changes
GPU
,→Floating Point Instruction Throughput
* FP32
: Consumer GPU performance is measured usually in single-precision (32 -bit) floating point FMA, fused-multiply-add, throughput.
* FP64
: Consumer GPUs have in general a lower ratio (FP32:FP64) for double-precision (64 -bit) floating point operations throughput than server brand GPUs.
* FP16
: Some GPGPU architectures offer half-precision (16 -bit) floating point operation throughput with an FP32:FP16 ratio of 1:2.
==Tensors==