Changes

Talk:GPU

946 bytes added, 21:29, 27 April 2021

→‎CPW GPU article: new section

~~Heyho, just a minor thing, the notation of Nvidia should be unified.~~ == AMD architectures ==

~~The official name is Nvidia Corporation, on their webpage they refer as NVIDIA, and in their logo styling nVIDIA, formerly nVidia.~~My own conclusions are:

~~Bests~~* TeraScale has VLIW design.* GCN has 16 wide SIMD,executing a Wavefront of 64 threads over 4 cycles.~~Srdja~~* RDNA has 32 wide SIMD, executing a Wavefront:32 over 1 cycle and Wavefront:64 over two cycles.

~~== SIMT ==~~ ~~SIMT is not only about running multiple threads in a Warp resp. Wavefront~~[[User:Smatovic|Smatovic]] ([[User talk:Smatovic|talk]]) 10:16, ~~it is more about running multiple waves of Warps resp. Wavefronts on the same SIMD unit to hide memory latencies.~~22 April 2021 (CEST)

== Nvidia architectures ==

Nevertheless, my own conclusions are:

* Tesla has 8 wide SIMD, executing a Warp of 32 threads over 4 cycles. * Fermi has 16 wide SIMD, executing a Warp of 32 threads over 2 cycles. * Kepler is somehow odd, not sure how the compute units are partitioned. * Maxwell and Pascal have 32 wide SIMD, executing a Warp of 32 threads over 1 cycle. * Volta and Turing seem to have 16 wide FPU SIMDs, but my own experiments show 32 wide VALU. [[User:Smatovic|Smatovic]] ([[User talk:Smatovic|talk]]) 10:17, 22 April 2021 (CEST) == SIMD + Scalar Unit == It seems every SIMD unit has one scalar unit on GPU architectures, executing things like branch-conditions or special functions the SIMD ALUs are not capable of. [[User:Smatovic|Smatovic]] ([[User talk:Smatovic|talk]]) 20:21, 22 April 2021 (CEST) == embedded CPU controller ==

~~Fermi~~ It is not documented in the whitepapers, but it seems that every discrete GPU has ~~16 wide SIMD, executing a Warp of 32 threads over 2 cycles~~an embedded CPU controller (e.g. Nvidia Falcon) who (speculation) launches the kernels.

~~Kepler is somehow odd~~[[User:Smatovic|Smatovic]] ([[User talk:Smatovic|talk]]) 10:36, ~~not sure how the compute units are partitioned.~~22 April 2021 (CEST)

~~Maxwell and Pascal have 32 wide SIMD, executing a Warp of 32 threads over 1 cycle.~~== CPW GPU article ==

~~Volta~~ A suggestion of mine, keep this GPU article as an generalized overview of GPUs, with incremental updates for different frameworks and architectures. GPUs and ~~Turing seem to have 16 wide FPU SIMDs~~GPGPU is a moving target with different platforms offering new feature sets, ~~but my~~ better open own ~~experiments show 32 wide VALU~~articles for things like GPGPU, SIMT, CUDA, ROCm, oneAPI, Metal or simply link to Wikipedia containing the newest specs and infos.

~~According to AMD papers every SIMD unit has one scalar unit~~[[User:Smatovic|Smatovic]] ([[User talk:Smatovic|talk]]) 21:29, ~~Nvidia seems to have one SFU, special function unit, per SIMD.~~27 April 2021 (CEST)

Smatovic

422

edits

Changes

Talk:GPU

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools