422
edits
Changes
Talk:GPU
,no edit summary
Nevertheless, my own conclusions are:
Tesla has 8 wide SIMD, executing a Warp of 32 threads in over 4 cycles.
Fermi has 16 wide SIMD, executing a Warp of 32 threads in over 2 cycles.
Kepler is somehow odd, not sure how the compute units are partitioned.
Maxwell and Pascal have 32 wide SIMD, executing a Warp of 32 threads in over 1 cycle.
Volta and Turing seem to have 16 wide FPU SIMDs, but my own experiments show 32 wide VALU.