Performance Optimization PDF - Nvidia NVIDIA 2011. Requirements for Maximum Performance. • Have sufficient parallelism. – At least a few 1,000 threads per function. • Coalesced memory access.