Slide 23 of 41
Notes:
Program for vectorization
Optimize for best cache use to maximize memory bandwidth
Floating performance scales with memory bandwidth