Slide 26 of 41
Notes:
- First, let is revisit the Laplaces equation solver. Recall that SSP performance was about 30% of the T90 due to limited cache utilization
- This is the loopmark listing with addntl annotations
- 3 additional MSP optimization;
- Note that reduction operation @25 not streamed