Slide 25 of 41
Notes:
- In its simplest form, MSP code implements a loop-level parallelism similar to what may be accomplished using OpenMP or auto-tasking.
- For example, when a generic loop is encounted the iterations are partitioned over the individual SSPs that make up the MSP.
- The MSP has certain performance advantages relative to traditional auto-tasking (tighter hdw integration, low ovrhd)