Parallelism in Code
The element level approach results in inherent parallelism, over the elements, throughout the code.
PCG section dominates, and within this, the major computation is in the matrix multiplication:
utemp = MATMUL(km,pmul) ! mat. mult.
u(g) = u(g) + utemp ! scatter