Data Distribution
km small (60x60), so replicate
Matrices pmul, utemp large (60xnels), so distribute in blocked column format
Vectors p, u large (neq), so distribute in blocks
(In test programs, nels = 8000, neq = 100840)
=> no communications in local matrix multiplications
But complicated communications for gather and scatter ñ wrote our own gather and scatter routines