Optimized MPI and compute in network implementation

Summary
Report the final optimizations on MPI and on in-network compute prototype.NB: ParTec, the linked third party of FZJ, will lead D4.7.