[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bgl-discuss] allreduce performance bug in coprocessor mode



While running some benchmarks, we found out that MPI_Allreduce has a serious performance bug in coprocessor mode on our machine (driver 202). Allreduce for virtual node mode is 100x faster (for twice the number of processors). Allreduce for co-processor mode is slow.

The bug is fixed in the next IBM driver release, but in the mean time, realize allreduce will be very slow in coprocessor mode.

Also, shows our need for performance regression testing of MPI...

-Pete

- --------------------------------------------------------------------
To add or remove yourself from this mailing list, use the 'notifyme'
command on any BGL machine. To remove: notifyme -n, to add: notifyme -y.