The recent driver update has fixed the CO-mode scaling problem with POP 1.4.3, as we hoped it would. Which would suggest the Allreduce bug was the culprit. Ray > From: Pete Beckman <beckman@xxxxxxxxxxx> > Subject: [bgl-discuss] allreduce performance bug in coprocessor mode > Date: Thu, 27 Oct 2005 07:50:39 -0500 > > While running some benchmarks, we found out that MPI_Allreduce has a > serious performance bug in coprocessor mode on our machine (driver > 202). Allreduce for virtual node mode is 100x faster (for twice the > number of processors). Allreduce for co-processor mode is slow. > > The bug is fixed in the next IBM driver release, but in the mean > time, realize allreduce will be very slow in coprocessor mode.
Attachment:
bgl.png
Description: Binary data