[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bgl-discuss] machine locked



the primary issue is that this data isn't only being written to disk;
it is also processed by the queue manager. (this is the mechanism we
will be able to use to determine if the job ran properly or not) The
reason we cut off stdio over a particular limit was because it caused
performance problems for all users on chiba.

Hmmm, a standard pipe should never run out of resources unless something is buffering everything in the middle. In other words, if I do this:


mpirun <....> | check_output_with_queue_manager | cat > foo.out

and the check_output program does not save all the data, but maybe only keeps the last 5 lines of output, then it should not grow without bounds, and everything should be fine.

Maybe there is a different problem?

You should be able to save mpirun output forever, even with an interposing output checker, until all your output is finished.

-Pete

--
---
Pete Beckman                                Phone: 630-252-9020
Argonne National Laboratory                 Email: beckman@xxxxxxxxxxx
MCS-221
9700 South Cass Avenue
Argonne, Illinois 60439-4844, USA
PGP: 12C0 4357 1197 7BC7 8BBB  B38A 869A ECE1 D7F0 6CD5

- --------------------------------------------------------------------
To add or remove yourself from this mailing list, use the 'notifyme'
command on any BGL machine. To remove: notifyme -n, to add: notifyme -y.