[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [bgl-discuss] machine locked
the primary issue is that this data isn't only being written to disk;
it is also processed by the queue manager. (this is the mechanism we
will be able to use to determine if the job ran properly or not) The
reason we cut off stdio over a particular limit was because it caused
performance problems for all users on chiba.
Hmmm, a standard pipe should never run out of resources unless
something is buffering everything in the middle. In other words, if
I do this:
mpirun <....> | check_output_with_queue_manager | cat > foo.out
and the check_output program does not save all the data, but maybe
only keeps the last 5 lines of output, then it should not grow
without bounds, and everything should be fine.
Maybe there is a different problem?
You should be able to save mpirun output forever, even with an
interposing output checker, until all your output is finished.
-Pete
--
---
Pete Beckman Phone: 630-252-9020
Argonne National Laboratory Email: beckman@xxxxxxxxxxx
MCS-221
9700 South Cass Avenue
Argonne, Illinois 60439-4844, USA
PGP: 12C0 4357 1197 7BC7 8BBB B38A 869A ECE1 D7F0 6CD5
- --------------------------------------------------------------------
To add or remove yourself from this mailing list, use the 'notifyme'
command on any BGL machine. To remove: notifyme -n, to add: notifyme -y.