[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bgl-discuss] machine locked



I understand Narayan's point of view, because I know what is
happening behind the scenes, but I agree that it should work "as
expected".  Narayan and I will discuss this today and figure out an
elegant and efficient solution.

Rusty


From: Pete Beckman <beckman@xxxxxxxxxxx>
Subject: Re: [bgl-discuss] machine locked
Date: Fri, 18 Mar 2005 08:18:32 -0600

> >the primary issue is that this data isn't only being written to disk;
> >it is also processed by the queue manager. (this is the mechanism we
> >will be able to use to determine if the job ran properly or not) The
> >reason we cut off stdio over a particular limit was because it caused
> >performance problems for all users on chiba.
> 
> Hmmm, a standard pipe should never run out of resources unless 
> something is buffering everything in the middle.  In other words, if 
> I do this:
> 
> mpirun <....> | check_output_with_queue_manager | cat > foo.out
> 
> and the check_output program does not save all the data, but maybe 
> only keeps the last 5 lines of output, then it should not grow 
> without bounds, and everything should be fine.
> 
> Maybe there is a different problem?
> 
> You should be able to save mpirun output forever, even with an 
> interposing output checker, until all your output is finished.
> 
> -Pete
> 
> -- 
> ---
> Pete Beckman                                Phone: 630-252-9020
> Argonne National Laboratory                 Email: beckman@xxxxxxxxxxx
> MCS-221
> 9700 South Cass Avenue
> Argonne, Illinois 60439-4844, USA
> PGP: 12C0 4357 1197 7BC7 8BBB  B38A 869A ECE1 D7F0 6CD5
> 
> - --------------------------------------------------------------------
> To add or remove yourself from this mailing list, use the 'notifyme'
> command on any BGL machine. To remove: notifyme -n, to add: notifyme -y.
> 

- --------------------------------------------------------------------
To add or remove yourself from this mailing list, use the 'notifyme'
command on any BGL machine. To remove: notifyme -n, to add: notifyme -y.