[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [bgl-discuss] big-o-jobs (Mark killed the machine)
Mark nods. The two 32-node jobs you saw were the tail
end of a larger swarm that seemed to clear just fine. One of these
died after two nodes ran out of memory.
At 09:38 AM 3/15/2005, Pete Beckman wrote:
I noticed that too. Last
night Mark had two jobs running in 32-node partitions, and 3 big, 512
node jobs queued. I figured Mark's science was much more important
than mine, so I gave up using the computer until this morning.
Then, to my horror, he still had the same 3 jobs queued up...
But worse yet, it seems we have everyone queuing and nobody doing
anything (Is this Britain?)
login1> qstat
JobID User WallTime Nodes State
==========================================
120 hereld 00:01:40 512 queued
121 hereld 00:01:40 512 queued
122 hereld 00:01:40 512 queued
130 rloy 00:00:10
32 queued
-Pete
At 9:10 AM -0600 3/15/05, Mark Hereld wrote:
so. last night i queued a few
fairly short jobs (probably only 15m to 30m) on 512 nodes. but they
remained queued all night, despite the likelyhood that i was the only
bloke doing bgl bidnis last night. earlier in the day i
successfully ran a 128 node short job, so i'm sure that it works in
principal. a stack of 32 node jobs ran, taking only minutes each,
and cleared the queue early in the night.
what's up: bug, policy, harassment?
-- mark
Mark
Hereld
Futures Laboratory
http://www.mcs.anl.gov/~hereld/ Mathematics & Computer
Science
Argonne National
Laboratory
Voice: 630 252
4170
9700 S. Cass Ave. #221
FAX: 630 252
6424
Argonne, IL 60439
--
---
Pete
Beckman
Phone: 630-252-9020
Argonne National
Laboratory
Email: beckman@xxxxxxxxxxx
MCS-221
9700 South Cass Avenue
Argonne, Illinois 60439-4844, USA
PGP: 12C0 4357 1197 7BC7 8BBB B38A 869A ECE1 D7F0
6CD5
Mark
Hereld
Futures Laboratory
http://www.mcs.anl.gov/~hereld/ Mathematics & Computer
Science
Argonne National
Laboratory
Voice: 630 252
4170
9700 S. Cass Ave. #221
FAX: 630 252
6424
Argonne, IL 60439