So I'm having a hard time justifying why I'm concentrating on scheduling BoTs.
finally found the Minh paper... (Why didn't I come across this earlier :( :(
Now I tried to look up a tool developed by Minh which i thought would make my life easier. It supposedly would cough up the BoTs if I fed any workload trace.
http://www.liacs.nl/home/minhtn/models
'n bad luck... I get "Pagina niet gevonden"...
So now I guess I have to write my own code to detect BoTs.... 'n all I gotta do is follow everything from the Minh paper on "Realistic Integrated Model on Parallel Worklaods"
Some Notes from the above mentioned paper...
What are BoTs?
In any parallel system workload, W,
(This parallel concept 'bout BoT is what's utterly confusing and even though I think I've understood it... it's confounding to the person I'm explaining it to...)
(So I'm going to interpret parallel system workload as workload given to a set of machines which can execute various tasks simultaneously...)
BoT contain tasks arriving at different times (one after the other), within a particular time frame
(They have observed this to be 100 sec). These jobs have the following characteristics :
finally found the Minh paper... (Why didn't I come across this earlier :( :(
Now I tried to look up a tool developed by Minh which i thought would make my life easier. It supposedly would cough up the BoTs if I fed any workload trace.
http://www.liacs.nl/home/minhtn/models
'n bad luck... I get "Pagina niet gevonden"...
So now I guess I have to write my own code to detect BoTs.... 'n all I gotta do is follow everything from the Minh paper on "Realistic Integrated Model on Parallel Worklaods"
Some Notes from the above mentioned paper...
What are BoTs?
In any parallel system workload, W,
(This parallel concept 'bout BoT is what's utterly confusing and even though I think I've understood it... it's confounding to the person I'm explaining it to...)
(So I'm going to interpret parallel system workload as workload given to a set of machines which can execute various tasks simultaneously...)
BoT contain tasks arriving at different times (one after the other), within a particular time frame
(They have observed this to be 100 sec). These jobs have the following characteristics :
- They originate from the same user.
- They have the same group name
- They have the same queue name
- They have the same job name
- They have the same user estimated runtime (approximately),
- They have the same number of requested processors.
Around 70% of jobs are part of BoT's
submission duration of a BoT is the difference between maximum and minimum duration of BoT submission. All are less than 100%.
Now there's an algo on how to generate synthetic BoT's... but I hopefully don't have to resort to that if I could just use the above "simple" observations 'n detect BoT's from the LCG trace.
Something tells me i'm not gonna get outta this without knowing digging into the depths of the Zipf and Gaussian Distribution.
No comments:
Post a Comment