UnEncrypted
.pdf
Figure 2: Resulting scheduling for the example
3Experimentation
The aim of our experimental study was to determine the e ectiveness of the scheduling solutions provided by METL compared with common scheduling policies from the literature: First Come First Served (FCFS), Short Jobs First (SJF), Big Jobs First (BJF), Fit Processors First Served (FPFS), Short Processing Time (SPT) and Long Processing Time
(LPT). The experimentation was carried out by simulation, using the GridSim framework [11], and by characterizing a heterogeneous multi-cluster system as shown in Table 3.
|
Num.Nodes |
E ective power |
Cluster 1 |
60 |
1.0 |
Cluster 2 |
60 |
1.5 |
Cluster 3 |
60 |
2.0 |
Cluster 4 |
60 |
1.0 |
|
|
|
Table 3: Characterization of the experimental environment
In this experimental study the authors used real traces from the HPC2N logs in [12]. A set of three workloads each made up of 15,000 jobs was selected. The majority of the jobs on the there workloads are computational intensive (σj = 0.7), with an average base time of 15,520 seconds and 8.4 tasks per job.
The main aim of the METL is to reduce the execution time of the jobs. Accordingly, the average job execution time was used as the comparison metric for the three workloads. The results are shown in Figure 3. The x-axis represents each of the three experimental workloads, and the y-axis represents the average execution time expressed in seconds.
As can be seen, METL produces lower average execution times, while the techniques from the literature performed very similarly, producing allocations with higher values.
In order to better analyze the performance of the di erent techniques, and reveal the
c CMMSE |
Page 203 of 1573 |
ISBN:978-84-615-5392-1 |
Figure 3: Average Execution Time comparison of three workloads
real e ect of the scheduling decisions taken, we also evaluate the di erences between the estimated execution time and the base time for all the jobs in the workload. As the results in the previous analysis are similar irrespectively of the workload and the behavior for the classical policies is practically the same, the Workload 3 was taken as a representative sample, and the the METL was compared with the well known SJF policy.
Figure 4 shown the obtained results. In the x-axys we have the job id for all jobs in the workload. The jobs are ordered from less to higher base time. On the primary y-axys (left side) is represented the base time for each job measured in seconds. As can be seen, the majority of jobs have a low base time, and just only a minor part are big jobs. On the secondary y-axys (right side), is denoted in percentage the di erence between the execution time and its base time. Negative values on the right y-axis means that the job was executed faster than in the reference dedicated environment. So the lower the values are, better decisions are taken for the specific technique.
As can be observed, SJF produced solutions in which an important number of jobs have a longer execution time than in the dedicated environment. This is represented by the green points on the figure. On the other hand, METL was able to produce solutions in which the majority of the parallel jobs executed faster than in the dedicated environment. This is due to its ability to determine the job execution order and a fairness allocation in which the loss of the execution time is optimized and the underutilization of resources is avoided. By this, just only a few number of jobs has increased their execution time, however these jobs are those from the workload with the slowest base time, so the final execution time do not a ect significantly on the results.
The degree of complexity of the proposed policy was determined to be O(n2), where n denotes the number of jobs waiting in the system queue when the METL policy is applied. In this experimental study our policy has been treated in average 16 jobs each time it was
c CMMSE |
Page 204 of 1573 |
ISBN:978-84-615-5392-1 |
executed, obtaining the scheduling solution in 60ms in average.
Figure 4: Relation of the Execution Time with the Base Time
4Conclusions
The present work focuses on multiple job scheduling on heterogeneous multi-cluster environments with co-allocation. The authors proposed a new policy that determines the job execution order and its resource allocation trying to minimize the job execution time, preventing resource underutilization and avoiding starvation. The results, compared with common scheduling techniques from the literature, shown that our proposal was able to produce more fairness scheduling decision reducing the jobs execution time for di erent types of real workloads traces.
Acknowledgment
This work was supported by the Ministry of Education and Science of Spain under contract TIN2011- 28689-C02, TIN2010-12011-E and the CUR of DIUE of GENCAT and the European Social Fund.
References
[1]Javadi, B., Akbari, M.K., Abawajy J.H., A performance Model for Analysis of Heterogeneous Multi-Cluster Systems, Parallel Computing, 32(11-12), 831-851, 2006.
[2]Bucur, A.I.D., Epema, D.H.J., Schedulling Policies for Processor Coallocation in Multicluster Systems, IEEE TPDS, 18(7), 958-972, 2007.
c CMMSE |
Page 205 of 1573 |
ISBN:978-84-615-5392-1 |
[3]Jones, W., Ligon, W., Pang, L., Stanzione, D., Characterization of Bandwidth-Aware MetaSchedulers for Co-Allocating Jobs Across Multiple Clusters, Journal of Supercomputing, 2005, 34(2), 135-163, 2005.
[4]Heien, E.M., Fujimoto, N., Hagihara, K., Static Load Distribution for Communicative Intensive Parallel Computing in Multiclusters, IEEE PDP’08, 321-328, 2008.
[5]Naik, V.K., Liu, C., Yang, L., Wagner, J., Online Resource Matching for Heterogeneous Grid Environments, IEEE/ACM Int. Conf. CCGRID’05, 2, 607-614, 2005.
[6]L´erida, J.L., Solsona, F., Gin´e, F., Garc´ıa, J.R., Hern´andez, P., Resource Matching in Nondedicated Multicluster Environments, VECPAR’08, 2008, 160-173, 2008.
[7]Shmueli E., Feitelson D.G., Backfilling with Lookahead to Optimize the Packing of Parallel Jobs, J. J. Parallel Distrib. Comput., vol.65(9), pp.1090-1107, 2005.
[8]Blanco, H., L´erida, J.L., Cores F., Guirado, F., Multiple Job Co-Allocation Strategy for Heterogeneous Multi-Cluster Systems Based on Linear Programming, Journal of Supercomputing, vol.58(3), pp:394-402, 2011.
[9]Skillicorn, D.B., Hill, J. M.D., McColl, W.F., Questions and Answers about BSP, Oxford University Computing Laboratory, 1997.
[10]Blanco, H., Monta˜nola, A., Guirado F., L´erida J.L, Fairness Scheduling for Multi-Cluster Systems Based on Linear Programming, CMMSE’10, vol.1, pp:227-239, 2010
[11]GridSim simulation framework, http://www.buyya.com/gridsim
[12]Parallel Workloads Archive, http://www.cs.huji.ac.il/labs/parallel/workload
c CMMSE |
Page 206 of 1573 |
ISBN:978-84-615-5392-1 |
