Published January 1, 2011
| Version v1
Conference paper
Open
Smart Job Scheduling for High-Performance Cloud Computing Services
Description
In this paper, we describe the challenges faced and lessons learned while establishing a large-scale high performance cloud computing service that enables online mechanical structural analysis and many other scientific applications using the finite element analysis (FEA) technique. The service is intended to process many independent and loosely-dependent (e.g. assembled system) tasks concurrently. Challenges faced include accurate job characterization, handling of many-task mixed jobs, sensitivity of task execution to multi-threading parameters, effective multi-core scheduling in a single node, and achieving seamless scale across multiple nodes. We find that significant performance gains in terms of both job completion latency and throughput are possible via dynamic or "smart" partitioning and resource-aware scheduling compared to shortest first and aggressive job scheduling techniques. We also discuss issues related to secure and private processing of sensitive models in the cloud.
Files
bib-c7c06472-c204-4646-8ad4-6ebeadb19dc7.txt
Files
(218 Bytes)
| Name | Size | Download all |
|---|---|---|
|
md5:1f5f900b389f18328b86973556b4b8cb
|
218 Bytes | Preview Download |