A Resource Management System for Adaptive Parallel Applications in Cluster Environments
Ghafoor, S., Haupt, T., Banicescu, I., & Carino, R.L. (2005). A Resource Management System for Adaptive Parallel Applications in Cluster Environments. 6th International Conference on Linux Clusters: The HPC Revolution 2005. Chapel Hill, USA.
Adaptive parallel applications that can change resources during execution, promise better system utilization, increased application performance, and furthermore, they open the opportunity for developing a new class of parallel applications driven by unpredictable data and events, capable of amassing huge resources on demand. This paper discusses requirements for a resource management system to support such applications including communication and negotiation of resources. To schedule adaptive applications, interaction between the applications and the resource management system is necessary. While managing adaptive applications is a multidimensional complex research problem, this paper focuses only on support that a RMS requires to accommodate adaptive applications. An early prototype implementation shows that scheduling of adaptive applications is possible in a cluster environment and the overhead of management of applications is low compared to the long running time of typical parallel applications. The prototype implementation supports a variety of adaptive parallel applications in addition to rigid parallel applications.