TY - GEN
T1 - Towards an Autonomic Cluster Management System (ACMS) with reflex autonomicity
AU - Truszkowski, Walt
AU - Hinchey, Mike
AU - Sterritt, Roy
PY - 2005
Y1 - 2005
N2 - Cluster computing, whereby a large number of simple processors or nodes are combined together to apparently function as a single powerful computer, has emerged as a research area in its own right. The approach offers a relatively inexpensive means of providing a fault-tolerant environment and achieving significant computational capabilities for high-performance computing applications. However, the task of manually managing and configuring a cluster quickly becomes daunting as the cluster grows in size. Autonomic computing, with its vision to provide self-management, can potentially solve many of the problems inherent in cluster management. We describe the development of a prototype Autonomic Cluster Management System (ACMS) that exploits autonomic properties in automating cluster management and its evolution to include reflex reactions via pulse monitoring.
AB - Cluster computing, whereby a large number of simple processors or nodes are combined together to apparently function as a single powerful computer, has emerged as a research area in its own right. The approach offers a relatively inexpensive means of providing a fault-tolerant environment and achieving significant computational capabilities for high-performance computing applications. However, the task of manually managing and configuring a cluster quickly becomes daunting as the cluster grows in size. Autonomic computing, with its vision to provide self-management, can potentially solve many of the problems inherent in cluster management. We describe the development of a prototype Autonomic Cluster Management System (ACMS) that exploits autonomic properties in automating cluster management and its evolution to include reflex reactions via pulse monitoring.
UR - http://www.scopus.com/inward/record.url?scp=23944458155&partnerID=8YFLogxK
U2 - 10.1109/ICPADS.2005.281
DO - 10.1109/ICPADS.2005.281
M3 - Conference contribution
AN - SCOPUS:23944458155
SN - 0769522815
T3 - Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
SP - 478
EP - 482
BT - Proceedings - 11th International Conference on Parallel and Distributed Systems Workshops, ICPADS 2005
A2 - Ma, J.
A2 - Yang, L.T.
T2 - 11th International Conference on Parallel and Distributed Systems Workshops, ICPADS 2005
Y2 - 20 July 2005 through 22 July 2005
ER -