From time-critical, real time computational experimentation to applications which process petabytes of data there is a continuing search for faster, more responsive computing platforms capable of supporting computational experimentation. Weather forecast models, for instance, process gigabytes of data to produce regional (mesoscale) predictions on a short turnaround. Through science gateways, these scientific experiments are made accessible to scientists and users worldwide. The introduction of cloud computing increases the options for computational platforms, putting scientists in the position of choosing the best resources to use that maximize their needs. Historical information recorded in job executions can reveal important information about the patterns and experiences of the users to improve the management of scientific jobs in large-scale systems.
This dissertation makes three key contributions to management of scientific job execution in large-scale systems. First, it proposes user patterns, using knowledge-based approaches, to provision for compute resources in such a way that maximizes cost metrics reducing the impact of startup overheads in cloud computing environments. Second, it proposes and through experimental evaluation establishes the viability of user derived reliability, learned by mining historical job execution information, as a new dimension to consider during resource selections. Experimental studies conducted on a prototype implementation of above two contributions validate the applicability of this approach and the accuracy of prediction models. Third, it proposes and prototypes a light-weight and reliable resource abstraction service to hide the complexities of interacting with multiple resources managers in grids and clouds. The framework has been applied in four different domains, including LEAD II cyberinfrastructure, and its performance validated through empirical analysis of such conditions as scalability in comparison to related approaches.
|Commitee:||Fox, Geoffrey, Gannon, Dennis, Leake, David|
|School Location:||United States -- Indiana|
|Source:||DAI-B 73/04, Dissertation Abstracts International|
|Subjects:||Information Technology, Computer science|
|Keywords:||Cloud computing, Grid computing, Knowledge-based computing, Machine learning, Resource abstraction, Scientific computing|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be