Dissertation/Thesis Abstract

Enabling urgent computing within the existing distributed computing infrastructure
by Trebon, Nicholas, Ph.D., The University of Chicago, 2011, 135; 3472964
Abstract (Summary)

This dissertation defines the domain of urgent computing as the set of deadline-constrained scientific simulations and models that are used to affect some critical decision (e.g., severe weather forecasting, the modeling of infectious diseases, wildfire simulations, patient-specific medical treatment). These computations typically target large, distributed systems (e.g., supercomputers, clusters, grids, clouds) that are shared by many users. This resource contention often leads to nontrivial and highly variable allocation delays that impede the ability of the computations to meet their deadlines. Thus, resources that support urgent computing should provide elevated priority access to these urgent computations. This dissertation focuses on the additional infrastructure necessary to aid these critical computations in meeting their deadlines within the current distributed computing environment. In particular, the combination of management mechanisms, access policies and tools needed to support urgent computing on batch queue resources (e.g., supercomputers and clusters) and computational clouds.

This dissertation describes the implementation of the Special PRiority and Urgent Computing Environment (SPRUCE), a token-based framework that manages urgent computing users, sessions, and resources. This framework has been widely deployed on the TeraGrid. Additionally, this dissertation proposes and evaluates a set of elevated priority policies that different resource types (i.e., batch queue resources, cloud resources) can use to support urgent computing. Included in this evaluation is a discussion of the effect these policies have on nonurgent users of the resource, along with potential compensation schemes for affected users. This dissertation also identifies and evaluates a set of statistical methods and heuristics that can be used to predict probabilistic upper bounds on the total turnaround time of the urgent computation (i.e., file staging, resource allocation, and execution delay). These upper bounds can guide the user in selecting the resource and policy that offers the greatest probability of meeting the given deadline.

Indexing (document details)
Advisor: Foster, Ian
Commitee: Beckman, Pete, Stevens, Rick
School: The University of Chicago
Department: Computer Science
School Location: United States -- Illinois
Source: DAI-B 72/12, Dissertation Abstracts International
Subjects: Computer science
Keywords: Cloud computing, Deadline-constrained computing, Distributed computing, Elevated priority, Spruce, Urgent computing
Publication Number: 3472964
ISBN: 9781124869407
Copyright © 2019 ProQuest LLC. All rights reserved. Terms and Conditions Privacy Policy Cookie Policy