A substantial portion of resource costs incurred by data centers relate to energy costs, with cooling energy and equipment powering energy accounting for a major fraction. Other major costs incurred by data centers, is due to huge data transmission volume and resultant network bandwidth consumption. In this dissertation, we study problems inspired by the needs to reduce energy consumption and network bandwidth billing costs in data centers.
A significant amount of data center cooling energy is wasted due to thermal imbalance and hot spots. In order to prevent this, the workload should be scheduled in a thermally aware manner based on the overall thermal profile, since machine temperatures depend on local load as well as on neighboring machines' load. We define 'effective load' that captures this spatial cross-interference and analyze different models for: 1) maximizing the profit of scheduled jobs under a cooling energy budget, for which we give ½ − O(ε) approximation algorithms; 2) minimizing the maximum temperature when all jobs have to be scheduled, for which we give a 2-approximation algorithm and a 3-competitive online algorithm for a single rack of machines, where the factors approach 4/3 and 2 respectively as the cross-interference decays.
Servers consume energy while running; hence, shutting down some will reduce the costs. We consider two problems which study this in literature: active time and busy time, the goal being minimizing the total 'on' time of machines. In active time, we have access to a single machine whereas in busy time, number of machines allowed is unlimited. Machines have bounded capacity and jobs have release times, deadlines and lengths. For active time, we show a minimal feasible solution is 3-approximate and give a 2-approximation algorithm via LP rounding. For busy time, we give a 3-approximation algorithm which improves the 4-approximation, and analyze the preemptive problem also.
Data centers need to transmit a huge volume of data daily which results in high network bandwidth costs. Frequently, ISP's charge for Internet use either based on the peak bandwidth usage in any slot in the billing cycle, or according to some percentile cost. We provide an optimal offline algorithm for the percentile problem when jobs can have variable delay. For the online problem of minimizing peak bandwidth, we study small values of delay in a discrete time setting and give much better lower and upper bounds than the best known bound of e that holds when delay allowed is arbitrarily large and time is continuous.
|Commitee:||Deshpande, Amol, Gasarch, William, Khuller, Samir, Mount, David, Qu, Gang|
|School:||University of Maryland, College Park|
|School Location:||United States -- Maryland|
|Source:||DAI-B 75/06(E), Dissertation Abstracts International|
|Keywords:||Active time, Busy time, Data centers, Energy, Percentile, Temperature|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be