Forecasting of resource utilization in large cloud computing systems is essential to maintain high-quality service and fulfill service level agreements to ensure cost-effectiveness and profitability. Accurately forecasting long-term resource utilization can be challenging, as it is dynamic and fluctuates at short time intervals.
A forecasting model was developed using multivariate time series k-nearest neighbor (MTS k-NN) regression to forecast optimal resource utilization for CPU and memory, with additional explanatory variables (exogenous) that contribute to resource utilization. The forecasting effectiveness and accuracy of the MTS k-NN model were evaluated using real-world (Google trace) data. The model was compared with two classical statistical time-series forecasting models, autoregressive integrated moving average (ARIMA), and autoregressive integrated moving average with exogenous (ARIMAX) variables, and six other machine learning algorithms: support vector machine (SVM), artificial neural network (ANN), multivariate adaptive regression splines (MARS), classification and regression trees (CART), random forest (RF), and naïve Bayes (NB).
Results indicate that CPU and memory are inextricably linked with exogenous variables, and the addition of exogenous variables with time-series is a significant factor in forecasting resource utilization that should not be overlooked and demonstrates the accuracy of the approach. The MTS k-NN model can potentially be applied to real-world scenarios for forecasting optimal resource utilization to facilitate efficient management of cloud computing systems to overcome resource utilization challenges, especially capacity planning and performance management, and maintain QoS and satisfy SLAs.
|Advisor:||Etemadi, Amirhossein, Malalla, Ebrahim|
|Commitee:||Blackburn, Timothy, Etemadi, Amirhossein, Malalla, Ebrahim|
|School:||The George Washington University|
|School Location:||United States -- District of Columbia|
|Source:||DAI-B 80/04(E), Dissertation Abstracts International|
|Subjects:||Operations research, Artificial intelligence, Computer science|
|Keywords:||K-nn, Machine learning, Multivariate, Optimization, Time series|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be