Dissertation/Thesis Abstract

Weighted voting ensembles for high dimensional data
by Hordoan, Liliana A., M.S., California State University, Long Beach, 2015, 67; 1584935
Abstract (Summary)

In recent years, the understanding and development of microarray data has grown intensively to the benefit of medical science. One of these areas involves the application of statistical algorithms to categorize disease, treatments, cancers, outcomes, etc., especially for high-dimensional data. This thesis investigates the application of two different types of decision voting schemes for the classification of outcomes using microarray data. A weighted adjusted voting scheme is compared to the standard majority voting scheme for classification algorithms in different type of ensemble models. The investigation starts with decision trees as base classifiers, and then works on the improvement of the ensemble structure to investigate how the weighted adjusted voting scheme performs on actual microarray data. Due to the structure of high-dimensional data, cross-validation is used to evaluate the validity of the statistical analysis. Variable importance is considered in this research to improve model efficiency by selecting top-ranked genes via Random Forest. Then accuracy is assessed on different ensemble methods to draw conclusions on the performance of weighted voting scheme compared to average majority voting.

Indexing (document details)
Advisor: Moon, Hojin
Commitee: Ebneshahrashoob, Morteza, Kim-Park, Yong Hee
School: California State University, Long Beach
Department: Mathematics and Statistics
School Location: United States -- California
Source: MAI 54/03M(E), Masters Abstracts International
Subjects: Statistics
Publication Number: 1584935
ISBN: 978-1-321-60347-7
Copyright © 2021 ProQuest LLC. All rights reserved. Terms and Conditions Privacy Policy Cookie Policy