Glioblastoma multiforme (GBM) is the most common form of brain cancer. The exact cause of GBM is not well understood. In this thesis, we tested whether germline genetic information could predict who will develop GBM and when will they develop it. We first extracted copy number variation (CNV) data from germline DNA in the peripheral blood samples of 8826 patients in the The Cancer Genome Atlas (TCGA) database. We compared that to 8338 patients in the database who did not develop GBM. We used several machine learning algorithms: deep learning, gradient boosting machine and random forest methods to test whether the germ line genetic data could predict who would develop GBM. The gradient boosting machine algorithm achieved the best results with an 0.82 AUC. We then used this gradient boosting method to test whether germ line DNA information could predict the age of diagnosis of GBM patients. We compared the correlation coefficient between the predicted age and actual age for GBM patients to the predicted correlation coefficient measured for randomized control groups and found a significantly better prediction in the GBM patient group (p-value 0.0004). These results suggest that who develops glioblastoma and when they are diagnosed with glioblastoma is influenced by germline genetics.
|Advisor:||Brody, James P.|
|Commitee:||Kruggel, Frithjof, Brewer, Gregory|
|School:||University of California, Irvine|
|Department:||Biomedical Engineering - M.S.|
|School Location:||United States -- California|
|Source:||MAI 81/1(E), Masters Abstracts International|
|Subjects:||Biomedical engineering, Oncology, Genetics|
|Keywords:||Glioblastoma multiforme, Brain cancer|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be