A major goal of human genetics is to characterize the role of genetic variation on complex, polygenic phenotypes. With the discovery from genome-wide association studies (GWAS) that many associated variants have a small effect size and are located in non-coding regions of the genome, there has been a large effort to collect functional genomics data. The hope is that a better understanding of how the genome functions in diverse developmental states and environments will provide insight into the context-specific activity of associated non-coding variants. My research applies this paradigm to the complex phenotype of susceptibility to develop tuberculosis (TB). It has been estimated that 10% of individuals infected with Mycobacterium tuberculosis (MTB) progress to active disease. Despite being heritable, very few genetic variants have been associated with susceptibility to TB. For my studies, I use RNA sequencing (RNA-seq) to interrogate genome-wide transcript levels in in vitro cellular models. In Chapter 2, I use a joint Bayesian model to identify genes which are differentially expressed in macrophages only after infection with MTB and related mycobacteria, but not other bacterial pathogens. In Chapter 3, I build a support vector machine model to classify individuals as susceptible or resistant to TB based on the gene expression levels in their dendritic cells. In Chapter 4, I characterize the technical variation introduced by batch processing of single-cell RNA-seq (scRNA-seq) and propose an effective study design that accounts for technical variation while minimizing replication. In addition to providing insight into the genes important for the innate immune response to MTB infection, my work is informative for the design and analysis of future functional genomics experiments. (Note: Supplementary tables are provided in a .zip file available online. Captions for the tables are provided within the dissertation.)
Some files may require a special program or browser plug-in. More Information
|Commitee:||Novembre, John, Stephens, Matthew, Thornton, Joseph W.|
|School:||The University of Chicago|
|School Location:||United States -- Illinois|
|Source:||DAI-B 78/05(E), Dissertation Abstracts International|
|Keywords:||Batch effects, Human genetics, Machine learning, RNA-seq, Single-cell, Tuberculosis|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be