Dissertation/Thesis Abstract

Assembling improved gene annotations in Clostridium acetobutylicum with RNA sequencing
by Ralston, Matthew T., M.S., University of Delaware, 2015, 152; 1585177
Abstract (Summary)

The C. acetobutylicum genome annotation has been markedly improved by integrating bioinformatic predictions with RNA sequencing(RNA-seq) data. Samples were acquired under butanol, butyrate, and unstressed treatments across various growth stages to sample the transcriptome from a range of physiologically relevant conditions. Analysis of an initial assembly revealed errors due to technical and biological background signals, challenges with few solutions. Hurdles for RNA-seq transcriptome mapping research include optimizing library complexity and sequencing depth, yet most studies in bacteria report low depth and ignore the effect of ribosomal RNA abundance and other sources on the effective sequencing depth.

In this work, workflows were established to address type I and II errors associated with these challenges. An integrative analysis method was developed to combine motif predictions, single-nucleotide resolution sequencing depth, and library complexity to resolve these errors during assembly curation. This contextualization minimized false positive error and determined gene boundaries, in some cases, to the exact basepair of prior studies. Curation of the pSOL1 megaplasmid reconciled transcriptome assembly statistics with findings from E. coli.

The resulting annotation can be readily explored and downloaded through a customized genome browser, enabling future genomic and transcriptomic research in this organism. This work demonstrates the first strand-specific transcriptome assembly in a Clostridium organism. This method can improve the precision of transcript boundary estimates in bacterial transcriptome mapping studies.

Indexing (document details)
Advisor: Papoutsakis, Eleftherios T.
Commitee: Polson, Shawn W., Wu, Cathy H.
School: University of Delaware
Department: Department of Computer and Information Sciences
School Location: United States -- Delaware
Source: MAI 54/04M(E), Masters Abstracts International
Source Type: DISSERTATION
Subjects: Microbiology, Bioinformatics
Keywords: Assembly, Genome, Rna-seq, Transcription start sites, Transcriptome, Untranslated region
Publication Number: 1585177
ISBN: 9781321611328
Copyright © 2019 ProQuest LLC. All rights reserved. Terms and Conditions Privacy Policy Cookie Policy
ProQuest