Dissertation/Thesis Abstract

Predicting Switch-Like Behavior in Proteins using Logistic Regression on Sequence-Based Descriptors
by Strauss, Benjamin N., M.S., San Jose State University, 2019, 151; 27672517
Abstract (Summary)

Ligands can bind at specific protein locations, inducing conformational changes such as those involving secondary structure. Identifying these possible switches from sequence, including homology, is an important ongoing area of research. We attempt to predict possible secondary structure switches from sequence in proteins using machine learning, specifically a logistic regression approach with 48 N-acetyltransferases as our learning set and 5 sirtuins as our test set. Validated residue binary assignments of 0 (no change in secondary structure) and 1 (change in secondary structure) were determined (DSSP) from 3D X-ray structures for sets of virtually identical chains crystallized under different conditions. Our sequence descriptors include amino acid type, six and twenty-term sequence entropy, Lobanov-Galzitskaya’s residue disorder propensity, Vkabat (variability with respect to predictions from sequence of helix, sheet and other), and all possible combinations. We find the optimal AUC values approaching 70% for the two models of just residue disorder propensity and separately Vkabat. We hope to follow up with a larger learning set and using residue charge as an additional descriptor.

Supplemental Files

Some files may require a special program or browser plug-in. More Information

Indexing (document details)
Advisor: Lustig, Brooke, Pearce, Jon
Commitee: Pearce, Jon, Lustig, Brooke, Khuri, Sami
School: San Jose State University
Department: Computer Science
School Location: United States -- California
Source: MAI 81/11(E), Masters Abstracts International
Source Type: DISSERTATION
Subjects: Bioinformatics, Computational chemistry, Biostatistics
Keywords: Logistic regression, Protein switch, Qualitative predictors
Publication Number: 27672517
ISBN: 9798645438951
Copyright © 2020 ProQuest LLC. All rights reserved. Terms and Conditions Privacy Policy Cookie Policy
ProQuest