Each year over 880 million doctor visits occur in the United States. Current estimates place the yearly healthcare data records at 2,314 exabytes with projections of reaching zettabytes and yottabytes. For a country with 329 million people that spend $4 trillion on healthcare each year, understanding and uncovering the hidden patterns and trends within this big data is paramount in improving healthcare outcomes through preventative care and early diagnosis, creating a more efficient healthcare system with optimizations, and making clinicians' and patients' lives easier. With the ever increasing availability of big data, these problems are more important now than ever before.
While many event analysis and time series tools have been developed for the purpose of analyzing such datasets, most approaches tend to target clean and evenly spaced data (i.e., with a fixed time interval between observations). When faced with noisy or irregular data, it is typical to use a preprocessing step of transforming the data into being regular. This transformation technique arguably interferes on a fundamental level as to how the data is represented, and may irrevocably bias the way in which results are obtained. Therefore, operating on raw data, in its noisy natural form, is necessary to ensure that the insights gathered through analysis are accurate and valid.
In this dissertation novel approaches are presented for analyzing irregular event sequences using a variety of techniques ranging from deep learning, reinforcement learning, and visualization. We show how common tasks in event analysis can be performed directly on an irregular event dataset without requiring a transformation that alters the natural representation of the process that the data was captured from. We focus our efforts on healthcare specifically, but also evaluate our approaches against other domains to test for generalizability. The three tasks that we showcase include: (i) predicting the probability of a future event occurring, (ii) summarizing large event datasets, and (iii) modeling the processes that create events.
|Commitee:||Caban, Jesus J., Finin, Tim, Banerjee, Nilanjan, Chapman, David|
|School:||University of Maryland, Baltimore County|
|School Location:||United States -- Maryland|
|Source:||DAI-B 82/7(E), Dissertation Abstracts International|
|Subjects:||Computer science, Artificial intelligence, Public health|
|Keywords:||Deep learning, Healthcare, Irregular events, Suicide prediction, User interaction, Visualization|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be