Methods for clustering health care claims into episodes of care are important tools for data analysis in evaluating health care outcomes and methods of payment. In this research, we implement two statistical methods (1) using an ensemble classifier, Random Forests, to estimate the strength of relationship in pairs of Medicare claims, and (2) using a sequence model and an Expectation Maximization (EM) algorithm.
Previous researchers into episode of care clustering have implemented other methods, some based on decision rules and others based on statistical methods. Other research in natural language processing, particularly conversation disentanglement, has developed methods that were inspirational for the methods that we implemented in this research.
We acquired two sets of Medicare claims data, one containing claims from 2006 and 2007 for 1.9 million patients (a 5 percent sample), and the other containing claims from 2007 and 2008 for 250 thousand patients. We tested our two statistical methods on claims for 50 randomly selected patients who were age 65 and older, using independent annotations by three licensed nurse practitioners. We found that the method using Random Forests outperformed the sequence model and achieved accuracy comparable to the nurse practitioner annotators. Future research into episode of care clustering might incorporate more extensive data on clinical relationships and create a more flexible representation of episode clusters, such as hierarchies and phases of care.
|Commitee:||Klein, Dan, Wainwright, Martin|
|School:||University of California, Berkeley|
|School Location:||United States -- California|
|Source:||DAI-B 72/12, Dissertation Abstracts International|
|Subjects:||Statistics, Health care management|
|Keywords:||Conversation disentanglement, Episode of care, Expectation maximization algorithm, Medicare, Natural language processing, Random forests|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be