Low substantiation rates are pervasive across the federal Office of Inspector General (OIG) community due to high levels of uncertainty and limited data availability at the time of case selection. OIG management often selects cases based on intuition and past experience. Intuitive project selection has proven unsuccessful because the methods are often subjective, prone to bias, and lead to error. The high uncertainty surrounding case selection and the current selection method employed by OIG management teams results in a significant loss of investigative resources spent on unsubstantiated cases. This research presents a novel approach to predict OIG investigative case substantiation using natural language processing techniques and multinomial naïve Bayes to retrieve information from complaint intakes. It aims to improve OIG substantiation rates and reduce the cost associated with unsubstantiated cases. The model developed in this study significantly outperformed OIG management and was 20% more accurate in the prediction of substantiated and unsubstantiated cases. This model will augment investigative case selection and improve investigative targeting, increase impact of investigative work, and improve OIG investigative resource allocation. Its application will result in a significant savings by reducing the resources dedicated to cases with a low probability of substantiation.
|Commitee:||Fossaceca, John, Etemadi, Amir|
|School:||The George Washington University|
|School Location:||United States -- District of Columbia|
|Source:||DAI-A 82/6(E), Dissertation Abstracts International|
|Subjects:||Applied Mathematics, Engineering, Business administration, Management, Finance|
|Keywords:||Naive bayes, Natural language processing, Substantiation, Natural Language Processing, Low substantiation rates, Office of Inspector General, Limited data availability|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be