Dissertation/Thesis Abstract

Dynamic Fraud Detection via Sequential Modeling
by Zheng, Panpan, Ph.D., University of Arkansas, 2020, 140; 27960392
Abstract (Summary)

The impacts of information revolution are omnipresent from life to work. The web services have significantly changed our living styles in daily life, such as Facebook for communication and Wikipedia for knowledge acquirement. Besides, varieties of information systems, such as data management system and management information system, make us work more efficiently. However, it is usually a double-edged sword. With the popularity of web services, relevant security issues are arising, such as fake news on Facebook and vandalism on Wikipedia, which definitely impose severe security threats to OSNs and their legitimate participants. Likewise, office automation incurs another challenging security issue, insider threat, which may involve the theft of confidential information, the theft of intellectual property, or the sabotage of computer systems. A recent survey says that 27% of all cyber crime incidents are suspected to be committed by the insiders. As a result, how to flag out these malicious web users or insiders is urgent. The fast development of machine learning (ML) techniques offers an unprecedented opportunity to build some ML models that can assist humans to detect the individuals who conduct misbehaviors automatically. However, unlike some static outlier detection scenarios where ML models have achieved promising performance, the malicious behaviors conducted by humans are often dynamic. Such dynamic behaviors lead to various unique challenges of dynamic fraud detection:

  • Unavailability of sufficient labeled data | traditional machine learning approaches usually require a balanced training dataset consisting of normal and abnormal samples. In practice, however, there are far fewer abnormal labeled samples than normal ones.
  • Lack of high quality labels - the labeled training records often have the time gap between the time that fraudulent users commit fraudulent actions and the time that they are suspended by the platforms.
  • Time-evolving nature - users are always changing their behaviors over time.

To address the aforementioned challenges, in this dissertation, we conduct a systematic study for dynamic fraud detection, with a focus on: (1) Unavailability of labeled data: we present (a) a few-shot learning framework to handle the extremely imbalanced dataset that abnormal samples are far fewer than the normal ones and (b) a one-class fraud detection method using a complementary GAN (Generative Adversarial Network) to adaptively generate potential abnormal samples; (2) Lack of high-quality labels: we develop a neural survival analysis model for fraud early detection to deal with the time gap; (3) Time-evolving nature: we propose (a) a hierarchical neural temporal point process model and (b) a dynamic Dirichlet marked Hawkes process model for fraud detection.

Indexing (document details)
Advisor: Wu, Xintao
Commitee: Li, Qinghua, Yang, Song, Zhang, Lu
School: University of Arkansas
Department: Computer Science
School Location: United States -- Arkansas
Source: DAI-B 82/2(E), Dissertation Abstracts International
Subjects: Computer science
Keywords: Dirichlet process, Fraud detection, Machine learning, Mixture model, Sequential model, Survival analysis
Publication Number: 27960392
ISBN: 9798662569751
Copyright © 2021 ProQuest LLC. All rights reserved. Terms and Conditions Privacy Policy Cookie Policy