Published January 1, 2021 | Version v1
Journal article Open

Online Anomaly Detection With Bandwidth Optimized Hierarchical Kernel Density Estimators

  • 1. Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
  • 2. Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
  • 3. Bilkent Univ, Dept Elect & Elect Engn, TR-06800 Ankara, Turkey

Description

We propose a novel unsupervised anomaly detection algorithm that can work for sequential data from any complex distribution in a truly online framework with mathematically proven strong performance guarantees. First, a partitioning tree is constructed to generate a doubly exponentially large hierarchical class of observation space partitions, and every partition region trains an online kernel density estimator (KDE) with its own unique dynamical bandwidth. At each time, the proposed algorithm optimally combines the class estimators to sequentially produce the final density estimation. We mathematically prove that the proposed algorithm learns the optimal partition with kernel bandwidths that are optimized in both region-specific and time-varying manner. The estimated density is then compared with a data-adaptive threshold to detect anomalies. Overall, the computational complexity is only linear in both the tree depth and data length. In our experiments, we observe significant improvements in anomaly detection accuracy compared with the state-of-the-art techniques.

Files

bib-bea414ec-52b5-43cf-a78c-efb5b657cc38.txt

Files (210 Bytes)

Name Size Download all
md5:e6152539fe24a0eb332d85a74f1ba1d8
210 Bytes Preview Download