Publications
Publications
- Statistics in Medicine
Fast Subset Scan for Multivariate Spatial Biosurveillance
By: Daniel B. Neill, Edward McFowland III and Huanian Zheng
Abstract
We present new subset scan methods for multivariate event detection in massive space-time datasets. We extend the recently proposed 'fast subset scan' framework from univariate to multivariate data, enabling computationally efficient detection of irregular space-time clusters even when the numbers of spatial locations and data streams are large. For two variants of the multivariate subset scan, we demonstrate that the scan statistic can be efficiently optimized over proximity-constrained subsets of locations and over all subsets of the monitored data streams, enabling timely detection of emerging events and accurate characterization of the affected locations and streams. Using our new fast search algorithms, we perform an empirical comparison of the Subset Aggregation and Kulldorff multivariate subset scans on synthetic data and real-world disease surveillance tasks, demonstrating tradeoffs between the detection and characterization performance of the two methods.
Keywords
Citation
Neill, Daniel B., Edward McFowland III, and Huanian Zheng. "Fast Subset Scan for Multivariate Spatial Biosurveillance." Statistics in Medicine 32, no. 13 (June 15, 2013): 2185–2208.