
arXiv: 2311.01174
Abstract The increasing volume of data streams poses significant computational challenges for detecting changepoints online. Likelihood-based methods are effective, but a naive sequential implementation becomes impractical online due to high computational costs. We develop an online algorithm that exactly calculates the likelihood ratio test for a single changepoint in p-dimensional data streams by leveraging a fascinating connection with computational geometry. This connection straightforwardly allows us to exactly recover sparse likelihood ratio statistics: that is assuming only a subset of the dimensions are changing. Our algorithm is straightforward, fast, and apparently quasi-linear. A dyadic variant of our algorithm is provably quasi-linear, being O(nlog(n)p+1) for n data points and p less than 3, but slower in practice. These algorithms are computationally impractical when p is larger than 5, and we provide an approximate algorithm suitable for such p which is O(nplog(n)p~+1), for some user-specified p~≤5. We derive statistical guarantees for the proposed procedures in the Gaussian case, and confirm the good computational and statistical performance, and usefulness, of the algorithms on both empirical data and NBA data.
dynamic programming, FOS: Computer and information sciences, online changepoint detection, Computation, computational geometry, convex hull, [MATH.MATH-ST] Mathematics [math]/Statistics [math.ST], Computation (stat.CO)
dynamic programming, FOS: Computer and information sciences, online changepoint detection, Computation, computational geometry, convex hull, [MATH.MATH-ST] Mathematics [math]/Statistics [math.ST], Computation (stat.CO)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
