## MOUSSE: Multiscale Online Union of Subspaces Estimation[Back] [Download] [Demo] ## IntroductionThis page contains a summary and code associated with the paper "Changepoint detection for high-dimensional time series with missing data", by Y. Xie, J. Huang, and R. Willett. In IEEE Journal of Selected Topics in Signal Processing, vol. 7, no. 1, 2013. arXiv:1208.5062This page describes a novel approach to change-point detection when the observed high-dimensional data may have missing elements. The performance of classical methods for change-point detection typically scales poorly with the dimensionality of the data, so that a large number of observations are collected after the true change-point before it can be reliably detected. Furthermore, missing components in the observed data handicap conventional approaches. The proposed method addresses these challenges by modeling the dynamic distribution underlying the data as lying close to a time-varying low-dimensional submanifold embedded within the ambient observation space. Specifically, streaming data is used to track a submanifold approximation, measure deviations from this approximation, and calculate a series of statistics of the deviations for detecting when the underlying manifold has changed in a sharp or unexpected manner. The proposed approach leverages several recent results in the field of high-dimensional data analysis, including subspace tracking with missing data, multiscale analysis techniques for point clouds, online optimization, and change-point detection performance analysis. Simulations and experiments highlight the robustness and efficacy of the proposed approach in detecting an abrupt change in an otherwise slowly varying low-dimensional manifold.
## Problem Setup
## InitializationRecursively perform bi-partition of a batch of training samples. For each partition, do eigendecomposition to initialize a corresponding node in the tree. The obove process will be stopped when a parition's error $\delta(D-d)$ is less than a threshold. Then an extra partition will be employed to initialize the virtual nodes. ## Online Update of Tree
Projection: $\beta=U^T(x_{t+1}-c_t)$ ## Tracking Manifold ExampleThe video shows a $D=100$ but $d=1$ manifold by projecting it onto a 3D space. Each observation has about 40% missing entries, ie, about 40 of the 100 dimensions are not observed. Dataset and code generating this video can be downloaded here. ## Application in Anomaly DetectionDetect anomaly in solar flare video. There is an obvious anomaly at about $t=227$. Full dataset and code generating this video can be downloaded here. ## References[1] L. Balzano, R. Nowak and B. Recht, "Online identification and tracking of subspaces from highly incomplete information," in proceedings of the Allerton Conference on Communication, Control and Computing, Sept. 2010, pp. 704-711.[2] Y. Chi, Y. Eldar, and R. Calderbank, "PETRELS: subspace estimation and tracking from partial observations," IEEE. Conf. on Acoustic, Speech and Signal Processing, 2012. [3] K. Abed-Meraim, A. Chkeif and Y. Hua, S. Attallah, "On a class of orthonormal algorithms for principal and minor subspace tracking," Journal of VLSI Signal Processing, 2002, vol. 31, pp. 57-70. |