Spatio-temporal prediction and reconstruction network for video anomaly detection.

Ting Liu, Chengqing Zhang, Xiaodong Niu, Liming Wang
Author Information
  1. Ting Liu: State Key Lab for Electronic Testing Technology, North University of China, Taiyuan, 030051, China.
  2. Chengqing Zhang: State Key Lab for Electronic Testing Technology, North University of China, Taiyuan, 030051, China.
  3. Xiaodong Niu: State Key Lab for Electronic Testing Technology, North University of China, Taiyuan, 030051, China.
  4. Liming Wang: State Key Lab for Electronic Testing Technology, North University of China, Taiyuan, 030051, China. ORCID

Abstract

The existing anomaly detection methods can be divided into two popular models based on reconstruction or future frame prediction. Due to the strong learning capacity, reconstruction approach can hardly generate significant reconstruction errors for anomalies, whereas future frame prediction approach is sensitive to noise in complicated scenarios. Therefore, a solution has been proposed by balancing the merits and demerits of the two models. However, most methods relied on single-scale information to capture spatial features and lacked temporal continuity between the video frames, affecting anomaly detection accuracy. Thus, we propose a novel method to improve anomaly detection performance. Because of the objects of various scales in each video, we select different receptive fields to extract comprehensive spatial features by the hybrid dilated convolution (HDC) module. Meanwhile, the deeper bidirectional convolutional long short-term memory (DB-ConvLSTM) module can remember the temporal information between the consecutive frames. Experiments prove that our method can detect abnormalities in various video scenes more accurately than the state-of-the-art methods in the anomaly-detection task.

References

  1. IEEE Trans Image Process. 2004 Apr;13(4):600-12 [PMID: 15376593]
  2. IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2740-2755 [PMID: 30183621]
  3. PLoS One. 2020 Mar 5;15(3):e0229951 [PMID: 32134949]
  4. IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):675-684 [PMID: 32275608]

MeSH Term

Neural Networks, Computer

Word Cloud

Created with Highcharts 10.0.0anomalydetectioncanreconstructionvideomethodspredictiontwomodelsfutureframeapproachinformationspatialfeaturestemporalframesmethodvariousmoduleexistingdividedpopularbasedDuestronglearningcapacityhardlygeneratesignificanterrorsanomalieswhereassensitivenoisecomplicatedscenariosThereforesolutionproposedbalancingmeritsdemeritsHoweverreliedsingle-scalecapturelackedcontinuityaffectingaccuracyThusproposenovelimproveperformanceobjectsscalesselectdifferentreceptivefieldsextractcomprehensivehybriddilatedconvolutionHDCMeanwhiledeeperbidirectionalconvolutionallongshort-termmemoryDB-ConvLSTMrememberconsecutiveExperimentsprovedetectabnormalitiesscenesaccuratelystate-of-the-artanomaly-detectiontaskSpatio-temporalnetwork

Similar Articles

Cited By

No available data.