Allan Variance-based Granulation Technique for Large Temporal Databases

Sinanaj, Lorina; Haeri, Hossein; Gao, Liming; Maddipatla, Satya Prasad; Chen, Cindy; Jerath, Kshitij; Beal, Craig; Brennan, Sean

Citation Details

decrease query response time with limited main memory and storage space, data reduction techniques that preserve data quality are needed. Existing data reduction techniques, however, are often computationally expensive and rely on heuristics for deciding how to split or reduce the original dataset. In this paper, we propose an effective granular data reduction technique for temporal databases, based on Allan Variance (AVAR). AVAR is used to systematically determine the temporal window length over which data remains relevant. The entire dataset to be reduced is then separated into granules with size equal to the AVAR-determined window length. Data reduction is achieved by generating aggregated information for each such granule. The proposed method is tested using a large database that contains temporal information for vehicular data. Then comparison experiments are conducted and the outstanding runtime performance is illustrated by comparing with three clustering-based data reduction methods. The performance results demonstrate that the proposed Allan Variance-based technique can efficiently generate reduced representation of the original data without losing data quality, while significantly reducing computation time. more »

Award ID(s):: 1932138

PAR ID:: 10299310

Author(s) / Creator(s):: Sinanaj, Lorina; Haeri, Hossein; Gao, Liming; Maddipatla, Satya Prasad; Chen, Cindy; Jerath, Kshitij; Beal, Craig; Brennan, Sean

Date Published:: 2021-10-01

Journal Name:: 13th International Conference on Knowledge Management and Information Systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this