Detecting Temporal Dependencies in Data

Cuomo, Joaquin; Homayouni, Hajar; Ray, Indrakshi; Ghosh, Sudipto

Citation Details

Organizations collect data from various sources, and these datasets may have characteristics that are unknown. Selecting the appropriate statistical and machine learning algorithm for data analytical purposes benefits from understanding these characteristics, such as if it contains temporal attributes or not. This paper presents a theoretical basis for automatically determining the presence of temporal data in a dataset given no prior knowledge about its attributes. We use a method to classify an attribute as temporal, non-temporal, or hidden temporal. A hidden (grouping) temporal attribute can only be treated as temporal if its values are categorized in groups. Our method uses a Ljung-Box test for autocorrelation as well as a set of metrics we proposed based on the classification statistics. Our approach detects all temporal and hidden temporal attributes in 15 datasets from various domains. more »

Award ID(s):: 2027750 1822118

PAR ID:: 10340373

Author(s) / Creator(s):: Cuomo, Joaquin; Homayouni, Hajar; Ray, Indrakshi; Ghosh, Sudipto

Editor(s):: Pirk, Holger; Heinis, Thomas

Date Published:: 2022-03-28

Journal Name:: Proceedings of the British International Conference on Databases

Page Range / eLocation ID:: 29-39

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this