Towards Statistically Significant Taxonomy Aware Co-Location Pattern Detection (Short Paper)

Ghosh, Subhankar; Sharma, Arun; Gupta, Jayant; Shekhar, Shashi

doi:10.4230/LIPIcs.COSIT.2024.25

Citation Details

Towards Statistically Significant Taxonomy Aware Co-Location Pattern Detection (Short Paper)

Given a collection of Boolean spatial feature types, their instances, a neighborhood relation (e.g., proximity), and a hierarchical taxonomy of the feature types, the goal is to find the subsets of feature types or their parents whose spatial interaction is statistically significant. This problem is for taxonomy-reliant applications such as ecology (e.g., finding new symbiotic relationships across the food chain), spatial pathology (e.g., immunotherapy for cancer), retail, etc. The problem is computationally challenging due to the exponential number of candidate co-location patterns generated by the taxonomy. Most approaches for co-location pattern detection overlook the hierarchical relationships among spatial features, and the statistical significance of the detected patterns is not always considered, leading to potential false discoveries. This paper introduces two methods for incorporating taxonomies and assessing the statistical significance of co-location patterns. The baseline approach iteratively checks the significance of co-locations between leaf nodes or their ancestors in the taxonomy. Using the Benjamini-Hochberg procedure, an advanced approach is proposed to control the false discovery rate. This approach effectively reduces the risk of false discoveries while maintaining the power to detect true co-location patterns. Experimental evaluation and case study results show the effectiveness of the approach. more »

Award ID(s):: 1901099

PAR ID:: 10609212

Author(s) / Creator(s):: Ghosh, Subhankar; Sharma, Arun; Gupta, Jayant; Shekhar, Shashi

Editor(s):: Adams, Benjamin; Griffin, Amy L; Scheider, Simon; McKenzie, Grant

Publisher / Repository:: Schloss Dagstuhl – Leibniz-Zentrum für Informatik

Date Published:: 2024-01-01

Volume:: 315

ISSN:: 1868-8969

ISBN:: 978-3-95977-330-0

Page Range / eLocation ID:: 25:1-25:11

Subject(s) / Keyword(s):: Co-location patterns spatial data mining taxonomy hierarchy statistical significance false discovery rate family-wise error rate Information systems → Data mining Computing methodologies → Spatial and physical reasoning

Format(s):: Medium: X Size: 11 pages; 746229 bytes Other: application/pdf

Size(s):: 11 pages 746229 bytes

Right(s):: Creative Commons Attribution 4.0 International license; info:eu-repo/semantics/openAccess

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.4230/LIPIcs.COSIT.2024.25

More Like this