Parallel Mining of Frequent Subtree Patterns

Qu, Wenwen; Yan, Da; Guo, Guimu; Wang, Xiaoling; Zou, Lei; Zhou, Yang

doi:10.1007/978-3-030-61133-0_2

Citation Details

Parallel Mining of Frequent Subtree Patterns

Mining frequent subtree patterns in a tree database (or, forest) is useful in domains such as bioinformatics and mining semi-structured data. We consider the problem of mining embedded subtrees in a database of rooted, labeled, and ordered trees. We compare two existing serial mining algorithms, PrefixTreeSpan and TreeMiner, and adapt them for parallel execution using PrefixFPM, our general-purpose framework for frequent pattern mining that is designed to effectively utilize the CPU cores in a multicore machine. Our experiments show that TreeMiner is faster than its successor PrefixTreeSpan when a limited number of CPU cores are used, as the total mining workloads is smaller; however, PrefixTreeSpan has a much higher speedup ratio and can beat TreeMiner when given enough CPU cores. more »

Award ID(s):: 1755464

PAR ID:: 10221363

Author(s) / Creator(s):: Qu, Wenwen; Yan, Da; Guo, Guimu; Wang, Xiaoling; Zou, Lei; Zhou, Yang

Date Published:: 2020-11-06

Journal Name:: Communications in computer and information science

Volume:: 1281

ISSN:: 1865-0929

Page Range / eLocation ID:: 18 - 32

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1007/978-3-030-61133-0_2

More Like this