Towards Distribution-aware Query Answering in Data Markets

Asudeh, Abolfazl; Nargesian, Fatemeh

doi:10.14778/3551793.3551858

Citation Details

Towards Distribution-aware Query Answering in Data Markets

Addressing the increasing demand for data exchange has led to the development of data markets that facilitate transactional interactions between data buyers and data sellers. Still, cost-effective and distribution-aware query answering is a substantial challenge in these environments. In this paper, while differentiating different types of data markets, we take the initial steps towards addressing this challenge. In particular, we envision a unified query answering framework and discuss its functionalities. Our framework enables integrating data from different sources in a data market into a dataset that meets user-provided schema and distribution requirements cost-effectively. In order to facilitate consumers' query answering, our system discovers data views in the form of join-paths on relevant data sources, defines a get-next operation to query views, and estimates the cost of get-next on each view. The query answering engine then selects the next views to sample sequentially to collect the output data. Depending on the knowledge of the system from the underlying data sources, the view selection problem can be modeled as an instance of a multi-arm bandit or coupon collector's problem. more »

Award ID(s):: 2107290 2107050

PAR ID:: 10355230

Author(s) / Creator(s):: Asudeh, Abolfazl; Nargesian, Fatemeh

Date Published:: 2022-08-21

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 15

Issue:: 11

ISSN:: 2150-8097

Page Range / eLocation ID:: 3137 - 3144

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.14778/3551793.3551858

More Like this