NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Supporting Complex Query Time Enrichment For Analytics

Ghosh D, Gupta P (January 2023, EDBT)

Full Text Available
Sentaur: Sensor Observable Data Model for Smart Spaces

https://doi.org/10.1145/3511808.3557147

Gupta, Peeyush; Mehrotra, Sharad; Sharma, Shantanu; Yus, Roberto; Venkatasubramanian, Nalini (October 2022, 31st ACM Int’l Conference on Information and Knowledge Management)

Full Text Available
The SemIoTic Ecosystem: A Semantic Bridge between IoT Devices and Smart Spaces

https://doi.org/10.1145/3527241

Yus, Roberto; Bouloukakis, Georgios; Mehrotra, Sharad; Venkatasubramanian, Nalini (August 2022, ACM Transactions on Internet Technology)

Smart space administration and application development is challenging in part due to the semantic gap that exists between the high-level requirements of users and the low-level capabilities of IoT devices. The stakeholders in a smart space are required to deal with communicating with specific IoT devices, capturing data, processing it, and abstracting it out to generate useful inferences. Additionally, this makes reusability of smart space applications difficult, since they are developed for specific sensor deployments. In this article, we present a holistic approach to IoT smart spaces, the SemIoTic ecosystem, to facilitate application development, space management, and service provision to its inhabitants. The ecosystem is based on a centralized repository, where developers can advertise their space-agnostic applications, and a SemIoTic system deployed in each smart space that interacts with those applications to provide them with the required information. SemIoTic applications are developed using a metamodel that defines high-level concepts abstracted from the smart space about the space itself and the people within it. Application requirements can be expressed then in terms of user-friendly high-level concepts, which are automatically translated by SemIoTic into sensor/actuator commands adapted to the underlying device deployment in each space. We present a reference implementation of the ecosystem that has been deployed at the University of California, Irvine and is abstracting data from hundreds of sensors in the space and providing applications to campus members.
more » « less
Full Text Available
A Case for Enrichment in Data Management Systems

https://doi.org/10.1145/3552490.3552497

Ghosh, Dhrubajyoti; Gupta, Peeyush; Mehrotra, Sharad; Sharma, Shantanu (July 2022, ACM SIGMOD Record)

We describe ENRICHDB, a new DBMS technology designed for emerging domains (e.g., sensor-driven smart spaces and social media analytics) that require incoming data to be enriched using expensive functions prior to its usage. To support online processing, today, such enrichment is performed outside of DBMSs, as a static data processing workflow prior to its ingestion into a DBMS. Such a strategy could result in a significant delay from the time when data arrives and when it is enriched and ingested into the DBMS, especially when the enrichment complexity is high. Also, enriching at ingestion could result in wastage of resources if applications do not use/require all data to be enriched. ENRICHDB's design represents a significant departure from the above, where we explore seamless integration of data enrichment all through the data processing pipeline - at ingestion, triggered based on events in the background, and progressively during query processing. The cornerstone of ENRICHDB is a powerful enrichment data and query model that encapsulates enrichment as an operator inside a DBMS enabling it to co-optimize enrichment with query processing. This paper describes this data model and provides a summary of the system implementation.
more » « less
Full Text Available
Don't be a tattle-tale: preventing leakages through data dependencies on access control protected data

https://doi.org/10.14778/3551793.3551805

Pappachan, Primal; Zhang, Shufan; He, Xi; Mehrotra, Sharad (July 2022, Proceedings of the VLDB Endowment)

We study the problem of answering queries when (part of) the data may be sensitive and should not be leaked to the querier. Simply restricting the computation to non-sensitive part of the data may leak sensitive data through inference based on data dependencies. While inference control from data dependencies during query processing has been studied in the literature, existing solution either detect and deny queries causing leakage, or use a weak security model that only protects against exact reconstruction of the sensitive data. In this paper, we adopt a stronger security model based on full deniability that prevents any information about sensitive data to be inferred from query answers. We identify conditions under which full deniability can be achieved and develop an efficient algorithm that minimally hides non-sensitive cells during query processing to achieve full deniability. We experimentally show that our approach is practical and scales to increasing proportion of sensitive data, as well as, to increasing database size.
more » « less
Full Text Available
MIDE: accuracy aware minimally invasive data exploration for decision support

https://doi.org/10.14778/3551793.3551821

Ghayyur, Sameera; Ghosh, Dhrubajyoti; He, Xi; Mehrotra, Sharad (July 2022, Proceedings of the VLDB Endowment)

This paper studies privacy in the context of decision-support queries that classify objects as either true or false based on whether they satisfy the query. Mechanisms to ensure privacy may result in false positives and false negatives. In decision-support applications, often, false negatives have to remain bounded. Existing accuracy-aware privacy preserving techniques cannot directly be used to support such an accuracy requirement and their naive adaptations to support bounded accuracy of false negatives results in significant privacy loss depending upon distribution of data. This paper explores the concept of minimally-invasive data exploration for decision support that attempts to minimize privacy loss while supporting bounded guarantee on false negatives by adaptively adjusting privacy based on data distribution. Our experimental results show that the MIDE algorithms perform well and are robust over variations in data distributions.
more » « less
Full Text Available
JENNER: just-in-time enrichment in query processing

https://doi.org/10.14778/3551793.3551822

Ghosh, Dhrubajyoti; Gupta, Peeyush; Mehrotra, Sharad; Yus, Roberto; Altowim, Yasser (July 2022, Proceedings of the VLDB Endowment)

Emerging domains, such as sensor-driven smart spaces and social media analytics, require incoming data to be enriched prior to its use. Enrichment often consists of machine learning (ML) functions that are too expensive/infeasible to execute at ingestion. We develop a strategy entitled Just-in-time ENrichmeNt in quERy Processing (JENNER) to support interactive analytics over data as soon as it arrives for such application context. JENNER exploits the inherent tradeoffs of cost and quality often displayed by the ML functions to progressively improve query answers during query execution. We describe how JENNER works for a large class of SPJ and aggregation queries that form the bulk of data analytics workload. Our experimental results on real datasets (IoT and Tweet) show that JENNER achieves progressive answers performing significantly better than the naive strategies of achieving progressive computation.
more » « less
Full Text Available
Quest: Privacy-Preserving Monitoring of Network Data: A System for Organizational Response to Pandemics

https://doi.org/10.1109/TSC.2022.3166802

Sharma, Shantanu; Mehrotra, Sharad; Panwar, Nisha; Venkatasubramanian, Nalini; Gupta, Peeyush; Han, Shanshan; Wang, Guoxi (May 2022, IEEE Transactions on Services Computing)

Full Text Available
SmartSPEC: Customizable Smart Space Datasets via Event-driven Simulations

https://doi.org/10.1109/PerCom53586.2022.9762405

Chio, Andrew; Jiang, Daokun; Gupta, Peeyush; Bouloukakis, Georgios; Yus, Roberto; Mehrotra, Sharad; Venkatasubramanian, Nalini (January 2022, 2022 IEEE International Conference on Pervasive Computing and Communications (PerCom))

Full Text Available
T-cove: an exposure tracing system based on cleaning wi-fi events on organizational premises

https://doi.org/10.14778/3476311.3476344

Lin, Yiming; Khargonekar, Pramod; Mehrotra, Sharad; Venkatasubramanian, Nalini (July 2021, Proceedings of the VLDB Endowment)

Full Text Available

« Prev Next »

Search for: All records