NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A generic framework for efficient computation of top-k diverse results

https://doi.org/10.1007/s00778-022-00770-0

Islam, Md Mouinul; Asadi, Mahsa; Amer-Yahia, Sihem; Roy, Senjuti Basu (November 2022, The VLDB Journal)

Result diversification is extensively studied in the context of search, recommendation, and data exploration. There are numerous algorithms that return top-k results that are both diverse and relevant. These algorithms typically have computational loops that compare the pairwise diversity of records to decide which ones to retain. We propose an access primitive DivGetBatch() that replaces repeated pairwise comparisons of diversity scores of records by pairwise comparisons of “aggregate” diversity scores of a group of records, thereby improving the running time of these algorithms while preserving the same results. We integrate the access primitive inside three representative diversity algorithms and prove that the augmented algorithms leveraging the access primitive preserve original results. We analyze the worst and expected case running times of these algorithms. We propose a computational framework to design this access primitive that has a pre-computed index structure I-tree that is agnostic to the specific details of diversity algorithms. We develop principled solutions to construct and maintain I-tree. Our experiments on multiple large real-world datasets corroborate our theoretical findings, while ensuring up to a 24× speedup.
more » « less
Full Text Available
Satisfying complex top- k fairness constraints by preference substitutions

https://doi.org/10.14778/3565816.3565832

Islam, Md. Mouinul; Wei, Dong; Schieber, Baruch; Roy, Senjuti Basu (October 2022, Proceedings of the VLDB Endowment)

Given m users (voters), where each user casts her preference for a single item (candidate) over n items (candidates) as a ballot, the preference aggregation problem returns k items (candidates) that have the k highest number of preferences (votes). Our work studies this problem considering complex fairness constraints that have to be satisfied via proportionate representations of different values of the group protected attribute(s) in the top- k results. Precisely, we study the margin finding problem under single ballot substitutions , where a single substitution amounts to removing a vote from candidate i and assigning it to candidate j and the goal is to minimize the number of single ballot substitutions needed to guarantee that the top-k results satisfy the fairness constraints. We study several variants of this problem considering how top- k fairness constraints are defined, (i) MFBinaryS and MFMultiS are defined when the fairness (proportionate representation) is defined over a single, binary or multivalued, protected attribute, respectively; (ii) MF-Multi2 is studied when top- k fairness is defined over two different protected attributes; (iii) MFMulti3+ investigates the margin finding problem, considering 3 or more protected attributes. We study these problems theoretically, and present a suite of algorithms with provable guarantees. We conduct rigorous large scale experiments involving multiple real world datasets by appropriately adapting multiple state-of-the-art solutions to demonstrate the effectiveness and scalability of our proposed methods.
more » « less
Full Text Available
Efficient approximate top-k mutual information based feature selection

https://doi.org/10.1007/s10844-022-00750-4

Salam, Md Abdus; Basu Roy, Senjuti; Das, Gautam (September 2022, Journal of Intelligent Information Systems)

Full Text Available
Cooperative Route Planning Framework for Multiple Distributed Assets in Maritime Applications

https://doi.org/10.1145/3514221.3526131

Nikookar, Sepideh; Sakharkar, Paras; Somasunder, Sathyanarayanan; Basu Roy, Senjuti; Bienkowski, Adam; Macesker, Matthew; Pattipati, Krishna R.; Sidoti, David (June 2022, SIGMOD 2022)

Full Text Available
Guided Task Planning Under Complex Constraints

https://doi.org/10.1109/ICDE53745.2022.00067

Nikookar, Sepideh; Sakharkar, Paras; Smagh, Baljinder; Amer-Yahia, Sihem; Roy, Senjuti Basu (May 2022, ICDE 2022)

Full Text Available
Accepted Tutorials at The Web Conference 2022

https://doi.org/10.1145/3487553.3547182

Tommasini, Riccardo; Basu Roy, Senjuti; Wang, Xuan; Wang, Hongwei; Ji, Heng; Han, Jiawei; Nakov, Preslav; Da San Martino, Giovanni; Alam, Firoj; Schedl, Markus; et al (April 2022, TWC 2022)

Full Text Available
Rank Aggregation with Proportionate Fairness

https://doi.org/10.1145/3514221.3517865

Wei, Dong; Islam, Md Mouinul; Schieber, Baruch; Basu Roy, Senjuti (January 2022, SIGMOD 2022)

Full Text Available
Diversifying recommendations on sequences of sets

https://doi.org/10.1007/s00778-022-00740-6

Nikookar, Sepideh; Esfandiari, Mohammadreza; Borromeo, Ria Mae; Sakharkar, Paras; Amer-Yahia, Sihem; Basu Roy, Senjuti (January 2022, The VLDB Journal)

Full Text Available
An active learning approach for clustering single-cell RNA-seq data

Lin Xiang; Liu Haoran; Wei Zhi; Basu Roy Senjuti; Gao Nan (January 2022, Laboratory investigation)

Full Text Available
Multi-Session Diversity to Improve User Satisfaction in Web Applications

https://doi.org/10.1145/3442381.3450046

Esfandiari, Mohammadreza; Borromeo, Ria Mae; Nikookar, Sepideh; Sakharkar, Paras; Amer-Yahia, Sihem; Basu Roy, Senjuti (April 2021, The Web Conference)

Full Text Available

« Prev Next »

Search for: All records