NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Promoting Fairness and Priority in Selecting k -Winners Using IRV

https://doi.org/10.1145/3637528.3671735

Islam, Md Mouinul; Vahidi, Soroush; Schieber, Baruch; Basu_Roy, Senjuti (August 2024, ACM)

Full Text Available
Equitable Top-k Results for Long Tail Data

https://doi.org/10.1145/3626727

Islam, Md Mouinul; Asadi, Mahsa; Basu_Roy, Senjuti (December 2023, Proceedings of the ACM on Management of Data)

For datasets exhibiting long tail phenomenon, we identify a fairness concern in existing top-k algorithms, that return a fixed set of k results for a given query. This causes a handful of popular records (products, items, etc) getting overexposed and always be returned to the user query, whereas, there exists a long tail of niche records that may be equally desirable (have similar utility). To alleviate this, we propose θ-Equiv-top-k-MMSP inside existing top-k algorithms - instead of returning a fixed top-k set, it generates all (or many) top-k sets that are equivalent in utility and creates a probability distribution over those sets. The end user will be returned one of these sets during the query time proportional to its associated probability, such that, after many draws from many end users, each record will have as equal exposure as possible (governed by uniform selection probability). θ-Equiv-top-k-MMSP is formalized with two sub-problems. (a) θ-Equiv-top-k-Sets to produce a set S of sets, each set has k records, where the sets are equivalent in utility with the top-k set; (b) MaxMinFair to produce a probability distribution over S, that is, PDF(S), such that the records in S have uniform selection probability. We formally study the hardness of θ-Equiv-top-k-MMSP. We present multiple algorithmic results - (a) An exact solution for θ-Equiv-top-k-Sets, and MaxMinFair. (b) We design highly scalable algorithms that solve θ-Equiv-top-k-Sets through a random walk and is backed by probability theory, as well as a greedy solution designed for MaxMinFair. (c) We finally present an adaptive random walk based algorithm that solves θ-Equiv-top-k-Sets and MaxMinFair at the same time. We empirically study how θ-Equiv-top-k-MMSP can alleviate a equitable exposure concerns that group fairness suffers from. We run extensive experiments using 6 datasets and design intuitive baseline algorithms that corroborate our theoretical analysis.
more » « less
Full Text Available
A generic framework for efficient computation of top-k diverse results

https://doi.org/10.1007/s00778-022-00770-0

Islam, Md Mouinul; Asadi, Mahsa; Amer-Yahia, Sihem; Roy, Senjuti Basu (November 2022, The VLDB Journal)

Result diversification is extensively studied in the context of search, recommendation, and data exploration. There are numerous algorithms that return top-k results that are both diverse and relevant. These algorithms typically have computational loops that compare the pairwise diversity of records to decide which ones to retain. We propose an access primitive DivGetBatch() that replaces repeated pairwise comparisons of diversity scores of records by pairwise comparisons of “aggregate” diversity scores of a group of records, thereby improving the running time of these algorithms while preserving the same results. We integrate the access primitive inside three representative diversity algorithms and prove that the augmented algorithms leveraging the access primitive preserve original results. We analyze the worst and expected case running times of these algorithms. We propose a computational framework to design this access primitive that has a pre-computed index structure I-tree that is agnostic to the specific details of diversity algorithms. We develop principled solutions to construct and maintain I-tree. Our experiments on multiple large real-world datasets corroborate our theoretical findings, while ensuring up to a 24× speedup.
more » « less
Full Text Available
Satisfying complex top- k fairness constraints by preference substitutions

https://doi.org/10.14778/3565816.3565832

Islam, Md. Mouinul; Wei, Dong; Schieber, Baruch; Roy, Senjuti Basu (October 2022, Proceedings of the VLDB Endowment)

Given m users (voters), where each user casts her preference for a single item (candidate) over n items (candidates) as a ballot, the preference aggregation problem returns k items (candidates) that have the k highest number of preferences (votes). Our work studies this problem considering complex fairness constraints that have to be satisfied via proportionate representations of different values of the group protected attribute(s) in the top- k results. Precisely, we study the margin finding problem under single ballot substitutions , where a single substitution amounts to removing a vote from candidate i and assigning it to candidate j and the goal is to minimize the number of single ballot substitutions needed to guarantee that the top-k results satisfy the fairness constraints. We study several variants of this problem considering how top- k fairness constraints are defined, (i) MFBinaryS and MFMultiS are defined when the fairness (proportionate representation) is defined over a single, binary or multivalued, protected attribute, respectively; (ii) MF-Multi2 is studied when top- k fairness is defined over two different protected attributes; (iii) MFMulti3+ investigates the margin finding problem, considering 3 or more protected attributes. We study these problems theoretically, and present a suite of algorithms with provable guarantees. We conduct rigorous large scale experiments involving multiple real world datasets by appropriately adapting multiple state-of-the-art solutions to demonstrate the effectiveness and scalability of our proposed methods.
more » « less
Full Text Available
Efficient approximate top-k mutual information based feature selection

https://doi.org/10.1007/s10844-022-00750-4

Salam, Md Abdus; Basu Roy, Senjuti; Das, Gautam (September 2022, Journal of Intelligent Information Systems)

Full Text Available
Cooperative Route Planning Framework for Multiple Distributed Assets in Maritime Applications

https://doi.org/10.1145/3514221.3526131

Nikookar, Sepideh; Sakharkar, Paras; Somasunder, Sathyanarayanan; Basu Roy, Senjuti; Bienkowski, Adam; Macesker, Matthew; Pattipati, Krishna R.; Sidoti, David (June 2022, SIGMOD 2022)

Full Text Available
Guided Task Planning Under Complex Constraints

https://doi.org/10.1109/ICDE53745.2022.00067

Nikookar, Sepideh; Sakharkar, Paras; Smagh, Baljinder; Amer-Yahia, Sihem; Roy, Senjuti Basu (May 2022, ICDE 2022)

Full Text Available
Accepted Tutorials at The Web Conference 2022

https://doi.org/10.1145/3487553.3547182

Tommasini, Riccardo; Basu Roy, Senjuti; Wang, Xuan; Wang, Hongwei; Ji, Heng; Han, Jiawei; Nakov, Preslav; Da San Martino, Giovanni; Alam, Firoj; Schedl, Markus; et al (April 2022, TWC 2022)

Full Text Available
Rank Aggregation with Proportionate Fairness

https://doi.org/10.1145/3514221.3517865

Wei, Dong; Islam, Md Mouinul; Schieber, Baruch; Basu Roy, Senjuti (January 2022, SIGMOD 2022)

Full Text Available
Diversifying recommendations on sequences of sets

https://doi.org/10.1007/s00778-022-00740-6

Nikookar, Sepideh; Esfandiari, Mohammadreza; Borromeo, Ria Mae; Sakharkar, Paras; Amer-Yahia, Sihem; Basu Roy, Senjuti (January 2022, The VLDB Journal)

Full Text Available

« Prev Next »

Search for: All records