NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning-Augmented Frequency Estimation in Sliding Windows

https://doi.org/10.1109/ICNP61940.2024.10858536

Shahout, Rana; Sabek, Ibrahim; Mitzenmacher, Michael (October 2024, IEEE)

Full Text Available
Pyneapple-L: Scalable Expressive Learning-based Spatial Analysis

https://doi.org/10.1145/3678717.3691228

Liu, Yongyi; Lee, Nicolas; Kang, Yunfan; Shahneh, Mohammad Reza; Mahmood, Ahmed; Chinnam, Vishal Rohith; Sarawadekar, Aparna Vivek; Oymak, Samet; Sabek, Ibrahim; Magdy, Amr (October 2024, ACM)

Full Text Available
Can Learned Models Replace Hash Functions?

https://doi.org/10.14778/3570690.3570702

Sabek, Ibrahim; Vaidya, Kapil; Horn, Dominik; Kipf, Andreas; Mitzenmacher, Michael; Kraska, Tim (November 2022, Proceedings of the VLDB Endowment)

Hashing is a fundamental operation in database management, playing a key role in the implementation of numerous core database data structures and algorithms. Traditional hash functions aim to mimic a function that maps a key to a random value, which can result in collisions, where multiple keys are mapped to the same value. There are many well-known schemes like chaining, probing, and cuckoo hashing to handle collisions. In this work, we aim to study if using learned models instead of traditional hash functions can reduce collisions and whether such a reduction translates to improved performance, particularly for indexing and joins. We show that learned models reduce collisions in some cases, which depend on how the data is distributed. To evaluate the effectiveness of learned models as hash function, we test them with bucket chaining, linear probing, and cuckoo hash tables. We find that learned models can (1) yield a 1.4x lower probe latency, and (2) reduce the non-partitioned hash join runtime with 28% over the next best baseline for certain datasets. On the other hand, if the data distribution is not suitable, we either do not see gains or see worse performance. In summary, we find that learned models can indeed outperform hash functions, but only for certain data distributions.
more » « less
Full Text Available
Machine Learning Meets Big Spatial Data (Revised)

https://doi.org/10.1109/MDM52706.2021.00014

Sabek, Ibrahim; Mokbel, Mohamed F. (June 2021, IEEE International Conference on Mobile Data Management (MDM))
null (Ed.)
Full Text Available
Sya: Enabling Spatial Awareness inside Probabilistic Knowledge Base Construction

https://doi.org/10.1109/ICDE48307.2020.00106

Sabek, Ibrahim; Mokbel, Mohamed F. (April 2020, IEEE International Conference on Data Engineering, ICDE)

Full Text Available
Machine Learning Meets Big Spatial Data

https://doi.org/10.1109/ICDE48307.2020.00169

Sabek, Ibrahim; Mokbel, Mohamed F. (April 2020, IEEE International Conference on Data Engineering, ICDE)
null (Ed.)
Full Text Available
RegRocket: Scalable Multinomial Autologistic Regression with Unordered Categorical Variables Using Markov Logic Networks

https://doi.org/10.1145/3366459

Sabek, Ibrahim; Musleh, Mashaal; Mokbel, Mohamed F. (December 2019, ACM Transactions on Spatial Algorithms and Systems)

Full Text Available
Flash in action: scalable spatial data analysis using Markov logic networks

https://doi.org/10.14778/3352063.3352078

Sabek, Ibrahim; Musleh, Mashaal; Mokbel, Mohamed F. (August 2019, Proceedings of the VLDB Endowment)

Full Text Available

Search for: All records