Search for: All records

Creators/Authors contains: "Shasha, Dennis"

« Prev Next »

Total Resources

7

Resource Type
Conference Paper

1

Conference Proceeding

0

Dataset

0

Journal Article

6

Workshop Report

0

Availability
Full Text / Resource Available

6

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Forgetful Forests: Data Structures for Machine Learning on Streaming Data under Concept Drift

https://doi.org/10.3390/a16060278

Yuan, Zhehu ; Sun, Yinqi ; Shasha, Dennis ( June 2023 , Algorithms)

Database and data structure research can improve machine learning performance in many ways. One way is to design better algorithms on data structures. This paper combines the use of incremental computation as well as sequential and probabilistic filtering to enable “forgetful” tree-based learning algorithms to cope with streaming data that suffers from concept drift. (Concept drift occurs when the functional mapping from input to classification changes over time). The forgetful algorithms described in this paper achieve high performance while maintaining high quality predictions on streaming data. Specifically, the algorithms are up to 24 times faster than state-of-the-art incremental algorithms with, at most, a 2% loss of accuracy, or are at least twice faster without any loss of accuracy. This makes such structures suitable for high volume streaming applications.
more » « less
Free, publicly-accessible full text available June 1, 2024
AutoTag: automated metadata tagging for film post-production

https://doi.org/10.1007/s11042-023-15565-w

Sandoval-Castañeda, Marcelo ; Copti, Scandar ; Shasha, Dennis ( June 2023 , Multimedia Tools and Applications)

Abstract
Film post-production can be time- and money-inefficient. The reason is that a lot of the work involves a person or group of people, called metadata taggers, going through each individual piece of media and marking it up with relevant tags, such as the scene number, transcripts, and the type of shot for video footage. Such a task is particularly time-consuming for films with high shooting ratios (i.e., footage shot/footage shown). AutoTag automates much of the tagging process across 16 languages, saving both time and money. We describe the algorithms and implementation of AutoTag and report on some case studies.

more » « less
EnsInfer: a simple ensemble approach to network inference outperforms any single method

https://doi.org/10.1186/s12859-023-05231-1

Shen, Bingran ; Coruzzi, Gloria ; Shasha, Dennis ( March 2023 , BMC Bioinformatics)

Abstract
This study evaluates both a variety of existing base causal inference methods and a variety of ensemble methods. We show that: (i) base network inference methods vary in their performance across different datasets, so a method that works poorly on one dataset may work well on another; (ii) a non-homogeneous ensemble method in the form of a Naive Bayes classifier leads overall to as good or better results than using the best single base method or any other ensemble method; (iii) for the best results, the ensemble method should integrate all methods that satisfy a statistical test of normality on training data. The resulting ensemble modelEnsInfereasily integrates all kinds of RNA-seq data as well as new and existing inference methods. The paper categorizes and reviews state-of-the-art underlying methods, describes theEnsInferensemble approach in detail, and presents experimental results. The source code and data used will be made available to the community upon publication.

more » « less
Verifying concurrent multicopy search structures

https://doi.org/10.1145/3485490

Patel, Nisarg ; Krishna, Siddharth ; Shasha, Dennis ; Wies, Thomas ( October 2021 , Proceedings of the ACM on Programming Languages)

Multicopy search structures such as log-structured merge (LSM) trees are optimized for high insert/update/delete (collectively known as upsert) performance. In such data structures, an upsert on key k , which adds ( k , v ) where v can be a value or a tombstone, is added to the root node even if k is already present in other nodes. Thus there may be multiple copies of k in the search structure. A search on k aims to return the value associated with the most recent upsert. We present a general framework for verifying linearizability of concurrent multicopy search structures that abstracts from the underlying representation of the data structure in memory, enabling proof-reuse across diverse implementations. Based on our framework, we propose template algorithms for (a) LSM structures forming arbitrary directed acyclic graphs and (b) differential file structures, and formally verify these templates in the concurrent separation logic Iris. We also instantiate the LSM template to obtain the first verified concurrent in-memory LSM tree implementation.
more » « less
Full Text Available
Pi-Radio v1: Calibration techniques to enable fully-digital beamforming at 60 GHz

https://doi.org/10.1016/j.comnet.2021.108220

Dhananjay, Aditya ; Zheng, Kai ; Mezzavilla, Marco ; Iotti, Lorenzo ; Shasha, Dennis ; Rangan, Sundeep ( September 2021 , Computer Networks)
null (Ed.)
Full Text Available
Verifying concurrent search structure templates

https://doi.org/10.1145/3385412.3386029

Krishna, Siddharth ; Patel, Nisarg ; Shasha, Dennis ; Wies, Thomas ( June 2020 , Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation)

Full Text Available
Temporal transcriptional logic of dynamic regulatory networks underlying nitrogen signaling and use in plants

https://doi.org/10.1073/pnas.1721487115

Varala, Kranthi ; Marshall-Colón, Amy ; Cirrone, Jacopo ; Brooks, Matthew D. ; Pasquino, Angelo V. ; Léran, Sophie ; Mittal, Shipra ; Rock, Tara M. ; Edwards, Molly B. ; Kim, Grace J. ; et al ( May 2018 , Proceedings of the National Academy of Sciences)