Search for: All records

Creators/Authors contains: "Azad, Ariful"

« Prev Next »

Total Resources

9

Resource Type
Conference Paper

3

Conference Proceeding

0

Dataset

0

Journal Article

6

Workshop Report

0

Availability
Full Text / Resource Available

8

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Scalable Method for Readable Tree Layouts

https://doi.org/10.1109/TVCG.2023.3274572

Gray, Kathryn ; Li, Mingwei ; Ahmed, Reyan ; Rahman, Md. Khaledur ; Azad, Ariful ; Kobourov, Stephen ; Börner, Katy ( April 2023 , IEEE Transactions on Visualization and Computer Graphics)

Large tree structures are ubiquitous and real-world relational datasets often have information associated with nodes (e.g., labels or other attributes) and edges (e.g., weights or distances) that need to be communicated to the viewers. Yet, scalable, easy to read tree layouts are difficult to achieve. We consider tree layouts to be readable if they meet some basic requirements: node labels should not overlap, edges should not cross, edge lengths should be preserved, and the output should be compact. There are many algorithms for drawing trees, although very few take node labels or edge lengths into account, and none optimizes all requirements above. With this in mind, we propose a new scalable method for readable tree layouts. The algorithm guarantees that the layout has no edge crossings and no label overlaps, and optimizing one of the remaining aspects: desired edge lengths and compactness. We evaluate the performance of the new algorithm by comparison with related earlier approaches using several real-world datasets, ranging from a few thousand nodes to hundreds of thousands of nodes. Tree layout algorithms can be used to visualize large general graphs, by extracting a hierarchy of progressively larger trees. We illustrate this functionality by presenting several map-like visualizations generated by the new tree layout algorithm.
more » « less
Full Text Available
JGCL: Joint Self-Supervised and Supervised Graph Contrastive Learning

https://doi.org/10.1145/3487553.3524722

Akkas, Selahattin ; Azad, Ariful ( April 2022 , WWW '22: Companion Proceedings of the Web Conference 2022)

Full Text Available
Supervised pretraining through contrastive categorical positive samplings to improve COVID-19 mortality prediction

https://doi.org/10.1145/3535508.3545541

Wanyan, Tingyi ; Lin, Mingquan ; Klang, Eyal ; Menon, Kartikeya M. ; Gulamali, Faris F. ; Azad, Ariful ; Zhang, Yiye ; Ding, Ying ; Wang, Zhangyang ; Wang, Fei ; et al ( August 2022 , BCB'22)

Full Text Available
Contrastive learning improves critical event prediction in COVID-19 patients

https://doi.org/10.1016/j.patter.2021.100389

Wanyan, Tingyi ; Honarvar, Hossein ; Jaladanki, Suraj K. ; Zang, Chengxi ; Naik, Nidhi ; Somani, Sulaiman ; De Freitas, Jessica K. ; Paranjpe, Ishan ; Vaid, Akhil ; Zhang, Jing ; et al ( December 2021 , Patterns)

Full Text Available
Unraveling the functional dark matter through global metagenomics

https://doi.org/10.1038/s41586-023-06583-7

Pavlopoulos, Georgios A. ; Baltoumas, Fotis A. ; Liu, Sirui ; Selvitopi, Oguz ; Camargo, Antonio Pedro ; Nayfach, Stephen ; Azad, Ariful ; Roux, Simon ; Call, Lee ; Ivanova, Natalia N. ; et al ( October 2023 , Nature)

Metagenomes encode an enormous diversity of proteins, reflecting a multiplicity of functions and activities. Exploration of this vast sequence space has been limited to a comparative analysis against reference microbial genomes and protein families derived from those genomes. Here, to examine the scale of yet untapped functional diversity beyond what is currently possible through the lens of reference genomes, we develop a computational approach to generate reference-free protein families from the sequence space in metagenomes. We analyze 26,931 metagenomes and identify 1.17 billion protein sequences longer than 35 amino acids with no similarity to any sequences from 102,491 reference genomes or the Pfam database. Using massively parallel graph-based clustering, we group these proteins into 106,198 novel sequence clusters with more than 100 members, doubling the number of protein families obtained from the reference genomes clustered using the same approach. We annotate these families on the basis of their taxonomic, habitat, geographical, and gene neighborhood distributions and, where sufficient sequence diversity is available, predict protein three-dimensional models, revealing novel structures. Overall, our results uncover an enormously diverse functional space, highlighting the importance of further exploring the microbial functional dark matter.
more » « less
Free, publicly-accessible full text available October 19, 2024
The parallelism motifs of genomic data analysis

https://doi.org/10.1098/rsta.2019.0394

Yelick, Katherine ; Buluç, Aydın ; Awan, Muaaz ; Azad, Ariful ; Brock, Benjamin ; Egan, Rob ; Ekanayake, Saliya ; Ellis, Marquita ; Georganas, Evangelos ; Guidi, Giulia ; et al ( March 2020 , Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences)

Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small sequencing devices become available. Enormous community databases store and share these data with the research community, but some of these genomic data analysis problems require large-scale computational platforms to meet both the memory and computational requirements. These applications differ from scientific simulations that dominate the workload on high-end parallel systems today and place different requirements on programming support, software libraries and parallel architectural design. For example, they involve irregular communication patterns such as asynchronous updates to shared data structures. We consider several problems in high-performance genomics analysis, including alignment, profiling, clustering and assembly for both single genomes and metagenomes. We identify some of the common computational patterns or ‘motifs’ that help inform parallelization strategies and compare our motifs to some of the established lists, arguing that at least two key patterns, sorting and hashing, are missing. This article is part of a discussion meeting issue ‘Numerical algorithms for high-performance computational science’.
more » « less
Full Text Available
Evaluation of Graph Analytics Frameworks Using the GAP Benchmark Suite

https://doi.org/10.1109/IISWC50251.2020.00029

Azad, Ariful ; Aznaveh, Mohsen Mahmoudi ; Beamer, Scott ; Blanco, Mark ; Chen, Jinhao ; D'Alessandro, Luke ; Dathathri, Roshan ; Davis, Tim ; Deweese, Kevin ; Firoz, Jesun ; et al ( October 2020 , IEEE International Symposium on Workload Characterization (IISWC 2020),)
null (Ed.)
Full Text Available
Computing Maximum Cardinality Matchings in Parallel on Bipartite Graphs via Tree-Grafting

https://doi.org/10.1109/TPDS.2016.2546258

Azad, Ariful ; Buluc, Aydn ; Pothen, Alex ( January 2017 , IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
flowVS: channel-specific variance stabilization in flow cytometry

https://doi.org/10.1186/s12859-016-1083-9

Azad, Ariful ; Rajwa, Bartek ; Pothen, Alex ( December 2016 , BMC Bioinformatics)

Full Text Available