NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improving the generalizability of protein-ligand binding predictions with AI-Bind

https://doi.org/10.1038/s41467-023-37572-z

Chatterjee, Ayan; Walters, Robin; Shafi, Zohair; Ahmed, Omair Shafi; Sebek, Michael; Gysi, Deisy; Yu, Rose; Eliassi-Rad, Tina; Barabási, Albert-László; Menichetti, Giulia (December 2023, Nature Communications)

Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery.
more » « less
Full Text Available
STABLE: Identifying and Mitigating Instability in Embeddings of the Degenerate Core

Liu, David; Eliassi-Rad, Tina (August 2023, Proceedings of the 2023 SIAM International Conference on Data Mining (SDM))

Full Text Available
AlignGraph: A Group of Generative Models for Graphs

Shayestehfard, Kimia; Brooks, Dana Brooks; Ioannidis, Stratis (April 2023, Proceedings of the 2023 SIAM International Conference on Data Mining (SDM))

Full Text Available
Graph transfer learning

https://doi.org/10.1007/s10115-022-01782-6

Gritsenko, Andrey; Shayestehfard, Kimia; Guo, Yuan; Moharrer, Armin; Dy, Jennifer; Ioannidis, Stratis (December 2022, Knowledge and Information Systems)

Full Text Available
The Why, How, and When of Representations for Complex Systems

https://doi.org/10.1137/20M1355896

Torres, Leo; Blevins, Ann S.; Bassett, Danielle; Eliassi-Rad, Tina (January 2021, SIAM Review)

Full Text Available
Graph Transfer Learning

https://doi.org/10.1109/ICDM51629.2021.00024

Gritsenko, Andrey; Guo, Yuan; Shayestehfard, Kimia; Moharrer, Armin; Dy, Jennifer; Ioannidis, Stratis (January 2021, IEEE International Conference on Data Mining)

Graph embeddings have been tremendously successful at producing node representations that are discriminative for downstream tasks. In this paper, we study the problem of graph transfer learning: given two graphs and labels in the nodes of the first graph, we wish to predict the labels on the second graph. We propose a tractable, noncombinatorial method for solving the graph transfer learning problem by combining classification and embedding losses with a continuous, convex penalty motivated by tractable graph distances. We demonstrate that our method successfully predicts labels across graphs with almost perfect accuracy; in the same scenarios, training embeddings through standard methods leads to predictions that are no better than random.
more » « less
Full Text Available
Nonbacktracking Eigenvalues under Node Removal: X-Centrality and Targeted Immunization

https://doi.org/10.1137/20M1352132

Torres, Leo; Chan, Kevin S.; Tong, Hanghang; Eliassi-Rad, Tina (January 2021, SIAM Journal on Mathematics of Data Science)
null (Ed.)
Full Text Available
Robust Regression via Model Based Methods

https://doi.org/10.1007/978-3-030-86523-8_13

Moharrer, Armin; Kamran, Khashayar; Yeh, Edmund; Ioannidis, Stratis (January 2021, Joint European Conference on Machine Learning and Knowledge Discovery in Databases)

The mean squared error loss is widely used in many applications, including auto-encoders, multi-target regression, and matrix factorization, to name a few. Despite computational advantages due to its differentiability, it is not robust to outliers. In contrast, ℓ𝑝 norms are known to be robust, but cannot be optimized via, e.g., stochastic gradient descent, as they are non-differentiable. We propose an algorithm inspired by so-called model-based optimization (MBO), which replaces a non-convex objective with a convex model function and alternates between optimizing the model function and updating the solution. We apply this to robust regression, proposing SADM, a stochastic variant of the Online Alternating Direction Method of Multipliers (OADM) to solve the inner optimization in MBO. We show that SADM converges with the rate 𝑂(log𝑇/𝑇) . Finally, we demonstrate experimentally (a) the robustness of ℓ𝑝 norms to outliers and (b) the efficiency of our proposed model-based algorithms in comparison with gradient methods on autoencoders and multi-target regression.
more » « less
Full Text Available
GLEE: Geometric Laplacian Eigenmap Embedding

https://doi.org/10.1093/comnet/cnaa007

Torres, Leo; Chan, Kevin S; Eliassi-Rad, Tina; Estrada, Ernesto (April 2020, Journal of Complex Networks)

Abstract Graph embedding seeks to build a low-dimensional representation of a graph $$G$$. This low-dimensional representation is then used for various downstream tasks. One popular approach is Laplacian Eigenmaps (LE), which constructs a graph embedding based on the spectral properties of the Laplacian matrix of $$G$$. The intuition behind it, and many other embedding techniques, is that the embedding of a graph must respect node similarity: similar nodes must have embeddings that are close to one another. Here, we dispose of this distance-minimization assumption. Instead, we use the Laplacian matrix to find an embedding with geometric properties instead of spectral ones, by leveraging the so-called simplex geometry of $$G$$. We introduce a new approach, Geometric Laplacian Eigenmap Embedding, and demonstrate that it outperforms various other techniques (including LE) in the tasks of graph reconstruction and link prediction.
more » « less
Full Text Available
Massively Distributed Graph Distances

https://doi.org/10.1109/TSIPN.2020.3022003

Moharrer, Armin; Gao, Jasmin; Wang, Shikun; Bento, Jose; Ioannidis, Stratis (January 2020, IEEE Transactions on Signal and Information Processing over Networks)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records