Search for: All records

Creators/Authors contains: "Aluru, Srinivas"

« Prev Next »

Total Resources

31

Resource Type
Conference Paper

16

Conference Proceeding

0

Dataset

0

Journal Article

15

Workshop Report

0

Availability
Full Text / Resource Available

31

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework

https://doi.org/10.1093/bioinformatics/btad373

Pan, Tony C. ; Chockalingam, Sriram P. ; Aluru, Maneesha ; Aluru, Srinivas ; Cowen, ed., Lenore ( June 2023 , Bioinformatics)

Abstract Motivation
Gene network reconstruction from gene expression profiles is a compute- and data-intensive problem. Numerous methods based on diverse approaches including mutual information, random forests, Bayesian networks, correlation measures, as well as their transforms and filters such as data processing inequality, have been proposed. However, an effective gene network reconstruction method that performs well in all three aspects of computational efficiency, data size scalability, and output quality remains elusive. Simple techniques such as Pearson correlation are fast to compute but ignore indirect interactions, while more robust methods such as Bayesian networks are prohibitively time consuming to apply to tens of thousands of genes.
Results
We developed maximum capacity path (MCP) score, a novel maximum-capacity-path-based metric to quantify the relative strengths of direct and indirect gene–gene interactions. We further present MCPNet, an efficient, parallelized gene network reconstruction software based on MCP score, to reverse engineer networks in unsupervised and ensemble manners. Using synthetic and real Saccharomyces cervisiae datasets as well as real Arabidopsis thaliana datasets, we demonstrate that MCPNet produces better quality networks as measured by AUPRC, is significantly faster than all other gene network reconstruction software, and also scales well to tens of thousands of genes and hundreds of CPU cores. Thus, MCPNet represents a new gene network reconstruction tool that simultaneously achieves quality, performance, and scalability requirements.
Availability and implementation
Source code freely available for download at https://doi.org/10.5281/zenodo.6499747 and https://github.com/AluruLab/MCPNet, implemented in C++ and supported on Linux.

more » « less
On the Hardness of Sequence Alignment on De Bruijn Graphs

https://doi.org/10.1089/cmb.2022.0411

Gibney, Daniel ; Thankachan, Sharma V. ; Aluru, Srinivas ( December 2022 , Journal of Computational Biology)

Full Text Available
Haplotype-aware variant selection for genome graphs

Tavakoli, Neda ; Gibney, Daniel ; Aluru, Srinivas. ( August 2022 , Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics)

Graph-based genome representations have proven to be a powerful tool in genomic analysis due to their ability to encode variations found in multiple haplotypes and capture population genetic diversity. Such graphs also unavoidably contain paths which switch between haplotypes (i.e., recombinant paths) and thus do not fully match any of the constituent haplotypes. The number of such recombinant paths increases combinatorially with path length and cause inefficiencies and false positives when mapping reads. In this paper, we study the problem of finding reduced haplotype-aware genome graphs that incorporate only a selected subset of variants, yet contain paths corresponding to all α-long substrings of the input haplotypes (i.e., non-recombinant paths) with at most δ mismatches. Solving this problem optimally, i.e., minimizing the number of variants selected, is previously known to be NP-hard. Here, we first establish several inapproximability results regarding finding haplotype-aware reduced variation graphs of optimal size. We then present an integer linear programming (ILP) formulation for solving the problem, and experimentally demonstrate this is a computationally feasible approach for real-world problems and provides far superior reduction compared to prior approaches.
more » « less
Full Text Available
Feasibility of Flow Decomposition with Subpath Constraints in Linear Time

Gibney, Sharma ; Thankachan, Sharma V. ; Aluru, Srinivas ( January 2022 , 22nd International Workshop on Algorithms in Bioinformatics (WABI 2022))

Full Text Available
The Complexity of Approximate Pattern Matching on de Bruijn Graphs

https://doi.org/10.1007/978-3-031-04749-7_16

Gibney, Daniel ; Thankachan, Sharma V. ; Aluru, Srinivas ( January 2022 , Research in Computational Molecular Biology - 26th Annual International Conference, RECOMB 2022)

Full Text Available
GRNUlar: A Deep Learning Framework for Recovering Single-Cell Gene Regulatory Networks

https://doi.org/10.1089/cmb.2021.0437

Shrivastava, Harsh ; Zhang, Xiuwei ; Song, Le ; Aluru, Srinivas ( January 2022 , Journal of Computational Biology)

Full Text Available
Parallel construction of module networks

https://doi.org/10.1145/3458817.3476207

Srivastava, Ankit ; Chockalingam, Sriram P. ; Aluru, Maneesha ; Aluru, Srinivas ( November 2021 , Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC))

Full Text Available
Reply to: “Re-evaluating the evidence for a universal genetic boundary among microbial species”

https://doi.org/10.1038/s41467-021-24129-1

Rodriguez-R, Luis M. ; Jain, Chirag ; Conrad, Roth E. ; Aluru, Srinivas ; Konstantinidis, Konstantinos T. ( December 2021 , Nature Communications)
null (Ed.)
Full Text Available
A variant selection framework for genome graphs

https://doi.org/10.1093/bioinformatics/btab302

Jain, Chirag ; Tavakoli, Neda ; Aluru, Srinivas ( July 2021 , Bioinformatics)

Abstract Motivation Variation graph representations are projected to either replace or supplement conventional single genome references due to their ability to capture population genetic diversity and reduce reference bias. Vast catalogues of genetic variants for many species now exist, and it is natural to ask which among these are crucial to circumvent reference bias during read mapping. Results In this work, we propose a novel mathematical framework for variant selection, by casting it in terms of minimizing variation graph size subject to preserving paths of length α with at most δ differences. This framework leads to a rich set of problems based on the types of variants [e.g. single nucleotide polymorphisms (SNPs), indels or structural variants (SVs)], and whether the goal is to minimize the number of positions at which variants are listed or to minimize the total number of variants listed. We classify the computational complexity of these problems and provide efficient algorithms along with their software implementation when feasible. We empirically evaluate the magnitude of graph reduction achieved in human chromosome variation graphs using multiple α and δ parameter values corresponding to short and long-read resequencing characteristics. When our algorithm is run with parameter settings amenable to long-read mapping (α = 10 kbp, δ = 1000), 99.99% SNPs and 73% SVs can be safely excluded from human chromosome 1 variation graph. The graph size reduction can benefit downstream pan-genome analysis. Availability and implementation https://github.com/AT-CG/VF. Supplementary information Supplementary data are available at Bioinformatics online.
more » « less
Full Text Available
EnGRaiN : a supervised ensemble learning method for recovery of large-scale gene regulatory networks

https://doi.org/10.1093/bioinformatics/btab829

Aluru, Maneesha ; Shrivastava, Harsh ; Chockalingam, Sriram P. ; Shivakumar, Shruti ; Aluru, Srinivas ; Martelli, ed., Pier Luigi ( December 2021 , Bioinformatics)

Abstract Motivation
Reconstruction of genome-scale networks from gene expression data is an actively studied problem. A wide range of methods that differ between the types of interactions they uncover with varying trade-offs between sensitivity and specificity have been proposed. To leverage benefits of multiple such methods, ensemble network methods that combine predictions from resulting networks have been developed, promising results better than or as good as the individual networks. Perhaps owing to the difficulty in obtaining accurate training examples, these ensemble methods hitherto are unsupervised.
Results
In this article, we introduce EnGRaiN, the first supervised ensemble learning method to construct gene networks. The supervision for training is provided by small training datasets of true edge connections (positives) and edges known to be absent (negatives) among gene pairs. We demonstrate the effectiveness of EnGRaiN using simulated datasets as well as a curated collection of Arabidopsis thaliana datasets we created from microarray datasets available from public repositories. EnGRaiN shows better results not only in terms of receiver operating characteristic and PR characteristics for both real and simulated datasets compared with unsupervised methods for ensemble network construction, but also generates networks that can be mined for elucidating complex biological interactions.
Availability and implementation
EnGRaiN software and the datasets used in the study are publicly available at the github repository: https://github.com/AluruLab/EnGRaiN.
Supplementary information
Supplementary data are available at Bioinformatics online.

more » « less

« Prev Next »