- Award ID(s):
- 1812843
- PAR ID:
- 10434397
- Editor(s):
- Przytycka, Teresa
- Date Published:
- Journal Name:
- Bioinformatics
- Volume:
- 37
- Issue:
- 17
- ISSN:
- 1367-4803
- Page Range / eLocation ID:
- 2787 to 2788
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Ponty, Yann (Ed.)Abstract Summary Here, we present PhyloWGA, an open source R package for conducting phylogenetic analysis and investigation of whole genome data. Availabilityand implementation Available at Github (https://github.com/radamsRHA/PhyloWGA). Supplementary information Supplementary data are available at Bioinformatics online.more » « less
-
Abstract Summary Despite the availability of existing calculators for statistical power analysis in genetic association studies, there has not been a model-invariant and test-independent tool that allows for both planning of prospective studies and systematic review of reported findings. In this work, we develop a web-based application U-PASS (Unified Power analysis of ASsociation Studies), implementing a unified framework for the analysis of common association tests for binary qualitative traits. The application quantifies the shared asymptotic power limits of the common association tests, and visualizes the fundamental statistical trade-off between risk allele frequency and odds ratio. The application also addresses the applicability of asymptotics-based power calculations in finite samples, and provides guidelines for single-SNP-based association tests. In addition to designing prospective studies, U-PASS enables researchers to retrospectively assess the statistical validity of previously reported associations.
Availability and implementation U-PASS is an open-source R Shiny application. A live instance is hosted at https://power.stat.lsa.umich.edu. Source is available on https://github.com/Pill-GZ/U-PASS.
Supplementary information Supplementary data are available at Bioinformatics online.
-
Russell, Schwartz (Ed.)Abstract Summary We describe eMPRess, a software program for phylogenetic tree reconciliation under the duplication-transfer-loss model that systematically addresses the problems of choosing event costs and selecting representative solutions, enabling users to make more robust inferences. Availability and implementation eMPRess is freely available at http://www.cs.hmc.edu/empress. Supplementary information Supplementary data are available at Bioinformatics online.more » « less
-
Abstract Motivation De novo transcriptome analysis using RNA-seq offers a promising means to study gene expression in non-model organisms. Yet, the difficulty of transcriptome assembly means that the contigs provided by the assembler often represent a fractured and incomplete view of the transcriptome, complicating downstream analysis. We introduce Grouper, a new method for clustering contigs from de novo assemblies that are likely to belong to the same transcripts and genes; these groups can subsequently be analyzed more robustly. When provided with access to the genome of a related organism, Grouper can transfer annotations to the de novo assembly, further improving the clustering.
Results On de novo assemblies from four different species, we show that Grouper is able to accurately cluster a larger number of contigs than the existing state-of-the-art method. The Grouper pipeline is able to map greater than 10% more reads against the contigs, leading to accurate downstream differential expression analyses. The labeling module, in the presence of a closely related annotated genome, can efficiently transfer annotations to the contigs and use this information to further improve clustering. Overall, Grouper provides a complete and efficient pipeline for processing de novo transcriptomic assemblies.
Availability and implementation The Grouper software is freely available at https://github.com/COMBINE-lab/grouper under the 2-clause BSD license.
Supplementary information Supplementary data are available at Bioinformatics online.
-
Abstract Motivation Computational systems biology analyses typically make use of multiple software and their dependencies, which are often run across heterogeneous compute environments. This can introduce differences in performance and reproducibility. Capturing metadata (e.g. package versions, GPU model) currently requires repetitious code and is difficult to store centrally for analysis. Even where virtual environments and containers are used, updates over time mean that versioning metadata should still be captured within analysis pipelines to guarantee reproducibility.
Results Microbench is a simple and extensible Python package to automate metadata capture to a file or Redis database. Captured metadata can include execution time, software package versions, environment variables, hardware information, Python version and more, with plugins. We present three case studies demonstrating Microbench usage to benchmark code execution and examine environment metadata for reproducibility purposes.
Availability and implementation Install from the Python Package Index using pip install microbench. Source code is available from https://github.com/alubbock/microbench.
Supplementary information Supplementary data are available at Bioinformatics online.