skip to main content

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Thursday, January 16 until 2:00 AM ET on Friday, January 17 due to maintenance. We apologize for the inconvenience.


Title: Reference-agnostic representation and visualization of pan-genomes
Abstract Background

The pan-genome of a species is the union of the genes and non-coding sequences present in all individuals (cultivar, accessions, or strains) within that species.

Results

Here we introduce PGV, a reference-agnostic representation of the pan-genome of a species based on the notion of consensus ordering. Our experimental results demonstrate that PGV enables an intuitive, effective and interactive visualization of a pan-genome by providing a genome browser that can elucidate complex structural genomic variations.

Conclusions

The PGV software can be installed via conda or downloaded fromhttps://github.com/ucrbioinfo/PGV. The companion PGV browser athttp://pgv.cs.ucr.educan be tested using example bed tracks available from the GitHub page.

 
more » « less
Award ID(s):
1814359
PAR ID:
10306443
Author(s) / Creator(s):
;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
BMC Bioinformatics
Volume:
22
Issue:
1
ISSN:
1471-2105
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Summary

    dadi is a popular software package for inferring models of demographic history and natural selection from population genomic data. But using dadi requires Python scripting and manual parallelization of optimization jobs. We developed dadi-cli to simplify dadi usage and also enable straighforward distributed computing.

    Availability and Implementation

    dadi-cli is implemented in Python and released under the Apache License 2.0. The source code is available athttps://github.com/xin-huang/dadi-cli. dadi-cli can be installed via PyPI and conda, and is also available through Cacao on Jetstream2https://cacao.jetstream-cloud.org/.

     
    more » « less
  2. Abstract <bold>Background</bold>

    Existing software for comparison of species delimitation models do not provide a (true) metric or distance functions between species delimitation models, nor a way to compare these models in terms of relative clustering differences along a lattice of partitions.

    <bold>Results</bold>

    is a Python package for analyzing and visualizing species delimitation models in an information theoretic framework that, in addition to classic measures of information such as the entropy and mutual information [1], provides for the calculation of the Variation of Information (VI) criterion [2], a true metric or distance function for species delimitation models that is aligned with the lattice of partitions.

    <bold>Conclusions</bold>

    is available under the MIT license from its public repository (https://github.com/jeetsukumaran/piikun), and can be installed locally using the Python package manager ‘pip‘.

     
    more » « less
  3. Abstract Background

    Differential correlation networks are increasingly used to delineate changes in interactions among biomolecules. They characterize differences between omics networks under two different conditions, and can be used to delineate mechanisms of disease initiation and progression.

    Results

    We present a new R package, , that facilitates the estimation and visualization of differential correlation networks using multiple correlation measures and inference methods. The software is implemented in , and , and is available athttps://github.com/sqyu/CorDiffViz. Visualization has been tested for the Chrome and Firefox web browsers. A demo is available athttps://diffcornet.github.io/CorDiffViz/demo.html.

    Conclusions

    Our software offers considerable flexibility by allowing the user to interact with the visualization and choose from different estimation methods and visualizations. It also allows the user to easily toggle between correlation networks for samples under one condition and differential correlations between samples under two conditions. Moreover, the software facilitates integrative analysis of cross-correlation networks between two omics data sets.

     
    more » « less
  4. A Gram-stain-negative, strictly anaerobic, non-motile, rod-shaped bacterium, designated SFB93T, was isolated from the intertidal sediments of South San Francisco Bay, located near Palo Alto, CA, USA. SFB93Twas capable of acetylenotrophic and diazotrophic growth, grew at 22–37 °C, pH 6.3–8.5 and in the presence of 10–45 g l−1NaCl. Phylogenetic analyses based on 16S rRNA gene sequencing showed that SFB93Trepresented a member of the genusSyntrophotaleawith highest 16S rRNA gene sequence similarities toSyntrophotalea acetylenicaDSM 3246T(96.6 %),Syntrophotalea carbinolicaDSM 2380T(96.5 %), andSyntrophotalea venetianaDSM 2394T(96.7 %). Genome sequencing revealed a genome size of 3.22 Mbp and a DNA G+C content of 53.4 %. SFB93Thad low genome-wide average nucleotide identity (81–87.5 %) and <70 % digital DNA–DNA hybridization value with other members of the genusSyntrophotalea. The phylogenetic position of SFB93Twithin the familySyntrophotaleaceaeand as a novel member of the genusSyntrophotaleawas confirmed via phylogenetic reconstruction based on concatenated alignments of 92 bacterial core genes. On the basis of the results of phenotypic, genotypic and phylogenetic analyses, a novel species,Syntrophotalea acetylenivoranssp. nov., is proposed, with SFB93T(=DSM 106009T=JCM 33327T=ATCC TSD-118T) as the type strain.

     
    more » « less
  5. A Gram-stain-negative, rod-shaped bacterial strain, designatedVibrio floridensisIRLE0018 (=NRRL B-65642=NCTC 14661), was isolated from a cyanobacterial bloom along the Indian River Lagoon (IRL), a large and highly biodiverse estuary in eastern Florida (USA). The results of phylogenetic, biochemical, and phenotypic analyses indicate that this isolate is distinct from species of the genusVibriowith validly published names and is the closest relative to the emergent human pathogen,Vibrio vulnificus. Here, we present the complete genome sequence ofV. floridensisstrain IRLE0018 (4 535 135 bp). On the basis of the established average nucleotide identity (ANI) values for the determination of different species (ANI <95 %), strain IRLE0018, with an ANI of approximately 92 % compared with its closest relative,V. vulnificus, represents a novel species within the genusVibrio. To our knowledge, this represents the first time this species has been described. The results of genomic analyses ofV. floridensisIRLE0018 indicate the presence of antibiotic resistance genes and several known virulence factors, however, its pathogenicity profile (e.g. survival in serum, phagocytosis avoidance) reveals limited virulence potential of this species in contrast toV. vulnificus.

     
    more » « less