Abstract BackgroundHistone post-translational modifications (PTMs) play an important role in our system by regulating the structure of chromatin and therefore contribute to the regulation of gene and protein expression. Irregularities in histone PTMs can lead to a variety of different diseases including various forms of cancer. Histone modifications are analyzed using high resolution mass spectrometry, which generate large amounts of data that requires sophisticated bioinformatics tools for analysis and visualization. PTMViz is designed for downstream differential abundance analysis and visualization of both protein and/or histone modifications. ResultsPTMViz provides users with data tables and visualization plots of significantly differentiated proteins and histone PTMs between two sample groups. All the data is packaged into interactive data tables and graphs using the Shiny platform to help the user explore the results in a fast and efficient manner to assess if changes in the system are due to protein abundance changes or epigenetic changes. In the example data provided, we identified several proteins differentially regulated in the dopaminergic pathway between mice treated with methamphetamine compared to a saline control. We also identified histone post-translational modifications including histone H3K9me, H3K27me3, H4K16ac, and that were regulated due to drug exposure. ConclusionsHistone modifications play an integral role in the regulation of gene expression. PTMViz provides an interactive platform for analyzing proteins and histone post-translational modifications from mass spectrometry data in order to quickly identify differentially expressed proteins and PTMs. 
                        more » 
                        « less   
                    
                            
                            C3NA: correlation and consensus-based cross-taxonomy network analysis for compositional microbial data
                        
                    
    
            Abstract BackgroundStudying the co-occurrence network structure of microbial samples is one of the critical approaches to understanding the perplexing and delicate relationship between the microbe, host, and diseases. It is also critical to develop a tool for investigating co-occurrence networks and differential abundance analyses to reveal the disease-related taxa–taxa relationship. In addition, it is also necessary to tighten the co-occurrence network into smaller modules to increase the ability for functional annotation and interpretability of  these taxa-taxa relationships.  Also, it is critical to retain the phylogenetic relationship among the taxa to identify differential abundance patterns, which can be used to resolve contradicting functions reported by different studies. ResultsIn this article, we present Correlation and Consensus-based Cross-taxonomy Network Analysis (C3NA), a user-friendly R package for investigating compositional microbial sequencing data to identify and compare co-occurrence patterns across different taxonomic levels. C3NA contains two interactive graphic user interfaces (Shiny applications), one of them dedicated to the comparison between two diagnoses, e.g., disease versus control. We used C3NA to analyze two well-studied diseases, colorectal cancer, and Crohn’s disease. We discovered clusters of study and disease-dependent taxa that overlap with known functional taxa studied by other discovery studies and differential abundance analyses. ConclusionC3NA offers a new microbial data analyses pipeline for refined and enriched taxa–taxa co-occurrence network analyses, and the usability was further expanded via the built-in Shiny applications for interactive investigation. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2133504
- PAR ID:
- 10379315
- Publisher / Repository:
- Springer Science + Business Media
- Date Published:
- Journal Name:
- BMC Bioinformatics
- Volume:
- 23
- Issue:
- 1
- ISSN:
- 1471-2105
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract AimAbundance–occupancy relationships posit that more locally abundant species occupy more sites than less abundant species. Although widely supported, the occurrence and detection of abundance–occupancy relationships is sensitive to sampling and detection processes. Data from large‐scale standardized sampling efforts are key to address abundance–occupancy relationships. We aimed to use such a dataset to evaluate the occurrence of abundance–occupancy relationships across different spatial grains and over time for aquatic and terrestrial taxa. LocationUSA. Time period2014–2019. Major taxa studiedBirds, mammals, beetles, ticks, fishes, macroinvertebrates and zooplankton. MethodsSpecies abundance and occupancy data were obtained from the National Ecological Observatory Network (NEON). Species mean abundance and occupancy (fraction of sampled locations that were occupied) were estimated for three different spatial grains (i.e., plot, site and domain) for all years sampled. Linear models were used to explore the consistency of interspecific abundance–occupancy relationships. The slope coefficients of these models were related to temporal and spatial variables and to species richness while controlling for taxa in a linear mixed‐effects model (LMM) framework. ResultsWe found evidence for positive abundance–occupancy relationships across the three spatial grains and over time for all taxa we studied. However, our linear models had low explanatory power, suggesting that relationships, although general, were weak. Abundance–occupancy relationships were slightly stronger at the smallest spatial grain than at the largest spatial grain, but showed no detectable change over time for any taxa. Finally, species richness was not associated with the strength of these relationships. Main conclusionsTogether, our results suggest that positive interspecific abundance–occupancy relationships are fairly general but are not capable of explaining substantial variation in spatial patterns of abundance, and that other factors, such as species traits and niche, are also likely to influence these relationships.more » « less
- 
            Abstract BackgroundA common task in analyzing metatranscriptomics data is to identify microbial metabolic pathways with differential RNA abundances across multiple sample groups. With information from paired metagenomics data, some differential methods control for either DNA or taxa abundances to address their strong correlation with RNA abundance. However, it remains unknown if both factors need to be controlled for simultaneously. ResultsWe discovered that when either DNA or taxa abundance is controlled for, RNA abundance still has a strong partial correlation with the other factor. In both simulation studies and a real data analysis, we demonstrated that controlling for both DNA and taxa abundances leads to superior performance compared to only controlling for one factor. ConclusionsTo fully address the confounding effects in analyzing metatranscriptomics data, both DNA and taxa abundances need to be controlled for in the differential analysis.more » « less
- 
            Faust, Karoline (Ed.)ABSTRACT Microbes commonly organize into communities consisting of hundreds of species involved in complex interactions with each other. 16S ribosomal RNA (16S rRNA) amplicon profiling provides snapshots that reveal the phylogenies and abundance profiles of these microbial communities. These snapshots, when collected from multiple samples, can reveal the co-occurrence of microbes, providing a glimpse into the network of associations in these communities. However, the inference of networks from 16S data involves numerous steps, each requiring specific tools and parameter choices. Moreover, the extent to which these steps affect the final network is still unclear. In this study, we perform a meticulous analysis of each step of a pipeline that can convert 16S sequencing data into a network of microbial associations. Through this process, we map how different choices of algorithms and parameters affect the co-occurrence network and identify the steps that contribute substantially to the variance. We further determine the tools and parameters that generate robust co-occurrence networks and develop consensus network algorithms based on benchmarks with mock and synthetic data sets. The Microbial Co-occurrence Network Explorer, or MiCoNE (available athttps://github.com/segrelab/MiCoNE) follows these default tools and parameters and can help explore the outcome of these combinations of choices on the inferred networks. We envisage that this pipeline could be used for integrating multiple data sets and generating comparative analyses and consensus networks that can guide our understanding of microbial community assembly in different biomes. IMPORTANCEMapping the interrelationships between different species in a microbial community is important for understanding and controlling their structure and function. The surge in the high-throughput sequencing of microbial communities has led to the creation of thousands of data sets containing information about microbial abundances. These abundances can be transformed into co-occurrence networks, providing a glimpse into the associations within microbiomes. However, processing these data sets to obtain co-occurrence information relies on several complex steps, each of which involves numerous choices of tools and corresponding parameters. These multiple options pose questions about the robustness and uniqueness of the inferred networks. In this study, we address this workflow and provide a systematic analysis of how these choices of tools affect the final network and guidelines on appropriate tool selection for a particular data set. We also develop a consensus network algorithm that helps generate more robust co-occurrence networks based on benchmark synthetic data sets.more » « less
- 
            ABSTRACT AimBeta diversity quantifies the similarity of ecological assemblages. Its increase, known as biotic homogenisation, can be a consequence of biological invasions. However, species occurrence (presence/absence) and abundance‐based analyses can produce contradictory assessments of the magnitude and direction of changes in beta diversity. Previous work indicates these contradictions should be less frequent in nature than in theory, but a growing number of empirical studies report discrepancies between occurrence‐ and abundance‐based approaches. Understanding if these discrepancies represent a few isolated cases or are systematic across a diversity of ecosystems would allow us to better understand the general patterns, mechanisms and impacts of biotic homogenisation. LocationUnited States. Time Period1963–2020. Major Taxa StudiedVascular plants. MethodsWe used a dataset of more than 70,000 vegetation survey plots to assess differences in biotic homogenisation with and without invasion using both occurrence‐ and abundance‐based metrics of beta diversity. We estimated taxonomic biotic homogenisation by comparing beta diversity of invaded and uninvaded plots with both classes of metrics and investigated the characteristics of the non‐native species pool that influenced the likelihood that these metrics disagree. ResultsIn 78% of plot comparisons, occurrence‐ and abundance‐based calculations agreed in direction, and the two metrics were generally well correlated. Our empirical results are consistent with previous theory. Discrepancies between the metrics were more likely when the same non‐native species was at high cover at both plots compared for beta diversity, and when these plots were spatially distant. Main ConclusionsIn about 20% of cases, our calculations revealed differences in direction (homogenisation vs. differentiation) when comparing occurrence‐ and abundance‐based metrics, indicating that the metrics are not interchangeable, especially when distances between plots are high and invader diversity is low. When data permit, combining the two approaches can offer insights into the role of invasions and extirpations in driving biotic homogenisation/differentiation.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
