skip to main content

Title: Backbone: An R package for extracting the backbone of bipartite projections
Bipartite projections are used in a wide range of network contexts including politics (bill co-sponsorship), genetics (gene co-expression), economics (executive board co-membership), and innovation (patent co-authorship). However, because bipartite projections are always weighted graphs, which are inherently challenging to analyze and visualize, it is often useful to examine the ‘backbone,’ an unweighted subgraph containing only the most significant edges. In this paper, we introduce the R package backbone for extracting the backbone of weighted bipartite projections, and use bill sponsorship data from the 114 th session of the United States Senate to demonstrate its functionality.  more » « less
Award ID(s):
Author(s) / Creator(s):
; ;
Rozenblat, Celine
Date Published:
Journal Name:
Page Range / eLocation ID:
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Cherifi, Hocine (Ed.)
    Networks are useful for representing phenomena in a broad range of domains. Although their ability to represent complexity can be a virtue, it is sometimes useful to focus on a simplified network that contains only the most important edges: the backbone. This paper introduces and demonstrates a substantially expanded version of the backbone package for R, which now provides methods for extracting backbones from weighted networks, weighted bipartite projections, and unweighted networks. For each type of network, fully replicable code is presented first for small toy examples, then for complete empirical examples using transportation, political, and social networks. The paper also demonstrates the implications of several issues of statistical inference that arise in backbone extraction. It concludes by briefly reviewing existing applications of backbone extraction using the backbone package, and future directions for research on network backbone extraction. 
    more » « less
  2. Abstract Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields (e.g., ecology, economics, bibliometrics, politics) to represent unipartite networks. A key challenge in analyzing such networks is determining whether an observed number of co-occurrences between two nodes is significant, and therefore whether an edge exists between them. One approach, the fixed degree sequence model (FDSM), evaluates the significance of an edge’s weight by comparison to a null model in which the degree sequences of the original bipartite network are fixed. Although the FDSM is an intuitive null model, it is computationally expensive because it requires Monte Carlo simulation to estimate each edge’s p value, and therefore is impractical for large projections. In this paper, we explore four potential alternatives to FDSM: fixed fill model, fixed row model, fixed column model, and stochastic degree sequence model (SDSM). We compare these models to FDSM in terms of accuracy, speed, statistical power, similarity, and ability to recover known communities. We find that the computationally-fast SDSM offers a statistically conservative but close approximation of the computationally-impractical FDSM under a wide range of conditions, and that it correctly recovers a known community structure even when the signal is weak. Therefore, although each backbone model may have particular applications, we recommend SDSM for extracting the backbone of bipartite projections when FDSM is impractical. 
    more » « less
  3. We investigate cost-efficient upgrade strategies for capacity enhancement in optical backbone networks enabled by C+L-band optical line systems. A multi-period strategy for upgrading network links from the C band to the C+L band is proposed, ensuring physical-layer awareness, cost effectiveness, and less than 0.1% blocking. Results indicate that the performance of an upgrade strategy depends on efficient selection of the sequence of links to be upgraded and on the time instant to upgrade, which are either topology or traffic dependent. Given a network topology, a set of traffic demands, and growth projections, our illustrative numerical results show that a well-devised upgrade strategy can achieve superior cost efficiency during the capacity upgrade to C+L enhancement.

    more » « less
  4. Members of Congress represent geographically demarcated districts embedded in subnational policy environments. Drawing on policy feedback literature and literature on congressional representation, I argue that, because of this institutional configuration, subnational policy adoption can affect national representation. More specifically, policy reforms in the states they represent can increase pressures members face from organized groups and individuals in their constituencies to promote aligned federal policies. Empirically, I examine the effects of state marijuana legalization. The inferential design leverages differences across the states in statewide citizen initiative institutions, which provides exogenous variation in legalization. Instrumental variables analysis indicates legalization influenced pro‐marijuana bill sponsorship and roll calls in the 116th Congress. The evidence points to growing influence of industry in legalizing states—including the ability to mobilize employees and customers—as the key mechanism, thus underscoring the importance of a political economy perspective for studying interdependencies in American federalism.

    more » « less
  5. Abstract Due to genome segmentation, rotaviruses must co-package eleven distinct genomic RNAs. The packaging is mediated by virus-encoded RNA chaperones, such as the rotavirus NSP2 protein. While the activities of distinct RNA chaperones are well studied on smaller RNAs, little is known about their global effect on the entire viral transcriptome. Here, we used Selective 2′-hydroxyl Acylation Analyzed by Primer Extension and Mutational Profiling (SHAPE-MaP) to examine the secondary structure of the rotavirus transcriptome in the presence of increasing amounts of NSP2. SHAPE-MaP data reveals that despite the well-documented helix-unwinding activity of NSP2 in vitro, its incubation with cognate rotavirus transcripts does not induce a significant change in the SHAPE reactivities. However, a quantitative analysis of mutation rates measured by mutational profiling reveals a global 5-fold rate increase in the presence of NSP2. We demonstrate that the normalization procedure used in deriving SHAPE reactivities from mutation rates can mask an important global effect of an RNA chaperone. Analysis of the mutation rates reveals a larger effect on stems rather than loops. Together, these data provide the first experimentally derived secondary structure model of the rotavirus transcriptome and reveal that NSP2 acts by globally increasing RNA backbone flexibility in a concentration-dependent manner. 
    more » « less