skip to main content


Search for: All records

Creators/Authors contains: "Arsuaga, Javier"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    Establishing the host range for novel viruses remains a challenge. Here, we address the challenge of identifying non-human animal coronaviruses that may infect humans by creating an artificial neural network model that learns from spike protein sequences of alpha and beta coronaviruses and their binding annotation to their host receptor. The proposed method produces a human-Binding Potential (h-BiP) score that distinguishes, with high accuracy, the binding potential among coronaviruses. Three viruses, previously unknown to bind human receptors, were identified: Bat coronavirus BtCoV/133/2005 and Pipistrellus abramus bat coronavirus HKU5-related (both MERS related viruses), andRhinolophus affiniscoronavirus isolate LYRa3 (a SARS related virus). We further analyze the binding properties of BtCoV/133/2005 and LYRa3 using molecular dynamics. To test whether this model can be used for surveillance of novel coronaviruses, we re-trained the model on a set that excludes SARS-CoV-2 and all viral sequences released after the SARS-CoV-2 was published. The results predict the binding of SARS-CoV-2 with a human receptor, indicating that machine learning methods are an excellent tool for the prediction of host expansion events.

     
    more » « less
  2. We study equilibrium configurations of double-stranded DNA in a cylindrical viral capsid. The state of the encapsidated DNA consists of a disordered inner core enclosed by an ordered outer region, next to the capsid wall. The DNA configuration is described by a unit helical vector field, tangent to an associated centre curve, passing through properly selected locations. We postulate an expression for the energy of the encapsulated DNA based on that of columnar chromonic liquid crystals. A thorough analysis of the Euler–Lagrange equations yields multiple solutions. We demonstrate that there is a trivial, non-helical solution, together with two solutions with non-zero helicity of opposite sign. Using bifurcation analysis, we derive the conditions for local stability and determine when the preferred coiling state is helical. The bifurcation parameters are the ratio of the twist versus the bend moduli of DNA and the ratio between the sizes of the ordered and the disordered regions. 
    more » « less
  3. Copy number changes play an important role in the development of cancer and are commonly associated with changes in gene expression. Persistence curves, such as Betti curves, have been used to detect copy number changes; however, it is known these curves are unstable with respect to small perturbations in the data. We address the stability of lifespan and Betti curves by providing bounds on the distance between persistence curves of Vietoris–Rips filtrations built on data and slightly perturbed data in terms of the bottleneck distance. Next, we perform simulations to compare the predictive ability of Betti curves, lifespan curves (conditionally stable) and stable persistent landscapes to detect copy number aberrations. We use these methods to identify significant chromosome regions associated with the four major molecular subtypes of breast cancer: Luminal A, Luminal B, Basal and HER2 positive. Identified segments are then used as predictor variables to build machine learning models which classify patients as one of the four subtypes. We find that no single persistence curve outperforms the others and instead suggest a complementary approach using a suite of persistence curves. In this study, we identified new cytobands associated with three of the subtypes: 1q21.1-q25.2, 2p23.2-p16.3, 23q26.2-q28 with the Basal subtype, 8p22-p11.1 with Luminal B and 2q12.1-q21.1 and 5p14.3-p12 with Luminal A. These segments are validated by the TCGA BRCA cohort dataset except for those found for Luminal A. 
    more » « less
  4. null (Ed.)
    We study equilibrium configurations of hexagonal columnar liquid crystals in the context of characterizing packing structures of bacteriophage viruses in a protein capsid. These are viruses that infect bacteria and are currently the focus of intense research efforts, with the goal of finding new therapies for bacteria-resistant antibiotics. The energy that we propose consists of the Oseen–Frank free energy of nematic liquid crystals that penalizes bending of the columnar directions, in addition to the cross-sectional elastic energy accounting for distortions of the transverse hexagonal structure; we also consider the isotropic contribution of the core and the energy of the unknown interface between the outer ordered region of the capsid and the inner disordered core. The problem becomes of free boundary type, with constraints. We show that the concentric, azimuthal, spool-like configuration is the absolute minimizer. Moreover, we present examples of toroidal structures formed by DNA in free solution and compare them with the analogous ones occurring in experiments with other types of lyotropic liquid crystals, such as food dyes and additives. This article is part of the theme issue ‘Topics in mathematical design of complex materials’. 
    more » « less
  5. Mice are the most commonly used model animals for itch research and for development of anti-itch drugs. Most laboratories manually quantify mouse scratching behavior to assess itch intensity. This process is labor-intensive and limits large-scale genetic or drug screenings. In this study, we developed a new system, Scratch-AID ( A utomatic I tch D etection), which could automatically identify and quantify mouse scratching behavior with high accuracy. Our system included a custom-designed videotaping box to ensure high-quality and replicable mouse behavior recording and a convolutional recurrent neural network trained with frame-labeled mouse scratching behavior videos, induced by nape injection of chloroquine. The best trained network achieved 97.6% recall and 96.9% precision on previously unseen test videos. Remarkably, Scratch-AID could reliably identify scratching behavior in other major mouse itch models, including the acute cheek model, the histaminergic model, and a chronic itch model. Moreover, our system detected significant differences in scratching behavior between control and mice treated with an anti-itch drug. Taken together, we have established a novel deep learning-based system that could replace manual quantification for mouse scratching behavior in different itch models and for drug screening. 
    more » « less
  6. null (Ed.)
  7. Abstract

    The three dimensional organization of genomes remains mostly unknown due to their high degree of condensation. Biophysical studies predict that condensation promotes the topological entanglement of chromatin fibers and the inhibition of function. How organisms balance between functionally active genomes and a high degree of condensation remains to be determined. Here we hypothesize that the Rabl configuration, characterized by the attachment of centromeres and telomeres to the nuclear envelope, helps to reduce the topological entanglement of chromosomes. To test this hypothesis we developed a novel method to quantify chromosome entanglement complexity in 3D reconstructions obtained from Chromosome Conformation Capture (CCC) data. Applying this method to published data of the yeast genome, we show that computational models implementing the attachment of telomeres or centromeres alone are not sufficient to obtain the reduced entanglement complexity observed in 3D reconstructions. It is only when the centromeres and telomeres are attached to the nuclear envelope (i.e. the Rabl configuration) that the complexity of entanglement of the genome is comparable to that of the 3D reconstructions. We therefore suggest that the Rabl configuration is an essential player in the simplification of the entanglement of chromatin fibers.

     
    more » « less