skip to main content

Search for: All records

Creators/Authors contains: "Wang, Pei"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Obtaining solutions to optimal transportation (OT) problems is typically intractable when marginal spaces are continuous. Recent research has focused on approximating continuous solutions with discretization methods based on i.i.d. sampling, and this has shown convergence as the sample size increases. However, obtaining OT solutions with large sample sizes requires intensive computation effort, which can be prohibitive in practice. In this paper, we propose an algorithm for calculating discretizations with a given number of weighted points for marginal distributions by minimizing the (entropy-regularized) Wasserstein distance and providing bounds on the performance. The results suggest that our plans are comparable to those obtained with much larger numbers of i.i.d. samples and are more efficient than existing alternatives. Moreover, we propose a local, parallelizable version of such discretizations for applications, which we demonstrate by approximating adorable images.

    more » « less
    Free, publicly-accessible full text available June 1, 2024
  2. Abstract Background Applying directed acyclic graph (DAG) models to proteogenomic data has been shown effective for detecting causal biomarkers of complex diseases. However, there remain unsolved challenges in DAG learning to jointly model binary clinical outcome variables and continuous biomarker measurements. Results In this paper, we propose a new tool, DAGBagM, to learn DAGs with both continuous and binary nodes. By using appropriate models, DAGBagM allows for either continuous or binary nodes to be parent or child nodes. It employs a bootstrap aggregating strategy to reduce false positives in edge inference. At the same time, the aggregation procedure provides a flexible framework to robustly incorporate prior information on edges. Conclusions Through extensive simulation experiments, we demonstrate that DAGBagM has superior performance compared to alternative strategies for modeling mixed types of nodes. In addition, DAGBagM is computationally more efficient than two competing methods. When applying DAGBagM to proteogenomic datasets from ovarian cancer studies, we identify potential protein biomarkers for platinum refractory/resistant response in ovarian cancer. DAGBagM is made available as a github repository at . 
    more » « less
  3. Abstract High-sensitivity interstellar scintillation and polarization observations of PSR B0656+14 made at three epochs over a year using the Five-hundred-meter Aperture Spherical radio Telescope (FAST) show that the scattering is dominated by two different compact regions. We identify the one nearer to the pulsar with the shell of the Monogem Ring, thereby confirming the association. The other is probably associated with the Local Bubble. We find that the observed position angles of the pulsar spin axis and the spatial velocity are significantly different, with a separation of 19.°3 ± 0.°8, inconsistent with a previously published near-perfect alignment of 1° ± 2°. The two independent scattering regions are clearly defined in the secondary spectra, which show two strong forward parabolic arcs. The arc curvatures imply that the scattering screens corresponding to the outer and inner arcs are located approximately 28 pc from PSR B0656+14 and 185 pc from the Earth, respectively. Comparison of the observed Doppler profiles with electromagnetic simulations shows that both scattering regions are mildly anisotropic. For the outer arc, we estimate the anisotropy A R to be approximately 1.3, with the scattering irregularities aligned parallel to the pulsar velocity. For the outer arc, we compare the observed delay profiles with delay profiles computed from a theoretical strong-scattering model. Our results suggest that the spatial spectrum of the scattering irregularities in the Monogem Ring is flatter than Kolmogorov, but further observations are required to confirm this. 
    more » « less
  4. Abstract Potato ( Solanum tuberosum L.) is the world’s most important non-cereal food crop, and the vast majority of commercially grown cultivars are highly heterozygous tetraploids. Advances in diploid hybrid breeding based on true seeds have the potential to revolutionize future potato breeding and production 1–4 . So far, relatively few studies have examined the genome evolution and diversity of wild and cultivated landrace potatoes, which limits the application of their diversity in potato breeding. Here we assemble 44 high-quality diploid potato genomes from 24 wild and 20 cultivated accessions that are representative of Solanum section Petota , the tuber-bearing clade, as well as 2 genomes from the neighbouring section, Etuberosum . Extensive discordance of phylogenomic relationships suggests the complexity of potato evolution. We find that the potato genome substantially expanded its repertoire of disease-resistance genes when compared with closely related seed-propagated solanaceous crops, indicative of the effect of tuber-based propagation strategies on the evolution of the potato genome. We discover a transcription factor that determines tuber identity and interacts with the mobile tuberization inductive signal SP6A. We also identify 561,433 high-confidence structural variants and construct a map of large inversions, which provides insights for improving inbred lines and precluding potential linkage drag, as exemplified by a 5.8-Mb inversion that is associated with carotenoid content in tubers. This study will accelerate hybrid potato breeding and enrich our understanding of the evolution and biology of potato as a global staple food crop. 
    more » « less
  5. null (Ed.)