Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Abstract Many statistical models for networks overlook the fact that most real-world networks are formed through a growth process. To address this, we introduce the Preferential Attachment Plus Erdős–Rényi model, where we let a random network G be the union of a preferential attachment (PA) tree T and additional Erdős–Rényi (ER) random edges. The PA tree captures the underlying growth process of a network where vertices/edges are added sequentially, while the ER component can be regarded as noise. Given only one snapshot of the final network G, we study the problem of constructing confidence sets for the root node of the unobserved growth process; the root node can be patient zero in an infection network or the source of fake news in a social network. We propose inference algorithms based on Gibbs sampling that scales to networks with millions of nodes and provide theoretical analysis showing that the size of the confidence set is small if the noise level of the ER edges is not too large. We also propose variations of the model in which multiple growth processes occur simultaneously, reflecting the growth of multiple communities; we use these models to provide a new approach to community detection.more » « less
-
Abstract The spread of infectious disease in a human community or the proliferation of fake news on social media can be modelled as a randomly growing tree-shaped graph. The history of the random growth process is often unobserved but contains important information such as the source of the infection. We consider the problem of statistical inference on aspects of the latent history using only a single snapshot of the final tree. Our approach is to apply random labels to the observed unlabelled tree and analyse the resulting distribution of the growth process, conditional on the final outcome. We show that this conditional distribution is tractable under a shape exchangeability condition, which we introduce here, and that this condition is satisfied for many popular models for randomly growing trees such as uniform attachment, linear preferential attachment and uniform attachment on a D-regular tree. For inference of the root under shape exchangeability, we propose O(n log n) time algorithms for constructing confidence sets with valid frequentist coverage as well as bounds on the expected size of the confidence sets. We also provide efficient sampling algorithms which extend our methods to a wide class of inference problems.more » « less
-
The female reproductive lifespan is highly dependent on egg quality, especially the presence of a normal number of chromosomes in an egg, known as euploidy. Mistakes in meiosis leading to egg aneuploidy are frequent in humans. Yet, knowledge of the precise genetic landscape that causes egg aneuploidy in women is limited, as phenotypic data on the frequency of human egg aneuploidy are difficult to obtain and therefore absent in public genetic datasets. Here, we identify genetic determinants of reproductive aging via egg aneuploidy in women using a biobank of individual maternal exomes linked with maternal age and embryonic aneuploidy data. Using the exome data, we identified 404 genes bearing variants enriched in individuals with pathologically elevated egg aneuploidy rates. Analysis of the gene ontology and protein–protein interaction network implicated genes encoding the kinesin protein family in egg aneuploidy. We interrogate the causal relationship of the human variants within candidate kinesin genes via experimental perturbations and demonstrate that motor domain variants increase aneuploidy in mouse oocytes. Finally, using a knock-in mouse model, we validate that a specific variant in kinesin KIF18A accelerates reproductive aging and diminishes fertility. These findings reveal additional functional mechanisms of reproductive aging and shed light on how genetic variation underlies individual heterogeneity in the female reproductive lifespan, which might be leveraged to predict reproductive longevity. Together, these results lay the groundwork for the noninvasive biomarkers for egg quality, a first step toward personalized fertility medicine.more » « lessFree, publicly-accessible full text available November 5, 2025
An official website of the United States government
