Abstract In this work, we address the question of how to enhance signal-agnostic searches by leveraging multiple testing strategies. Specifically, we consider hypothesis tests relying on machine learning, where model selection can introduce a bias towards specific families of new physics signals. Focusing on the New Physics Learning Machine, a methodology to perform a signal-agnostic likelihood-ratio test, we explore a number of approaches to multiple testing, such as combiningp-values and aggregating test statistics. Our findings show that it is beneficial to combine different tests, characterised by distinct choices of hyperparameters, and that performances comparable to the best available test are generally achieved, while also providing a more uniform response to various types of anomalies. This study proposes a methodology that is valid beyond machine learning approaches and could in principle be applied to a larger class model-agnostic analyses based on hypothesis testing. 
                        more » 
                        « less   
                    
                            
                            A generalized hypothesis test for community structure in networks
                        
                    
    
            Abstract Researchers theorize that many real-world networks exhibit community structure where within-community edges are more likely than between-community edges. While numerous methods exist to cluster nodes into different communities, less work has addressed this question: given some network, does it exhibitstatistically meaningfulcommunity structure? We answer this question in a principled manner by framing it as a statistical hypothesis test in terms of a general and model-agnostic community structure parameter. Leveraging this parameter, we propose a simple and interpretable test statistic used to formulate two separate hypothesis testing frameworks. The first is an asymptotic test against a baseline value of the parameter while the second tests against a baseline model using bootstrap-based thresholds. We prove theoretical properties of these tests and demonstrate how the proposed method yields rich insights into real-world datasets. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2413327
- PAR ID:
- 10590431
- Publisher / Repository:
- Cambridge University Press
- Date Published:
- Journal Name:
- Network Science
- Volume:
- 12
- Issue:
- 2
- ISSN:
- 2050-1242
- Page Range / eLocation ID:
- 122 to 138
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract We develop alternative families of Bayes factors for use in hypothesis tests as alternatives to the popular default Bayes factors. The alternative Bayes factors are derived for the statistical analyses most commonly used in psychological research – one-sample and two-samplet tests, regression, and ANOVA analyses. They possess the same desirable theoretical and practical properties as the default Bayes factors and satisfy additional theoretical desiderata while mitigating against two features of the default priors that we consider implausible. They can be conveniently computed via an R package that we provide. Furthermore, hypothesis tests based on Bayes factors and those based on significance tests are juxtaposed. This discussion leads to the insight that default Bayes factors as well as the alternative Bayes factors are equivalent to test-statistic-based Bayes factors as proposed by Johnson.Journal of the Royal Statistical Society Series B: Statistical Methodology,67, 689–701. (2005). We highlight test-statistic-based Bayes factors as a general approach to Bayes-factor computation that is applicable to many hypothesis-testing problems for which an effect-size measure has been proposed and for which test power can be computed.more » « less
- 
            Abstract Species around the world are shifting their ranges in response to climate change. To make robust predictions about climate‐related colonizations and extinctions, it is vital to understand the dynamics of range edges. This study is among the first to examine annual dynamics of cold and warm range edges, as most global change studies average observational data over space or over time. We analyzed annual range edge dynamics of marine fishes—both at the individual species level and pooled into cold‐ and warm‐edge assemblages—in a multi‐decade time‐series of trawl surveys conducted on the Northeast US Shelf during a period of rapid warming. We tested whether cold edges show stronger evidence of climate tracking than warm edges (due to non‐climate processes or time lags at the warm edge; thebiogeography hypothesisorextinction debt hypothesis), or whether they tracked temperature change equally (due to the influence of habitat suitability; theecophysiology hypothesis). In addition to exploring correlations with regional temperature change, we calculated species‐ and assemblage‐specific sea bottom and sea surface temperature isotherms and used them to predict range edge position. Cold edges shifted further and tracked sea surface and bottom temperature isotherms to a greater degree than warm edges. Mixed‐effects models revealed that for a one‐degree latitude shift in isotherm position, cold edges shifted 0.47 degrees of latitude, and warm edges shifted only 0.28 degrees. Our results suggest that cold range edges are tracking climate change better than warm range edges, invalidating the ecophysiology hypothesis. We also found that even among highly mobile marine ectotherms in a global warming hotspot, few species are fully keeping pace with climate.more » « less
- 
            Abstract In this paper, we consider data consisting of multiple networks, each composed of a different edge set on a common set of nodes. Many models have been proposed for the analysis of suchmultiviewnetwork data under the assumption that the data views are closely related. In this paper, we provide tools for evaluating this assumption. In particular, we ask: given two networks that each follow a stochastic block model, is there an association between the latent community memberships of the nodes in the two networks? To answer this question, we extend the stochastic block model for a single network view to the two‐view setting, and develop a new hypothesis test for the null hypothesis that the latent community memberships in the two data views are independent. We apply our test to protein–protein interaction data from the HINT database. We find evidence of a weak association between the latent community memberships of proteins defined with respect to binary interaction data and the latent community memberships of proteins defined with respect to cocomplex association data. We also extend this proposal to the setting of a network with node covariates. The proposed methods extend readily to three or more network/multivariate data views.more » « less
- 
            Abstract Backing is a procedure for retouching a stone tool edge to an angle of or near 90°. Archaeologists have recorded backed lithic specimens in the Pleistocene and Holocene around the world. One prominent hypothesis for the occurrence of backing is that it increases a stone tool's adhesion relative to what it would have otherwise been with unmodified, sharp edges. We conducted a highly controlled semi‐static tensile test in which we assessed lithic specimens that possessed both a backed and a non‐backed edge, opposing each other. We hafted each specimen's backed and non‐backed edges to wood, and the bi‐hafted stone implement was then pulled apart using an Universal Instron Materials Tester, allowing for a direct ‘head‐to‐head’ comparison of the two edge types’ adhesive properties. Our tensile test results suggested no significant difference between backed and non‐backed edges in terms of adhesion, which does not support the hypothesis that backing increases a lithic specimen's adhesion.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    