skip to main content

Title: Multi-scale mining of kinematic distributions with wavelets
Typical LHC analyses search for local features in kinematicdistributions. Assumptions about anomalous patterns limit them to arelatively narrow subset of possible signals. Wavelets extractinformation from an entire distribution and decompose it at all scales,simultaneously searching for features over a wide range of scales. Wepropose a systematic wavelet analysis and show how bumps, bump-dipcombinations, and oscillatory patterns are extracted. Our kinematicwavelet analysis kit KWAK provides a publicly available framework toanalyze and visualize general distributions.
; ; ;
Award ID(s):
Publication Date:
Journal Name:
SciPost Physics
Sponsoring Org:
National Science Foundation
More Like this
  1. We develop Categorical Exploratory Data Analysis (CEDA) with mimicking to explore and exhibit the complexity of information content that is contained within any data matrix: categorical, discrete, or continuous. Such complexity is shown through visible and explainable serial multiscale structural dependency with heterogeneity. CEDA is developed upon all features’ categorical nature via histogram and it is guided by all features’ associative patterns (order-2 dependence) in a mutual conditional entropy matrix. Higher-order structural dependency of k(≥3) features is exhibited through block patterns within heatmaps that are constructed by permuting contingency-kD-lattices of counts. By growing k, the resultant heatmap series contains global and large scales of structural dependency that constitute the data matrix’s information content. When involving continuous features, the principal component analysis (PCA) extracts fine-scale information content from each block in the final heatmap. Our mimicking protocol coherently simulates this heatmap series by preserving global-to-fine scales structural dependency. Upon every step of mimicking process, each accepted simulated heatmap is subject to constraints with respect to all of the reliable observed categorical patterns. For reliability and robustness in sciences, CEDA with mimicking enhances data visualization by revealing deterministic and stochastic structures within each scale-specific structural dependency. For inferences in Machine Learning (ML)more »and Statistics, it clarifies, upon which scales, which covariate feature-groups have major-vs.-minor predictive powers on response features. For the social justice of Artificial Intelligence (AI) products, it checks whether a data matrix incompletely prescribes the targeted system.« less
  2. Self-organized pattern behavior is ubiquitous throughout nature, from fish schooling to collective cell dynamics during organism development. Qualitatively these patterns display impressive consistency, yet variability inevitably exists within pattern-forming systems on both microscopic and macroscopic scales. Quantifying variability and measuring pattern features can inform the underlying agent interactions and allow for predictive analyses. Nevertheless, current methods for analyzing patterns that arise from collective behavior capture only macroscopic features or rely on either manual inspection or smoothing algorithms that lose the underlying agent-based nature of the data. Here we introduce methods based on topological data analysis and interpretable machine learning for quantifying both agent-level features and global pattern attributes on a large scale. Because the zebrafish is a model organism for skin pattern formation, we focus specifically on analyzing its skin patterns as a means of illustrating our approach. Using a recent agent-based model, we simulate thousands of wild-type and mutant zebrafish patterns and apply our methodology to better understand pattern variability in zebrafish. Our methodology is able to quantify the differential impact of stochasticity in cell interactions on wild-type and mutant patterns, and we use our methods to predict stripe and spot statistics as a function of varying cellular communication.more »Our work provides an approach to automatically quantifying biological patterns and analyzing agent-based dynamics so that we can now answer critical questions in pattern formation at a much larger scale.« less
  3. Microbial biogeography in terrestrial and freshwater ecosystems is mainly dominated by community biofilm lifestyles. Here, we describe applications of computer-assisted microscopy using CMEIAS (Center for Microbial Ecology Image Analysis System) bioimage informatics software for a comprehensive analysis of river biofilm architectures and ecology. Natural biofilms were developed for four summer days on microscope slides of plain borosilicate glass and transparent polystyrene submerged in the Red Cedar River that flows through the Michigan State University campus. Images of the biofilm communities were acquired using brightfield and phase-contrast microscopy at spatial resolutions revealing details of microcolonies and individual cells, then digitally segmented to the foreground objects of interest. Phenotypic features of their size, abundance, surface texture, contour morphology, fractal geometry, ecophysiology, and landscape/spatial ecology were digitally extracted and evaluated by many discriminating statistical tests. The results indicate that river biofilm architecture exhibits significant geospatial structure in situ, providing many insights on the strong influence that substratum hydrophobicity–wettability exert on biofilm development and ecology, including their productivity and colonization intensity, morphological diversity/dominance/conditional rarity, nutrient apportionment/uptake efficiency/utilization, allometry/metabolic activity, responses to starvation and bacteriovory stresses, spatial patterns of distribution/dispersion/connectivity, and interpolated autocorrelations of cooperative/conflicting cell–cell interactions at real-world spatial scales directly relevant to theirmore »ecological niches. The significant impact of substratum physicochemistry was revealed for biofilms during their early immature stage of development in the river ecosystem. Bioimage informatics can fill major gaps in understanding the geomicrobiology and microbial ecology of biofilms in situ when examined at spatial scales suitable for phenotypic analysis at microcolony and single-cell resolutions.« less
  4. Abstract Coral reef fishes constitute one of the most diverse assemblages of vertebrates on the planet. Color patterns are known to serve a number of functions including intra- and inter-specific signaling, camouflage, mimicry, and defense. However, the relative importance of these and other factors in shaping color pattern evolution is poorly understood. Here we conduct a comparative phylogenetic analysis of color pattern evolution in the butterflyfishes (Chaetodontidae). Using recently developed tools for quantifying color pattern geometry as well as machine learning approaches, we investigate the tempo of evolution of color pattern elements and test whether ecological variables relating to defense, depth, and social behavior predict color pattern evolution. Butterflyfishes exhibit high diversity in measures of chromatic conspicuousness and the degrees of fine versus gross scale color patterning. Surprisingly, most diversity in color pattern was not predicted by any of the measures of ecology in our study, although we did find a significant but weak relationship between the level of fine scale patterning and some aspects of defensive morphology. We find that the tempo of color pattern diversification in butterflyfishes has increased toward the present and suggest that rapid evolution, presumably in response to evolutionary pressures surrounding speciation and lineage divergence,more »has effectively decoupled color pattern geometry from some aspects of ecology. Machine learning classification of color pattern appears to rely on a set of features that are weakly correlated with current color pattern geometry descriptors, but that may be better suited for the detection of discrete components of color pattern. A key challenge for future studies lies in determining whether rapid evolution has generally decoupled color patterns from ecology, or whether convergence in function produces convergence in color pattern at phylogenetic scales.« less
  5. Abstract

    The extent to which the eddy statistics of the Martian atmosphere can be inferred from the mean state and highly simplified assumptions about diabatic and frictional processes is investigated using an idealized general circulation model (GCM) with Newtonian relaxation thermal forcing. An iterative technique, adapted from previous terrestrial studies, is used to generate radiative equilibrium temperatures such that the three-dimensional time-mean temperature fields of the idealized model match means computed from the Mars Analysis Correction Data Assimilation (MACDA). Focusing on a period of strong Northern Hemisphere eddy activity prior to winter solstice, it is found that the idealized model reproduces some key features of the spatial patterns of the MACDA eddy temperature variance and kinetic energy fields. The idealized model can also simulate aspects of MACDA’s seasonal cycle of spatial patterns of low-level eddy meridional wind and temperature variances. The most notable weakness of the model is its eddy amplitudes—both their absolute values and seasonal variations are quite unrealistic, for reasons unclear. The idealized model was also run with a mean flow based on output from the Geophysical Fluid Dynamics Laboratory (GFDL) full-physics Mars GCM. The idealized model captures the difference in mean flows between MACDA and the GFDLmore »Mars GCM and reproduces a bias in the more complex model’s eddy zonal wavenumber distribution. This implies that the mean flow is an important influence on transient eddy wavenumbers and that improving the GFDL Mars GCM’s mean flow would make its eddy scales more realistic.

    « less