skip to main content


Title: Shape-Based Classification of Partially Observed Curves, With Applications to Anthropology
We consider the problem of classifying curves when they are observed only partially on their parameter domains. We propose computational methods for (i) completion of partially observed curves; (ii) assessment of completion variability through a nonparametric multiple imputation procedure; (iii) development of nearest neighbor classifiers compatible with the completion techniques. Our contributions are founded on exploiting the geometric notion of shape of a curve, defined as those aspects of a curve that remain unchanged under translations, rotations and reparameterizations. Explicit incorporation of shape information into the computational methods plays the dual role of limiting the set of all possible completions of a curve to those with similar shape while simultaneously enabling more efficient use of training data in the classifier through shape-informed neighborhoods. Our methods are then used for taxonomic classification of partially observed curves arising from images of fossilized Bovidae teeth, obtained from a novel anthropological application concerning paleoenvironmental reconstruction.  more » « less
Award ID(s):
1812065 2015226 1839252 1740761
NSF-PAR ID:
10342977
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Applied Mathematics and Statistics
Volume:
7
ISSN:
2297-4687
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Understanding the properties of dust attenuation curves in galaxies and the physical mechanisms that shape them are among the fundamental questions of extragalactic astrophysics, with great practical significance for deriving the physical properties of galaxies. Attenuation curves result from a combination of dust grain properties, dust content, and the spatial arrangement of dust and different populations of stars. In this review, we assess the state of the field, paying particular attention to extinction curves as the building blocks of attenuation laws. We introduce a quantitative framework to characterize extinction and attenuation curves, present a theoretical foundation for interpreting empirical results, overview an array of observational methods, and review observational results at low and high redshifts. Our main conclusions include the following: ▪  Attenuation curves exhibit a wide range of UV-through-optical slopes, from curves with shallow (Milky Way–like) slopes to those exceeding the slope of the Small Magellanic Cloud extinction curve. ▪  The slopes of the curves correlate strongly with the effective optical opacities, in the sense that galaxies with lower dust column density (lower visual attenuation) tend to have steeper slopes, whereas the galaxies with higher dust column density have shallower (grayer) slopes. ▪  Galaxies exhibit a range of 2175-Å UV bump strengths, including no bump, but, on average, are suppressed compared with the average Milky Way extinction curve. ▪  Theoretical studies indicate that both the correlation between the slope and the dust column as well as variations in bump strength may result from geometric and radiative transfer effects. 
    more » « less
  2. Context. Standing and moving shocks in relativistic astrophysical jets are very promising sites for particle acceleration to large Lorentz factors and for the emission from the radio up to the γ -ray band. They are thought to be responsible for at least part of the observed variability in radio-loud active galactic nuclei. Aims. We aim to simulate the interactions of moving shock waves with standing recollimation shocks in structured and magnetized relativistic jets and to characterize the profiles of connected flares in the radio light curve. Methods. Using the relativistic magneto-hydrodynamic code MPI-AMRVAC and a radiative transfer code in post-processing, we explore the influence of the magnetic-field configuration and transverse stratification of an over-pressured jet on its morphology, on the moving shock dynamics, and on the emitted radio light curve. First, we investigate different large-scale magnetic fields with their effects on the standing shocks and on the stratified jet morphology. Secondly, we study the interaction of a moving shock wave with the standing shocks. We calculated the synthetic synchrotron maps and radio light curves and analyze the variability at two frequencies 1 and 15.3 GHz and for several observation angles. Finally, we compare the characteristics of our simulated light curves with radio flares observed from the blazar 3C 273 with the Owens Valley Radio Observatory and Very Long Baseline Array in the MOJAVE survey between 2008 and 2019. Results. We find that in a structured over-pressured relativistic jet, the presence of the large-scale magnetic field structure changes the properties of the standing shock waves and leads to an opening in the jet. The interaction between waves from inner and outer jet components can produce strong standing shocks. When crossing such standing shocks, moving shock waves accompanying overdensities injected in the base of the jet cause very luminous radio flares. The observation of the temporal structure of these flares under different viewing angles probes the jet at different optical depths. At 1 GHz and for small angles, the self-absorption caused by the moving shock wave becomes more important and leads to a drop in the observed flux after it interacts with the brightest standing knot. A weak asymmetry is seen in the shape of the simulated flares, resulting from the remnant emission of the shocked standing shocks. The characteristics of the simulated flares and the correlation of peaks in the light curve with the crossing of moving and standing shocks favor this scenario as an explanation of the observed radio flares of 3C 273. 
    more » « less
  3. Abstract

    Proliferation of high‐resolution imaging data in recent years has led to substantial improvements in the two popular approaches for analyzing shapes of data objects based on landmarks and/or continuous curves. We provide an expository account of elastic shape analysis of parametric planar curves representing shapes of two‐dimensional (2D) objects by discussing its differences, and its commonalities, to the landmark‐based approach. Particular attention is accorded to the role of reparameterization of a curve, which in addition to rotation, scaling and translation, represents an important shape‐preserving transformation of a curve. The transition to the curve‐based approach moves the mathematical setting of shape analysis from finite‐dimensional non‐Euclidean spaces to infinite‐dimensional ones. We discuss some of the challenges associated with the infinite‐dimensionality of the shape space, and illustrate the use of geometry‐based methods in the computation of intrinsic statistical summaries and in the definition of statistical models on a 2D imaging dataset consisting of mouse vertebrae. We conclude with an overview of the current state‐of‐the‐art in the field.

    This article is categorized under:

    Image and Spatial Data < Data: Types and Structure

    Computational Mathematics < Applications of Computational Statistics

     
    more » « less
  4. Gadekallu, Thippa Reddy (Ed.)
    As of March 30 2021, over 5,193 COVID-19 clinical trials have been registered through Clinicaltrial.gov. Among them, 191 trials were terminated, suspended, or withdrawn (indicating the cessation of the study). On the other hand, 909 trials have been completed (indicating the completion of the study). In this study, we propose to study underlying factors of COVID-19 trial completion vs . cessation, and design predictive models to accurately predict whether a COVID-19 trial may complete or cease in the future. We collect 4,441 COVID-19 trials from ClinicalTrial.gov to build a testbed, and design four types of features to characterize clinical trial administration, eligibility, study information, criteria, drug types, study keywords, as well as embedding features commonly used in the state-of-the-art machine learning. Our study shows that drug features and study keywords are most informative features, but all four types of features are essential for accurate trial prediction. By using predictive models, our approach achieves more than 0.87 AUC (Area Under the Curve) score and 0.81 balanced accuracy to correctly predict COVID-19 clinical trial completion vs . cessation. Our research shows that computational methods can deliver effective features to understand difference between completed vs . ceased COVID-19 trials. In addition, such models can also predict COVID-19 trial status with satisfactory accuracy, and help stakeholders better plan trials and minimize costs. 
    more » « less
  5. Registering functions (curves) using time warpings (re-parameterizations) is central to many computer vision and shape analysis solutions. While traditional registration methods minimize penalized-L2 norm, the elastic Riemannian metric and square-root velocity functions (SRVFs) have resulted in significant improvements in terms of theory and practical performance. This solution uses the dynamic programming algorithm to minimize the L2 norm between SRVFs of given functions. However, the computational cost of this elastic dynamic programming framework – O(nT 2 k) – where T is the number of time samples along curves, n is the number of curves, and k < T is a parameter – limits its use in applications involving big data. This paper introduces a deep-learning approach, named SRVF Registration Net or SrvfRegNet to overcome these limitations. SrvfRegNet architecture trains by optimizing the elastic metric-based objective function on the training data and then applies this trained network to the test data to perform fast registration. In case the training and the test data are from different classes, it generalizes to the test data using transfer learning, i.e., retraining of only the last few layers of the network. It achieves the state-of-the-art alignment performance albeit at much reduced computational cost. We demonstrate the efficiency and efficacy of this framework using several standard curve datasets. 
    more » « less