 Award ID(s):
 1854336
 NSFPAR ID:
 10175450
 Date Published:
 Journal Name:
 Leibniz international proceedings in informatics
 Volume:
 164
 Issue:
 36th International Symposium on Computational Geometry (SoCG 2020)
 ISSN:
 18688969
 Page Range / eLocation ID:
 54:154:19
 Format(s):
 Medium: X
 Sponsoring Org:
 National Science Foundation
More Like this

We consider the topological and geometric reconstruction of a geodesic subspace of [Formula: see text] both from the Čech and VietorisRips filtrations on a finite, Hausdorffclose, Euclidean sample. Our reconstruction technique leverages the intrinsic length metric induced by the geodesics on the subspace. We consider the distortion and convexity radius as our sampling parameters for the reconstruction problem. For a geodesic subspace with finite distortion and positive convexity radius, we guarantee a correct computation of its homotopy and homology groups from the sample. This technique provides alternative sampling conditions to the existing and commonly used conditions based on weak feature size and [Formula: see text]–reach, and performs better under certain types of perturbations of the geodesic subspace. For geodesic subspaces of [Formula: see text], we also devise an algorithm to output a homotopy equivalent geometric complex that has a very small Hausdorff distance to the unknown underlying space.more » « less

null (Ed.)Let [Formula: see text] be a group acting properly and by isometries on a metric space [Formula: see text]; it follows that the quotient or orbit space [Formula: see text] is also a metric space. We study the Vietoris–Rips and Čech complexes of [Formula: see text]. Whereas (co)homology theories for metric spaces let the scale parameter of a Vietoris–Rips or Čech complex go to zero, and whereas geometric group theory requires the scale parameter to be sufficiently large, we instead consider intermediate scale parameters (neither tending to zero nor to infinity). As a particular case, we study the Vietoris–Rips and Čech thickenings of projective spaces at the first scale parameter where the homotopy type changes.more » « less

Abstract We study a family of invariants of compact metric spaces that combines the Curvature Sets defined by Gromov in the 1980 s with Vietoris–Rips Persistent Homology. For given integers
and$$k\ge 0$$ $k\ge 0$ we consider the dimension$$n\ge 1$$ $n\ge 1$k Vietoris–Rips persistence diagrams ofall subsets of a given metric space with cardinality at mostn . We call these invariantspersistence sets and denote them as . We first point out that this family encompasses the usual Vietoris–Rips diagrams. We then establish that (1) for certain range of values of the parameters$${\textbf{D}}_{n,k}^{\textrm{VR}}$$ ${D}_{n,k}^{\text{VR}}$n andk , computing these invariants is significantly more efficient than computing the usual Vietoris–Rips persistence diagrams, (2) these invariants have very good discriminating power and, in many cases, capture information that is imperceptible through standard Vietoris–Rips persistence diagrams, and (3) they enjoy stability properties analogous to those of the usual Vietoris–Rips persistence diagrams. We precisely characterize some of them in the case of spheres and surfaces with constant curvature using a generalization of Ptolemy’s inequality. We also identify a rich family of metric graphs for which fully recovers their homotopy type by studying splitmetric decompositions. Along the way we prove some useful properties of Vietoris–Rips persistence diagrams using Mayer–Vietoris sequences. These yield a geometric algorithm for computing the Vietoris–Rips persistence diagram of a space$${\textbf{D}}_{4,1}^{\textrm{VR}}$$ ${D}_{4,1}^{\text{VR}}$X with cardinality with quadratic time complexity as opposed to the much higher cost incurred by the usual algebraic algorithms relying on matrix reduction.$$2k+2$$ $2k+2$ 
Persistent Homology (PH) is a method of Topological Data Analysis that analyzes the topological structure of data to help data scientists infer relationships in the data to assist in informed decision making. A significant component in the computation of PH is the construction and use of a complex that represents the topological structure of the data. Some complex types are fast to construct but space inefficient whereas others are costly to construct and space efficient. Unfortunately, existing complex types are not both fast to construct and compact. This paper works to increase the scope of PH to support the computation of low dimensional homologies (H0 –H10 ) in highdimension, big data. In particular, this paper exploits the desirable properties of the Vietoris–Rips Complex (VRComplex) and the Delaunay Complex in order to construct a sparsified complex. The VRComplex uses a distance matrix to quickly generate a complex up to the desired homology dimension. In contrast, the Delaunay Complex works at the dimensionality of the data to generate a sparsified complex. While construction of the VRComplex is fast, its size grows exponentially by the size and dimension of the data set; in contrast, the Delaunay complex is significantly smaller for any given data dimension. However, its construction requires the computation of a Delaunay Triangulation that has high computational complexity. As a result, it is difficult to construct a Delaunay Complex for data in dimensions d > 6 that contains more than a few hundred points. The techniques in this paper enable the computation of topological preserving sparsification of kSimplices (where k ≪ d) to quickly generate a reduced sparsified complex sufficient to compute homologies up to ksubspace, irrespective of the data dimensionality d.more » « less

Copy number changes play an important role in the development of cancer and are commonly associated with changes in gene expression. Persistence curves, such as Betti curves, have been used to detect copy number changes; however, it is known these curves are unstable with respect to small perturbations in the data. We address the stability of lifespan and Betti curves by providing bounds on the distance between persistence curves of Vietoris–Rips filtrations built on data and slightly perturbed data in terms of the bottleneck distance. Next, we perform simulations to compare the predictive ability of Betti curves, lifespan curves (conditionally stable) and stable persistent landscapes to detect copy number aberrations. We use these methods to identify significant chromosome regions associated with the four major molecular subtypes of breast cancer: Luminal A, Luminal B, Basal and HER2 positive. Identified segments are then used as predictor variables to build machine learning models which classify patients as one of the four subtypes. We find that no single persistence curve outperforms the others and instead suggest a complementary approach using a suite of persistence curves. In this study, we identified new cytobands associated with three of the subtypes: 1q21.1q25.2, 2p23.2p16.3, 23q26.2q28 with the Basal subtype, 8p22p11.1 with Luminal B and 2q12.1q21.1 and 5p14.3p12 with Luminal A. These segments are validated by the TCGA BRCA cohort dataset except for those found for Luminal A.more » « less