skip to main content


Title: The nonsmooth landscape of phase retrieval
Abstract

We consider a popular nonsmooth formulation of the real phase retrieval problem. We show that under standard statistical assumptions a simple subgradient method converges linearly when initialized within a constant relative distance of an optimal solution. Seeking to understand the distribution of the stationary points of the problem, we complete the paper by proving that as the number of Gaussian measurements increases, the stationary points converge to a codimension two set, at a controlled rate. Experiments on image recovery problems illustrate the developed algorithm and theory.

 
more » « less
Award ID(s):
1651851 1740551
NSF-PAR ID:
10131002
Author(s) / Creator(s):
 ;  ;  
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
IMA Journal of Numerical Analysis
ISSN:
0272-4979
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Stationary points embedded in the derivatives are often critical for a model to be interpretable and may be considered as key features of interest in many applications. We propose a semiparametric Bayesian model to efficiently infer the locations of stationary points of a nonparametric function, which also produces an estimate of the function. We use Gaussian processes as a flexible prior for the underlying function and impose derivative constraints to control the function's shape via conditioning. We develop an inferential strategy that intentionally restricts estimation to the case of at least one stationary point, bypassing possible mis-specifications in the number of stationary points and avoiding the varying dimension problem that often brings in computational complexity. We illustrate the proposed methods using simulations and then apply the method to the estimation of event-related potentials derived from electroencephalography (EEG) signals. We show how the proposed method automatically identifies characteristic components and their latencies at the individual level, which avoids the excessive averaging across subjects that is routinely done in the field to obtain smooth curves. By applying this approach to EEG data collected from younger and older adults during a speech perception task, we are able to demonstrate how the time course of speech perception processes changes with age.

     
    more » « less
  2. Abstract

    We study the behavior of solutions to the incompressible 2dEuler equations near two canonical shear flows with critical points, the Kolmogorov and Poiseuille flows, with consequences for the associated Navier–Stokes problems. We exhibit a large family of new, non-trivial stationary states that are arbitrarily close to the Kolmogorov flow on the square torus$$\mathbb {T}^2$$T2in analytic regularity. This situation contrasts strongly with the setting of some monotone shear flows, such as the Couette flow: there the linearized problem exhibits an “inviscid damping” mechanism that leads to relaxation of perturbations of the base flows back to nearby shear flows. Our results show that such a simple description of the long-time behavior is not possible for solutions near the Kolmogorov flow on$$\mathbb {T}^2$$T2. Our construction of the new stationary states builds on a degeneracy in the global structure of the Kolmogorov flow on$$\mathbb {T}^2$$T2, and we also show a lack of correspondence between the linearized description of the set of steady states and its true nonlinear structure. Both the Kolmogorov flow on a rectangular torus and the Poiseuille flow in a channel are very different. We show that the only stationary states near them must indeed be shears, even in relatively low regularity. In addition, we show that this behavior is mirrored closely in the related Navier–Stokes settings: the linearized problems near the Poiseuille and Kolmogorov flows both exhibit an enhanced rate of dissipation. Previous work by us and others shows that this effect survives in the full, nonlinear problem near the Poiseuille flow and near the Kolmogorov flow on rectangular tori, provided that the perturbations lie below a certain threshold. However, we show here that the corresponding result cannot hold near the Kolmogorov flow on$${\mathbb T}^2$$T2.

     
    more » « less
  3. Abstract

    Cumulative sum (CUSUM) statistics are widely used in the change point inference and identification. For the problem of testing for existence of a change point in an independent sample generated from the mean-shift model, we introduce a Gaussian multiplier bootstrap to calibrate critical values of the CUSUM test statistics in high dimensions. The proposed bootstrap CUSUM test is fully data dependent and it has strong theoretical guarantees under arbitrary dependence structures and mild moment conditions. Specifically, we show that with a boundary removal parameter the bootstrap CUSUM test enjoys the uniform validity in size under the null and it achieves the minimax separation rate under the sparse alternatives when the dimension p can be larger than the sample size n.

    Once a change point is detected, we estimate the change point location by maximising the ℓ∞-norm of the generalised CUSUM statistics at two different weighting scales corresponding to covariance stationary and non-stationary CUSUM statistics. For both estimators, we derive their rates of convergence and show that dimension impacts the rates only through logarithmic factors, which implies that consistency of the CUSUM estimators is possible when p is much larger than n. In the presence of multiple change points, we propose a principled bootstrap-assisted binary segmentation (BABS) algorithm to dynamically adjust the change point detection rule and recursively estimate their locations. We derive its rate of convergence under suitable signal separation and strength conditions.

    The results derived in this paper are non-asymptotic and we provide extensive simulation studies to assess the finite sample performance. The empirical evidence shows an encouraging agreement with our theoretical results.

     
    more » « less
  4. Abstract

    Electromechanical impedance-based (EMI) techniques using piezoelectric transducers are promising for structural damage identification. They can be implemented in high frequency range with small characteristic wavelengths, leading to high detection sensitivity. The impedance measured is the outcome of harmonic and stationary excitation, which makes it easier to conduct inverse analysis for damage localization and quantification. Nevertheless, the EMI data measurement points are usually limited, thus oftentimes resulting in an under-determined problem. To address this issue, damage identification process can be converted into a multi-objective optimization formulation which naturally yields multiple solutions. While this setup fits the nature of damage identification that a number of possibilities may exist under given observations/measurements, existing algorithms may suffer from premature convergence and entrapment in local extremes. Consequently, the solutions found may not cover the true damage scenario. To tackle these challenges, in this research, a series of local search strategies are tailored to enhance the global searching ability and incorporated into particle swarm-based optimization. The Q-table is utilized to help the algorithm select proper local search strategy based on the maximum Q-table values. Case studies are carried out for verification, and the results show that the proposed memetic algorithm achieves good performance in damage identification.

     
    more » « less
  5. We study a deep learning inspired formulation for the blind demodulation problem, which is the task of recovering two unknown vectors from their entrywise multiplication. We consider the case where the unknown vectors are in the range of known deep generative models, G(1):R^n→R^l and G(2):R^p→R^l. In the case when the networks corresponding to the generative models are expansive, the weight matrices are random and the dimension of the unknown vectors satisfy l=Omega(n^2+p^2), up to log factors, we show that the empirical risk objective has a favorable landscape for optimization. That is, the objective function has a descent direction at every point outside of a small neighborhood around four hyperbolic curves. We also characterize the local maximizers of the empirical risk objective and, hence, show that there does not exist any other stationary points outside of these neighborhood around four hyperbolic curves and the set of local maximizers. We also implement a gradient descent scheme inspired by the geometry of the landscape of the objective function. In order to converge to a global minimizer, this gradient descent scheme exploits the fact that exactly one of the hyperbolic curve corresponds to the global minimizer, and thus points near this hyperbolic curve have a lower objective value than points close to the other spurious hyperbolic curves. We show that this gradient descent scheme can effectively remove distortions synthetically introduced to the MNIST dataset. 
    more » « less