A<sc>bstract</sc> Inspired by the structure of top-down derived models endowed with modular flavor symmetries, we investigate the yet phenomenologically unexplored binary dihedral group 2D3. After building the vector-valued modular forms in the representations of 2D3with small modular weights, we systematically classify all (Dirac and Majorana) mass textures of fermions with fractional modular weights and all possible 2 + 1-family structures. This allows us to explore the parameter space of fermion models based on 2D3, aiming at a description of both quarks and leptons with a minimal number of parameters and best compatibility with observed data. We consider the separate possibilities of neutrino masses generated by either a type-I seesaw mechanism or the Weinberg operator. We identify a model that, besides fitting all known flavor observables, delivers predictions for six not-yet measured parameters and favors normal-ordered neutrino masses generated by the Weinberg operator. It would be interesting to figure out whether it is possible to embed our model within a top-down scheme, such as$${\mathbb{T}}^{2}/{\mathbb{Z}}_{4}$$heterotic orbifold compactifications.
more »
« less
Identifiability of Product of Experts Models
Product of experts (PoE) are layered networks in which the value at each node is an AND (or product) of the values (possibly negated) at its inputs. These were introduced as a neural network architecture that can efficiently learn to generate high-dimensional data which satisfy many low-dimensional constraints---thereby allowing each individual expert to perform a simple task. PoEs have found a variety of applications in learning. We study the problem of identifiability of a product of experts model having a layer of binary latent variables, and a layer of binary observables that are iid conditional on the latents. The previous best upper bound on the number of observables needed to identify the model was exponential in the number of parameters. We show: (a) When the latents are uniformly distributed, the model is identifiable with a number of observables equal to the number of parameters (and hence best possible). (b) In the more general case of arbitrarily distributed latents, the model is identifiable for a number of observables that is still linear in the number of parameters (and within a factor of two of best-possible). The proofs rely on root interlacing phenomena for some special three-term recurrences.
more »
« less
- Award ID(s):
- 2321079
- PAR ID:
- 10508258
- Publisher / Repository:
- Proceedings of Machine Learning Research
- Date Published:
- Journal Name:
- Proceedings of Machine Learning Research
- Volume:
- 238
- ISSN:
- 2640-3498
- Format(s):
- Medium: X
- Location:
- https://proceedings.mlr.press/v238/kant24a.html
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
An important problem across multiple disciplines is to infer and understand meaningful latent variables. One strategy commonly used is to model the measured variables in terms of the latent variables under suitable assumptions on the connectivity from the latents to the measured (known as measurement model). Furthermore, it might be even more interesting to discover the causal relations among the latent variables (known as structural model). Recently, some methods have been proposed to estimate the structural model by assuming that the noise terms in the measured and latent variables are non-Gaussian. However, they are not suitable when some of the noise terms become Gaussian. To bridge this gap, we investigate the problem of identification of the structural model with arbitrary noise distributions. We provide necessary and sufficient condition under which the structural model is identifiable: it is identifiable iff for each pair of adjacent latent variables Lx, Ly, (1) at least one of Lx and Ly has non-Gaussian noise, or (2) at least one of them has a non-Gaussian ancestor and is not d-separated from the non-Gaussian component of this ancestor by the common causes of Lx and Ly. This identifiability result relaxes the non-Gaussianity requirements to only a (hopefully small) subset of variables, and accordingly elegantly extends the application scope of the structural model. Based on the above identifiability result, we further propose a practical algorithm to learn the structural model. We verify the correctness of the identifiability result and the effectiveness of the proposed method through empirical studies.more » « less
-
Information about the privacy and security of Internet of Things (IoT) devices is not readily available to consumers who want to consider it before making purchase decisions. While legislators have proposed adding succinct, consumer accessible, labels, they do not provide guidance on the content of these labels. In this paper, we report on the results of a series of interviews and surveys with privacy and security experts, as well as consumers, where we explore and test the design space of the content to include on an IoT privacy and security label. We conduct an expert elicitation study by following a three-round Delphi process with 22 privacy and security experts to identify the factors that experts believed are important for consumers when comparing the privacy and security of IoT devices to inform their purchase decisions. Based on how critical experts believed each factor is in conveying risk to consumers, we distributed these factors across two layers—a primary layer to display on the product package itself or prominently on a website, and a secondary layer available online through a web link or a QR code. We report on the experts’ rationale and arguments used to support their choice of factors. Moreover, to study how consumers would perceive the privacy and security information specified by experts, we conducted a series of semi-structured interviews with 15 participants, who had purchased at least one IoT device (smart home device or wearable). Based on the results of our expert elicitation and consumer studies, we propose a prototype privacy and security label to help consumers make more informed IoTrelated purchase decisions.more » « less
-
The simultaneous excitation and measurement of two eigenmodes in bimodal atomic force microscopy (AFM) during sub-micron scale surface imaging augments the number of observables at each pixel of the image compared to the normal tapping mode. However, a comprehensive connection between the bimodal AFM observables and the surface adhesive and viscoelastic properties of polymer samples remains elusive. To address this gap, we first propose an algorithm that systematically accommodates surface forces and linearly viscoelastic three-dimensional deformation computed via Attard's model into the bimodal AFM framework. The proposed algorithm simultaneously satisfies the amplitude reduction formulas for both resonant eigenmodes and enables the rigorous prediction and interpretation of bimodal AFM observables with a first-principles approach. We used the proposed algorithm to predict the dependence of bimodal AFM observables on local adhesion and standard linear solid (SLS) constitutive parameters as well as operating conditions. Secondly, we present an inverse method to quantitatively predict the local adhesion and SLS viscoelastic parameters from bimodal AFM data acquired on a heterogeneous sample. We demonstrate the method experimentally using bimodal AFM on polystyrene-low density polyethylene (PS-LDPE) polymer blend. This inverse method enables the quantitative discrimination of adhesion and viscoelastic properties from bimodal AFM maps of such samples and opens the door for advanced computational interaction models to be used to quantify local nanomechanical properties of adhesive, viscoelastic materials using bimodal AFM.more » « less
-
Parameter estimation from observable or experimental data is a crucial stage in any modeling study. Identifiability refers to one’s ability to uniquely estimate the model parameters from the available data. Structural unidentifiability in dynamic models, the opposite of identifiability, is associated with the notion of degeneracy where multiple parameter sets produce the same pattern. Therefore, the inverse function of determining the model parameters from the data is not well defined. Degeneracy is not only a mathematical property of models, but it has also been reported in biological experiments. Classical studies on structural unidentifiability focused on the notion that one can at most identify combinations of unidentifiable model parameters. We have identified a different type of structural degeneracy/unidentifiability present in a family of models, which we refer to as the Lambda-Omega (Λ-Ω) models. These are an extension of the classical lambda-omega (λ-ω) models that have been used to model biological systems, and display a richer dynamic behavior and waveforms that range from sinusoidal to square wave to spike like. We show that the Λ-Ω models feature infinitely many parameter sets that produce identical stable oscillations, except possible for a phase shift (reflecting the initial phase). These degenerate parameters are not identifiable combinations of unidentifiable parameters as is the case in structural degeneracy. In fact, reducing the number of model parameters in the Λ-Ω models is minimal in the sense that each one controls a different aspect of the model dynamics and the dynamic complexity of the system would be reduced by reducing the number of parameters. We argue that the family of Λ-Ω models serves as a framework for the systematic investigation of degeneracy and identifiability in dynamic models and for the investigation of the interplay between structural and other forms of unidentifiability resulting on the lack of information from the experimental/observational data.more » « less