skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: On Learning Continuous Pairwise Markov Random Fields
We consider learning a sparse pairwise Markov Random Field (MRF) with continuous valued variables from i.i.d samples. We adapt the algorithm of Vuffray et al. (2019) to this setting and provide finite- sample analysis revealing sample complexity scaling logarithmically with the number of variables, as in the discrete and Gaussian settings. Our approach is applicable to a large class of pairwise MRFs with continuous variables and also has desirable asymptotic properties, including consistency and normality under mild conditions. Further, we establish that the population version of the optimization criterion employed by Vuffray et al. (2019) can be interpreted as local maximum likelihood estimation (MLE). As part of our analysis, we introduce a robust variation of sparse linear regression à la Lasso, which may be of interest in its own right.  more » « less
Award ID(s):
1816209
PAR ID:
10301907
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Proc. Int. Conf. Artif. Intell., Stat. (AISTATS-2021), Proc. Mach. Learn. Res. (PMLR)
Volume:
130
Page Range / eLocation ID:
1153 - 1161
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    We introduce the General Pairwise Model (GPM), a general parametric framework for pairwise comparison. Under the umbrella of the exponential family, the GPM unifies many pop- ular models with discrete observations, including the Thurstone (Case V), Berry-Terry-Luce (BTL) and Ordinal Models, along with models with continuous observations, such as the Gaussian Pairwise Cardinal Model. Using information theoretic techniques, we establish minimax lower bounds with tight topological dependence. When applied as a special case to the Ordinal Model, our results uniformly improve upon previously known lower bounds and confirms one direction of a conjecture put forth by Shah et al. (2016). Performance guarantees of the MLE for a broad class of GPMs with subgaussian assumptions are given and compared against our lower bounds, showing that in many natural settings the MLE is optimal up to constants. Matching lower and upper bounds (up to constants) are achieved by the Gaussian Pairwise Cardinal Model, suggesting that our lower bounds are best-possible under the few assumptions we adopt. 
    more » « less
  2. null (Ed.)
    We study how several published baryonic Tully-Fisher relations (BTFRs) fit a large sample of galaxies from the Arecibo legacy fast ALFA (ALFALFA) 21cm survey to determine which BTFR is a better template for calculating the distances and peculiar velocities of the ALFALFA galaxies. In particular, the BTFRs studied were those published by Papastergis et al. (2016) and Lelli et al. (2019). To do so, we first derived the rotational velocities and baryonic masses of a sub sample of galaxies with ”good” data, which make up 68% of the ALFALFA galaxies. We then calculated the best-fit line of the sample using fivedifferent fitting methods: (1) the ordinary least squares (OLS), (2) the maximum likelihood method assuming no intrinsic scatter as defined by Papastergis et al. (2016), (3) the maximum likelihood method with intrinsic scatter along the perpendicular direction (σ⊥,intr) also defined by Papastergis et al. (2016), (4) the maximum likelihood method assuming intrinsic scatter along the vertical direction (σy) as defined by Lelli et al. (2019)., and (5) the maximum likelihood method with intrinsic scatter along the perpendicular direction (σ⊥) also defined by Lelli et al. (2019). We find that fitting method (2) yields the steepest slope, 3.42, which agrees well with the values obtained in previous studies. We use this fit-line to compare with the two published BTFRs and determine that the BTFR derived by Papastergis et al.(2016) is the better template for calculating the distances and pecu-liar velocities of the ALFALFA catalog. This work was supported by NSF/AST-1714828 and by grants from the Brinson Foundation. 
    more » « less
  3. null (Ed.)
    We introduce a continuous analogue of the Learning with Errors (LWE) problem, which we name CLWE. We give a polynomial-time quantum reduction from worst-case lattice problems to CLWE, showing that CLWE enjoys similar hardness guarantees to those of LWE. Alternatively, our result can also be seen as opening new avenues of (quantum) attacks on lattice problems. Our work resolves an open problem regarding the computational complexity of learning mixtures of Gaussians without separability assumptions (Diakonikolas 2016, Moitra 2018). As an additional motivation, (a slight variant of) CLWE was considered in the context of robust machine learning (Diakonikolas et al. FOCS 2017), where hardness in the statistical query (SQ) model was shown; our work addresses the open question regarding its computational hardness (Bubeck et al. ICML 2019). 
    more » « less
  4. Samples for the analysis of dissolved nutrients were collected during the Multidisciplinary drifting Observatory for the Study of Arctic Climate (MOSAiC) from the water column, sea ice cores and from special events/locations (e.g., leads, melt ponds, brine, incubation experiments). Samples for dissolved inorganic nutrients (NO3 +NO2 , NO2 , PO4 , Si(OH)4, NH4 ) were analysed onboard during PS122 legs 1 to 3, with duplicate samples collected from CTD casts for later analysis of total dissolved nitrogen (TDN) and total dissolved phosphorus (TDP). From leg 4, all samples collected were stored frozen at -20°C for later analysis. Analyses of stored samples were carried out at the AWI Nutrient Facility between January and March 2021. Nutrient analyses onboard and on land were carried out using a Seal Analytical AA3 continuous flow autoanalyser, controlled by the AACE software version 7.09. Best practice procedures for the measurement of nutrients were adopted following GO-SHIP recommendations (Hydes et al., 2010; Becker et al., 2019). Descriptions of sample collection and handling can be found in the various cruise reports (Haas & Rabe, 2023; Kanzow & Damm, 2023; Rex & Metfies, 2023; Rex & Nicolaus, 2023; Rex & Shupe, 2023). Here we provide data from the water column, obtained from the analysis of discrete samples collected from CTD-Rosette casts from Polarstern (https://sensor.awi.de/?site=search&q=vessel:polarstern:ctd_sbe9plus_321) and Ocean City (https://sensor.awi.de/?site=search&q=vessel:polarstern:ctd_sbe9plus_935). Data from sea ice cores and special events are presented elsewhere. Data from sea ice cores and special events are presented elsewhere. For reference, here we included data from CTD-BTL files associated with nutrient samples. These data are presented by Tippenhauer et al. (2023) Polarstern CTD and Tippenhauer et al. (2023) Ocean City CTD. 
    more » « less
  5. Samples for the analysis of dissolved nutrients were collected during the Multidisciplinary drifting Observatory for the Study of Arctic Climate (MOSAiC) from the water column, sea ice cores and from special events/locations (e.g., leads, melt ponds, brine, incubation experiments). Samples for dissolved inorganic nutrients (NO3 +NO2 , NO2 , PO4 , Si(OH)4, NH4 ) were analysed onboard during PS122 legs 1 to 3, with duplicate samples collected from CTD casts for later analysis of total dissolved nitrogen (TDN) and total dissolved phosphorus (TDP). From leg 4, all samples collected were stored frozen at -20°C for later analysis. Analyses of stored samples were carried out at the AWI Nutrient Facility between January and March 2021. Nutrient analyses onboard and on land were carried out using a Seal Analytical AA3 continuous flow autoanalyser, controlled by the AACE software version 7.09. Best practice procedures for the measurement of nutrients were adopted following GO-SHIP recommendations (Hydes et al., 2010; Becker et al., 2019). Descriptions of sample collection and handling can be found in the various cruise reports (Haas & Rabe, 2023; Kanzow & Damm, 2023; Rex & Metfies, 2023; Rex & Nicolaus, 2023; Rex & Shupe, 2023). Here we provide data from the water column, obtained from the analysis of discrete samples collected from CTD-Rosette casts from Polarstern (https://sensor.awi.de/?site=search&q=vessel:polarstern:ctd_sbe9plus_321) and Ocean City (https://sensor.awi.de/?site=search&q=vessel:polarstern:ctd_sbe9plus_935). Data from sea ice cores and special events are presented elsewhere. Data from sea ice cores and special events are presented elsewhere. For reference, here we included data from CTD-BTL files associated with nutrient samples. These data are presented by Tippenhauer et al. (2023) Polarstern CTD and Tippenhauer et al. (2023) Ocean City CTD. 
    more » « less