skip to main content


Title: Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
Award ID(s):
1717916
NSF-PAR ID:
10314807
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Annual Meeting of the Association for Computational Linguistics
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The world’s coastlines are spatially highly variable, coupled-human-natural systems that comprise a nested hierarchy of component landforms, ecosystems, and human interventions, each interacting over a range of space and time scales. Understanding and predicting coastline dynamics necessitates frequent observation from imaging sensors on remote sensing platforms. Machine Learning models that carry out supervised (i.e., human-guided) pixel-based classification, or image segmentation, have transformative applications in spatio-temporal mapping of dynamic environments, including transient coastal landforms, sediments, habitats, waterbodies, and water flows. However, these models require large and well-documented training and testing datasets consisting of labeled imagery. We describe “Coast Train,” a multi-labeler dataset of orthomosaic and satellite images of coastal environments and corresponding labels. These data include imagery that are diverse in space and time, and contain 1.2 billion labeled pixels, representing over 3.6 million hectares. We use a human-in-the-loop tool especially designed for rapid and reproducible Earth surface image segmentation. Our approach permits image labeling by multiple labelers, in turn enabling quantification of pixel-level agreement over individual and collections of images. 
    more » « less
  2. null (Ed.)
    The synthesis of poly(propylene carbonate) with 100% 13C-labeled carbonate carbons is reported. The 𝑣CO3 vibration is shifted 45 cm−1 lower than that observed in its 12C analog. This lowering of the 𝑣CO3 vibrational mode of the copolymer provides a window for observing νNO stretching motions in incorporated dinitrosyl iron complexes. 
    more » « less
  3. We demonstrate a host-guest molecular recognition approach to advance double electron-electron resonance (DEER) distance measurements of spin-labeled proteins. We synthesized an iodoacetamide (IA) derivative of 2,6-diazaadamantane nitroxide (DZD) spin label that could be doubly incorporated into T4 Lysozyme (T4L) by site-directed spin labeling (SDSL) with efficiency up to 50% per cysteine. The rigidity of the fused ring structure and absence of mobile methyl groups increase the spin echo dephasing time (Tm) at temperatures above 80 K. This enables DEER measurements of distances >4 nm in DZD labeled-T4L in glycerol/water at temperatures up to 150 K, with increased sensitivity compared to common spin label such as MTSL. Addition of β-cyclodextrin (β-CD) reduces the rotational correlation time of the label, slightly increases Tm, and most importantly, narrows (and slightly lengthens) the inter-spin distance distributions. The distance distributions are in good agreement with simulated distance distributions obtained by rotamer libraries. These results provide a foundation for developing supramolecular recognition to facilitate long-distance DEER measurements at near physiological temperatures. 
    more » « less