skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Preparing for the next pandemic via transfer learning from existing diseases with hierarchical multi-modal BERT: a study on COVID-19 outcome prediction
Abstract Developing prediction models for emerging infectious diseases from relatively small numbers of cases is a critical need for improving pandemic preparedness. Using COVID-19 as an exemplar, we propose a transfer learning methodology for developing predictive models from multi-modal electronic healthcare records by leveraging information from more prevalent diseases with shared clinical characteristics. Our novel hierarchical, multi-modal model ($${\textsc {TransMED}}$$ T R A N S MED ) integrates baseline risk factors from the natural language processing of clinical notes at admission, time-series measurements of biomarkers obtained from laboratory tests, and discrete diagnostic, procedure and drug codes. We demonstrate the alignment of$${\textsc {TransMED}}$$ T R A N S MED ’s predictions with well-established clinical knowledge about COVID-19 through univariate and multivariate risk factor driven sub-cohort analysis.$${\textsc {TransMED}}$$ T R A N S MED ’s superior performance over state-of-the-art methods shows that leveraging patient data across modalities and transferring prior knowledge from similar disorders is critical for accurate prediction of patient outcomes, and this approach may serve as an important tool in the early response to future pandemics.  more » « less
Award ID(s):
1838730
PAR ID:
10368315
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Reports
Volume:
12
Issue:
1
ISSN:
2045-2322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract We continue the program of proving circuit lower bounds via circuit satisfiability algorithms. So far, this program has yielded several concrete results, proving that functions in$$\mathsf {Quasi}\text {-}\mathsf {NP} = \mathsf {NTIME}[n^{(\log n)^{O(1)}}]$$ Quasi - NP = NTIME [ n ( log n ) O ( 1 ) ] and other complexity classes do not have small circuits (in the worst case and/or on average) from various circuit classes$$\mathcal { C}$$ C , by showing that$$\mathcal { C}$$ C admits non-trivial satisfiability and/or#SAT algorithms which beat exhaustive search by a minor amount. In this paper, we present a new strong lower bound consequence of having a non-trivial#SAT algorithm for a circuit class$${\mathcal C}$$ C . Say that a symmetric Boolean functionf(x1,…,xn) issparseif it outputs 1 onO(1) values of$${\sum }_{i} x_{i}$$ i x i . We show that for every sparsef, and for all “typical”$$\mathcal { C}$$ C , faster#SAT algorithms for$$\mathcal { C}$$ C circuits imply lower bounds against the circuit class$$f \circ \mathcal { C}$$ f C , which may bestrongerthan$$\mathcal { C}$$ C itself. In particular:#SAT algorithms fornk-size$$\mathcal { C}$$ C -circuits running in 2n/nktime (for allk) implyNEXPdoes not have$$(f \circ \mathcal { C})$$ ( f C ) -circuits of polynomial size.#SAT algorithms for$$2^{n^{{\varepsilon }}}$$ 2 n ε -size$$\mathcal { C}$$ C -circuits running in$$2^{n-n^{{\varepsilon }}}$$ 2 n n ε time (for someε> 0) implyQuasi-NPdoes not have$$(f \circ \mathcal { C})$$ ( f C ) -circuits of polynomial size. Applying#SAT algorithms from the literature, one immediate corollary of our results is thatQuasi-NPdoes not haveEMAJ∘ACC0∘THRcircuits of polynomial size, whereEMAJis the “exact majority” function, improving previous lower bounds againstACC0[Williams JACM’14] andACC0∘THR[Williams STOC’14], [Murray-Williams STOC’18]. This is the first nontrivial lower bound against such a circuit class. 
    more » « less
  2. Abstract In this work, we aim to accurately predict the number of hospitalizations during the COVID-19 pandemic by developing a spatiotemporal prediction model. We propose HOIST, an Ising dynamics-based deep learning model for spatiotemporal COVID-19 hospitalization prediction. By drawing the analogy between locations and lattice sites in statistical mechanics, we use the Ising dynamics to guide the model to extract and utilize spatial relationships across locations and model the complex influence of granular information from real-world clinical evidence. By leveraging rich linked databases, including insurance claims, census information, and hospital resource usage data across the U.S., we evaluate the HOIST model on the large-scale spatiotemporal COVID-19 hospitalization prediction task for 2299 counties in the U.S. In the 4-week hospitalization prediction task, HOIST achieves 368.7 mean absolute error, 0.6$${R}^{2}$$ R 2 and 0.89 concordance correlation coefficient score on average. Our detailed number needed to treat (NNT) and cost analysis suggest that future COVID-19 vaccination efforts may be most impactful in rural areas. This model may serve as a resource for future county and state-level vaccination efforts. 
    more » « less
  3. Abstract With the growing use of temporal logics in areas ranging from robot planning to runtime verification, it is critical that users have a clear understanding of what a specification means. Toward this end, we have been developing a catalog of semantic errors and a suite of test instruments targeting various user-groups. The catalog is of interest to educators, to logic designers, to formula authors, and to tool builders, e.g., to identify mistakes. The test instruments are suitable for classroom teaching or self-study. This paper reports on five sets of survey data collected over a three-year span. We study misconceptions about finite-trace$$\textsc {ltl}_{f}$$ L T L f in threeltl-aware audiences, and misconceptions about standardltlin novices. We find several mistakes, even among experts. In addition, the data supports several categories of errors in both$$\textsc {ltl}_{f}$$ L T L f andltlthat have not been identified in prior work. These findings, based on data from actual users, offer insights into whatspecific waystemporal logics are tricky and provide a groundwork for future interventions. 
    more » « less
  4. Measurements are presented of the cross-section for the central exclusive production ofJ/\psi\to\mu^+\mu^- J / ψ μ + μ and\psi(2S)\to\mu^+\mu^- ψ ( 2 S ) μ + μ processes in proton-proton collisions at\sqrt{s} = 13 \ \mathrm{TeV} s = 13 T e V with 2016–2018 data. They are performed by requiring both muons to be in the LHCb acceptance (with pseudorapidity2<\eta_{\mu^±} < 4.5 2 < η μ ± < 4.5 ) and mesons in the rapidity range2.0 < y < 4.5 2.0 < y < 4.5 . The integrated cross-section results are\sigma_{J/\psi\to\mu^+\mu^-}(2.0 σ J / ψ μ + μ ( 2.0 < y J / ψ < 4.5 , 2.0 < η μ ± < 4.5 ) = 400 ± 2 ± 5 ± 12 p b , σ ψ ( 2 S ) μ + μ ( 2.0 < y ψ ( 2 S ) < 4.5 , 2.0 < η μ ± < 4.5 ) = 9.40 ± 0.15 ± 0.13 ± 0.27 p b , where the uncertainties are statistical, systematic and due to the luminosity determination. In addition, a measurement of the ratio of\psi(2S) ψ ( 2 S ) andJ/\psi J / ψ cross-sections, at an average photon-proton centre-of-mass energy of1\ \mathrm{TeV} 1 T e V , is performed, giving$ = 0.1763 ± 0.0029 ± 0.0008 ± 0.0039,$$ where the first uncertainty is statistical, the second systematic and the third due to the knowledge of the involved branching fractions. For the first time, the dependence of theJ/\psi$ J / ψ and\psi(2S) ψ ( 2 S ) cross-sections on the total transverse momentum transfer is determined inpp p p collisions and is found consistent with the behaviour observed in electron-proton collisions. 
    more » « less
  5. Abstract We consider the following stochastic heat equation$$\begin{aligned} \partial _t u(t,x) = \tfrac{1}{2} \partial ^2_x u(t,x) + b(u(t,x)) + \sigma (u(t,x)) {\dot{W}}(t,x), \end{aligned}$$ t u ( t , x ) = 1 2 x 2 u ( t , x ) + b ( u ( t , x ) ) + σ ( u ( t , x ) ) W ˙ ( t , x ) , defined for$$(t,x)\in (0,\infty )\times {\mathbb {R}}$$ ( t , x ) ( 0 , ) × R , where$${\dot{W}}$$ W ˙ denotes space-time white noise. The function$$\sigma $$ σ is assumed to be positive, bounded, globally Lipschitz, and bounded uniformly away from the origin, and the functionbis assumed to be positive, locally Lipschitz and nondecreasing. We prove that the Osgood condition$$\begin{aligned} \int _1^\infty \frac{\textrm{d}y}{b(y)}<\infty \end{aligned}$$ 1 d y b ( y ) < implies that the solution almost surely blows up everywhere and instantaneously, In other words, the Osgood condition ensures that$$\textrm{P}\{ u(t,x)=\infty \quad \hbox { for all } t>0 \hbox { and } x\in {\mathbb {R}}\}=1.$$ P { u ( t , x ) = for all t > 0 and x R } = 1 . The main ingredients of the proof involve a hitting-time bound for a class of differential inequalities (Remark 3.3), and the study of the spatial growth of stochastic convolutions using techniques from the Malliavin calculus and the Poincaré inequalities that were developed in Chen et al. (Electron J Probab 26:1–37, 2021, J Funct Anal 282(2):109290, 2022). 
    more » « less