skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Social Media based Simulation Models for Understanding Disease Dynamics
In this modern era, infectious diseases, such as H1N1, SARS, and Ebola, are spreading much faster than any time in history. Efficient approaches are therefore desired to monitor and track the diffusion of these deadly epidemics. Traditional computational epidemiology models are able to capture the disease spreading trends through contact network, however, one unable to provide timely updates via real-world data. In contrast, techniques focusing on emerging social media platforms can collect and monitor real-time disease data, but do not provide an understanding of the underlying dynamics of ailment propagation. To achieve efficient and accurate real-time disease prediction, the framework proposed in this paper combines the strength of social media mining and computational epidemiology. Specifically, individual health status is first learned from user's online posts through Bayesian inference, disease parameters are then extracted for the computational models at population-level, and the outputs of computational epidemiology model are inversely fed into social media data based models for further performance improvement. In various experiments, our proposed model outperforms current disease forecasting approaches with better accuracy and more stability.  more » « less
Award ID(s):
1646881 1707498 1619028
PAR ID:
10066664
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
Page Range / eLocation ID:
3797 to 3804
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Social media has become an indispensable resource in disaster response, providing real-time crowdsourced data on public experiences, needs, and conditions during crises. This user-generated content enables government agencies and emergency responders to identify emerging threats, prioritize resource allocation, and optimize relief operations through data-driven insights. We present an AI-powered framework that combines natural language processing with geospatial visualization to analyze disaster-related social media content. Our solution features a text analysis model that achieved an 81.4% F1 score in classifying Twitter/X posts, integrated with an interactive web platform that maps emotional trends and crisis situations across geographic regions. The system’s dynamic visualization capabilities allow authorities to monitor situational developments through an interactive map, facilitating targeted response coordination. The experimental results show the model’s effectiveness in extracting actionable intelligence from Twitter/X posts during natural disasters. 
    more » « less
  2. Opioid addiction constitutes a significant contemporary health crisis that is multifarious in its complexity. Modeling the epidemiology of any addiction is challenging in its own right. For opioid addiction, the challenge is exacerbated due to the difficulties in collecting real-time data and the circumscribed nature of information opioid users may disclose owing to stigma associated with prescription misuse. Given this context, identifying the progression of individuals through the stages of (opioid) addiction is one of the more acute problems in epidemiological modeling whose solution is crucial for designing specific interventions at both personal and population levels. We describe a computational approach for determining and characterizing addiction stages of opioid users from their social media posts. The proposed approach combines recurrent neural network learning with information-theoretic analysis of word-associations and context-based word embedding to determine addiction stage-specific language usage. Users who have a high likelihood for relapsing back to drug-use are identified and characterized using propensity score matching and logistic regression. Experimental evaluations indicate that the proposed approach can distinguish between various addiction stages and identify users prone to relapse with high accuracy as evidenced by F1 scores of 0.88 and 0.79 respectively 
    more » « less
  3. Agent-based models provide a flexible framework that is frequently used for modelling many biological systems, including cell migration, molecular dynamics, ecology and epidemiology. Analysis of the model dynamics can be challenging due to their inherent stochasticity and heavy computational requirements. Common approaches to the analysis of agent-based models include extensive Monte Carlo simulation of the model or the derivation of coarse-grained differential equation models to predict the expected or averaged output from the agent-based model. Both of these approaches have limitations, however, as extensive computation of complex agent-based models may be infeasible, and coarse-grained differential equation models can fail to accurately describe model dynamics in certain parameter regimes. We propose that methods from the equation learning field provide a promising, novel and unifying approach for agent-based model analysis. Equation learning is a recent field of research from data science that aims to infer differential equation models directly from data. We use this tutorial to review how methods from equation learning can be used to learn differential equation models from agent-based model simulations. We demonstrate that this framework is easy to use, requires few model simulations, and accurately predicts model dynamics in parameter regions where coarse-grained differential equation models fail to do so. We highlight these advantages through several case studies involving two agent-based models that are broadly applicable to biological phenomena: a birth–death–migration model commonly used to explore cell biology experiments and a susceptible–infected–recovered model of infectious disease spread. 
    more » « less
  4. The unprecedented rise of social media platforms, combined with location-aware technologies, has led to continuously producing a significant amount of geo-social data that flows as a user-generated data stream. This data has been exploited in several important use cases in various application domains. This article supports geo-social personalized queries in streaming data environments. We define temporal geo-social queries that provide users with real-time personalized answers based on their social graph. The new queries allow incorporating keyword search to get personalized results that are relevant to certain topics. To efficiently support these queries, we propose an indexing framework that provides lightweight and effective real-time indexing to digest geo-social data in real time. The framework distinguishes highly dynamic data from relatively stable data and uses appropriate data structures and a storage tier for each. Based on this framework, we propose a novel geo-social index and adopt two baseline indexes to support the addressed queries. The query processor then employs different types of pruning to efficiently access the index content and provide a real-time query response. The extensive experimental evaluation based on real datasets has shown the superiority of our proposed techniques to index real-time data and provide low-latency queries compared to existing competitors. 
    more » « less
  5. Kolawole, Olatunji Matthew (Ed.)
    Persons living with human immunodeficiency virus (HIV) have a disproportionately higher burden of human papillomavirus infection (HPV)-related cancers. Causal factors include both behavioral and biological. While pharmaceutical and care support interventions help address biological risk of coinfection, as social conditions are common drivers of behaviors, structural interventions are key part of behavioral interventions. Our objective is to develop a joint HIV-HPV model to evaluate the contribution of each factor, to subsequently inform intervention analyses. While compartmental modeling is sufficient for faster spreading HPV, network modeling is suitable for slower spreading HIV. However, using network modeling for jointly modeling HIV and HPV can generate computational complexities given their vastly varying disease epidemiology and disease burden across sub-population groups. We applied a recently developed mixed agent-based compartmental (MAC) simulation technique, which simulates persons with at least one slower spreading disease and their immediate contacts as agents in a network, and all other persons including those with faster spreading diseases in a compartmental model, with an evolving contact network algorithm maintaining the dynamics between the two models. We simulated HIV and HPV in the U.S. among heterosexual female, heterosexual male, and men who have sex with men (men only and men and women) (MSM), sub-populations that mix but have varying HIV burden, and cervical cancer among women. We conducted numerical analyses to evaluate the contribution of behavioral and biological factors to risk of cervical cancer among women with HIV. The model outputs for HIV, HPV, and cervical cancer compared well with surveillance estimates. Model estimates for relative prevalence of HPV (1.67 times) and relative incidence of cervical cancer (3.6 times), among women with HIV compared to women without, were also similar to that reported in observational studies in the literature. The fraction attributed to biological factors ranged from 22–38% for increased HPV prevalence and 80% for increased cervical cancer incidence, the remaining attributed to behavioral. The attribution of both behavioral and biological factors to increased HPV prevalence and cervical cancer incidence suggest the need for behavioral, structural, and pharmaceutical interventions. Validity of model results related to both individual and joint disease metrics serves as proof-of-concept of the MAC simulation technique. Understanding the contribution of behavioral and biological factors of risk helps inform interventions. Future work can expand the model to simulate sexual and care behaviors as functions of social conditions to jointly evaluate behavioral, structural, and pharmaceutical interventions for HIV and cervical cancer prevention. 
    more » « less