skip to main content


Title: Similarity analysis of federal reserve statements using document embeddings: the Great Recession vs. COVID-19
Abstract

The coronavirus pandemic has already caused plenty of severe problems for humanity and the economy. The exact impact of the COVID-19 pandemic is still unknown, and economists and financial advisers are exploring all possible scenarios to mitigate the risks arising from the pandemic. An intriguing question is whether this pandemic and its impacts are similar, and to what extent, to any other catastrophic events that occurred in the past, such as the 2009 Great Recession. This paper intends to address this problem by analyzing official public announcements and statements issued by federal authorities such as the Federal Reserve. More specifically, we measure similarities of consecutive statements issued by the Federal Reserve during the 2009 Great Recession and the COVID-19 pandemic using natural language processing techniques. Furthermore, we explore the usage of document embedding representations of the statements in a more complex task: clustering. Our analysis shows that, using an advanced NLP technique in document embedding such as Doc2Vec, we can detect a difference of 10.8% in similarities of Federal Open Market Committee (FOMC) statements issued during the Great Recession (2007–2009) and the COVID-19 pandemic. Finally, the results of our clustering exercise show that the document embeddings representations of the statements are suitable for more complex tasks, which provides a basis for future applications of state-of-the-art natural language processing techniques using the FOMC post-meeting statements as the dataset.

 
more » « less
NSF-PAR ID:
10368506
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
SN Business & Economics
Volume:
2
Issue:
7
ISSN:
2662-9399
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Background

    The COVID-19 pandemic has resulted in heightened levels of depression, anxiety, and other mental health issues due to sudden changes in daily life, such as economic stress, social isolation, and educational irregularity. Accurately assessing emotional and behavioral changes in response to the pandemic can be challenging, but it is essential to understand the evolving emotions, themes, and discussions surrounding the impact of COVID-19 on mental health.

    Objective

    This study aims to understand the evolving emotions and themes associated with the impact of COVID-19 on mental health support groups (eg, r/Depression and r/Anxiety) on Reddit (Reddit Inc) during the initial phase and after the peak of the pandemic using natural language processing techniques and statistical methods.

    Methods

    This study used data from the r/Depression and r/Anxiety Reddit communities, which consisted of posts contributed by 351,409 distinct users over a period spanning from 2019 to 2022. Topic modeling and Word2Vec embedding models were used to identify key terms associated with the targeted themes within the data set. A range of trend and thematic analysis techniques, including time-to-event analysis, heat map analysis, factor analysis, regression analysis, and k-means clustering analysis, were used to analyze the data.

    Results

    The time-to-event analysis revealed that the first 28 days following a major event could be considered a critical window for mental health concerns to become more prominent. The theme trend analysis revealed key themes such as economic stress, social stress, suicide, and substance use, with varying trends and impacts in each community. The factor analysis highlighted pandemic-related stress, economic concerns, and social factors as primary themes during the analyzed period. Regression analysis showed that economic stress consistently demonstrated the strongest association with the suicide theme, whereas the substance theme had a notable association in both data sets. Finally, the k-means clustering analysis showed that in r/Depression, the number of posts related to the “depression, anxiety, and medication” cluster decreased after 2020, whereas the “social relationships and friendship” cluster showed a steady decrease. In r/Anxiety, the “general anxiety and feelings of unease” cluster peaked in April 2020 and remained high, whereas the “physical symptoms of anxiety” cluster showed a slight increase.

    Conclusions

    This study sheds light on the impact of COVID-19 on mental health and the related themes discussed in 2 web-based communities during the pandemic. The results offer valuable insights for developing targeted interventions and policies to support individuals and communities in similar crises.

     
    more » « less
  2. The ability to quickly learn fundamentals about a new infectious disease, such as how it is transmitted, the incubation period, and related symptoms, is crucial in any novel pandemic. For instance, rapid identification of symptoms can enable interventions for dampening the spread of the disease. Traditionally, symptoms are learned from research publications associated with clinical studies. However, clinical studies are often slow and time intensive, and hence delays can have dire consequences in a rapidly spreading pandemic like we have seen with COVID-19. In this article, we introduce SymptomID, a modular artificial intelligence–based framework for rapid identification of symptoms associated with novel pandemics using publicly available news reports. SymptomID is built using the state-of-the-art natural language processing model (Bidirectional Encoder Representations for Transformers) to extract symptoms from publicly available news reports and cluster-related symptoms together to remove redundancy. Our proposed framework requires minimal training data, because it builds on a pre-trained language model. In this study, we present a case study of SymptomID using news articles about the current COVID-19 pandemic. Our COVID-19 symptom extraction module, trained on 225 articles, achieves an F1 score of over 0.8. SymptomID can correctly identify well-established symptoms (e.g., “fever” and “cough”) and less-prevalent symptoms (e.g., “rashes,” “hair loss,” “brain fog”) associated with the novel coronavirus. We believe this framework can be extended and easily adapted in future pandemics to quickly learn relevant insights that are fundamental for understanding and combating a new infectious disease. 
    more » « less
  3. null (Ed.)
    The outbreak and emergence of the novel coronavirus (COVID-19) pandemic affected every aspect of human activity, especially the transportation sector. Many cities adopted unprecedented lockdown strategies that resulted in significant nonessential mobility restrictions; hence, transportation network companies (TNCs) have experienced major shifts in their operation. Millions of people alone in the USA have filed for unemployment in the early stage of the COVID-19 outbreak, many belonging to self-employed groups such as Uber/Lyft drivers. Due to unprecedented scenarios, both drivers and passengers experienced overwhelming challenges that might elongate the recovery process. The goal of this study is to understand the risk, response, and challenges associated with ridesharing (TNCs, drivers, and passengers) during the COVID-19 pandemic situation. As such, large-scale crowdsourced data were collected from online ridesharing forums (i.e., Uber Drivers) since the emergence of COVID-19 (January 25–May 10, 2020). Word bigrams, word frequency heatmaps, and topic models are among the different natural language processing and text-mining techniques used to preprocess the data and classify risk perception, risk-taking, or risk-averting behaviors associated with ridesharing during a major disease outbreak. Results indicate higher levels of concern about economic disruption, availability of stimulus checks, new employment opportunities, hospitalization, pandemic, personal hygiene, and staying at home. In addition, unprecedented challenges due to unemployment and the risk and uncertainties in the required personal protective actions against spreading the disease due to sharing are among the major interactions. The proposed text-based data analytics of the ridesharing risk communication dynamics during this pandemic will help to identify unobserved factors inadvertently affecting the TNCs as well as the users (drivers and passengers) and identify more efficient strategies and alternatives for the forthcoming “new normal” of the current pandemic and the ones in the future. The study will also guide us toward understanding how efficiently online social interaction outlets can be designed and implemented more effectively during a major crisis and how to leverage such platforms for providing guidelines during emergencies to minimize transmission of disease due to shared travel. 
    more » « less
  4. The COVID-19 pandemic represents the most significant public health disaster since the 1918 influenza pandemic. During pandemics such as COVID-19, timely and reliable spatiotemporal forecasting of epidemic dynamics is crucial. Deep learning-based time series models for forecasting have recently gained popularity and have been successfully used for epidemic forecasting. Here we focus on the design and analysis of deep learning-based models for COVID-19 forecasting. We implement multiple recurrent neural network-based deep learning models and combine them using the stacking ensemble technique. In order to incorporate the effects of multiple factors in COVID-19 spread, we consider multiple sources such as COVID-19 confirmed and death case count data and testing data for better predictions. To overcome the sparsity of training data and to address the dynamic correlation of the disease, we propose clustering-based training for high-resolution forecasting. The methods help us to identify the similar trends of certain groups of regions due to various spatio-temporal effects. We examine the proposed method for forecasting weekly COVID-19 new confirmed cases at county-, state-, and country-level. A comprehensive comparison between different time series models in COVID-19 context is conducted and analyzed. The results show that simple deep learning models can achieve comparable or better performance when compared with more complicated models. We are currently integrating our methods as a part of our weekly forecasts that we provide state and federal authorities. 
    more » « less
  5. Abstract Background

    This article addresses the urgent need for more evidence-based research using primary data to document how the COVID-19 pandemic affected the health and social wellbeing of disabled individuals. Our study sought to determine if adults with disabilities, and with specific types of disability, were more likely to suffer adverse health and social impacts related to COVID-19 than nondisabled adults in metropolitan Texas, during the first 18 months of the pandemic.

    Methods

    We collected primary data from randomly selected residents in eight Texas metropolitan areas through a bilingual telephone survey in July 2021. Statistical analysis comprised multivariable generalized estimating equations that control for relevant sociodemographic and COVID-related risk factors, and spatial clustering.

    Results

    Disabled survey respondents had been more adversely affected by COVID-19 than nondisabled respondents, in terms of mental and physical health, health care access, living conditions and social life. Significant disparities were also found for almost all COVID-19 impacts when the disabled category was disaggregated by disability type. Respondents experiencing cognitive and independent living difficulties were negatively impacted in all five areas of life examined.

    Conclusions

    Findings emphasize the need to consider a wide range of impacts associated with the COVID-19 pandemic that negatively affect the health and social wellbeing of disabled persons, as well as develop disability-inclusive policies that provide adequate protections.

     
    more » « less