Research articles published in medical journals often present findings from causal experiments. In this paper, we use this intuition to build a model that leverages causal relations expressed in text to unearth factors related to Sjögren’s syndrome. Sjögren’s syndrome is an auto-immune disease affecting up to 3.1 million Americans. The uncommon nature of the disease, coupled with common symptoms with other autoimmune conditions make the timely diagnosis of this disease very hard. A centralized information system with easy access to common and uncommon factors related to Sjögren’s syndrome may alleviate the problem. We use automatically extracted causal relationships from text related to Sjögren’s syndrome collected from the medical literature to identify a set of factors, such as “signs and symptoms” and “associated conditions”, related to this disease. We show that our approach is capable of retrieving such factors with a high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN.
more »
« less
Mining Latent Disease Factors from Medical Literature using Causality
Understanding causality is a longstanding goal across many different domains. Different articles, such as those published in medical journals, publish newly discovered knowledge, often causal. In this paper, we use this intuition to build a model that leverages causal relations to unearth factors related to Sjögren’s syndrome. Sjögren’s syndrome is an autoimmune disease affecting up to 3.1 million Americans. The uncommon nature of the disease, coupled with common symptoms of other autoimmune conditions such as rheumatoid arthritis, it is difficult for clinicians to timely diagnose the disease. This is further worsened by suboptimal communication between dentists, and physicians, including rheumatologists and ophthalmologists, because clinical manifestations of this disease require the patients to visit physicians with different specialties. A centralized information system with easy access to common and uncommon factors related to Sjögren’s syndrome may alleviate the problem. We use automatically extracted causal relationships from text related to Sjögren’s syndrome collected from the medical literature to identify a set of factors, such as “signs and symptoms” and “associated conditions”, related to this disease. We show that our approach is capable of retrieving such factors with high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN.
more »
« less
- Award ID(s):
- 1948322
- PAR ID:
- 10446657
- Date Published:
- Journal Name:
- 2022 IEEE International Conference on Big Data (Big Data)
- Page Range / eLocation ID:
- 2755 to 2764
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract In response to the COVID-19 outbreak, scientists and medical researchers are capturing a wide range of host responses, symptoms and lingering postrecovery problems within the human population. These variable clinical manifestations suggest differences in influential factors, such as innate and adaptive host immunity, existing or underlying health conditions, comorbidities, genetics and other factors—compounding the complexity of COVID-19 pathobiology and potential biomarkers associated with the disease, as they become available. The heterogeneous data pose challenges for efficient extrapolation of information into clinical applications. We have curated 145 COVID-19 biomarkers by developing a novel cross-cutting disease biomarker data model that allows integration and evaluation of biomarkers in patients with comorbidities. Most biomarkers are related to the immune (SAA, TNF-∝ and IP-10) or coagulation (D-dimer, antithrombin and VWF) cascades, suggesting complex vascular pathobiology of the disease. Furthermore, we observe commonality with established cancer biomarkers (ACE2, IL-6, IL-4 and IL-2) as well as biomarkers for metabolic syndrome and diabetes (CRP, NLR and LDL). We explore these trends as we put forth a COVID-19 biomarker resource (https://data.oncomx.org/covid19) that will help researchers and diagnosticians alike.more » « less
-
Masanori Aikawa (Ed.)As binary switches, RAS proteins switch to an ON/OFF state during signaling and are on a leash under normal conditions. However, in RAS-related diseases such as cancer and RASopathies, mutations in the genes that regulate RAS signaling or the RAS itself permanently activate the RAS protein. The structural basis of this switch is well understood; however, the exact mechanisms by which RAS proteins are regulated are less clear. RAS/MAPK syndromes are multisystem developmental disorders caused by germline mutations in genes associated with the RAS/mitogen-activated protein kinase pathway, impacting 1 in 1,000–2,500 children. These include a variety of disorders such as Noonan syndrome (NS) and NS-related disorders (NSRD), such as cardio facio cutaneous (CFC) syndrome, Costello syndrome (CS), and NS with multiple lentigines (NSML, also known as LEOPARD syndrome). A frequent manifestation of cardiomyopathy (CM) and hypertrophic cardiomyopathy associated with RASopathies suggest that RASopathies could be a potential causative factor for CM. However, the current supporting evidence is sporadic and unclear. RASopathy-patients also display a broad spectrum of congenital heart disease (CHD). More than 15 genes encode components of the RAS/MAPK signaling pathway that are essential for the cell cycle and play regulatory roles in proliferation, differentiation, growth, and metabolism. These genes are linked to the molecular genetic pathogenesis of these syndromes. However, genetic heterogeneity for a given syndrome on the one hand and alleles for multiple syndromes on the other make classification difficult in diagnosing RAS/MAPK-related diseases. Although there is some genetic homogeneity in most RASopathies, several RASopathies are allelic diseases. This allelism points to the role of critical signaling nodes and sheds light on the overlap between these related syndromes. Even though considerable progress has been made in understanding the pathophysiology of RASopathy with the identification of causal mutations and the functional analysis of their pathophysiological consequences, there are still unidentified causal genes for many patients diagnosed with RASopathies.more » « less
-
Molecular mimicry between viral antigens and host proteins can produce cross-reacting antibodies leading to autoimmunity. The coronavirus SARS-CoV-2 causes COVID-19, a disease curiously resulting in varied symptoms and outcomes, ranging from asymptomatic to fatal. Autoimmunity due to cross-reacting antibodies resulting from molecular mimicry between viral antigens and host proteins may provide an explanation. Thus, we computationally investigated molecular mimicry between SARS-CoV-2 Spike and known epitopes. We discovered molecular mimicry hotspots in Spike and highlight two examples with tentative high autoimmune potential and implications for understanding COVID-19 complications. We show that a TQLPP motif in Spike and thrombopoietin shares similar antibody binding properties. Antibodies cross-reacting with thrombopoietin may induce thrombocytopenia, a condition observed in COVID-19 patients. Another motif, ELDKY, is shared in multiple human proteins, such as PRKG1 involved in platelet activation and calcium regulation, and tropomyosin, which is linked to cardiac disease. Antibodies cross-reacting with PRKG1 and tropomyosin may cause known COVID-19 complications such as blood-clotting disorders and cardiac disease, respectively. Our findings illuminate COVID-19 pathogenesis and highlight the importance of considering autoimmune potential when developing therapeutic interventions to reduce adverse reactions.more » « less
-
The COVID-19 pandemic has changed the lives of many people around the world. Based on the available data and published reports, most people diagnosed with COVID-19 exhibit no or mild symptoms and could be discharged home for self-isolation. Considering that a substantial portion of them will progress to a severe disease requiring hospitalization and medical management, including respiratory and circulatory support in the form of supplemental oxygen therapy, mechanical ventilation, vasopressors, etc. The continuous monitoring of patient conditions at home for patients with COVID-19 will allow early determination of disease severity and medical intervention to reduce morbidity and mortality. In addition, this will allow early and safe hospital discharge and free hospital beds for patients who are in need of admission. In this review, we focus on the recent developments in next-generation wearable sensors capable of continuous monitoring of disease symptoms, particularly those associated with COVID-19. These include wearable non/minimally invasive biophysical (temperature, respiratory rate, oxygen saturation, heart rate, and heart rate variability) and biochemical (cytokines, cortisol, and electrolytes) sensors, sensor data analytics, and machine learning-enabled early detection and medical intervention techniques. Together, we aim to inspire the future development of wearable sensors integrated with data analytics, which serve as a foundation for disease diagnostics, health monitoring and predictions, and medical interventions.more » « less
An official website of the United States government

