skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: CovIdentify Dataset
This dataset supports the study "A method for intelligent allocation of diagnostic testing by leveraging data from commercial wearable devices: a case study on COVID-19" which developed an Intelligent Testing Allocation (ITA) method. The study demonstrated the efficacy of using continuous digital biomarkers like resting heart rate and steps to enhance COVID-19 diagnostic testing positivity rates. The findings suggest significant potential for large-scale, symptom-independent surveillance testing to alleviate diagnostic test shortages. The provided data is from the CovIdentify study launched by Duke's BIG IDEAs Lab in the Biomedical Engineering Department. From April 2nd, 2020 to May 25th, 2021, 2,887 participants connected their smartwatches to the CovIdentify platform, including 1,689 Garmin, 1,091 Fitbit, and 107 Apple smartwatches  more » « less
Award ID(s):
2339669
PAR ID:
10587594
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
PhysioNet
Date Published:
Format(s):
Medium: X
Location:
Durham, NC
Institution:
Duke University
Sponsoring Org:
National Science Foundation
More Like this
  1. The screening testing is an effective tool to control the early spread of an infectious disease such as COVID‐19. When the total testing capacity is limited, we aim to optimally allocate testing resources amongncounties. We build a (weighted) commute network on counties, with the weight between two counties a decreasing function of their traffic distance. We introduce a network‐based disease model, in which the number of newly confirmed cases of each county depends on the numbers of hidden cases of all counties on the network. Our proposed testing allocation strategy first uses historical data to learn model parameters and then decides the testing rates for all counties by solving an optimization problem. We apply the method on the commute networks of Massachusetts, USA and Hubei, China, and observe its advantages over testing allocation strategies that ignore the network structure. Our approach can also be extended to study the vaccine allocation problem. 
    more » « less
  2. Abstract Early detection of diseases such as COVID-19 could be a critical tool in reducing disease transmission by helping individuals recognize when they should self-isolate, seek testing, and obtain early medical intervention. Consumer wearable devices that continuously measure physiological metrics hold promise as tools for early illness detection. We gathered daily questionnaire data and physiological data using a consumer wearable (Oura Ring) from 63,153 participants, of whom 704 self-reported possible COVID-19 disease. We selected 73 of these 704 participants with reliable confirmation of COVID-19 by PCR testing and high-quality physiological data for algorithm training to identify onset of COVID-19 using machine learning classification. The algorithm identified COVID-19 an average of 2.75 days before participants sought diagnostic testing with a sensitivity of 82% and specificity of 63%. The receiving operating characteristic (ROC) area under the curve (AUC) was 0.819 (95% CI [0.809, 0.830]). Including continuous temperature yielded an AUC 4.9% higher than without this feature. For further validation, we obtained SARS CoV-2 antibody in a subset of participants and identified 10 additional participants who self-reported COVID-19 disease with antibody confirmation. The algorithm had an overall ROC AUC of 0.819 (95% CI [0.809, 0.830]), with a sensitivity of 90% and specificity of 80% in these additional participants. Finally, we observed substantial variation in accuracy based on age and biological sex. Findings highlight the importance of including temperature assessment, using continuous physiological features for alignment, and including diverse populations in algorithm development to optimize accuracy in COVID-19 detection from wearables. 
    more » « less
  3. Abstract BackgroundThe coronavirus disease 2019 (COVID-19) presents critical diagnostic challenges for managing the pandemic. We investigated the 30-month changes in COVID-19 testing modalities and functional testing sites from the early period of the pandemic to the most recent Omicron surge in 2022 in Kyoto City, Japan. MethodsThis is a retrospective-observational study using a local anonymized population database that included patients' demographic and clinical information, testing methods and facilities from January 2020 to June 2022, a total of 30 months. We computed the distribution of symptomatic presentation, testing methods, and testing facilities among cases. Differences over time were tested using chi-square tests of independence. ResultsDuring the study period, 133,115 confirmed COVID-19 cases were reported, of which 90.9% were symptomatic. Although nucleic acid amplification testing occupied 68.9% of all testing, the ratio of lateral flow devices (LFDs) rapidly increased in 2022. As the pandemic continued, the testing capability was shifted from COVID-19 designated facilities to general practitioners, who became the leading testing providers (57.3% of 99,945 tests in 2022). ConclusionsThere was a dynamic shift in testing modality during the first 30 months of the pandemic in Kyoto City. General practitioners increased their role substantially as the use of LFDs spread dramatically in 2022. By comprehending and documenting the evolution of testing methods and testing locations, it is anticipated that this will contribute to the establishment of an even more efficient testing infrastructure for the next pandemic. 
    more » « less
  4. null (Ed.)
    Unlike pandemics in the past, COVID-19 has hit us in the midst of the information age. We have built vast capabilities to collect and store data of any kind that can be analyzed in myriad ways to help us mitigate the impact of this catastrophic disease. Specifically for COVID-19, data analysis can help local governments to plan the allocation of testing kits, testing stations, and primary care units, and it can help them in setting guidelines for residents, such as the need for social distancing, the use of face masks, and when to open local businesses that enable human contact. Further, it can also lead to a better understanding of pandemics in general and so inform policy makers on the regional and national level. All of this can save both cost and lives. In this article, we show the results of an ongoing study we conducted using a prominent regularly updated dataset. We used a pattern mining engine we developed to find specific characteristics of US counties that appear to expose them to higher COVID-19 mortality. Furthermore, we also show that these characteristics can be used to predict future COVID-19 mortality. 
    more » « less
  5. Pantea, Casian (Ed.)
    Limited testing capacity for COVID-19 has hampered the pandemic response. Pooling is a testing method wherein samples from specimens (e.g., swabs) from multiple subjects are combined into a pool and screened with a single test. If the pool tests positive, then new samples from the collected specimens are individually tested, while if the pool tests negative, the subjects are classified as negative for the disease. Pooling can substantially expand COVID-19 testing capacity and throughput, without requiring additional resources. We develop a mathematical model to determine the best pool size for different risk groups , based on each group’s estimated COVID-19 prevalence. Our approach takes into consideration the sensitivity and specificity of the test, and a dynamic and uncertain prevalence, and provides a robust pool size for each group. For practical relevance, we also develop a companion COVID-19 pooling design tool (through a spread sheet). To demonstrate the potential value of pooling, we study COVID-19 screening using testing data from Iceland for the period, February-28-2020 to June-14-2020, for subjects stratified into high- and low-risk groups. We implement the robust pooling strategy within a sequential framework, which updates pool sizes each week, for each risk group, based on prior week’s testing data. Robust pooling reduces the number of tests, over individual testing, by 88.5% to 90.2%, and 54.2% to 61.9%, respectively, for the low-risk and high-risk groups (based on test sensitivity values in the range [0.71, 0.98] as reported in the literature). This results in much shorter times, on average, to get the test results compared to individual testing (due to the higher testing throughput), and also allows for expanded screening to cover more individuals. Thus, robust pooling can potentially be a valuable strategy for COVID-19 screening. 
    more » « less