skip to main content


Title: Reduced Graphene Oxide-Metalloporphyrin Sensors for Human Breath Screening
The objective of this study is to validate reduced graphene oxide (RGO)-based volatile organic compounds (VOC) sensors, assembled by simple and low-cost manufacturing, for the detection of disease-related VOCs in human breath using machine learning (ML) algorithms. RGO films were functionalized by four different metalloporphryins to assemble cross-sensitive chemiresistive sensors with different sensing properties. This work demonstrated how different ML algorithms affect the discrimination capabilities of RGO–based VOC sensors. In addition, an ML-based disease classifier was derived to discriminate healthy vs. unhealthy individuals based on breath sample data. The results show that our ML models could predict the presence of disease-related VOC compounds of interest with a minimum accuracy and F1-score of 91.7% and 83.3%, respectively, and discriminate chronic kidney disease breath with a high accuracy, 91.7%.  more » « less
Award ID(s):
1934568
NSF-PAR ID:
10349464
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Applied Sciences
Volume:
11
Issue:
23
ISSN:
2076-3417
Page Range / eLocation ID:
11290
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Fast diagnostic results using breath analysis are an anticipated possibility for disease diagnosis or general health screenings. Tests that do not require sending specimens to medical laboratories possess capabilities to speed patient diagnosis and protect both patient and healthcare staff from unnecessary prolonged exposure. The objective of this work was to develop testing procedures on an initial healthy subject cohort in Hawaii to act as a range-finding pilot study for characterizing the baseline of exhaled breath prior to further research. Using comprehensive two-dimensional gas chromatography (GC×GC), this study analyzed exhaled breath from a healthy adult population in Hawaii to profile the range of different volatile organic compounds (VOCs) and survey Hawaii-specific differences. The most consistently reported compounds in the breath profile of individuals were acetic acid, dimethoxymethane, benzoic acid methyl ester, and n-hexane. In comparison to other breathprinting studies, the list of compounds discovered was representative of control cohorts. This must be considered when implementing proposed breath diagnostics in new locations with increased interpersonal variation due to diversity. Further studies on larger numbers of subjects over longer periods of time will provide additional foundational data on baseline breath VOC profiles of control populations for comparison to disease-positive cohorts. 
    more » « less
  2. Abstract

    Cystic fibrosis (CF) is characterized by chronic respiratory infections which progressively decrease lung function over time. Affected individuals experience episodes of intensified respiratory symptoms called pulmonary exacerbations (PEx), which in turn accelerate pulmonary function decline and decrease survival rate. An overarching challenge is that there is no standard classification for PEx, which results in treatments that are heterogeneous. Improving PEx classification and management is a significant research priority for people with CF. Previous studies have shown volatile organic compounds (VOCs) in exhaled breath can be used as biomarkers because they are products of metabolic pathways dysregulated by different diseases. To provide insights on PEx classification and other CF clinical factors, exhaled breath samples were collected from 18 subjects with CF, with some experiencing PEx and others serving as a baseline. Exhaled breath was collected in Tedlar bags during tidal breathing and cryotransferred to headspace vials for VOC analysis by solid phase microextraction coupled to gas chromatography–mass spectrometry. Statistical significance testing between quantitative and categorical clinical variables displayed percent-predicted forced expiratory volume in one second (FEV1pp) was decreased in subjects experiencing PEx. VOCs correlating with other clinical variables (body mass index, age, use of highly effective modulator treatment (HEMT), and the need for inhaled tobramycin) were also explored. Two volatile aldehydes (octanal and nonanal) were upregulated in patients not taking the HEMT. VOCs correlating to potential confounding variables were removed and then analyzed by regression for significant correlations with FEV1pp measurements. Interestingly, the VOC with the highest correlation with FEV1pp (3,7-dimethyldecane) also gave the lowestp-value when comparing subjects at baseline and during PEx. Other VOCs that were differentially expressed due to PEx that were identified in this study include durene, 2,4,4-trimethyl-1,3-pentanediol 1-isobutyrate and 5-methyltridecane. Receiver operator characteristic curves were developed and showed 3,7-dimethyldecane had higher ability to classify PEx (area under the curve (AUC) = 0.91) relative to FEV1pp values at collection (AUC = 0.83). However, normalized ΔFEV1pp values had the highest capability to distinguish PEx (AUC = 0.93). These results show that VOCs in exhaled breath may be a rich source of biomarkers for various clinical traits of CF, including PEx, that should be explored in larger sample cohorts and validation studies.

     
    more » « less
  3. Conventional black box machine learning (ML) algorithms for gas-phase species identification from THz frequency region absorption spectra have been reported in the literature. While the robust classification performance of such ML models is promising, the black box nature of these ML tools limits their interpretability and acceptance in application. Here, a one-dimensional convolutional neural network (CNN), VOC-Net, is developed and demonstrated for the classification of absorption spectra for volatile organic compounds (VOCs) in the THz frequency range, specifically from 220 to 330 GHz where prior experimental data is available. VOC-Net is trained and validated against simulated spectra, and also demonstrated and tested against experimental spectra. The performance of VOC-Net is examined by the consideration of confusion matrices and receiver-operator-characteristic (ROC) curves. The model is shown to be 99+% accurate for the classification of simulated spectra and 97% accurate for the classification of noisy experimental spectra. The model’s internal logic is examined using the Gradient-weighted Class Activation Mapping (Grad-CAM) method, which provides a visual and interpretable explanation of the model’s decision making process with respect to the important distinguishing spectral features. 
    more » « less
  4. Porcine reproductive and respiratory syndrome is an infectious disease of pigs caused by PRRS virus (PRRSV). A modified live-attenuated vaccine has been widely used to control the spread of PRRSV and the classification of field strains is a key for a successful control and prevention. Restriction fragment length polymorphism targeting the Open reading frame 5 (ORF5) genes is widely used to classify PRRSV strains but showed unstable accuracy. Phylogenetic analysis is a powerful tool for PRRSV classification with consistent accuracy but it demands large computational power as the number of sequences gets increased. Our study aimed to apply four machine learning (ML) algorithms, random forest, k-nearest neighbor, support vector machine and multilayer perceptron, to classify field PRRSV strains into four clades using amino acid scores based on ORF5 gene sequence. Our study used amino acid sequences of ORF5 gene in 1931 field PRRSV strains collected in the US from 2012 to 2020. Phylogenetic analysis was used to labels field PRRSV strains into one of four clades: Lineage 5 or three clades in Linage 1. We measured accuracy and time consumption of classification using four ML approaches by different size of gene sequences. We found that all four ML algorithms classify a large number of field strains in a very short time (<2.5 s) with very high accuracy (>0.99 Area under curve of the Receiver of operating characteristics curve). Furthermore, the random forest approach detects a total of 4 key amino acid positions for the classification of field PRRSV strains into four clades. Our finding will provide an insightful idea to develop a rapid and accurate classification model using genetic information, which also enables us to handle large genome datasets in real time or semi-real time for data-driven decision-making and more timely surveillance. 
    more » « less
  5. Raynal, Ann M. ; Ranney, Kenneth I. (Ed.)
    Most research in technologies for the Deaf community have focused on translation using either video or wearable devices. Sensor-augmented gloves have been reported to yield higher gesture recognition rates than camera-based systems; however, they cannot capture information expressed through head and body movement. Gloves are also intrusive and inhibit users in their pursuit of normal daily life, while cameras can raise concerns over privacy and are ineffective in the dark. In contrast, RF sensors are non-contact, non-invasive and do not reveal private information even if hacked. Although RF sensors are unable to measure facial expressions or hand shapes, which would be required for complete translation, this paper aims to exploit near real-time ASL recognition using RF sensors for the design of smart Deaf spaces. In this way, we hope to enable the Deaf community to benefit from advances in technologies that could generate tangible improvements in their quality of life. More specifically, this paper investigates near real-time implementation of machine learning and deep learning architectures for the purpose of sequential ASL signing recognition. We utilize a 60 GHz RF sensor which transmits a frequency modulation continuous wave (FMWC waveform). RF sensors can acquire a unique source of information that is inaccessible to optical or wearable devices: namely, a visual representation of the kinematic patterns of motion via the micro-Doppler signature. Micro-Doppler refers to frequency modulations that appear about the central Doppler shift, which are caused by rotational or vibrational motions that deviate from principle translational motion. In prior work, we showed that fractal complexity computed from RF data could be used to discriminate signing from daily activities and that RF data could reveal linguistic properties, such as coarticulation. We have also shown that machine learning can be used to discriminate with 99% accuracy the signing of native Deaf ASL users from that of copysigning (or imitation signing) by hearing individuals. Therefore, imitation signing data is not effective for directly training deep models. But, adversarial learning can be used to transform imitation signing to resemble native signing, or, alternatively, physics-aware generative models can be used to synthesize ASL micro-Doppler signatures for training deep neural networks. With such approaches, we have achieved over 90% recognition accuracy of 20 ASL signs. In natural environments, however, near real-time implementations of classification algorithms are required, as well as an ability to process data streams in a continuous and sequential fashion. In this work, we focus on extensions of our prior work towards this aim, and compare the efficacy of various approaches for embedding deep neural networks (DNNs) on platforms such as a Raspberry Pi or Jetson board. We examine methods for optimizing the size and computational complexity of DNNs for embedded micro-Doppler analysis, methods for network compression, and their resulting sequential ASL recognition performance. 
    more » « less