skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Moving toward automated µFTIR spectra matching for microplastic identification: addressing false identifications and improving accuracy
Abstract Infrared spectroscopy is a widely used tool for studying microplastics and identifying microparticles. Researchers rely on spectral libraries to differentiate between synthetic and natural materials. Unfortunately, spectral library matching is not perfect, and best practices require researchers to use time consuming, manual peak matching to assess spectral matches. Moving toward automated matching requires increased confidence in the matching process. Using spectra matching software may increase the efficiency of particle identification, however some matching strategies may confuse natural materials such as cotton, silk, and plant matter with common classes of synthetics such as polyesters and polyamides. In this experiment, we prepared 22 pristine sample materials from natural and synthetic sources and measured micro-Fourier transform infrared (µFTIR) spectra in transmission mode for each sample using a Thermo Nicolet iN10 MX instrument. The collected spectra were then input into two spectral library matching systems (Omnic Picta and Open Specy), using a total of five identification routines. Next, we placed a subset of four pristine microplastic materials in a biologically active river system for two weeks to simulate environmental samples. These simulated environmental samples were processed using 10% hydrogen peroxide for 24 h to remove organic contamination and then identified using the strongest performing library. We found that libraries with fewer sample spectra produced lower correlation matches and that using derivative correction greatly reduced the number of inaccuracies in identifying materials as either natural or synthetic. We also found that environmental fouling reduced the correlation value of library matches when compared to pristine particles, however the effect was not consistent across the four materials tested. Overall, we found that the accuracy of automated library matching in the tested systems and processing routines varied from 64.1 to 98.0% for distinguishing between natural and synthetic materials, and that a high Hit Quality Index (HQI) did not always correlate with accuracy. These results are important for the microplastic field, demonstrating a need to rigorously test spectral libraries and processing routines with known materials to ensure identification accuracy.  more » « less
Award ID(s):
2045871 2018848
PAR ID:
10556212
Author(s) / Creator(s):
; ;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
Microplastics and Nanoplastics
Volume:
4
Issue:
1
ISSN:
2662-4966
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Fourier transform infrared (FTIR) and Raman microspectroscopy are methods applied in microplastics research to determine the chemical identity of microplastics. These techniques enable quantification of microplastic particles across various matrices. Previous work has highlighted the benefits and limitations of each method and found these to be complimentary. Within this work, metadata collected within an interlaboratory method validation study was used to determine which variables most influenced successful chemical identification of un-weathered microplastics in simulated drinking water samples using FTIR and Raman microspectroscopy. No variables tested had a strong correlation with the accuracy of chemical identification (r = ≤0.63). The variables most correlated with accuracy differed between the two methods, and include both physical characteristics of particles (color, morphology, size, polymer type), and instrumental parameters (spectral collection mode, spectral range). Based on these results, we provide technical recommendations to improve capabilities of both methods for measuring microplastics in drinking water and highlight priorities for further research. For FTIR microspectroscopy, recommendations include considering the type of particle in question to inform sample presentation and spectral collection mode for sample analysis. Instrumental parameters should be adjusted for certain particle types when using Raman microspectroscopy. For both instruments, the study highlighted the need for harmonization of spectral reference libraries among research groups, including the use of libraries containing reference materials of both weathered plastic and natural materials that are commonly found in environmental samples. 
    more » « less
  2. Accurate spectral identification of weathered plastics and analyses that provide insight into environmental degradation and age are desirable for source tracking and understanding hazards. The objectives of this study were to (1) evaluate the kinetics of spectral changes for lab-weathered polymers and compare to spectra from environmental microplastics (MPs), and (2) assess the accuracy of spectral databases in identifying weathered polymers. For objective 1, polyethylene (PE) and polypropylene (PP) fragments were exposed to simulated solar radiation in water for 90 days. FTIR spectra were collected periodically and degradation was quantified using carbonyl and hydroxyl bond indices. Significant linear increases in carbonyl indices for PP, but not PE, were observed as a function of exposure time. Spectra (via principal component analysis) and bond indices from lab-weathered polymers were then compared to environmental MPs collected from urban stormwater and the Delaware Bay estuary. Estuarine PP carbonyl and hydroxyl indices varied as a function of spectral collection mode (i.e., ATR vs. transmission) and by sampling site, potentially indicating the bond indices provide insight into sources/fate/transport of PP and are worthy of further study. In contrast, no significant differences were observed for PP in stormwater samples, possibly due to the close proximity of collection locations. PE exhibited non-linear trends in bond indices in the laboratory study and showed no significant association with sampling location in environmental samples, suggesting these indices may be less useful for PE degradation analysis. For objective 2, 14 different polymers, eight of which were polymer blends, were exposed to simulated solar radiation for up to 90 days, in dry and wet conditions. FTIR spectra were collected periodically and analyzed with two spectral identification software. OpenSpecy achieved an 88 % true positive rate compared to siMPle's 57 % at a 70 % hit quality threshold. Expanding reference libraries, to include weathered polymers and polymer blends, could improve spectral identification accuracy, and manual interpretation of FTIR spectra is recommended for low-confidence matches. 
    more » « less
  3. Tandem mass spectrometry (MS/MS) is crucial for small-molecule analysis; however, traditional computational methods are limited by incomplete reference libraries and complex data processing. Machine learning (ML) is transforming small-molecule mass spectrometry in three key directions: (a) predicting MS/MS spectra and related physicochemical properties to expand reference libraries, (b) improving spectral matching through automated pattern extraction, and (c) predicting molecular structures of compounds directly from their MS/MS spectra. We review ML approaches for molecular representations [descriptors, simplified molecular-input line-entry (SMILE) strings, and graphs] and MS/MS spectra representations (using binned vectors and peak lists) along with recent advances in spectra prediction, retention time, collision cross sections, and spectral matching. Finally, we discuss ML-integrated workflows for chemical formula identification. By addressing the limitations of current methods for compound identification, these ML approaches can greatly enhance the understanding of biological processes and the development of diagnostic and therapeutic tools. 
    more » « less
  4. Abstract Neuropeptides have tremendous potential for application in modern medicine, including utility as biomarkers and therapeutics. To overcome the inherent challenges associated with neuropeptide identification and characterization, data‐independent acquisition (DIA) is a fitting mass spectrometry (MS) method of choice to achieve sensitive and accurate analysis. It is advantageous for preliminary neuropeptidomic studies to occur in less complex organisms, with crustacean models serving as a popular choice due to their relatively simple nervous system. With spectral libraries serving as a means to interpret DIA‐MS output spectra, andCancer borealisas a model of choice for neuropeptide analysis, we performed the first spectral library mapping of crustacean neuropeptides. Leveraging pre‐existing data‐dependent acquisition (DDA) spectra, a spectral library was built using PEAKS Online. The library is comprised of 333 unique neuropeptides. The identification results obtained through the use of this spectral library were compared with those achieved through library‐free analysis of crustacean brain, pericardial organs (PO), and thoracic ganglia (TG) tissues. A statistically significant increase (Student'st‐test,Pvalue < 0.05) in the number of identifications achieved from the TG data was observed in the spectral library results. Furthermore, in each of the tissues, a distinctly different set of identifications was found in the library search compared to the library‐free search. This work highlights the necessity for the use of spectral libraries in neuropeptide analysis, illustrating the advantage of spectral libraries for interpreting DIA spectra in a reproducible manner with greater neuropeptidomic depth. 
    more » « less
  5. Abstract MotivationDriven by technological advances, the throughput and cost of mass spectrometry (MS) proteomics experiments have improved by orders of magnitude in recent decades. Spectral library searching is a common approach to annotating experimental mass spectra by matching them against large libraries of reference spectra corresponding to known peptides. An important disadvantage, however, is that only peptides included in the spectral library can be found, whereas novel peptides, such as those with unexpected post-translational modifications (PTMs), will remain unknown. Open modification searching (OMS) is an increasingly popular approach to annotate modified peptides based on partial matches against their unmodified counterparts. Unfortunately, this leads to very large search spaces and excessive runtimes, which is especially problematic considering the continuously increasing sizes of MS proteomics datasets. ResultsWe propose an OMS algorithm, called HOMS-TC, that fully exploits parallelism in the entire pipeline of spectral library searching. We designed a new highly parallel encoding method based on the principle of hyperdimensional computing to encode mass spectral data to hypervectors while minimizing information loss. This process can be easily parallelized since each dimension is calculated independently. HOMS-TC processes two stages of existing cascade search in parallel and selects the most similar spectra while considering PTMs. We accelerate HOMS-TC on NVIDIA’s tensor core units, which is emerging and readily available in the recent graphics processing unit (GPU). Our evaluation shows that HOMS-TC is 31× faster on average than alternative search engines and provides comparable accuracy to competing search tools. Availability and implementationHOMS-TC is freely available under the Apache 2.0 license as an open-source software project at https://github.com/tycheyoung/homs-tc. 
    more » « less