Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
                                            Some full text articles may not yet be available without a charge during the embargo (administrative interval).
                                        
                                        
                                        
                                            
                                                
                                             What is a DOI Number?
                                        
                                    
                                
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
- 
            Abstract Analysis of virus-like particles (VLPs) is an essential task in optimizing their implementation as vaccine antigens for virus-initiated diseases. Interrogating VLP collections for elasticity by probing with a rigid atomic force microscopy (AFM) tip is a potential method for determining VLP morphological changes. During VLP morphological change, it is not expected that all VLPs would be in the same state. This leads to the open question of whether VLPs may change in a continuous or stepwise fashion. For continuous change, the statistical distribution of observed VLP properties would be expected as a single distribution, while stepwise change would lead to a multimodal distribution of properties. This study presents the application of a Gaussian mixture model (GMM), fit by the Expectation-Maximization (EM) algorithm, to identify different states of VLP morphological change observed by AFM imaging.more » « lessFree, publicly-accessible full text available December 1, 2025
- 
            ABSTRACT Conformal predictions transform a measurable, heuristic notion of uncertainty into statistically valid confidence intervals such that, for a future sample, the true class prediction will be included in the conformal prediction set at a predetermined confidence. In a Bayesian perspective, common estimates of uncertainty in multivariate classification, namelyp‐values, only provide the probability that the data fits the presumed class model,P(D|M). Conformal predictions, on the other hand, address the more meaningful probability that a model fits the data,P(M|D). Herein, two methods to perform inductive conformal predictions are investigated—the traditional Split Conformal Prediction that uses an external calibration set and a novel Bagged Conformal Prediction, closely related to Cross Conformal Predictions, that utilizes bagging to calibrate the heuristic notions of uncertainty. Methods for preprocessing the conformal prediction scores to improve performance are discussed and investigated. These conformal prediction strategies are applied to identifying four non‐steroidal anti‐inflammatory drugs (NSAIDs) from hyperspectral Raman imaging data. In addition to assigning meaningful confidence intervals on the model results, we herein demonstrate how conformal predictions can add additional diagnostics for model quality and method stability.more » « less
- 
            Abstract The use of network analysis as a means of visualizing the off‐diagonal (misclassified) elements of a confusion matrix is demonstrated, and the potential to use the network graphs as a guide for developing hierarchical classification models is presented. A very brief summary of graph theory is described. This is followed by an explanation and code with examples of how these networks can then be used for visualization of confusion matrices. The use of network graphs to provide insight into differing model performance is also addressed.more » « less
- 
            Abstract The illegal timber trade has significant impact on the survival of endangered tropical hardwood species likeDalbergiaspp. (rosewood), a world‐wide protected genus from the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). Due to increased threat toDalbergiaspp., and lack of action to reduce threats, port of entry analysis methods are required to identifyDalbergiaspp. Handheld laser‐induced breakdown spectroscopy (LIBS) has been shown to be capable of identifying species and establishing provenance ofDalbergiaspp. and other tropical hardwoods, but analysis methods for this work have yet to be investigated in detail. The present work investigates five well‐known algorithms—partial least squares discriminant analysis (PLS‐DA), classification and regression trees (CART),k‐nearest neighbor (k‐NN), random forest (RF), and support vector machine (SVM)—two training/test set sampling regimes, and data collection at two signal‐to‐noise (S/N) ratios to assess the potential for handheld LIBS analyses. Additionally, imbalanced classes are addressed. For this application, SVM and RF yield near identical results (though RF takes nearly 100 longer to compute), while the S/N ratio has a significant effect on model success assuming all else is equal. It was found that forming a training set with replicate low S/N analyses can perform as well as higher precision training sets for true prediction, even if the predicted samples have low signal to noise! This work confirms handheld LIBS analyzers can provide a viable method for classification of hardwood species, even within the same genus.more » « less
- 
            Abstract A potential method to determine whether two varieties of edible oils can be differentiated by Fourier transform infrared (FTIR) spectroscopy is proposed using digitally generated data of adulterated edible oils from an infrared (IR) spectral library. The first step is the evaluation of digitally blended data sets. Specifically, IR spectra of adulterated edible oils are computed from digitally blending experimental data of the IR spectra of an edible oil and the corresponding adulterant using the appropriate mixing coefficients to achieve the desired level of adulteration. To determine whether two edible oils can be differentiated by FTIR spectroscopy, pure IR spectra of the two edible oils are compared with IR spectra of two edible oils digitally mixed using a genetic algorithm for pattern recognition to solve a ternary classification problem. If the IR spectra of the two edible oils and their binary mixtures are differentiable from principal component plots of the spectral data, then differences between the IR spectra of these two edible oils are of sufficient magnitude to ensure that a reliable classification by FTIR spectroscopy can be obtained. Using this approach, the feasibility of authenticating edible oils such as extra virgin olive oil (EVOO) directly from library spectra is demonstrated. For this study, both digital and experimental data are combined to generate training and validation data sets to assess detection limits in FTIR spectroscopy for the adulterants.more » « less
- 
            We present four unique prediction techniques, combined with multiple data pre-processing methods, utilizing a wide range of both oil types and oil peroxide values (PV) as well as incorporating natural aging for peroxide creation. Samples were PV assayed using a standard starch titration method, AOCS Method Cd 8-53, and used as a verified reference method for PV determination. Near-infrared (NIR) spectra were collected from each sample in two unique optical pathlengths (OPLs), 2 and 24 mm, then fused into a third distinct set. All three sets were used in partial least squares (PLS) regression, ridge regression, LASSO regression, and elastic net regression model calculation. While no individual regression model was established as the best, global models for each regression type and pre-processing method show good agreement between all regression types when performed in their optimal scenarios. Furthermore, small spectral window size boxcar averaging shows prediction accuracy improvements for edible oil PVs. Best-performing models for each regression type are: PLS regression, 25 point boxcar window fused OPL spectral information RMSEP = 2.50; ridge regression, 5 point boxcar window, 24 mm OPL, RMSEP = 2.20; LASSO raw spectral information, 24 mm OPL, RMSEP = 1.80; and elastic net, 10 point boxcar window, 24 mm OPL, RMSEP = 1.91. The results show promising advancements in the development of a full global model for PV determination of edible oils.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
