Neighborhood models have allowed us to test many hypotheses regarding the drivers of variation in tree growth, but require considerable computation due to the many empirically supported non-linear relationships they include. Regularized regression represents a far more efficient neighborhood modeling method, but it is unclear whether such an ecologically unrealistic model can provide accurate insights on tree growth. Rapid computation is becoming increasingly important as ecological datasets grow in size, and may be essential when using neighborhood models to predict tree growth beyond sample plots or into the future. We built a novel regularized regression model of tree growth and investigated whether it reached the same conclusions as a commonly used neighborhood model, regarding hypotheses of how tree growth is influenced by the species identity of neighboring trees. We also evaluated the ability of both models to interpolate the growth of trees not included in the model fitting dataset. Our regularized regression model replicated most of the classical model’s inferences in a fraction of the time without using high-performance computing resources. We found that both methods could interpolate out-of-sample tree growth, but the method making the most accurate predictions varied among focal species. Regularized regression is particularly efficient for comparing hypotheses because it automates the process of model selection and can handle correlated explanatory variables. This feature means that regularized regression could also be used to select among potential explanatory variables (e.g., climate variables) and thereby streamline the development of a classical neighborhood model. Both regularized regression and classical methods can interpolate out-of-sample tree growth, but future research must determine whether predictions can be extrapolated to trees experiencing novel conditions. Overall, we conclude that regularized regression methods can complement classical methods in the investigation of tree growth drivers and represent a valuable tool for advancing this field toward prediction.
more »
« less
Predicting Mechanical Properties of High-Performance Fiber-Reinforced Cementitious Composites by Integrating Micromechanics and Machine Learning
Current development of high-performance fiber-reinforced cementitious composites (HPFRCC) mainly relies on intensive experiments. The main purpose of this study is to develop a machine learning method for effective and efficient discovery and development of HPFRCC. Specifically, this research develops machine learning models to predict the mechanical properties of HPFRCC through innovative incorporation of micromechanics, aiming to increase the prediction accuracy and generalization performance by enriching and improving the datasets through data cleaning, principal component analysis (PCA), and K-fold cross-validation. This study considers a total of 14 different mix design variables and predicts the ductility of HPFRCC for the first time, in addition to the compressive and tensile strengths. Different types of machine learning methods are investigated and compared, including artificial neural network (ANN), support vector regression (SVR), classification and regression tree (CART), and extreme gradient boosting tree (XGBoost). The results show that the developed machine learning models can reasonably predict the concerned mechanical properties and can be applied to perform parametric studies for the effects of different mix design variables on the mechanical properties. This study is expected to greatly promote efficient discovery and development of HPFRCC.
more »
« less
- Award ID(s):
- 2046407
- PAR ID:
- 10301378
- Date Published:
- Journal Name:
- Materials
- Volume:
- 14
- Issue:
- 12
- ISSN:
- 1996-1944
- Page Range / eLocation ID:
- 3143
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
A Data-Driven Framework to Select a Cost-Efficient Subset of Parameters to Qualify Sourced MaterialsThe quality of powder processed for manufacturing can be certified by hundreds of different variables. Assessing the impact of all these different metrics on the performance of additively manufactured engineered products is an invaluable, but time intensive specification process. In this work, a comprehensive, generalizable, data-driven framework was implemented to select the optimal powder processing and microstructure variables that are required to predict the target property variables. The framework was demonstrated on a high-dimensional dataset collected from selective laser melted, additively manufactured, Inconel 718. One hundred and twenty-nine powder quality variables including particle morphology, rheology, chemical composition, and build composition, were assessed for their impact on eight microstructural features and sixteen mechanical properties. The importance of each powder and microstructure variable was determined by using statistical analysis and machine learning models. The trained models predicted target mechanical properties with an R2 value of 0.9 or higher. The results indicate that the desired mechanical properties can be achieved by controlling only a few critical powder properties and without the need for collecting microstructure data. This framework significantly reduces the time and cost of qualifying source materials for production by determining an optimal subset of experiments needed to predict that a given source material will lead to a desired outcome. This general framework can be easily applied to other material systems.more » « less
-
Abstract Ultrahigh temperature ceramics (UHTCs) have melting points above 3000°C and outstanding strength at high temperatures, thus making them apposite structural materials for high‐temperature applications. Di‐borides, nitride, and carbide compounds—processed via various techniques—have been extensively studied and used in the manufacture of UHTCs. Current analytical models, based on our current but incomplete understanding of the theory, are unable to produce a priori predictions of mechanical properties of UHTCs based on their mixture designs and processing parameters. As a result, researchers have to rely on experiments—which are often costly and time‐consuming—to understand composition–structure–performance links in UHTCs. This study employs machine learning (ML) models (i.e., random forest and artificial neural network models) to predict Young's modulus, flexural strength, and fracture toughness of UHTCs in relation to a wide range of mixture designs, processing parameters, and testing conditions. Outcomes demonstrate that adequately trained ML models can yield reliable predictions, a priori, of the three aforesaid mechanical properties. The prediction performance on Young's modulus is superior to flexural strength and fracture toughness. Next, the ML model with the best prediction performance is utilized to evaluate and rank the impacts of input variables on Young's modulus. Finally, on the basis of such classification of consequential and inconsequential input variables, this study develops an easy‐to‐use, closed‐form analytical model to predict Young's modulus of UHTCs. Overall, this study highlights the ability of data‐driven numerical models to complement, or even replace, time‐consuming experiments, thereby accelerating the development of UHTCs.more » « less
-
Limestone calcined clay cement (LC3) is a sustainable alternative to ordinary Portland cement, capable of reducing the binder’s carbon footprint by 40% while satisfying all key performance metrics. The inherent compositional heterogeneity in select components of LC3, combined with their convoluted chemical interactions, poses challenges to conventional analytical models when predicting mechanical properties. Although some studies have employed machine learning (ML) to predict the mechanical properties of LC3, many have overlooked the pivotal role of feature selection. Proper feature selection not only refines and simplifies the structure of ML models but also enhances these models’ prediction performance and interpretability. This research harnesses the power of the random forest (RF) model to predict the compressive strength of LC3. Three feature reduction methods—Pearson correlation, SHapley Additive exPlanations, and variable importance—are employed to analyze the influence of LC3 components and mixture design on compressive strength. Practical guidelines for utilizing these methods on cementitious materials are elucidated. Through the rigorous screening of insignificant variables from the database, the RF model conserves computational resources while also producing high-fidelity predictions. Additionally, a feature enhancement method is utilized, consolidating numerous input variables into a singular feature while feeding the RF model with richer information, resulting in a substantial improvement in prediction accuracy. Overall, this study provides a novel pathway to apply ML to LC3, emphasizing the need to tailor ML models to cement chemistry rather than employing them generically.more » « less
-
Abstract Singlet fission (SF), the conversion of one singlet exciton into two triplet excitons, could significantly enhance solar cell efficiency. Molecular crystals that undergo SF are scarce. Computational exploration may accelerate the discovery of SF materials. However, many-body perturbation theory (MBPT) calculations of the excitonic properties of molecular crystals are impractical for large-scale materials screening. We use the sure-independence-screening-and-sparsifying-operator (SISSO) machine-learning algorithm to generate computationally efficient models that can predict the MBPT thermodynamic driving force for SF for a dataset of 101 polycyclic aromatic hydrocarbons (PAH101). SISSO generates models by iteratively combining physical primary features. The best models are selected by linear regression with cross-validation. The SISSO models successfully predict the SF driving force with errors below 0.2 eV. Based on the cost, accuracy, and classification performance of SISSO models, we propose a hierarchical materials screening workflow. Three potential SF candidates are found in the PAH101 set.more » « less
An official website of the United States government

