Title: Machine Learning: New Ideas and Tools in Environmental Science and Engineering
The rapid increase in both quantity and complexity of data that are being generated daily in the field of environmental science and engineering (ESE) demands accompanied advancement in data analytics. Advanced data analysis approaches, such as machine learning (ML), have become indispensable tools for revealing hidden patterns or deducing correlations for which conventional analytical methods face limitations or challenges. However, ML concepts and practices have not been widely utilized by researchers in ESE. This feature explores the potential of ML to revolutionize data analysis and modeling in the ESE field, and covers the essential knowledge needed for such applications. First, we use five examples to illustrate how ML addresses complex ESE problems. We then summarize four major types of applications of ML in ESE: making predictions; extracting feature importance; detecting anomalies; and discovering new materials or chemicals. Next, we introduce the essential knowledge required and current shortcomings in ML applications in ESE, with a focus on three important but often overlooked components when applying ML: correct model development; proper model interpretation; and sound applicability analysis. Finally, we discuss challenges and future opportunities in the application of ML tools in ESE to highlight the potential of ML in this field. more »« less
Ravinder, R.; Venugopal, Vineeth; Bishnoi, Suresh; Singh, Sourabh; Zaki, Mohd; Grover, Hargun Singh; Bauchy, Mathieu; Agarwal, Manish; Krishnan, N. M.
(, International journal of applied glass science)
null
(Ed.)
Glasses have been an integral part of human life for more than 2000 years. Despite several years of research and analysis, some fundamental and practical questions on glasses still remain unanswered. While most of the earlier approaches were based on (i) expert knowledge and intuition, (ii) Edisonian trial and error, or (iii) physics-driven modeling and analysis, recent studies suggest that data-driven techniques, such as artificial intelligence (AI) and machine learning (ML), can provide fresh perspectives to tackle some of these questions. In this article, we identify 21 grand challenges in glass science, the solutions of which are either enabling AI and ML or enabled by AI and ML to accelerate the field of glass science. The challenges presented here range from fundamental questions related to glass formation and composition–processing–property relationships to industrial problems such as automated flaw detection in glass manufacturing. We believe that the present article will instill enthusiasm among the readers to explore some of the grand challenges outlined here and to discover many more challenges that can advance the field of glass science, engineering, and technology.
Shi, Zhe; Tsymbalov, Evgenii; Shi, Wencong; Barr, Ariel; Li, Qingjie; Li, Jiangxu; Chen, Xing-Qiu; Dao, Ming; Suresh, Subra; Li, Ju
(, Proceedings of the National Academy of Sciences)
Recent studies have reported the experimental discovery that nanoscale specimens of even a natural material, such as diamond, can be deformed elastically to as much as 10% tensile elastic strain at room temperature without the onset of permanent damage or fracture. Computational work combining ab initio calculations and machine learning (ML) algorithms has further demonstrated that the bandgap of diamond can be altered significantly purely by reversible elastic straining. These findings open up unprecedented possibilities for designing materials and devices with extreme physical properties and performance characteristics for a variety of technological applications. However, a general scientific framework to guide the design of engineering materials through such elastic strain engineering (ESE) has not yet been developed. By combining first-principles calculations with ML, we present here a general approach to map out the entire phonon stability boundary in six-dimensional strain space, which can guide the ESE of a material without phase transitions. We focus on ESE of vibrational properties, including harmonic phonon dispersions, nonlinear phonon scattering, and thermal conductivity. While the framework presented here can be applied to any material, we show as an example demonstration that the room-temperature lattice thermal conductivity of diamond can be increased by more than 100% or reduced by more than 95% purely by ESE, without triggering phonon instabilities. Such a framework opens the door for tailoring of thermal-barrier, thermoelectric, and electro-optical properties of materials and devices through the purposeful design of homogeneous or inhomogeneous strains.
Abstract We discuss the emerging advances and opportunities at the intersection of machine learning (ML) and climate physics, highlighting the use of ML techniques, including supervised, unsupervised, and equation discovery, to accelerate climate knowledge discoveries and simulations. We delineate two distinct yet complementary aspects: (a) ML for climate physics and (b) ML for climate simulations. Although physics-free ML-based models, such as ML-based weather forecasting, have demonstrated success when data are abundant and stationary, the physics knowledge and interpretability of ML models become crucial in the small-data/nonstationary regime to ensure generalizability. Given the absence of observations, the long-term future climate falls into the small-data regime. Therefore, ML for climate physics holds a critical role in addressing the challenges of ML for climate simulations. We emphasize the need for collaboration among climate physics, ML theory, and numerical analysis to achieve reliable ML-based models for climate applications.
Abstract Machine learning (ML) provides a powerful framework for the analysis of high‐dimensional datasets by modelling complex relationships, often encountered in modern data with many variables, cases and potentially non‐linear effects. The impact of ML methods on research and practical applications in the educational sciences is still limited, but continuously grows, as larger and more complex datasets become available through massive open online courses (MOOCs) and large‐scale investigations. The educational sciences are at a crucial pivot point, because of the anticipated impact ML methods hold for the field. To provide educational researchers with an elaborate introduction to the topic, we provide an instructional summary of the opportunities and challenges of ML for the educational sciences, show how a look at related disciplines can help learning from their experiences, and argue for a philosophical shift in model evaluation. We demonstrate how the overall quality of data analysis in educational research can benefit from these methods and show how ML can play a decisive role in the validation of empirical models. Specifically, we (1) provide an overview of the types of data suitable for ML and (2) give practical advice for the application of ML methods. In each section, we provide analytical examples and reproducible R code. Also, we provide an extensive Appendix on ML‐based applications for education. This instructional summary will help educational scientists and practitioners to prepare for the promises and threats that come with the shift towards digitisation and large‐scale assessment in education. Context and implicationsRationale for this studyIn 2020, the worldwide SARS‐COV‐2 pandemic forced the educational sciences to perform a rapid paradigm shift with classrooms going online around the world—a hardly novel but now strongly catalysed development. In the context of data‐driven education, this paper demonstrates that the widespread adoption of machine learning techniques is central for the educational sciences and shows how these methods will become crucial tools in the collection and analysis of data and in concrete educational applications. Helping to leverage the opportunities and to avoid the common pitfalls of machine learning, this paper provides educators with the theoretical, conceptual and practical essentials.Why the new findings matterThe process of teaching and learning is complex, multifaceted and dynamic. This paper contributes a seminal resource to highlight the digitisation of the educational sciences by demonstrating how new machine learning methods can be effectively and reliably used in research, education and practical application.Implications for educational researchers and policy makersThe progressing digitisation of societies around the globe and the impact of the SARS‐COV‐2 pandemic have highlighted the vulnerabilities and shortcomings of educational systems. These developments have shown the necessity to provide effective educational processes that can support sometimes overwhelmed teachers to digitally impart knowledge on the plan of many governments and policy makers. Educational scientists, corporate partners and stakeholders can make use of machine learning techniques to develop advanced, scalable educational processes that account for individual needs of learners and that can complement and support existing learning infrastructure. The proper use of machine learning methods can contribute essential applications to the educational sciences, such as (semi‐)automated assessments, algorithmic‐grading, personalised feedback and adaptive learning approaches. However, these promises are strongly tied to an at least basic understanding of the concepts of machine learning and a degree of data literacy, which has to become the standard in education and the educational sciences.Demonstrating both the promises and the challenges that are inherent to the collection and the analysis of large educational data with machine learning, this paper covers the essential topics that their application requires and provides easy‐to‐follow resources and code to facilitate the process of adoption.
Organic molecules and polymers have a broad range of applications in biomedical, chemical, and materials science fields. Traditional design approaches for organic molecules and polymers are mainly experimentally-driven, guided by experience, intuition, and conceptual insights. Though they have been successfully applied to discover many important materials, these methods are facing significant challenges due to the tremendous demand of new materials and vast design space of organic molecules and polymers. Accelerated and inverse materials design is an ideal solution to these challenges. With advancements in high-throughput computation, artificial intelligence (especially machining learning, ML), and the growth of materials databases, ML-assisted materials design is emerging as a promising tool to flourish breakthroughs in many areas of materials science and engineering. To date, using ML-assisted approaches, the quantitative structure property/activity relation for material property prediction can be established more accurately and efficiently. In addition, materials design can be revolutionized and accelerated much faster than ever, through ML-enabled molecular generation and inverse molecular design. In this perspective, we review the recent progresses in ML-guided design of organic molecules and polymers, highlight several successful examples, and examine future opportunities in biomedical, chemical, and materials science fields. We further discuss the relevant challenges to solve in order to fully realize the potential of ML-assisted materials design for organic molecules and polymers. In particular, this study summarizes publicly available materials databases, feature representations for organic molecules, open-source tools for feature generation, methods for molecular generation, and ML models for prediction of material properties, which serve as a tutorial for researchers who have little experience with ML before and want to apply ML for various applications. Last but not least, it draws insights into the current limitations of ML-guided design of organic molecules and polymers. We anticipate that ML-assisted materials design for organic molecules and polymers will be the driving force in the near future, to meet the tremendous demand of new materials with tailored properties in different fields.
@article{osti_10289361,
place = {Country unknown/Code not available},
title = {Machine Learning: New Ideas and Tools in Environmental Science and Engineering},
url = {https://par.nsf.gov/biblio/10289361},
DOI = {10.1021/acs.est.1c01339},
abstractNote = {The rapid increase in both quantity and complexity of data that are being generated daily in the field of environmental science and engineering (ESE) demands accompanied advancement in data analytics. Advanced data analysis approaches, such as machine learning (ML), have become indispensable tools for revealing hidden patterns or deducing correlations for which conventional analytical methods face limitations or challenges. However, ML concepts and practices have not been widely utilized by researchers in ESE. This feature explores the potential of ML to revolutionize data analysis and modeling in the ESE field, and covers the essential knowledge needed for such applications. First, we use five examples to illustrate how ML addresses complex ESE problems. We then summarize four major types of applications of ML in ESE: making predictions; extracting feature importance; detecting anomalies; and discovering new materials or chemicals. Next, we introduce the essential knowledge required and current shortcomings in ML applications in ESE, with a focus on three important but often overlooked components when applying ML: correct model development; proper model interpretation; and sound applicability analysis. Finally, we discuss challenges and future opportunities in the application of ML tools in ESE to highlight the potential of ML in this field.},
journal = {Environmental Science & Technology},
volume = {ASAP},
author = {Zhong, Shifa and Zhang, Kai and Bagheri, Majid and Burken, Joel G. and Gu, April and Li, Baikun and Ma, Xingmao and Marrone, Babetta L. and Ren, Zhiyong Jason and Schrier, Joshua and Shi, Wei and Tan, Haoyue and Wang, Tianbao and Wang, Xu and Wong, Bryan M. and Xiao, Xusheng and Yu, Xiong and Zhu, Jun-Jie and Zhang, Huichun},
editor = {null}
}
Warning: Leaving National Science Foundation Website
You are now leaving the National Science Foundation website to go to a non-government website.
Website:
NSF takes no responsibility for and exercises no control over the views expressed or the accuracy of
the information contained on this site. Also be aware that NSF's privacy policy does not apply to this site.