Abstract Highly selective C−H functionalization remains an ongoing challenge in organic synthetic methodologies. Biocatalysts are robust tools for achieving these difficult chemical transformations. Biocatalyst engineering has often required directed evolution or structure‐based rational design campaigns to improve their activities. In recent years, machine learning has been integrated into these workflows to improve the discovery of beneficial enzyme variants. In this work, we combine a structure‐based self‐supervised machine learning framework, MutComputeX, with classical molecular dynamics simulations to down select mutations for rational design of a non‐heme iron‐dependent lysine dioxygenase, LDO. This approach consistently resulted in functional LDO mutants and circumvents the need for extensive study of mutational activity before‐hand. Our rationally designed single mutants purified with up to 2‐fold higher expression yields than WT and displayed higher total turnover numbers (TTN). Combining five such single mutations into a pentamutant variant, LPNYI LDO, leads to a 40 % improvement in the TTN (218±3) as compared to WT LDO (TTN=160±2). Overall, this work offers a low‐barrier approach for those seeking to synergize machine learning algorithms with pre‐existing protein engineering strategies.
more »
« less
Tackling Photonic Inverse Design with Machine Learning
Abstract Machine learning, as a study of algorithms that automate prediction and decision‐making based on complex data, has become one of the most effective tools in the study of artificial intelligence. In recent years, scientific communities have been gradually merging data‐driven approaches with research, enabling dramatic progress in revealing underlying mechanisms, predicting essential properties, and discovering unconventional phenomena. It is becoming an indispensable tool in the fields of, for instance, quantum physics, organic chemistry, and medical imaging. Very recently, machine learning has been adopted in the research of photonics and optics as an alternative approach to address the inverse design problem. In this report, the fast advances of machine‐learning‐enabled photonic design strategies in the past few years are summarized. In particular, deep learning methods, a subset of machine learning algorithms, dealing with intractable high degrees‐of‐freedom structure design are focused upon.
more »
« less
- Award ID(s):
- 2004749
- PAR ID:
- 10380611
- Publisher / Repository:
- Wiley Blackwell (John Wiley & Sons)
- Date Published:
- Journal Name:
- Advanced Science
- Volume:
- 8
- Issue:
- 5
- ISSN:
- 2198-3844
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Research into "gaming the system" behavior in intelligent tutoring systems (ITS) has been around for almost two decades, and detection has been developed for many ITSs. Machine learning models can detect this behavior in both real-time and in historical data. However, intelligent tutoring system designs often change over time, in terms of the design of the student interface, assessment models, and data collection log schemas. Can gaming detectors still be trusted, a decade or more after they are developed? In this research, we evaluate the robustness/degradation of gaming detectors when trained on older data logs and evaluated on current data logs. We demonstrate that some machine learning models developed using past data are still able to predict gaming behavior from student data collected 16 years later, but that there is considerable variance in how well different algorithms perform over time. We demonstrate that a classic decision tree algorithm maintained its performance while more contemporary algorithms struggled to transfer to new data, even though they exhibited better performance on unseen students in both New and Old data sets by themselves. Examining the feature importance values provides some explanation for the differences in performance between models, and offers some insight into how we might safeguard against detector rot over time.more » « less
-
Research into "gaming the system" behavior in intelligent tutoring systems (ITS) has been around for almost two decades, and detection has been developed for many ITSs. Machine learning models can detect this behavior in both real-time and in historical data. However, intelligent tutoring system designs often change over time, in terms of the design of the student interface, assessment models, and data collection log schemas. Can gaming detectors still be trusted, a decade or more after they are developed? In this research, we evaluate the robustness/degradation of gaming detectors when trained on old data logs and evaluated on current data logs. We demonstrate that some machine learning models developed using past data are still able to predict gaming behavior from student data collected 16 years later, but that there is considerable variance in how well different algorithms perform over time. We demonstrate that a classic decision tree algorithm maintained its performance while more contemporary algorithms struggled to transfer to new data, even though they exhibited better performance on both new and old data alone. Examining the feature importances provides some explanation for the differences in performance between models, and offers some insight into how we might safeguard against detector rot over time.more » « less
-
Abstract Direct ink writing (DIW) is an extrusion-based additive manufacturing technology. It has gained wide attentions in both industry and research because of its simple design and versatile platform. In electric-field-assisted Direct Ink Writing (eDIW) processes, an external electric field is added between the nozzle and the printing substrate to manipulate the ink-substrate wetting dynamics and therefore optimize the ink printability. eDIW was found effective in printing liquids that are typically difficult to print in the conventional DIW processes. In this paper, an eDIW process modeling system based on machine learning (ML) algorithms is developed. The system is found effective in predicting eDIW printing geometry under varied process parameter settings. Image processing approaches to collect experiment data are developed. Accuracies of different machine learning algorithms for predicting printing results and trace width are compared and discussed. The capabilities, applications and limitations of the presented machine learning-based modeling approach are presented.more » « less
-
Abstract Machine learning (ML) has been broadly applied for vadose zone applications in recent years. This article provides a comprehensive review of such developments. ML applications for variables corresponding to different complex vadose zone processes are summarized mostly in a prediction context. By analyzing and assessing these applications, we discovered extensive usages of classic ML models with relatively limited applications of deep learning (DL) approaches in general. We also recognized a lack of benchmark datasets for soil property research as well as limited integration of physics‐based vadose zone principles into the ML approaches. To facilitate this interdisciplinary research of ML in vadose zone characterization and processes, a paradigm of knowledge‐guided machine learning is suggested along with other data‐driven and ML model‐based research suggestions to advance future research.more » « less
An official website of the United States government
