Accelerated knowledge discovery from omics data by optimal experimental design

Wang, Xiaokang; Rai, Navneet; Merchel Piovesan Pereira, Beatriz; Eetemadi, Ameen; Tagkopoulos, Ilias (ORCID:0000000311047616)

doi:10.1038/s41467-020-18785-y

Citation Details

Accelerated knowledge discovery from omics data by optimal experimental design

Abstract How to design experiments that accelerate knowledge discovery on complex biological landscapes remains a tantalizing question. We present an optimal experimental design method (coined OPEX) to identify informative omics experiments using machine learning models for both experimental space exploration and model training. OPEX-guided exploration ofEscherichia coli’s populations exposed to biocide and antibiotic combinations lead to more accurate predictive models of gene expression with 44% less data. Analysis of the proposed experiments shows that broad exploration of the experimental space followed by fine-tuning emerges as the optimal strategy. Additionally, analysis of the experimental data reveals 29 cases of cross-stress protection and 4 cases of cross-stress vulnerability. Further validation reveals the central role of chaperones, stress response proteins and transport pumps in cross-stress exposure. This work demonstrates how active learning can be used to guide omics data collection for training predictive models, making evidence-driven decisions and accelerating knowledge discovery in life sciences. more »

Award ID(s):: 1934568 1743101

PAR ID:: 10196951

Author(s) / Creator(s):: Wang, Xiaokang; Rai, Navneet; Merchel Piovesan Pereira, Beatriz; Eetemadi, Ameen; Tagkopoulos, Ilias

Publisher / Repository:: Nature Publishing Group

Date Published:: 2020-10-06

Journal Name:: Nature Communications

Volume:: 11

Issue:: 1

ISSN:: 2041-1723

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1038/s41467-020-18785-y

More Like this