NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multi‐cell‐line learning for the data‐driven construction of mechanistic metabolic models

https://doi.org/10.1002/bit.28757

Lu, Yen‐An; McCann, Meghan_G; Hu, Wei‐Shou; Zhang, Qi (June 2024, Biotechnology and Bioengineering)

Abstract Mammalian cells are commonly used as hosts in cell culture for biologics production in the pharmaceutical industry. Structured mechanistic models of metabolism have been used to capture complex cellular mechanisms that contribute to varying metabolic shifts in different cell lines. However, little research has focused on the impact of temporal changes in enzyme abundance and activity on the modeling of cell metabolism. In this work, we present a framework for constructing mechanistic models of metabolism that integrate growth‐signaling control of enzyme activity and transcript dynamics. The proposed approach is applied to build models for three Chinese hamster ovary (CHO) cell lines using fed‐batch culture data and time‐series transcript profiles. Leveraging information from the transcriptome data, we develop a parameter estimation approach based on multi‐cell‐line (MCL) learning, which combines data sets from different cell lines and trains the individual cell‐line models jointly to improve model accuracy. The computational results demonstrate the important role of growth signaling and transcript variability in metabolic models as well as the virtue of the MCL approach for constructing cell‐line models with a limited amount of data. The resulting models exhibit a high level of accuracy in predicting distinct metabolic behaviors in the different cell lines; these models can potentially be used to accelerate the process and cell‐line development for the biomanufacturing of new protein therapeutics.
more » « less
Data‐driven decision‐focused surrogate modeling

https://doi.org/10.1002/aic.18338

Gupta, Rishabh; Zhang, Qi (January 2024, AIChE Journal)

Abstract We introduce the concept of decision‐focused surrogate modeling for solving computationally challenging nonlinear optimization problems in real‐time settings. The proposed data‐driven framework seeks to learn a simpler, for example, convex, surrogate optimization model that is trained to minimize thedecision prediction error, which is defined as the difference between the optimal solutions of the original and the surrogate optimization models. The learning problem, formulated as a bilevel program, can be viewed as a data‐driven inverse optimization problem to which we apply a decomposition‐based solution algorithm from previous work. We validate our framework through numerical experiments involving the optimization of common nonlinear chemical processes such as chemical reactors, heat exchanger networks, and material blending systems. We also present a detailed comparison of decision‐focused surrogate modeling with standard data‐driven surrogate modeling methods and demonstrate that our approach is significantly more data‐efficient while producing simple surrogate models with high decision prediction accuracy.
more » « less
Recurrent Neural Networks for Forecasting Time-Varying Process Behavior in Mammalian Cell Culture

https://doi.org/10.1021/acs.iecr.4c03986

Lu, Yen-An; Fukae, Yudai; Hu, Wei-Shou; Zhang, Qi (May 2025, Industrial & Engineering Chemistry Research)

Free, publicly-accessible full text available May 7, 2026
Decision-focused surrogate modeling for mixed-integer linear optimization

Dixit, Shivi; Gupta, Rishabh; Zhang, Qi (March 2025, Transactions on Machine Learning Research)

Free, publicly-accessible full text available March 27, 2026
BO4IO: A Bayesian optimization approach to inverse optimization with uncertainty quantification

https://doi.org/10.1016/j.compchemeng.2024.108859

Lu, Yen-An; Hu, Wei-Shou; Paulson, Joel A; Zhang, Qi (January 2025, Computers & Chemical Engineering)

Full Text Available
Enhancing explainability of stochastic programming solutions via scenario and recourse reduction

https://doi.org/10.1007/s11081-023-09825-9

Rathi, Tushar; Gupta, Rishabh; Pinto, Jose M; Zhang, Qi (February 2024, Optimization and Engineering)

Full Text Available
An inverse optimization approach to decision-focused learning

https://doi.org/10.1016/B978-0-443-15274-0.50216-X

Gupta, R; Zhang, Q (October 2023, Computer Aided Chemical Engineering)

Full Text Available
Efficient learning of decision-making models: A penalty block coordinate descent algorithm for data-driven inverse optimization

https://doi.org/10.1016/j.compchemeng.2022.108123

Gupta, Rishabh; Zhang, Qi (February 2023, Computers & Chemical Engineering)

Full Text Available
Kinetic‐model‐based pathway optimization with application to reverse glycolysis in mammalian cells

https://doi.org/10.1002/bit.28249

Lu, Yen‐An; Brien, Conor M.; Mashek, Douglas G.; Hu, Wei‐Shou; Zhang, Qi (January 2023, Biotechnology and Bioengineering)

Full Text Available
Decomposition and Adaptive Sampling for Data-Driven Inverse Linear Optimization

https://doi.org/10.1287/ijoc.2022.1162

Gupta, Rishabh; Zhang, Qi (September 2022, INFORMS Journal on Computing)

This work addresses inverse linear optimization, where the goal is to infer the unknown cost vector of a linear program. Specifically, we consider the data-driven setting in which the available data are noisy observations of optimal solutions that correspond to different instances of the linear program. We introduce a new formulation of the problem that, compared with other existing methods, allows the recovery of a less restrictive and generally more appropriate admissible set of cost estimates. It can be shown that this inverse optimization problem yields a finite number of solutions, and we develop an exact two-phase algorithm to determine all such solutions. Moreover, we propose an efficient decomposition algorithm to solve large instances of the problem. The algorithm extends naturally to an online learning environment where it can be used to provide quick updates of the cost estimate as new data become available over time. For the online setting, we further develop an effective adaptive sampling strategy that guides the selection of the next samples. The efficacy of the proposed methods is demonstrated in computational experiments involving two applications: customer preference learning and cost estimation for production planning. The results show significant reductions in computation and sampling efforts. Summary of Contribution: Using optimization to facilitate decision making is at the core of operations research. This work addresses the inverse problem (i.e., inverse optimization), which aims to infer unknown optimization models from decision data. It is, conceptually and computationally, a challenging problem. Here, we propose a new formulation of the data-driven inverse linear optimization problem and develop an efficient decomposition algorithm that can solve problem instances up to a scale that has not been addressed previously. The computational performance is further improved by an online adaptive sampling strategy that substantially reduces the number of required data points.
more » « less
Full Text Available

« Prev Next »

Search for: All records