NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Metabolic engineering of Bcat1, Adh5 and Hahdb towards controlling metabolic inhibitors and improving performance in CHO cell-cultures

https://doi.org/10.1016/j.bej.2024.109282

Kuang, Bingyu; Hoang, Duc; Fu, Qiang; Park, SoYoung; Liang, George; Yoon, Seongkyu (June 2024, Biochemical Engineering Journal)

Full Text Available
Don’t Just Prune by Magnitude! Your Mask Topology is Another Secret Weapon

Hoang, Duc; Kundu, Souvik; Liu, Shiwei; Wang, Zhangyang (December 2023, Advances in neural information processing systems)

Recent years have witnessed significant progress in understanding the relationship between the connectivity of a deep network's architecture as a graph, and the network's performance. A few prior arts connected deep architectures to expander graphs or Ramanujan graphs, and particularly,[7] demonstrated the use of such graph connectivity measures with ranking and relative performance of various obtained sparse sub-networks (i.e. models with prune masks) without the need for training. However, no prior work explicitly explores the role of parameters in the graph's connectivity, making the graph-based understanding of prune masks and the magnitude/gradient-based pruning practice isolated from one another. This paper strives to fill in this gap, by analyzing the Weighted Spectral Gap of Ramanujan structures in sparse neural networks and investigates its correlation with final performance. We specifically examine the evolution of sparse structures under a popular dynamic sparse-to-sparse network training scheme, and intriguingly find that the generated random topologies inherently maximize Ramanujan graphs. We also identify a strong correlation between masks, performance, and the weighted spectral gap. Leveraging this observation, we propose to construct a new "full-spectrum coordinate'' aiming to comprehensively characterize a sparse neural network's promise. Concretely, it consists of the classical Ramanujan's gap (structure), our proposed weighted spectral gap (parameters), and the constituent nested regular graphs within. In this new coordinate system, a sparse subnetwork's L2-distance from its original initialization is found to have nearly linear correlated with its performance. Eventually, we apply this unified perspective to develop a new actionable pruning method, by sampling sparse masks to maximize the L2-coordinate distance. Our method can be augmented with the "pruning at initialization" (PaI) method, and significantly outperforms existing PaI methods. With only a few iterations of training (e.g 500 iterations), we can get LTH-comparable performance as that yielded via "pruning after training", significantly saving pre-training costs. Codes can be found at: https://github.com/VITA-Group/FullSpectrum-PAI.
more » « less
Full Text Available
Zero-Shot Neural Architecture Search: Challenges, Solutions, and Opportunities

https://doi.org/10.1109/TPAMI.2024.3395423

Li, Guihong; Hoang, Duc; Bhardwaj, Kartikeya; Lin, Ming; Wang, Zhangyang; Marculescu, Radu (January 2024, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Recently, zero-shot (or training-free) Neural Architecture Search (NAS) approaches have been proposed to liberate NAS from the expensive training process. The key idea behind zero-shot NAS approaches is to design proxies that can predict the accuracy of some given networks without training the network parameters. The proxies proposed so far are usually inspired by recent progress in theoretical understanding of deep learning and have shown great potential on several datasets and NAS benchmarks. This paper aims to comprehensively review and compare the state-of-the-art (SOTA) zero-shot NAS approaches, with an emphasis on their hardware awareness. To this end, we first review the mainstream zero-shot proxies and discuss their theoretical underpinnings. We then compare these zero-shot proxies through large-scale experiments and demonstrate their effectiveness in both hardware-aware and hardware-oblivious NAS scenarios. Finally, we point out several promising ideas to design better proxies.
more » « less
Full Text Available
Revisiting Pruning at Initialization Through the Lens of Ramanujan Graph

Hoang, Duc; Liu, Shiwei; Marculescu, Radu; Wang, Zhangyang (April 2023, International Conference on Learning Representations)

Pruning neural networks at initialization (PaI) has received an upsurge of interest due to its end-to-end saving potential. PaI is able to find sparse subnetworks at initialization that can achieve comparable performance to the full networks. These methods can surpass the trivial baseline of random pruning but suffer from a significant performance gap compared to post-training pruning. Previous approaches firmly rely on weights, gradients, and sanity checks as primary signals when conducting PaI analysis. To better understand the underlying mechanism of PaI, we propose to interpret it through the lens of the Ramanujan Graph - a class of expander graphs that are sparse while being highly connected. It is often believed there should be a strong correlation between the Ramanujan graph and PaI since both are about finding sparse and well-connected neural networks. However, the finer-grained link relating highly sparse and connected networks to their relative performance (i.e., ranking of difference sparse structures at the same specific global sparsity) is still missing. We observe that not only the Ramanujan property for sparse networks shows no significant relationship to PaI’s relative performance, but maximizing it can also lead to the formation of pseudo-random graphs with no structural meanings. We reveal the underlying cause to be Ramanujan Graph’s strong assumption on the upper bound of the largest nontrivial eigenvalue (µˆ) of layers belonging to highly sparse networks. We hence propose Iterative Mean Difference of Bound (IMDB) as a mean to relax the µˆ upper bound. Likewise, we also show there exists a lower bound for µˆ, which we call the Normalized Random Coefficient (NaRC), that gives us an accurate assessment for when sparse but highly connected structure degenerates into naive randomness. Finally, we systematically analyze the behavior of various PaI methods and demonstrate the utility of our proposed metrics in characterizing PaI performance. We show that subnetworks preserving better the IMDB property correlate higher in performance, while NaRC provides us with a possible mean to locate the region where highly connected, highly sparse, and non-trivial Ramanujan expanders exist. Our code is available at: https://github.com/VITA-Group/ramanujan-on-pai.
more » « less
Full Text Available
Modulation of nutrient precursors for controlling metabolic inhibitors by flux balance analysis

https://doi.org/10.1002/btpr.3313

Hoang, Duc; Kuang, Bingyu; Liang, George; Wang, Zhao; Yoon, Seongkyu (October 2022, Biotechnology Progress)

Full Text Available
Characterization of Chinese hamster ovary cell culture feed media precipitate

https://doi.org/10.1002/btpr.3188

Hoang, Duc; Galbraith, Shaun; Kuang, Bingyu; Johnson, Amy; Yoon, Seongkyu (September 2021, Biotechnology Progress)

Full Text Available
Identification of novel inhibitory metabolites and impact verification on growth and protein synthesis in mammalian cells

https://doi.org/10.1016/j.mec.2021.e00182

Kuang, Bingyu; Dhara, Venkata Gayatri; Hoang, Duc; Jenkins, Jack; Ladiwala, Pranay; Tan, Yanglan; Shaffer, Scott A.; Galbraith, Shaun C.; Betenbaugh, Michael J.; Yoon, Seongkyu (December 2021, Metabolic Engineering Communications)

Full Text Available
Fast convolutional neural networks on FPGAs with hls4ml

https://doi.org/10.1088/2632-2153/ac0ea1

Aarrestad, Thea; Loncar, Vladimir; Ghielmetti, Nicolò; Pierini, Maurizio; Summers, Sioni; Ngadiuba, Jennifer; Petersson, Christoffer; Linander, Hampus; Iiyama, Yutaro; Di Guglielmo, Giuseppe; et al (July 2021, Machine Learning: Science and Technology)
null (Ed.)
Full Text Available
Addressing amino acid‐derived inhibitory metabolites and enhancing CHO cell culture performance through DOE‐guided media modifications

https://doi.org/10.1002/bit.28403

Ladiwala, Pranay; Dhara, Venkata Gayatri; Jenkins, Jackson; Kuang, Bingyu; Hoang, Duc; Yoon, Seongkyu; Betenbaugh, Michael J. (April 2023, Biotechnology and Bioengineering)

Abstract Previously, we identified six inhibitory metabolites (IMs) accumulating in Chinese hamster ovary (CHO) cultures using AMBIC 1.0 community reference medium that negatively impacted culture performance. The goal of the current study was to modify the medium to control IM accumulation through design of experiments (DOE). Initial over‐supplementation of precursor amino acids (AAs) by 100% to 200% in the culture medium revealed positive correlations between initial AA concentrations and IM levels. A screening design identified 5 AA targets, Lys, Ile, Trp, Leu, Arg, as key contributors to IMs. Response surface design analysis was used to reduce initial AA levels between 13% and 33%, and these were then evaluated in batch and fed‐batch cultures. Lowering AAs in basal and feed medium and reducing feed rate from 10% to 5% reduced inhibitory metabolites HICA and NAP by up to 50%, MSA by 30%, and CMP by 15%. These reductions were accompanied by a 13% to 40% improvement in peak viable cell densities and 7% to 50% enhancement in IgG production in batch and fed‐batch processes, respectively. This study demonstrates the value of tuning specific AA levels in reference basal and feed media using statistical design methodologies to lower problematic IMs.
more » « less
hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

Fahim, Farah; Hawks, Benjamin; Herwig, Christian; Hirschauer, James; Jindariani, Serge; Nhan, Trần; Carloni, Luca; DiGuglielmo, Giuseppe; Harris, Phillip; Krupa, Jeffrey; et al (April 2021, ArXivorg)
null (Ed.)
Accessible machine learning algorithms, software, and diagnostic tools for energy-efficient devices and systems are extremely valuable across a broad range of application domains. In scientific domains, real-time near-sensor processing can drastically improve experimental design and accelerate scientific discoveries. To support domain scientists, we have developed hls4ml, an open-source software-hardware codesign workflow to interpret and translate machine learning algorithms for implementation with both FPGA and ASIC technologies. We expand on previous hls4ml work by extending capabilities and techniques towards low-power implementations and increased usability: new Python APIs, quantization-aware pruning, end-to-end FPGA workflows, long pipeline kernels for low power, and new device backends include an ASIC workflow. Taken together, these and continued efforts in hls4ml will arm a new generation of domain scientists with accessible, efficient, and powerful tools for machine-learning-accelerated discovery.
more » « less
Full Text Available

« Prev Next »

Search for: All records