<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Practical Layout-Aware Analog/Mixed-Signal Design Automation with Bayesian Neural Networks</title></titleStmt>
			<publicationStmt>
				<publisher>IEEE</publisher>
				<date>10/28/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10569065</idno>
					<idno type="doi">10.1109/ICCAD57390.2023.10323923</idno>
					
					<author>Ahmet F Budak</author><author>Keren Zhu</author><author>David Z Pan</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[The high simulation cost has been a bottleneck of practical analog/mixed-signal design automation. Many learningbased algorithms require thousands of simulated data points, which is impractical for expensive to simulate circuits. We propose a learning-based algorithm that can be trained using a small amount of data and, therefore, scalable to tasks with expensive simulations. Our efficient algorithm solves the postlayout performance optimization problem where simulations are known to be expensive. Our comprehensive study also solves the schematic-level sizing problem. For efficient optimization, we utilize Bayesian Neural Networks as a regression model to approximate circuit performance. For layout-aware optimization, we handle the problem as a multi-fidelity optimization problem and improve efficiency by exploiting the correlations from cheaper evaluations. We present three test cases to demonstrate the efficiency of our algorithms. Our tests prove that the proposed approach is more efficient than conventional baselines and stateof-the-art algorithms.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>Analog/Mixed-signal (AMS) integrated circuit (IC) design typically follows a process flow visualized in Figure <ref type="figure">1</ref>. A combination of designer experience and computer simulation feedback is iterated to determine the design that meets the performance requirements. A large portion of design time is spent on the sizing and layout phases, where multiple iterations are possible due to potential loop-backs in the design flow. This is a labor-intensive process in industry practice with little to no automation. To address this costly exercise, a considerable effort in academia is focused on introducing automated solutions.</p><p>Analog sizing automation is the task of optimizing AMS design variables, e.g., transistor widths, lengths, resistor, and capacitor values. The aim is to satisfy the performance constraints and optimize the design objective. In general, sizing automation is run through schematic-level simulations. However, AMS IC performance is also sensitive to layout implementation <ref type="bibr">[1]</ref>. Especially in the advanced process nodes, layout-induced parasitics may greatly affect the final design performance. Therefore, sizing the AMS design variables considering the layout effects is also crucial.</p><p>The majority of the recent sizing and post-layout performance optimization algorithms have simulation feedback in</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Simulate performance</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>System Specification</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Choose circuit topology (schematic)</head><p>Device sizing</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Physical layout</head><p>Integration/Fabrication Fig. <ref type="figure">1</ref>: AMS Design Flow the loop. Due to advanced scaling, simulations are required to obtain accurate performance evaluations. Simulation-based AMS automation algorithms adapted various methods from the optimization and Machine Learning (ML) communities. The earlier approaches include population-based methods such as particle swarm optimization <ref type="bibr">[2]</ref> and evolutionary algorithms <ref type="bibr">[3]</ref>. Although these algorithms have good convergence behavior, they are inefficient in sampling since they explore the design space randomly. To mitigate sample inefficiency, model-based methods gained popularity <ref type="bibr">[4]</ref>- <ref type="bibr">[6]</ref>. These methods employ surrogate-models between the solution space and performance space and provide efficiency in exploring the solution space. A typical surrogate model is Gaussian Process Regression (GPR) <ref type="bibr">[7]</ref>, which is a well-studied model in Bayesian Optimization (BO) field <ref type="bibr">[8]</ref> and is adapted by several analog sizing algorithms. The main drawback of GPR modeling is its computational complexity. Recent research trend in analog sizing introduces ML to simulation-based methodology <ref type="bibr">[9]</ref>. However, the literature review reveals that most of these methods require thousands of simulation data to train Deep Neural Network (DNN) models that approximate the relations between the design variables and the performance metrics. Therefore, the practicality of these algorithms is severely reduced when the optimization task has a high simulation cost. For example, drawing/generating the layout, extracting the parasitics, and running post-layout simulations is typically an expensive procedure. Therefore, optimization algorithms designed for schematic-level sizing can not be adapted by simply changing how data is generated. This paper presents a Machine Learning-based simulationin-the-loop automation method for the AMS design problem. Overall, we formalize two stand-alone recipes for schematiclevel sizing and post-layout performance optimization, i.e., layout-aware sizing. We integrate the state-of-the-art analog layout generator, MAGICAL <ref type="bibr">[10]</ref>, into our flow to handle layout-aware sizing. Our algorithms do not assume the preexistence of any dataset, and we generate all training data during the optimization. We employ Bayesian Neural Networks (BNN) for modeling design performances. Bayesian Neural Networks allow error quantification, and compared to Deep Neural Networks, BNN are shown to be effective in handling scarce datasets and preventing overfitting <ref type="bibr">[11]</ref>. Therefore, BNN can be trained on smaller datasets, significantly improving the practicality and scalability. We also introduce a batch-optimization framework and design space sampling strategy that is compatible with BNN modeling. Further, when optimizing the design variables based on post-layout performance, we exploit the correlation between schematiclevel simulations and post-layout simulations. Our algorithm introduces a co-learning scheme that reduces the need for costly post-layout simulations and boosts efficiency even further. We compile our contributions as follows:</p><p>&#8226; We use Bayesian Neural Network-based modeling to obtain performance approximations. Different learning strategies are adapted for schematic-level sizing and postlayout performance optimization. &#8226; We adopt a scalable sampling strategy and query the optimization batches by utilizing a trust region and Thompson sampling.</p><p>&#8226; The post-layout sizing is handled as a multi-fidelity optimization problem, and an efficient co-learning strategy is developed. &#8226; The efficiency of the proposed methods is demonstrated on three circuits by providing comparisons to previous state-of-the-art.</p><p>The rest of the paper is organized as follows. Section II introduces the backgrounds and previous work. Section III describes our algorithms for handling schematic-level sizing and post-layout performance-based sizing problems. Section IV provides the experiments on circuit examples to demonstrate the efficiency of our algorithms. Finally, Section V concludes the paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. BACKGROUND &amp; RELATED WORK</head><p>In this section, we first formally define the AMS design automation problem. Then we review the recent approaches to schematic-level sizing and layout-aware sizing. We summarize the state-of-the-art algorithms' advantages and shortcomings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Problem Formulation</head><p>In this paper, we assume that the existence of post-layout performance implies the existence of schematic-level performance values. However, the reverse implication does not hold.</p><p>We formulate the AMS schematic-level sizing and layoutaware sizing task as a constrained optimization problem succinctly as below.</p><p>where, x &#8712; R d is the parameter vector and d is the number of design variables of sizing task. Thus, R d is the design space. f 0 (x) is the objective performance metric we aim to minimize. Without loss of generality, we denote i th constraint by f i (x).</p><p>Notice that if the problem is schematic-level optimization, the f i values are obtained from schematic simulations. If the problem is post-layout optimization, the f i values are determined by post-layout simulations.</p><p>Through this paper, we will evaluate the quality of a design by defining a Figure of Merit (FoM) in the following form:</p><p>where w i is the weighting factor. Note, a max(&#8226;) clipping is used for equating designs after constraints are met, and min(&#8226;) is used to prevent single constraint violation from dominating FoM value.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Schematic-Level Sizing</head><p>The recent methods for AMS sizing can be collected under two algorithm classes: Bayesian Optimization methods and Deep Learning methods.</p><p>Bayesian Optimization methods are tested on AMS problems and are proven to be sample efficient. For example, GASPAD <ref type="bibr">[4]</ref> is a hybrid algorithm using a combination of evolutionary space exploration and GPR surrogate-based selection. WEIBO <ref type="bibr">[5]</ref> method also employs GPR as a surrogate and introduces a Bayesian Optimization framework where a weighted acquisition function is tailored to comply with the performance-constrained nature of sizing problem. In <ref type="bibr">[6]</ref>, the authors introduced a multi-fidelity GPR algorithm where the fidelity of the performance is varied with the simulation accuracy. However, this work did not address the layout effects. The disadvantage shared by all GPR models is their cubic complexity to the number of samples, O(N 3 ).</p><p>Deep Learning based sizing methods includes supervised learning and reinforcement learning (RL) methods <ref type="bibr">[12]</ref>- <ref type="bibr">[16]</ref>. GCN-RL <ref type="bibr">[13]</ref> is a Graph Neural Network algorithm where state representation is built via device index, type, and selected electrical properties. They also propose methods to transfer the optimization experience between different topologies and processes. However, their training graphs show that they use up to 10 4 simulations for sizing academic circuits. AutoCkt <ref type="bibr">[14]</ref> is a discrete action space policy gradient method. The RL agent is trained on different optimization tasks where the task is randomly sampled from a predefined set. The trained agent is then tested for the particular tests during deployment. We also observe from the training graphs that AutoCkt requires up to 10 5 simulated samples for training. In <ref type="bibr">[17]</ref>, the authors successfully applied BNN on multi-objective analog sizing. Si m ul at e t h e q n e w q u er y p oi nts a n d o bt ai n s p e cs f (X t ) vi a S PI C E si ms </p><p>w h er e &#181; is t h e m e a n a n d &#963; 2 is t h e v ari a n c e a p pr o xi m ati o n.   are typically computationally expensive. Therefore, methods are sought to further increase the efficiency of the (BNNbased) optimization algorithm. As a solution, we treat this problem as a multi-fidelity problem where we have access to two different information sources for calculating circuit performance metrics. Considering that the schematic-level simulations are less accurate approximations of post-layout level simulations, we define these information sources as schematic-level simulations having the lower fidelity and postlayout simulations are the highest fidelity.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>2) Tr ust-R e gi o n S e ar c h E n gi n e: We f oll o w t h e tr ust r e gi o n a p pr o a c h i ntr o d u c e d i n [ 2 2] a n d c o n fi n e t h e c a n di d at e p oi nts l o c all y. T h e tr ust-r e gi o n assi g ns a l o c ali z e d s u bs et of t h e s e ar c h s p a c e a n d pr o c e e ds i n r o u n ds. We d e n ot e t h e tr ust r e gi o n b y &#8486; . I n e a c h r o u n d, a b at c h of q d esi g ns i n &#8486; ar e s el e ct e d b y t h e B N N al g orit h m a n d t h e n si m ul at e d i n p ar all el. N ot e t h at t his pr o c e d ur e is e asil y e xt e n d e d t o as y n c hr o n o us b at c h e v al u ati o ns, a n d w e a d a pt as y n c hr o n o us e v al u ati o n f or t h e m ulti-fi d elit y B N N al g orit h m ( will b e dis c uss e d), w h er e e v al u ati o n ti m es s h o w si g ni fi c a nt diff er e n c es. T h e tr ust-r e gi o n is c e nt er e d ar o u n d t h e b est d esi g n e x pl or e d, i. e., t h e d esi g n wit h mi ni m u m F o M w h er e t h e ti es ar e h a n dl e d a c c or di n g t o t h e d esi g n o bj e cti v e. T his a p pr o a c h miti g at es c o m m o n iss u es of B a y esi a n o pti mi z ati o n i n hi g h-di m e nsi o n al s etti n gs, w h er e p o p ul ar a c q uisiti o n f u n cti o ns f ail t o f o c us o n pr o misi n g r e gi o ns a n d s pr e a d o ut s a m pl es d u e t o l ar g e pr e di cti o n u n c ert ai nt y. T h o m ps o n S a m pli n g-b as e d E x pl o r ati o n : We e m pl o y T h o m ps o n s a m pli n g t o o bt ai n p erf or m a n c e a p pr o xi m ati o ns f or u nt est e d d esi g n c a n di d at es. T h o m ps o n s a m pli n g s c al es t o l ar g e b at c h es at l o w c o m p ut ati o n al c ost a n d h as s h o w n t o b e as eff e cti v e as t h e e x p e ct e d i m pr o v e m e nt a c q uisiti o n f u n cti o n [ 2 2]. F urt h er, t h e T h o m ps o n s a m pli n g n at ur all y e xt e n ds t o c o nstr ai n e d s etti n gs w hi c h is us u all y t h e c as e f or A M S a ut o m ati o n. T o s el e ct a p oi nt f or t h e n e xt b at</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>., f m (x i )) T f or all x i wit h 1 &#8804; i &#8804; r fr o m t h e r es p e cti v e p ost eri or distri b uti o ns o n t h e f u n cti o ns f</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>c e t h e l ast ti m e t h e tr ustr e gi o n is u p d at e d. A s u c c ess is w h e n t h e al g orit h m i m pr o v es t h e s ol uti o n q u alit y, a n d b y c o nstr u cti o n, t his p oi nt m ust b e i nsi d e t h e tr ust r e gi o n. We c all it a f ail ur e w h e n t h e l ast b at c h of si m ul at e d d esi g ns is w ors e t h a n t h e c urr e nt b est s ol uti o n. T h e c e nt er C of t h e tr ust r e gi o n is u p d at e d as f oll o ws. If t h er e e xist f e asi bl e d esi g ns, t h e o n e wit h t h e mi ni m u m o bj e cti v e is assi g n e d as t h e c e nt er. Ot h er wis e, t h e d esi g n wit h mi ni m u m F o M, i. e., mi ni m u m s c al e d c o nstr ai nt vi ol ati o n, is c h os e n as t h e c e nt er. T h er ef or e, t h e c e nt er of t h e tr ust-r e gi o n is u p d at e d e v er y ti m e t h e d esi g n p erf or m a n c e is i m pr o v e d. T h e si d e l e n gt h of t h e tr ust r e gi o n is u p d at e d as f oll o ws: if n s = &#964; s t h e n t h e si d e l e n gt h is</head><p>We modify the BNN architecture to capture two levels of fidelities (Figure <ref type="figure">2</ref>) at the output and propose a co-learning scheme similar to multi-task BNN learning <ref type="bibr">[23]</ref>. The multifidelity BNN model has two output nodes where &#981;(x) <ref type="bibr">[1]</ref> models the lower fidelity prediction, i.e., schematic-level performance prediction, and &#981;(x) <ref type="bibr">[2]</ref> models the high fidelity prediction, i.e., post-layout performance prediction. Under the assumption that we have access to two levels of information sources, we denote the new dataset by D = D 1 &#8746; D 2 , where</p><p>n=1 and the joint probability of the updated BNN model is given by:</p><p>where p(&#952;) = N (vec(&#952;) | 0, I) and N k is the number training points in given fidelity level k. The joint probability expression is a combination of the data sourced from both types of simulations; therefore, we utilize the full dataset to train multifidelity BNN. In this way, both fidelities are learned together, and the correlations between them are captured due to shared BNN parameters.</p><p>To handle the multi-fidelity problem, we adopt the following modifications to Algorithm 1: 1) We train multi-fidelity BNN models using the whole history of simulations, D.</p><p>2) The trust-region centering and length updates are based on the post-layout simulation results, i.e., highest-level fidelity results. 3) We determine the candidate selection by modifying the work of <ref type="bibr">[24]</ref> where they propose an upper-confidence-bound selection criteria for a single objective BO. We obtain Thompson sampling-based realizations for each fidelity, i.e., {</p><p>i (x), for i = 0, 1, . . . , m} where {1, 2} indicate the fidelity level (schematic-level simulations and postlayout level simulations) and then calculate the low fidelity and high fidelity FoM approximations, F oM ( f L (x)) and F oM ( f H (x)) using the corresponding realizations. The candidate selection is queried according to the following utility expression:</p><p>where &#8710; is the FoM difference between the samples with the best utility at each fidelity. In this step, we take a practical approach to convert two fidelities to each other by defining a reduction term and assign the conservative prediction as the utility value. Finally, the argmin selection is conducted on the candidate utility values to determine the next batch.</p><p>4) The current literature on multi-fidelity Bayesian optimization lacks in handling large number of constraints. Therefore, we randomly assign the fidelity (simulation type) for selected candidates and leave the fidelity selection as future work. Note that this action does not prevent us from studying the benefits of multi-fidelity handling of layout-aware sizing. However, we sacrifice potential cost-aware improvements through intelligent fidelity selection.</p><p>IV. EXPERIMENTS Experiment Setup and Algorithm Settings: We run our tests using 3 different AMS circuits designed with different technologies. A Two-Stage Folded Cascode Operational Transconductance Amplifier (OTA), and a Strong-Arm Latch Comparator are designed with TSMC 180nm process and used to test schematic-level sizing algorithms. Then, we demonstrate the results for layout-aware algorithms on a Two-Stage Miller OTA. This circuit is designed in TSMC 40nm technology since the layout generator used in this work, MAGICAL, is crafted for TSMC 40nm. The schematic designs for these circuits are included in Figure <ref type="figure">4</ref>.</p><p>We run experiments to study the effectiveness of both of the proposed algorithms. First, we test for the schematic-level sizing algorithm, which is given by Algorithm 1, and we refer to our Bayesian Neural Network Based Bayesian Optimization algorithm as "BNN-BO". Then, we run tests for our postlayout performance-based sizing algorithm. Since we utilize a multi-fidelity BNN for this task, we will refer to this algorithm as "MF-BNN-BO".</p><p>We implemented several state-of-the-art baseline algorithms to compare and quantify the quality of our proposed algorithms. We selected the baseline algorithms to cover the different categories of approaches. We list the compared baseline algorithms as follows: 1) A differential evolution global optimization algorithm (DE), 2) Bayesian Optimization with weighted expected improvement (BO) <ref type="bibr">[5]</ref>, and, 3) RLbased sizing algorithm, DNN-Opt <ref type="bibr">[15]</ref>. All algorithms are implemented using Python. We implemented DNN-Opt via PyTorch <ref type="bibr">[25]</ref>, Bayesian Optimization algorithm is implemented using BoTorch <ref type="bibr">[26]</ref> package and BNN-BO and MF-BNN-BO are implemented using PyTorch and Hamiltorch <ref type="bibr">[27]</ref> packages.</p><p>We configured BNN-BO and MF-BNN-BO to evaluate a batch of q = 8 designs in parallel. For fairness, DNN-Opt and BO are also configured to do parallel evaluations. Both our algorithms use 200 HMC samples to train BNN models. All BNN models are feedforward neural networks with 2 hidden layers and 100 nodes at each hidden layer. Trust-region is initiated with L = 0.8 and L min and L max are chosen to be 0. TABLE I: Schematic-Level Sizing Optimization Statistics Circuit Name Folded Cascode OTA Strong-Arm Latch Comparator Algorithm DE BO DNN-Opt BNN-BO DE BO DNN-Opt BNN-BO success rate 10/10 7/10 10/10 10/10 7/10 1/10 9/10 10/10 # of simulations 3200 340 151 82 2800 &gt;500 154 68 Min power (mW ) 0.75 0.88 0.64 0.60 3.02 3.67 2.45 2.5 Max power (mW ) 1.53 1.43 0.8 0.75 4.1 3.67 2.66 2.55 Mean pow. (mW ) 1.14 1.19 0.72 0.69 3.44 3.67 2.54 2.52 Modeling time (h) NA 30 0.6 1.5 NA 17 0.3 0.7 Simulation time (h) 54 2.7 2.7 2.7 72 3.6 3.6 3.6 Total runtime (h) 54 32.7 3.3 4.2 72 20.6 3.9 4.3 Fig. 7: Strong-Arm Latch Comparator Optimization circuits suggests that BNN-BO can achieve feasible solutions in all and it uses the smallest number of simulations to achieve this. Compared to Differential Evolution (DE), BNN-BO can find feasible solutions using up to 40x less number of simulations. Compared to the closest baseline algorithm, DNN-Opt, BNN-BO reduces the simulation time for finding similar results by up to 55%, proving its high efficiency. It also demonstrated in Table I that, on average, the final design proposed by BNN-BO draws up to 40% less power. The only disadvantage of BNN-BO to DNN-Opt is the modeling time as DNN-Opt maintains a single DNN model to approximate all performance metrics. Note that all reported times consider the full simulation budget (500 new samples). Therefore, although it takes longer time for BNN-BO to do a single iteration, the required real time for BNN-BO to find a feasible solution is still smaller than other approaches.</p><p>In addition to experiment statistics, we further include the FoM convergence curves of both tests in Figure <ref type="figure">6</ref> and Figure <ref type="figure">7</ref>. The y-axis in the graphs represents the total constraint violation; therefore, FoM=0 represents a feasible solution. We observe that, compared to DNN-Opt, BNN-BO has 65% and 33% smaller area under the curve for Folded Cascode OTA and SA Latch Comparator, respectively. Layout-Aware Design Automation: In order to demonstrate the importance of layout effects on the final performance, we perform experiments on a Miller OTA circuit designed in 40nm technology (Fig. 4). The optimization problem has 17 independent design variables and the optimization problem is defined as follows: minimize Power s.t. DC Gain &gt; 45 dB Settling Time &lt; 100 ns CMRR &gt; 55 dB Saturation Margins &gt; 50 mV PSRR &gt; 55 dB Unity Gain BW. &gt; 40 MHz Out. Swing &gt; 1 V RMS Noise &lt; 400 uV rms Static error&lt; %2 Phase Margin &gt; 60 deg.</p><p>Obtaining the post-layout performance of the Miller OTA is around 9 times more expensive than obtaining the schematiclevel performance. Therefore this experiment is to prove the efficiency by utilizing multiple information sources. We initialize all algorithms with 50 high-fidelity random samples, and MF-BNN-BO has additional 50 samples from low-fidelity source (schematic-level simulations). We demonstrate the FoM evolution for the rest of the optimization steps in Figure <ref type="figure">8</ref>. We observe that our Multi-Fidelity BNN algorithm provides even more efficiency compared to already efficient BNN-BO. Our analysis shows that the area under the curve is 45% smaller for MF-BNN-BO compared to BNN-BO. Further, we observe that BNN-BO's average best solution after 150 high-fidelity iterations is surpassed by MF-BNN-BO only using 84 simulations. This also implies close to 45% improved efficiency due to utilizing correlations between the schematic-level evaluations and post-layout level evaluations. Note that there is an equal number of schematic-level simulations while running MF-BNN-BO that are not reflected in Figure <ref type="figure">8</ref>. Considering these simulations, the time efficiency is slightly reduced to around 38%. This efficiency figure serves as a lower-bound since we leave the improvements on fidelity selection as a future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>V. CONCLUSION</head><p>In this work, we presented Bayesian Neural Networkbased solutions for schematic-level analog sizing automation and post-layout performance optimization. We targeted the scalability issue of the learning-based automation methods and provided a sample efficient optimization flow. We demonstrated the efficiency of the proposed approaches on academic benchmarks. Compared to the state-of-the-art, we improved the sizing automation efficiency by up to 45%. The Multi Fidelity BNN algorithm analysis proved that utilizing cheaper (schematic-level) simulations reduces the need for expensive (post-layout) simulations considerably, further boosting the efficiency.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>Authorized licensed use limited to: University of Texas at Austin. Downloaded on January 30,2025 at 20:23:44 UTC from IEEE Xplore. Restrictions apply.</p></note>
		</body>
		</text>
</TEI>
