<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Data-Driven Flow and Injection Estimation in PMU-Unobservable Transmission Systems</title></titleStmt>
			<publicationStmt>
				<publisher>IEEE</publisher>
				<date>07/16/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10535021</idno>
					<idno type="doi">10.1109/PESGM52003.2023.10252719</idno>
					
					<author>Satyaprajna Sahoo</author><author>Anwarul Islam Sifat</author><author>Anamitra Pal</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Fast and accurate knowledge of power flows and power injections is needed for a variety of applications in the electric grid. Phasor measurement units (PMUs) can be used to directly compute them at high speeds; however, a large number of PMUs will be needed for computing all the flows and injections. Similarly, if they are calculated from the outputs of a linearstate estimator, then their accuracy will deteriorate due to the quadratic relationship between voltage and power. This paperemploys machine learning to perform fast and accurate flow and injection estimation in power systems that are sparsely observed by PMUs. We train a deep neural network (DNN) to learn the mapping function between PMU measurements and power flows/injections. The relation between power flows and injections is incorporated into the DNN by adding a linear constraint to its loss function. The results obtained using the IEEE 118-bus system indicate that the proposed approach performs more accurate flow/injection estimation in severely unobservable power systems compared to other data-driven methods.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>Knowledge of active power flows and power injections is fundamental for the reliable, resilient, and economic operation of the electric grid. Traditionally, their knowledge had been used to determine the cost for buying/selling energy as well as for performing vulnerability/security assessment <ref type="bibr">[1]</ref>, <ref type="bibr">[2]</ref>. More recently, with the growing frequency and intensity of extreme weather events, such as wildfires, as well as increasing penetration of renewable energy resources, a need has been felt for high-speed tracking of the power flowing through critical equipment, such as transmission lines and transformers <ref type="bibr">[3]</ref>, <ref type="bibr">[4]</ref>. Power flows and injections can be directly computed at high speeds from the outputs of a phasor measurement unit (PMU). However, a large number of PMUs will be needed to estimate all of them. They can be calculated indirectly from the outputs of a linear state estimator. However, the quadratic relationship between voltage and power deteriorates the quality of flow/injection estimates considerably (see Table <ref type="table">I</ref> in Section II for a sample illustration).</p><p>This paper presents a physics-inspired machine learning (ML) formulation to perform data-driven power flow and injection estimation in PMU-unobservable power systems. We focus on the transmission system since most of them already have some PMUs installed. However, the proposed methodology is generic enough that it can be applied to distribution systems as well.</p><p>Literature Review: In <ref type="bibr">[5]</ref>, a minimum variance unbiased estimator was developed to calculate the power flows using This work was supported in part by the National Science Foundation (NSF) grant under Award ECCS-2132904. The authors are with the School of Electrical, Computer, and Energy Engineering (ECEE) at Arizona State University (ASU).</p><p>the state estimation approach. However, the analysis was restricted to DC power flows. A linearly constrained leastsquares optimization problem was formulated in <ref type="bibr">[6]</ref> to estimate nodal power injections that did not violate power flow constraints. However, no information was provided regarding how the formulation could perform high speed estimation in unobservable power systems. A methodology to learn the topology and estimate the injection statistics in distribution systems with unobservable nodes at PMU timescales was developed in <ref type="bibr">[7]</ref>. However, it required the unobservable nodes to be non-adjacent, which may not always be the case.</p><p>Main Contributions: This paper proposes the use of ML to simultaneously estimate all power flows and injections in a system that is sparsely observed by PMUs. The ML model is built using deep neural networks (DNNs). The flows and injections in a power system are related by the law of conservation of energy. This knowledge is embedded into the DNNs as a linear constraint; the resulting ML model is referred to as a physics-inspired constrained-DNN (PIC-DNN). The offline training is performed using slow timescale historical data obtained from the supervisory control and data acquisition (SCADA) system. The online implementation only uses PMU data, ensuring high speed of estimation. The superior performance of the proposed model over classical as well as other ML models is demonstrated for different numbers of PMUs placed in the IEEE 118-bus system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. SOURCES OF ERROR IN POWER CALCULATED FROM STATE ESTIMATES</head><p>The most traditional way of determining electrical quantities in a power system is through state estimation. The widely used static state estimator provides an estimate of the voltage phasor (magnitude and angle) of every bus of the system. When a system is completely observed by PMUs, then by only employing PMU data one can perform linear state estimation (LSE); LSE is faster and more accurate than SCADA-based state estimation <ref type="bibr">[8]</ref>. However, there are two issues with computing power flows/injections from the outputs of LSE: (a) quadratic relationship between voltage and power, and (b) non-Gaussian noise in PMU measurements. These two issues are elaborated below.</p><p>Quadratic Relationship: Power flowing through a branch located between bus i and bus j, denoted by p ij , is related to the voltages of the two buses by the equation shown below:</p><p>In <ref type="bibr">(1)</ref>, v i and v j denote the voltage magnitudes of buses i and j, &#948; ij denotes the voltage angle difference between the two buses, and g ij and b ij are the real and imaginary components of the admittance matrix, respectively. Similarly, the power injection at bus i, denoted by p i , can be expressed as,</p><p>where br i denotes the total number of branches incident on bus i, and bu denotes the total number of buses in the system. It is clear from (1) and ( <ref type="formula">2</ref>) that power is proportional to the square of the voltage. Therefore, even a small error in the voltage estimates will get amplified when used for calculating the power flows and power injections.</p><p>Non-Gaussian Noise: LSE is most commonly performed using the least squares method. However, least squares is the solution to the maximum likelihood estimation problem under Gaussian noise environments <ref type="bibr">[9]</ref>. Recently, it has been demonstrated that PMU data has non-Gaussian noise <ref type="bibr">[10]</ref>. This, in combination with the previously-mentioned source of error, implies that an indirect estimation of power flows/injections (by finding the voltages first using state estimation) can result in lower accuracy. This implication is further reinforced through a numerical analysis performed on the IEEE 118-bus system. The results are provided in Table <ref type="table">I</ref> below.</p><p>Table <ref type="table">I</ref> shows the estimation performance (in terms of root mean square error) when the output of a linear state estimator is used to calculate the power flows and injections in the presence of no noise, 1% total vector error (TVE) Gaussian noise, and 1% TVE non-Gaussian noise. The non-Gaussian noise is described by a two-component Gaussian mixture model (GMM); its parameters are given in the last paragraph of Section IV-A. It is observed from the table that the estimation error increased by at least 60% when the distribution of the noise changed from Gaussian to non-Gaussian. Furthermore, a purely PMU-based LSE requires complete observability of the system by PMUs (for obtaining the results shown in Table <ref type="table">I</ref>, PMUs were placed at 32 optimal locations that completely observed the system <ref type="bibr">[11]</ref>). This requirement of complete observability is another concern of the LSE-based approach.</p><p>TABLE I LSE RESULTS FOR VOLTAGE AND POWER ESTIMATES IN IEEE 118-BUS SYSTEM No noise Gaussian noise (1% TVE) Non-Gaussian noise (1% TVE) Voltage magnitude (pu) 4.36 e-13 0.0036 0.0060 Voltage angle (degrees) 1.88 e-11 0.1764 0.2890 Power flow (MW) 1.46 e-10 1.3382 2.2273 Power injection (MW) 2.69 e-10 1.3613 2.2912</p><p>Note that a PMU measures voltage and current phasors at the location where it is placed (subject to its measurement channel limitations <ref type="bibr">[12]</ref>). This means that the outputs of a PMU can be used to directly estimate all the power flows (and subsequently the power injections using (2)). However, this translates to the well-known minimum vertex cover problem <ref type="bibr">[13]</ref>, which requires placing even more PMUs than those required for complete observability for LSE (which is the solution to the minimum dominating set problem). For example, PMUs must be placed at 61 locations in the IEEE 118-bus system to estimate all the power flows using only PMU data. In this paper, we employ ML to directly estimate all the power flows and injections at PMU timescales and with reasonable accuracy while placing significantly fewer PMUs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. PHYSICS-INSPIRED ML FOR POWER ESTIMATION IN</head><p>PMU-UNOBSERVABLE SYSTEMS Data-driven approaches have been shown to attain considerable success for a variety of power system estimation problems <ref type="bibr">[14]</ref>. Some popular data-driven approaches include linear regression (LR), support vector regression (SVR), and DNN-based regression. As the linear relation between the input (PMU measurements) and output (power flows and power injections) for transmission systems that are sparsely observed by PMUs is not guaranteed, LR may not be a good fit for the problem considered in this paper. Similarly, the time complexity of SVR is quadratic w.r.t. the number of training samples <ref type="bibr">[15]</ref>, which makes it challenging to implement it in a large network with wide variations in features. Therefore, we employ DNNs to perform direct estimation of power flows and injections from PMU measurements.</p><p>In Section III-A, we show how DNN-based regression can be used to overcome the unobservability problem associated with purely PMU-based estimation. In Section III-B, we embed the physical law that relates flows and injections into the DNN framework to create the proposed PIC-DNN.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. DNN Regression to Overcome Unobservability</head><p>Recently, we have demonstrated the ability of DNNs to perform high-speed time-synchronized static state estimation in distribution systems that are sparingly observed by micro-PMUs <ref type="bibr">[16]</ref>, <ref type="bibr">[17]</ref>. We did this by using the slow timescale historical smart meter data to create a mapping function between the fast timescale micro-PMU measurements and the states. The mapping function was learned using a DNN because it has excellent approximation capabilities. Here, we leverage this concept to perform power flow and injection estimation from sparsely-placed PMUs in transmission systems.</p><p>We start by making two assumptions: (1) we have access to historical SCADA data and system information (e.g., topology) for a sufficient time-period (say, a few months), and (2) PMUs are already placed on select buses of the system (say, highest voltage buses). Using historical SCADA data and system information, we solve the power flow problem to produce voltage and current phasor measurements corresponding to the locations where PMUs are placed. The power flow problem also generates the flow and injection information that matches the PMU measurements. Then, we train a DNN whose inputs are the PMU measurements and outputs are the flows and injections. Finally, during online implementation, streaming data from a select few PMUs is fed into the trained DNN to estimate all the flows and injections at PMU timescales. In this way, a DNN can perform purely PMU-based estimation without needing PMUs to completely observe the system. This DNN model is henceforth referred to as a Direct DNN.</p><p>978-1-6654-6441-3/23/$31.00 &#169;2023 IEEE Authorized licensed use limited to: Arizona State University. Downloaded on August 22,2024 at 21:24:23 UTC from IEEE Xplore. Restrictions apply.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Incorporating Physics-based Constraints into DNN</head><p>The Direct DNN model developed in Section III-A can estimate the power flows and power injections independently from the phasor measurements coming from a select few PMUs. However, as seen in ( <ref type="formula">2</ref>), the flows and injections are related by the law of conservation of energy. To account for this physical law, we modify the DNN architecture by appending a linear constraint to its loss function; the resulting model, called PIC-DNN, is described below. In the following, small-case letters indicate vectors, upper-case letters indicate matrices, and the symbol&#710;indicates estimates.</p><p>For an input, z, and output, y, a regular DNN with weights, W , and biases, b, tries to find a function, F , that minimizes the difference between y and &#375;, where &#375; = F (z, W, b). The Direct DNN model described in Section III-A performs this minimization using a mean squared error (MSE) loss function, where z = [V ; I] n&#215;1 and y = [P f ; P in ] m&#215;1 . To incorporate the law of conservation of energy into the Direct DNN model, we start by defining the following loss function:</p><p>In (3), A is a sparse matrix (a ij &#8712; {0, 1} (m-m b )&#215;m b ) which denotes the power flows that must be added to calculate the power injection of a bus, and m b and m -m b denote the number of branch power flows and bus power injections in the system, respectively. Now, power injections can be removed from the output features of the DNN and calculated from the power flow estimates in the following way:</p><p>The conversion matrix, B, is the vertical concatenation of an appropriately sized identity matrix with A, and enables us to get both flow and injection estimates. Thus, by modifying the loss function of the conventional DNN in the manner shown in (3) and ( <ref type="formula">4</ref>), we are able to simultaneously minimize the error in the estimates of power flows and power injections while also accounting for the law that relates the two.</p><p>There are several ways to implement the above-mentioned constrained formulation inside a DNN model. A simple strategy would be to add a convex optimization layer to the model output. However, the inclusion of such a layer will result in solving two different optimization problems simultaneously during training, which can become computationally burdensome. A more intuitive strategy is to add a static layer after the DNN output layer that calculates the combined &#375; from the estimated Pf and backpropagates the net error. The updated loss function is given by,</p><p>The static weight, (B T B) -1 B T , is the Moore-Penrose inverse of the conversion matrix, B. This weight is not necessary for optimization as it is static, but to ensure backpropagation errors are dimensionally consistent with the output features. Using (5), we are not only able to reduce the number of DNN output variables to just the power flows but also implicitly include the linear constraints into the loss function without needing a separate convex optimization layer.</p><p>Next, the training dataset, y train , which is only composed of the branch power flows, is split into multiple bins in accordance with the variations that occur in the flows. This improves the accuracy of the DNN because it now has to estimate features that have similar variations. The selection of the optimal number of bins, n bin , involves a trade-off. Increasing the number of bins homogenizes the output data, thereby improving the accuracy of the model. However, unrestricted increment in bin count can increase computational burden while decreasing performance utility.</p><p>In summary, the two ways in which we modify the Direct DNN model to create the PIC-DNN model are: (1) modifying the loss function to include a linear constraint, and (2) binning y train in accordance with the observed variations in the power flows in the training database. The overall implementation of PIC-DNN is shown in Algorithm 1. The results obtained when the PIC-DNN as well as other ML models are used to estimate power flows and injections in the IEEE 118-bus system is provided in the next section. for i=1 to epoch do</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>5:</head><p>for j=1 to n bin do 6:</p><p>end for 8:</p><p>Pf &#8592; [p 1 , p2 , ..., pn bin ] 9:</p><p>end for 13: end procedure</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. RESULTS</head><p>The proposed PIC-DNN model for power flow and injection estimation was applied to the IEEE 118-bus system. This system has 99 loads, 54 generators, 11 high voltage (HV) buses, and 186 branches. PMUs were assumed to be placed by default on the 11 HV buses such that all the branches coming out of these buses was directly monitored by them. To obtain branch power flows and bus power injections for this system, we solved an AC optimal power flow (ACOPF) using MATPOWER <ref type="bibr">[18]</ref>. The process is similar to the approach in <ref type="bibr">[17]</ref>, where a distribution kernel was fit over historical slow timescale data, and then multiple samples were drawn</p><p>978-1-6654-6441-3/23/$31.00 &#169;2023 IEEE Authorized licensed use limited to: Arizona State University. Downloaded on August 22,2024 at 21:24:23 UTC from IEEE Xplore. Restrictions apply.</p><p>from it to generate realistic load variation data. For the given application, the source of slow timescale data was the SCADA system. However, SCADA data is not available for the IEEE 118-bus system. Therefore, we superimposed the variations of similar loads found in the publicly available 2000-bus Synthetic Texas system <ref type="bibr">[19]</ref> onto loads of the IEEE 118bus system. Doing so ensured that our load variations were realistic. Afterward, the outputs of the ACOPF were used to train the ML models. The training and validation database had a size of 20, 000 &#215; n and 2, 000 &#215; n, respectively, while the test database had a size of 6, 000 &#215; n, where n is the number of phasor measurements.</p><p>The performance of five ML models was investigated in this study. The first two models are LR and SVR, with the latter implemented using a radial basis function kernel. The third model was an Indirect DNN. This model first performs state estimation from sparsely-placed PMUs using the methodology proposed in <ref type="bibr">[17]</ref>, and then computes the flows and injections from the outputs of the state estimator. The fourth model was the Direct DNN described in Section III-A, while the last model was the PIC-DNN developed in Section III-B. The hyperparameters used for the three DNN models are given in Table <ref type="table">II</ref>. The loss function for the Direct and Indirect DNN models is the MSE, while for the PIC-DNN model we used <ref type="bibr">(5)</ref>, with n bin = 5. Note that the number of neurons in each hidden layer is kept flexible (using the scaling factor, &#951;) to account for the increase in number of input features due to increase in number of PMUs (see Section IV-B).</p><p>TABLE II HYPERPARAMETERS OF DNNS CREATED IN THIS STUDY Parameter Value Layers 3 Neurons in each hidden layer Number of input features &#215;&#951; Learning rate 1e-3 Number of epochs 200 Activation function Rectified Linear Unit Optimizer Adam Batch size 64</p><p>The simulations done to compare the performance of the five ML models were conducted on a computer with an Intel Core (TM) i7-11800H CPU @2.3GHz with 16GB of RAM and an RTX 3070Ti GPU. The following two case-studies were designed. The first case-study is a comparative analysis between PIC-DNN and other ML models when PMUs are placed on only the 11 HV buses of the test system. In the second case-study, the performance of all five ML models is investigated as more PMUs are added to the system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Case-Study I: Performance Comparison with PMUs placed only on HV buses</head><p>In this case-study, the inputs to the ML models are the voltage and currents obtained from PMUs placed on the 11 HV buses of the IEEE 118-bus system. Therefore, the number of phasor measurements is 41 (= 11 voltage phasors + 30 current phasors), i.e., n = 82, since one phasor has two components, while the number of active power flows and injections are 490 (= 372 power flows + 118 power injections), i.e., m = 490.</p><p>To ensure the consistency of estimates obtained from the five ML models, the analysis was repeated 100 times with different random subsets of the test dataset.</p><p>The statistics of the error metric (RMSE) for the five ML models are recorded in Table <ref type="table">III</ref>. The results show that PIC-DNN does better than the other ML models in both the mean value of RMSE across all the flows and injections as well as the variation of RMSE over the 100 trials. For example, the proposed PIC-DNN outperforms others in terms of the mean by at least 15% and in terms of standard deviation by at least 40%; this denotes a significant improvement in both accuracy and consistency of power flow and injection estimates. The poor performance of the Indirect DNN model can be attributed to the quadratic relationship between voltage and power, as highlighted in Section II. The relatively poor performance of SVR is due to the input dataset being restricted to 5, 000 samples. This was done to compensate for its (longer) training time, indicating the handicap of using SVR for large datasets as its scalability is a concern <ref type="bibr">[15]</ref>.</p><p>Lastly, note that the results shown in Table <ref type="table">III</ref> (as well as the last column of Table <ref type="table">I</ref> and Fig. <ref type="figure">1</ref>) were obtained when a two-component GMM noise was added to the voltage and current phasors. The mean, standard deviation, and weights of this noise for magnitudes and angles are (-0.4%, 0.6%) and (-0.2&#176;, 0.3&#176;), (0.25%, 0.25%) and (0.12&#176;, 0.12&#176;), and (0.4, 0.6), respectively. These values corresponded to a 1% TVE error for voltage and current phasors, as mandated in the IEEE/IEC Standard for PMUs <ref type="bibr">[20]</ref>.</p><p>TABLE III PERFORMANCE COMPARISON OF ML MODELS FOR POWER ESTIMATION IN TERMS OF THEIR ROOT MEAN SQUARE ERROR (RMSE) ML Model Mean of RMSE Standard deviation of RMSE LR 5.80 0.020 SVR 7.75 0.120 Indirect DNN 8.37 0.020 Direct DNN 5.91 0.024 PIC-DNN 4.92 0.012</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Case-Study II: Impact of Increase in Number of PMUs</head><p>In this case-study, the number of locations where PMUs must be placed is increased one at a time for each of the five ML models to determine how their performance changes with an increase in the number of PMUs in the IEEE 118-bus system. This is a very practical scenario since, with additional investment in grid modernization, power utilities will add more PMUs not only in the lower voltages of the transmission system but also in the distribution system. Therefore, this casestudy indicates the improvements in ML-based power flow and injection estimation with an increase in PMU coverage.</p><p>The results obtained as the number of locations where PMUs must be placed increased from 11 to 32 is shown in Fig. <ref type="figure">1</ref>. To ensure fairness in comparison, the next optimal bus location for placing PMUs is determined independently for each ML model. This means that the horizontal axis of Fig. <ref type="figure">1</ref> denotes the total number of buses where PMUs are placed; however, the locations of PMU buses are different for each model since this ensures best performance for the different 978-1-6654-6441-3/23/$31.00 &#169;2023 IEEE Authorized licensed use limited to: Arizona State University. Downloaded on August 22,2024 at 21:24:23 UTC from IEEE Xplore. Restrictions apply.</p><p>models. We stopped at 32 because by placing PMUs on that many buses, we could directly compare the performance of the different ML models with the conventional LSE results (shown in Table <ref type="table">I</ref>).</p><p>From Fig. <ref type="figure">1</ref>, it can be realized that the proposed PIC-DNN performs better than the other four ML models for higher degrees of unobservability (fewer PMUs). With an increase in the number of PMUs, the performance of LR becomes comparable with the proposed approach (and even better than PIC-DNN for PMUs placed at 30 or more bus locations). This is expected because linear models can successfully express the relations between the inputs and the outputs as the number of sensors increase. Lastly, the superiority of data-driven approaches is realized from the fact that when the number of bus locations where PMUs are placed is 32, both LR and PIC-DNN outperform LSE results with two-component GMM measurement noise by 35% and 28%, respectively. This paper proposes a physics-inspired ML approach that uses DNNs to quickly and accurately estimate all the power flows and injections directly from PMUs placed in the transmission system. The proposed PIC-DNN model not only performs consistent estimation when there are very few PMUs in the system but also ensures that the law of conservation of energy is always satisfied. The former is established by intelligently combining inferences drawn from historical SCADA data with real-time PMU data, while the latter is secured by adding a linear constraint to the loss function of the DNN. The results indicate that by placing PMUs on only 11 buses of the IEEE 118-bus system, PIC-DNN can estimate the flows and injections to within 5 MW of the actual value, even in the presence of non-Gaussian noise in the PMU measurements. When the PMU locations increase to 32, the estimation error of the proposed approach becomes lower than 2 MW.</p><p>The focus of this paper has been on the transmission system. In the future, we plan to extend our analysis to distribution systems where PMUs are gradually being added and for which fast detection of reverse power flows is crucial. We will also combine the outcomes of this research to perform a better security assessment of the power system (e.g., by extending the research of <ref type="bibr">[21]</ref>). The DNN models analyzed here had fully-connected feed-forward architectures. We are currently investigating the ability of graph neural networks to better incorporate the physical properties of the power system into ML training and execution (e.g., for ensuring robustness during topology changes).</p></div></body>
		</text>
</TEI>
