<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Stealthy DGoS Attack under Passive and Active Measurements</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>12/07/2020</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10195980</idno>
					<idno type="doi"></idno>
					<title level='j'>IEEE Global Communications Conference</title>
<idno>2576-6813</idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Cho-Chun Chiu</author><author>Ting He</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[As a tool to infer the internal state of a network that cannot be measured directly (e.g., the Internet and all-optical networks), network tomography has been extensively studied under the assumption that the measurements truthfully reflect the end-to-end performance of measurement paths, which makes the resulting solutions vulnerable to manipulated measurements. In this work, we investigate the impact of manipulated measurements via a recently proposed attack model called the \emph{stealthy DeGrading of Service (DGoS) attack}, which aims at maximally degrading path performances without exposing the manipulated links to network tomography. While existing studies on this attack assume that network tomography only measures the paths actively used for data transfer (by passively recording the performance of data packets), our model allows network tomography to measure a larger set of paths, e.g., by sending probes on some paths not carrying data flows. By developing and analyzing the optimal attack strategy, we quantify the maximum damage of such an attack and shed light on possible defenses.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>Network tomography <ref type="bibr">[1]</ref> is a family of inference-based techniques to monitor the internal state (e.g., link delays or loss rates) of a network from external measurements. The need of such techniques arises in many networks where the internal network elements are accessible in the data plane but inaccessible in the control plane, e.g., the public Internet and all-optical networks.</p><p>Theoretically, network tomography works by inverting a given observation model that captures the relationship between the unknown link states and the observed path states, where specific solutions differ in the observation models they assume, e.g., a linear model for inferring additive link metrics such as delays <ref type="bibr">[2]</ref>, <ref type="bibr">[3]</ref>, <ref type="bibr">[4]</ref>, a Boolean model for localizing failures <ref type="bibr">[5]</ref>, <ref type="bibr">[6]</ref>, or various probabilistic models for accommodating performance fluctuations (see <ref type="bibr">[1]</ref> and references therein). However, most of the existing works assumed that the measurements truthfully reflect the performance of measurement paths, leaving open what will happen when measurements can be manipulated by an attacker.</p><p>Manipulated measurements fundamentally change the problem of network tomography, because instead of only changing This work was supported by the National Science Foundation under award CCF-1813219. the link states (e.g., by delaying all the packets traversing a link), the attacker may manipulate different packets traversing the same link differently (e.g., by adding delays for packets with one source-destination pair but not adding delays for packets with another source-destination pair), thus changing the observation model. For example, a link showing two different behaviors for two groups of flows is effectively two different links, each traversed by one group of flows. For linear observation models, recent studies <ref type="bibr">[7]</ref>, <ref type="bibr">[8]</ref> revealed novel attacks that can substantially degrade path performances while misleading network tomography to believe that the manipulated links perform well. However, these studies implicitly assumed that network tomography only collects passive measurements, i.e., the performances of data packets, and thus only measures the paths used for data transfer.</p><p>In practice, however, network tomography can monitor a larger set of paths via active measurements obtained from probes. Intuitively, augmenting passive measurements with active measurements exposes the performances of a larger set of paths and thus should help defending against attacks. In this work, we harden this intuition by quantifying the damage a stealthy attacker can inflict on a network monitored by network tomography with both passive and active measurements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Related Work</head><p>Network tomography is a rich family of network monitoring techniques that infer network internal characteristics from external measurements <ref type="bibr">[1]</ref>, <ref type="bibr">[9]</ref>. Early works focused on besteffort solutions, which tried to find the most likely network state from given measurements, obtained by unicast <ref type="bibr">[10]</ref>, multicast <ref type="bibr">[11]</ref>, and their variations (e.g., back-to-back unicast <ref type="bibr">[12]</ref>). After observing that an arbitrary set of measurements is frequently insufficient for identifying all the link metrics <ref type="bibr">[13]</ref>, <ref type="bibr">[14]</ref>, later works aimed at either reducing the ambiguity with given measurements (e.g., <ref type="bibr">[10]</ref>, <ref type="bibr">[13]</ref>), or ensuring identifiability by carefully designing the monitor locations and the measurement paths (e.g., <ref type="bibr">[2]</ref>, <ref type="bibr">[3]</ref>, <ref type="bibr">[4]</ref>). All these works assume a benign setting, where the links behave consistently and the measurements are truthful.</p><p>Very few works have considered network tomography in an adversarial setting. In <ref type="bibr">[7]</ref>, optimizations were formulated to design the manipulations at compromised nodes in order to cause the maximum degradation while scapegoating certain benign links as the cause of poor performance; however, the set of compromised nodes is not optimized. In <ref type="bibr">[8]</ref>, a similar but more sophisticated attack model, called the stealthy DeGrading of Service (DGoS) attack, was proposed, where the attacker jointly optimizes where to attack (in terms of compromised links) and how to attack (in terms of the manipulation at each path traversing at least one compromised link). However, both works assumed that only the data paths are monitored by network tomography, and thus the manipulation of any measurement path will affect the end user's performance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Summary of Contributions</head><p>Our goal is to quantify the impact of DGoS attack in networks monitored by network tomography based on both passive and active measurements.</p><p>1) We extend the attack model in <ref type="bibr">[8]</ref> to include both passive measurement paths (for data packets) and active measurement paths (for probes), where only the performance degradation on passive measurement paths counts towards the damage caused by an attack.</p><p>2) We derive sufficient/necessary conditions for an attack strategy to be optimal under the above attack model. Based on these conditions, we establish the hardness of designing the optimal attack strategy and develop efficient approximations.</p><p>3) We evaluate the proposed attack strategies on real Internet topologies. Our evaluations show that: (i) the proposed strategies achieve more severe performance degradation than intuitive alternatives and thus better reveal the potential damage of DGoS attack, (ii) even a few compromised links can cause significant damage to end-to-end communications, and (iii) adding active measurements provides little protection if these measurements are only between the communicating terminals.</p><p>Roadmap. Section II formulates the generalized DGoS attack. Section III presents the optimality conditions and the associated algorithms for attack design. Section IV evaluates the proposed algorithms. Finally, Section V concludes the paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. PROBLEM FORMULATION A. Network Model</head><p>We model the network as an undirected graph G = (N, L), where N is the set of nodes and L the set of links. Each link l j &#8712; L is associated with an unknown metric x j that describes its performance (the smaller, the better). We assume that these link metrics are additive, i.e., a path metric equals the sum of its link metrics. This is a canonical assumption satisfied by several important performance metrics including delays, jitters, log-success rates, and their statistics.</p><p>We assume that this network is monitored by a tomographybased detection system that measures the end-to-end metrics on a set P of paths to detect anomalies on link metrics. Let R = (r ij ) pi&#8712;P,lj &#8712;L be the matrix representation of P , called the measurement matrix, where r ij &#8712; {0, 1} indicates if path p i traverses link l j . Let r i = (r ij ) lj &#8712;L be the i-th row in R. Given the measured path metrics y = (y i ) pi&#8712;P , network tomography detects link anomalies by finding a solution x to R x = y and then comparing each inferred link metric x j with the maximum normal delay &#964; : l j is considered "normal" if x j &#8804; &#964; and "abnormal" otherwise. To focus on anomalies caused by the attack, we assume that the before-attack link metrics are all normal, i.e., x j &#8804; &#964; (&#8704;l j &#8712; L). Let &#964; max denote the maximum possible link metric, which can be infinity; assume that &#964; &#8804; &#964; max .</p><p>We note that the solution to R x = y may not be unique as R may not have a full column rank <ref type="bibr">[13]</ref>, <ref type="bibr">[14]</ref>. In this case, we consider a powerful network tomography solver that can compute all the possible solutions to the link metrics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Attack Model</head><p>Suppose that an attacker wants to degrade the performance of a subset of paths P d &#8838; P . Paths in P \ P d are monitored by network tomography but not carrying data flows of interest. For example, paths in P \ P d may be only used by probes or non-performance-sensitive packets.</p><p>The attack is mounted by first controlling a subset L m &#8838; L of links and then modifying the path metrics by z = (z i ) pi&#8712;P through these links. Let c j (l j &#8712; L) denote the cost of compromising link l j , and k denote the budget of the attacker. We call L m the compromised links and L n := L \ L m the uncompromised links. We call the paths P m &#8838; P traversing at least one link in L m the compromised paths, and the rest P n := P \ P m the uncompromised paths. To ensure that the attack is feasible, we adopt the following assumptions from <ref type="bibr">[7]</ref>, <ref type="bibr">[8]</ref>:</p><p>1) Only the metrics of compromised paths can be manipulated, i.e., z i = 0 for any p i &#8712; P n .</p><p>2) The manipulation can only degrade (not improve) path performance, i.e., z i &#8805; 0 for any p i &#8712; P m .</p><p>Moreover, the attacker wants to stay stealthy to the detection system by ensuring that (i) the network tomography problem remains feasible under the manipulation, i.e., R x = Rx + z is feasible, and (ii) there exists a feasible solution x according to which all the compromised links appear normal.</p><p>Using a change of variable z = R( xx), we formulate the problem of optimal attack design as follows:</p><p>In words, (1) designs "where to attack" (represented by L m ) and "how to attack" (represented by x) to maximize the total degradation on the paths of interest (1a), subject to feasibility (1b), stealthiness (1c)(1d), and budget constraints (1e). The above formulation generalizes the stealthy DGoS attack proposed in <ref type="bibr">[8, (1)</ref>] in that: (i) only degradation on the paths in P d is included in the objective, which allows us to model network tomography based on both passive and active measurements, and (ii) a budget constraint (1e) is added to capture the resource constraint faced by a realistic attacker. As shown later, these differences lead to subtle but critical changes in the solution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. OPTIMAL ATTACK STRATEGY</head><p>Given the set of compromised links L m , ( <ref type="formula">1</ref>) is a linear program (LP) in x that can be solved in polynomial time by standard LP solvers. Meanwhile, optimizing L m is a combinatorial optimization problem, with an objective F (L m ) that denotes the optimal value of (1a) under a given L m . The main challenge is that the objective function F (L m ) is not an explicit function of the decision variable L m . Below, we propose two approaches to turn F (L m ) into an explicit function of L m , which then lead to efficient algorithms.</p><p>A. The Case of k = &#8734; First, consider the case that the attacker has an unlimited budget, i.e. the constraint (1e) is removed.</p><p>1) Property of the Optimal L m : In the unbudgeted case, we establish the sufficient and the necessary conditions for a given L m to be optimal for (1). To this end, we introduce the following definitions.</p><p>Definition 1. Given P and P d , we define:</p><p>1) the traversal number w j := pi&#8712;P d r ij for link l j as the number of paths in P d that traverse l j , 2) T (L ) := lj &#8712;L w j as the total traversal number of a set of links L , 3) a set of links L as a cut of a set of paths P if every p i &#8712; P traverses at least one link in L , 4) C P as the collection of all the cuts of P , and 5) C * P as the collection of all the cuts of P with the minimal total traversal number, i.e., C *</p><p>Based on these definitions, we can state the optimality conditions as follows.</p><p>Theorem III.1. A set of compromised links L m is optimal if it is a cut of P with the minimal total traversal number, i.e., L m &#8712; C * P . Proof. see <ref type="bibr">[15]</ref>.</p><p>Remark: Theorem III.1 generalizes [8, Theorem III.1], which states that in the case of P d = P , the minimal traversal cut of P achieves optimality, where the traversal number of a link is defined as the total number of paths in P that traverse it. Theorem III.1 extends this statement to the case of P d &#8838; P by redefining the traversal number for a link to only count the paths in P d that traverse this link. While Theorem III.1 gives a sufficient condition to achieve optimality, it does not rule out other possibilities. We show that L m &#8712; C * P is not necessary by a simple example. In the example shown in Fig. <ref type="figure">1</ref>, suppose that n k=2 x k &#8805; &#964; max . It is easy to see that the optimal solution can be</p><p>The optimal solution {l 1 } / &#8712; C * P shows that L m &#8712; C * P is not a necessary condition. Nevertheless, we will show that forming a cut of P d is necessary under mild conditions.</p><p>Proof. See <ref type="bibr">[15]</ref>.</p><p>Theorems III.1 and III.2 imply the following condition.</p><p>Corollary III.3. If P d = P and &#964; &gt; x j (&#8704;l j &#8712; L), then a set of compromised links L m is optimal if and only if L m &#8712; C * P . Proof. See <ref type="bibr">[15]</ref>.</p><p>2) Hardness and Algorithm Design: Theorem III.1 implies that finding a minimum-traversal cut L m &#8712; C * P will give an optimal solution to (1). This reduces (1) to the following combinatorial optimization problem. Definition 2. Given a set of paths P and a subset P d &#8838; P , the generalized adversarial link selection (GALS) problem aims at finding the cut of P with the minimum total traversal number by P d : min</p><p>GALS is similar to the adversarial link selection (ALS) problem in <ref type="bibr">[8]</ref>, except that the traversal number w j only counts the traversals by paths in P d . Nevertheless, given P d , the traversal number of each link is a constant, and thus the solutions for ALS and GALS are the same.</p><p>Specifically, since ALS is NP-hard <ref type="bibr">[8]</ref>, GALS is also NPhard. Moreover, similarly to the reduction of ALS to the weighted set cover (WSC) problem <ref type="bibr">[8]</ref>, GALS can also be Algorithm 1: Greedy GALS reduced to WSC, and can thus leverage existing algorithms designed for WSC. One such algorithm is the greedy algorithm, shown in Algorithm 1. The algorithm iterates until all the paths are compromised (line 4), where in each iteration, it picks a link with the smallest cost-value ratio (line 5) and adds it to the set of compromised links (lines 6-7). Here, we define the cost-value ratio of link l j by w j /|P j \ P m |, where P j is the set of paths traversing link l j . It is known <ref type="bibr">[16]</ref> that this greedy algorithm has the best approximation factor for WSC, which is &#920;(log |P |) in our case.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. The Case of k &lt; &#8734;</head><p>In the general case, the attacker may not have sufficient budget to compromise the minimum-traversal cut, and thus the optimal strategy needs to be adapted.</p><p>1) Property of the Asymptotically Optimal L m : For a general L m , it is difficult to write the optimal value of (1a) wrt x as an explicit function of L m . Nevertheless, we find the following approximation to be asymptotically accurate. Definition 3. Given a set of paths P , a subset P d &#8838; P , and the cost c j for each link l j , the generalized constrained adversarial link selection (GCALS) problem aims at:</p><p>where L n := L n \ &#8746; p&#8712;Pn p is the set of uncompromised links that are only traversed by compromised paths.</p><p>We show that when &#964; max is large, GCALS is asymptotically equivalent to the original optimization <ref type="bibr">(1)</ref>.</p><p>m is optimal for (1) if and only if L * m is an optimal solution to GCALS. Proof. See <ref type="bibr">[15]</ref>.</p><p>2) Hardness and Algorithm Design: First, we will show that GCALS problem is NP-hard.</p><p>Theorem III.5. The GCALS problem (3) is NP-hard. Proof. See <ref type="bibr">[15]</ref>.</p><p>Next, we will develop a solution by formulating this problem as an integer linear programming (ILP) problem (w j is defined as in Definition 1, the other parameters defined as in ( <ref type="formula">1</ref>)):</p><p>Lemma III.6. The optimization ( <ref type="formula">4</ref>) is equivalent to the optimization (3), where l j &#8712; L m if and only if &#945; j = 1.</p><p>Proof. see <ref type="bibr">[15]</ref>.</p><p>The ILP formulation (4) allows us to leverage techniques for solving ILP to solve the GCALS problem <ref type="bibr">(3)</ref>. In particular, one commonly-used approach is to relax the ILP into an LP by relaxing the integer constraint (4g) into &#945; j , &#946; j , &#947; j &#8712; [0, 1]. After solving this LP relaxation for a fractional solution, we can use various rounding techniques to convert it into a feasible solution to the original problem. A rounding scheme we find to be particularly effective is as follows. For a link l j , we define the value-cost ratio as</p><p>, where P j is the set of paths traversing link l j and &#945; j is the fractional solution of &#945; j from the LP relaxation. We then iteratively select links into L m until reaching the budget, where in each iteration, we select the link l j with the largest value-cost ratio. We refer to this algorithm as "LP relaxation with rounding (LP-R)", for which the pseudo code is given in Algorithm 2.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. PERFORMANCE EVALUATION</head><p>In order to understand the potential damage of the generalized DGoS attack, we evaluate the proposed algorithms as well as benchmarks on real network topologies.</p><p>A. Experiment Setup 1) Network topology: We use real network topologies from public datasets, whose parameters are shown in the TABLE I. The first four topologies are Point of Presence (PoP)-level topologies from the Internet Topology Zoo <ref type="bibr">[17]</ref>, and the last two topologies are router-level topologies from the CAIDA project <ref type="bibr">[18]</ref>.</p><p>2) Paths: For each topology, we select a given number of terminals from low-degree nodes (degree &#8804; 2), and compute P as the shortest paths (in hop count) between terminals, with ties broken arbitrarily. We then select a subset of paths in P as P d uniformly at random.</p><p>3) Other parameters: Before the attack, each link has a delay sampled from the interval of [0, 20) (ms) uniformly at random. The cost 1 of compromising each link is drawn uniformly at random from the interval of [0, 2). A link is considered "normal" if its delay is within 150 ms, i.e., &#964; = 150.</p><p>The maximum delay at a link is 2000 ms, i.e., &#964; max = 2000.</p><p>4) Benchmarks: We compare the proposed algorithms, Greedy GALS (Algorithm 1) and LP-R (Algorithm 2), with three heuristics and an optimal solution: i) "Random selection" ('random'): This algorithm selects links uniformly at random within the given budget. ii) "Top traversal" ('top traversal'): Based on the intuition that compromising the most traversed links will allow the attacker to control the most paths, this algorithm sorts the links by their traversal numbers in descending order, and then selects links in this order under the budget. iii) "LP relaxation with Randomized Rounding" ('LP-RR'):</p><p>To benchmark the proposed rounding scheme in Algorithm 2, we evaluate a randomized rounding scheme, where the fractional solution (&#945; j ) lj &#8712;L to the LP relaxation of ( <ref type="formula">4</ref>) is used as probabilities for selecting links. iv) "ILP" ('ILP'): This solution directly solves the ILP (4) by a commercial optimizer called Gurobi, which performs an exhaustive search in the worst case.</p><p>Under each selection of the compromised links L m , we solve the optimization (1) in x to compute the total performance degradation under the optimal manipulations (measured by the total delay injected by the attacker over all the paths in P d ). 1 The cost is relative to the budget and is thus unitless. We set the average cost to 1 so that a budget of k will allow the attacker to compromise k randomly selected links on the average, while the optimal strategy may compromise more or fewer. 2 For Bics, these are all the nodes with degree &#8804; 2; for the other networks, these are all the nodes with degree one.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Experiment Results</head><p>We evaluate the average performance degradation over the paths in P d (plus/minus one standard deviation) under each attack design, computed over 100 Monte Carlo runs.</p><p>First, we increase the budget k as in Table <ref type="table">II</ref> to evaluate the impact of a stronger attacker. Fig. <ref type="figure">2</ref> shows that the proposed algorithm for the budgeted case (LP-R) achieves much bigger damage than the benchmark heuristics for a wide range of k, whereas the proposed algorithm for the unbudgeted case (Greedy GALS) is only effective when k is large. Moreover,   <ref type="table">III</ref> to evaluate the impact of monitoring more paths by network tomography. Fig. <ref type="figure">3</ref> shows that the damage achieved by all the attack strategies, especially the optimal strategy (ILP), decreases very slowly in |P |.</p><p>Discussion: The above results provide a number of insights for defending against DGoS attacks: (i) it is important to defend against intelligent attack strategies as they can achieve substantially more damage than simplistic ones, (ii) it is important to guard against compromised network elements (nodes/links) from the bottom up as even a few compromised elements can cause a big damage, and (iii) simply monitoring more paths is not sufficient to make network tomography robust against DGoS attacks. For (iii), however, the added paths in our experiments are only between the terminals, and it remains open how much protection can be achieved by carefully designing the paths, which is left for future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>V. CONCLUSION</head><p>We have quantified the impact of DGoS attack by formulating and computing the maximum damage that an attacker can inflict on end-to-end communications through compromised links, without exposing these links to a tomography-based detection system based on both passive and active measurements. By establishing optimality conditions, we convert the attack design problem into novel combinatorial optimization problems and develop efficient algorithms. Our evaluations on real network topologies reveal significant damage of the DGoS attack and provide insights for future defenses.</p></div></body>
		</text>
</TEI>
