<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>When Does the Gittins Policy Have Asymptotically Optimal Response Time Tail in the M/G/1?</title></titleStmt>
			<publicationStmt>
				<publisher>INFORMS</publisher>
				<date>02/19/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10532338</idno>
					<idno type="doi">10.1287/opre.2022.0038</idno>
					<title level='j'>Operations Research</title>
<idno>0030-364X</idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Ziv Scully</author><author>Lucas van_Kreveld</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[We consider scheduling in the M/G/1 queue with unknown job sizes. It is known that the Gittins policy minimizes mean response time in this setting. However, the behavior of the tail of response time under Gittins is poorly understood, even in the large-response-time limit. Characterizing Gittins’s asymptotic tail behavior is important because if Gittins has optimal tail asymptotics, then it simultaneously provides optimal mean response time and good tail performance. In this work, we give the first comprehensive account of Gittins’s asymptotic tail behavior. For heavy-tailed job sizes, we find that Gittins always has asymptotically optimal tail. The story for light-tailed job sizes is less clear-cut: Gittins’s tail can be optimal, pessimal, or in between. To remedy this, we show that a modification of Gittins avoids pessimal tail behavior, while achieving near-optimal mean response time.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Scheduling to minimize response time (also known as sojourn time) of single-server queueing models is an important problem in queueing theory, with applications in computer systems, service systems, and beyond. In general, a queueing system will have a response time distribution, denoted T, and there are a variety of metrics that one might hope to minimize. There is significant work on minimizing mean response time E[T], which is the average response time of all jobs in a long arrival sequence <ref type="bibr">(Schrage 1968</ref><ref type="bibr">, Gittins 1989</ref><ref type="bibr">, Aalto et al. 2009</ref><ref type="bibr">, Gittins et al. 2011)</ref>.</p><p>Much less is known about minimizing the tail of response time P[T &gt; t], which is the probability that a job has response time greater than a parameter t 0. In light of the difficulty of studying the tail directly, theorists have studied the asymptotic tail of response time, which is the asymptotic decay of P[T &gt; t] in the t ! 1 limit <ref type="bibr">(Stolyar and Ramanan 2001</ref><ref type="bibr">, N &#250; &#241;ez-Queija 2002</ref><ref type="bibr">, Borst et al. 2003</ref><ref type="bibr">, Boxma and Zwart 2007</ref><ref type="bibr">, Scully et al. 2020b)</ref>. In this work, we consider the preemptive M/G/1 queue and ask the following question.</p><p>Question 1.1. Does any scheduling policy simultaneously optimize the mean and asymptotic tail of response time in the M/G/1? Our focus on the M/G/1, a classic, single-server queueing model <ref type="bibr">(Kendall 1953</ref><ref type="bibr">, Cox and Smith 1961</ref><ref type="bibr">, Kleinrock 1976</ref><ref type="bibr">, Harchol-Balter 2013)</ref>, is motivated by its balance between modeling flexibility and analytical tractability. The fact that the distribution of job sizes (also known as service times) may be general is particularly important for modeling computer systems, where the distribution can be far from exponential <ref type="bibr">(Peterson 1996</ref><ref type="bibr">, Crovella and Bestavros 1997</ref><ref type="bibr">, Harchol-Balter and Downey 1997)</ref>.</p><p>Prior work answers Question 1.1 when job sizes are known to the scheduler. In this setting, the Shortest Remaining Processing Time (SRPT) policy, which preemptively serves the job of least remaining size, always minimizes mean response time <ref type="bibr">(Schrage 1968</ref>).</p><p>However, SRPT's tail performance depends on the job size distribution. 1  &#8226; If the job size distribution is heavy-tailed (roughly, power-law; see Definition 4.1), then SRPT is tail-optimal, meaning that it has the best possible asymptotic tail decay (Definition 4.2).</p><p>&#8226; If the job size distribution is light-tailed (roughly, subexponential; see Definition 5.2), then SRPT is tailpessimal, meaning that it has the worst possible asymptotic tail decay <ref type="bibr">(Definition 5.3)</ref>.</p><p>This answers Question 1.1 for known job sizes: "yes, namely SRPT" in the heavy-tailed case, "no" in the light-tailed case.</p><p>Unfortunately, in practice, the scheduler often does not know job sizes, and, thus, one cannot implement SRPT. Instead, the scheduler often only knows the job size distribution. We study Question 1.1 in this unknownsize setting.</p><p>The question of minimizing mean response time with unknown job sizes was settled by <ref type="bibr">Gittins (1989)</ref>. He introduced a policy, now known as the Gittins policy, which leverages the job size distribution to minimize mean response time. Roughly speaking, Gittins uses each job's age-namely, the amount of time each job has been served so far-to figure out which job is most likely to complete after a small amount of service, then serves that job. For some job size distributions, Gittins reduces to a simpler policy, such as First-Come, <ref type="bibr">First-Served (FCFS)</ref> or Foreground-Background (FB) <ref type="bibr">(Aalto et al. 2009</ref><ref type="bibr">(Aalto et al. , 2011))</ref>.</p><p>In the unknown-size setting, given that Gittins minimizes mean response time, Question 1.1 reduces to the following.</p><p>Question 1.2. For which job size distributions is Gittins tail-optimal for response time?</p><p>Unfortunately, the asymptotic tail behavior of Gittins is understood in only a few special cases.</p><p>&#8226; In the heavy-tailed case, Gittins has been shown to be tail-optimal, but only under an assumption on the job size distribution's hazard rate <ref type="bibr">(Scully et al. 2020b, corollary 3.5)</ref>.</p><p>&#8226; In the light-tailed case, Gittins sometimes reduces to FCFS or FB <ref type="bibr">(Aalto et al. 2009</ref><ref type="bibr">(Aalto et al. , 2011))</ref>. For light-tailed job sizes, FCFS is tail-optimal <ref type="bibr">(Stolyar and</ref><ref type="bibr">Ramanan 2001, Boxma and</ref><ref type="bibr">Zwart 2007)</ref>, but FB is tail-pessimal <ref type="bibr">(Mandjes and Nuyens 2005)</ref>.</p><p>This prior work leaves Question 1.2 largely open. We do not know whether Gittins is always tail-optimal in the heavy-tailed case, or whether it is sometimes suboptimal, or even tail-pessimal. Moreover, we do not understand Gittins's asymptotic tail at all in the lighttailed case, aside from when Gittins happens to reduce to a simpler policy.</p><p>The prior work above does tell us an important fact: Gittins can be tail-pessimal. This prompts another question.</p><p>Question 1.3. For job size distributions for which Gittins is tail-pessimal, is there another policy that has near-optimal mean response time while not being tailpessimal?</p><p>In this work, we answer Questions 1.1-1.3 for the M/G/1 with unknown job sizes, covering wide classes of heavy-and light-tailed job size distributions.</p><p>The key tool we use to analyze Gittins's asymptotic response time tail is the SOAP (Schedule Ordered by Age-based Priority) framework <ref type="bibr">(Scully and</ref><ref type="bibr">Harchol-Balter 2018, Scully et al. 2020b)</ref>. SOAP gives a universal M/G/1 response time analysis of all SOAP policies, which are scheduling policies where a job's priority level is a function of its age (Definition 3.1). Underlying our Gittins results is a general tail analysis of SOAP policies.</p><p>Our main contributions, which we describe in more detail later (Sections 4 and 5), are as follows:</p><p>&#8226; Heavy-tailed case: We give a sufficient condition under which an arbitrary SOAP policy is tail-optimal (Section 7).</p><p>&#8226; Heavy-tailed case: We show that the above condition always applies to Gittins, implying that it is always tail-optimal (Section 8).</p><p>&#8226; Light-tailed case: We characterize when an arbitrary SOAP policy is tail-optimal, tail-pessimal, or in between (Section 9).</p><p>&#8226; Light-tailed case: We spell out how the above characterization applies to Gittins and show how to modify Gittins to avoid tail pessimality (Section 10).</p><p>&#8226; General case: At the core of our modification of Gittins which avoids tail pessimality is a general result, which states that slightly perturbing the Gittins rank function only slightly affects its mean response time (Theorem 5.10 and Online Appendix EC.2). 2  The rest of the paper introduces definitions and notation (Sections 3 and 6) and concludes with some remarks about our motivating questions (Section 11).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Prior Work</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Asymptotic Tail Analysis of Classic</head><p>Scheduling Policies Because of their frequent occurrence, light-tailed job size distributions have received a great amount of attention by queueing theorists. The performance of policies under light-tailed job sizes is generally measured in terms of the decay rate of the response time tail. In this sense, FCFS has proven to be optimal among all service policies <ref type="bibr">(Stolyar and Ramanan 2001)</ref>. Conversely, Foreground-Background Processor Sharing has the worst possible decay rate of the response time tail <ref type="bibr">(Mandjes and Nuyens 2005)</ref>.</p><p>On the other hand, it is shown that heavy-tailed job sizes can have a large impact on the performance characteristics of the queue. For this reason also, heavy-tailed job sizes have been thoroughly investigated in the literature. For example, researchers have made the striking observation that, contrary to the light-tailed case, FB is optimal where FCFS has the worst possible response time tail <ref type="bibr">(Borst et al. 2003)</ref>. This dichotomy between light and heavy tails is not limited to FCFS and FB <ref type="bibr">(Boxma and Zwart 2007)</ref>.</p><p>Other noteworthy literature highlighting both light and heavy tails includes delicate asymptotic results for a two-class priority policy <ref type="bibr">(Abate and Whitt 1997)</ref> and robust optimization using a limited PS policy <ref type="bibr">(Nair et al. 2010)</ref>.</p><p>With one exception, discussed in Section 2.3, the literature on this subject concerns only a few relatively simple policies. This paper considers policies in which the priority of a job can vary essentially arbitrarily with its age. This generality is needed to analyze the Gittins policy, where a job's priority can be nonmonotonic <ref type="bibr">(Aalto et al. 2011)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Impossibility of Universal Tail-</head><p>Optimal Scheduling In light of the fact that both FCFS and FB can vary between tail-optimal and tail-pessimal for different job sizes, it is natural to ask whether there is a single policy that is always tail-optimal. <ref type="bibr">Wierman and Zwart (2012)</ref> answer this question with an impossibility result, showing that no policy is tail-optimal for both heavyand light-tailed job sizes, unless the policy has knowledge of the size distribution X or learns X over time.</p><p>One might worry that this impossibility result contradicts our results for Gittins, given that we show Gittins is always tail-optimal in the heavy-tailed case and sometimes tail-optimal in the light-tailed case. The reason there is no contradiction is that the Gittins policy changes based on the size distribution. For instance, there are some light-tailed distributions where Gittins reduces to FCFS and some heavy-tailed distributions where Gittins reduces to FB <ref type="bibr">(Aalto et al. 2009)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Tail Optimality of Certain SOAP Policies in</head><p>the Heavy-Tailed Case We mention particularly the relation between this paper and the work of <ref type="bibr">Scully et al. (2020b)</ref>. Both this paper and the prior work study the response-time tail behavior of arbitrary SOAP policies, including the Gittins policy. There are two main factors that distinguish this paper from the prior work.</p><p>&#8226; <ref type="bibr">Scully et al. (2020b)</ref> only study heavy-tailed job size distributions. In contrast, we study both the heavyand light-tailed cases.</p><p>&#8226; <ref type="bibr">Scully et al. (2020b)</ref> show that Gittins is tail-optimal, subject to a condition on the job size distribution's hazard rate <ref type="bibr">(Scully et al. 2020b, corollary 3.5</ref>). However, their analysis is not sharp enough to completely characterize under which (heavy-tailed) job size distributions Gittins is tail-optimal. In contrast, our analysis is sharper, allowing us to identify Gittins's tail performance under any job size distribution.</p><p>With this said, <ref type="bibr">Scully et al. (2020b)</ref> lay an important technical foundation upon which we build to derive our heavy-tailed results. See Section 4.3 for a more technical discussion of what aspects of their work we use and what aspects we improve upon.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Beyond Asymptotic Tail Optimality</head><p>It is well known that FCFS has optimal tail decay rate under light-tailed job sizes. However, decay rate is a relatively crude tail performance measure, as it does not take into account the constant (or nonexponential term) in front of the exponent. Although this paper focuses just on decay rates, we mention that, very recently, a policy was introduced that has a better leading constant than FCFS <ref type="bibr">(Grosof et al. 2021</ref>). An open question remains of what is the best possible leading constant in the response time tail. A by-product of our resultsnamely, that FCFS is the only SOAP policy with optimal decay rate-partially answers this question. Specifically, it follows that no SOAP policy is tail-optimal up to the leading constant.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5.">Mean Response Time of Modified</head><p>Gittins Policies A recent study <ref type="bibr">(Scully et al. 2022, theorem 7.2)</ref> shows that if one slightly modifies the prioritization rules of SRPT, then the mean response time of the resulting policy is only slightly worse than that of unmodified SRPT (which is optimal in case job sizes are known). It turns out, as shown in this paper, that essentially the same result holds for an approximate version of the Gittins policy, which can thus be seen as the unknown-job-sizes counterpart of <ref type="bibr">Scully et al. (2022, theorem 7.2</ref>). 3</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Model, SOAP Policies, and the Gittins Policy</head><p>We consider an M/G/1 queue with arrival rate &#955;, job size distribution X, and load &#961; &#224; &#955;E[X]. For the tail of the job size distribution, we write F(t) &#224; P[X &gt; t]. We denote the maximum job size by x max &#224; inf{t 0 |F(t) &#224; 0}, allowing x max &#224; 1. We write T &#960; for the M/G/1's response time distribution under policy &#960;. We allow policies to preempt jobs or share the processor without any overhead or loss of work (i.e., the model is preempt-resume). Special attention is given in this paper to the Gittins policy. It assigns each job a rank-namely, a prioritybased on the job's age-namely, the amount of time the job has been served so far. To analyze the Gittins policy, we make use of the SOAP framework <ref type="bibr">(Scully and Harchol-Balter 2018</ref><ref type="bibr">, Scully et al. 2018</ref><ref type="bibr">, 2020b)</ref>, which gives a response time analysis of the following broad class of policies. Definition 3.1. A SOAP policy is a policy &#960; specified by a rank function r &#960; : [0, x max ) ! R. Policy &#960; assigns rank r &#960; (a) to a job at age a. 4 When the policy being discussed is clear from context, we often omit the subscript and simply write r(a). At every moment in time, a SOAP policy serves the job of minimum rank, breaking ties in FCFS order. 5 Definition 3.2. The Gittins policy, denoted "Gtn" in subscripts for brevity, is the SOAP policy with rank function</p><p>Note that the Gittins rank function depends on the job size distribution X by way of F.</p><p>As is standard <ref type="bibr">(Scully and</ref><ref type="bibr">Harchol-Balter 2018, Scully et al. 2018 appendix B)</ref>, we assume that rank functions are piecewise-continuous and piecewise-monotonic, with finitely many pieces in any compact interval for both properties. This holds for Gittins under very mild conditions on the job size distribution X. For example, <ref type="bibr">Aalto et al. (2011)</ref> show that the Gittins rank function is continuous and piecewise-monotonic, provided that X is a continuous distribution with continuous and piecewisemonotonic hazard rate. However, our results are not restricted to continuous job size distributions. Our generic SOAP results require no additional assumptions on X, and our Gittins results require only that X induces a piecewise-continuous and piecewise-monotonic Gittins rank function, which can occur even if X is not continuous.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Heavy-Tailed Job Sizes</head><p>In Section 4.1, we define which job size distributions are heavy-tailed, and we give our criterion for tail optimality in this scenario. The two main results in the heavy-tailed case are presented in Section 4.2:</p><p>&#8226; Theorem 4.6 gives a sufficient condition for a SOAP policy to be tail-optimal for heavy-tailed job sizes.</p><p>&#8226; Theorem 4.7 shows that for heavy-tailed job sizes, Gittins always satisfies this sufficient condition and is thus always tail-optimal.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Background on Heavy-Tailed Job Sizes</head><p>Roughly speaking, the heavy-tailed job size distributions we study are those that are asymptotically Pareto. The specific class that we study, described below, is slightly more general in that it also includes distributions whose tails oscillate between Pareto tails of different shape parameters.</p><p>Definition 4.1 (Heavy-Tailed Job Size Distribution). We say a job size distribution X is nicely heavy-tailed if x max &#224; 1 and both of the following hold:</p><p>i. The tail F(&#8226;) is of intermediate regular variation <ref type="bibr">(Cline 1994)</ref>, meaning lim inf</p><p>ii. There exist &#946; &#945; &gt; 1 such that the upper and lower Matuszewska indices of F(&#8226;) are in ( &#946;, &#945;) <ref type="bibr">(Bingham et al. 1987, section 2.1)</ref>. This implies that for all sufficiently large x 2 x 1 , 6</p><p>In informal discussion, we omit "nicely." Although the above definition includes all distributions with power-law-like tail decay, it is noted that we do not consider tails of a less heavy order, such as those of the lognormal and Weibull distributions. Consider an M/G/1 with nicely heavy-tailed job size distribution X. We call a scheduling policy &#960; tail-optimal among preemptive work-conserving policies if</p><p>That is, tail optimality holds if large jobs have a response time of approximately 1=(1 &#961;) times their size, which is the best possible asymptotic tail decay in the heavytailed case <ref type="bibr">(Boxma and</ref><ref type="bibr">Zwart 2007, Wierman and</ref><ref type="bibr">Zwart 2012)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Results for Heavy-Tailed Case</head><p>Let us focus on a tagged job of size x. For determining whether it will be delayed by other jobs, it is important to know the worst (highest) ever rank that it will ever have, as well as the ages at which other jobs will have rank lower than that worst ever rank.</p><p>Definition 4.3. The worst ever rank of a job of size x is defined by w x &#224; sup 0 &#63743; a &lt; x r(a). Definition 4.4. i. A w-interval is an interval (b, c) with 0 &#63743; b &lt; c &#63743; x max such that r(a) &#63743; w for all a 2 (b, c).</p><p>ii. A w-interval (b, c) is right-maximal if for all c 0 c, the interval (b, c 0 ) is not a w-interval. This is equivalent to c satisfying either r(c) w or c &#224; x max . We define leftmaximal similarly, and we call a w-interval maximal if it is both left-and right-maximal.</p><p>Operations Research, Articles in Advance, pp. 1-18, &#169; 2024 INFORMS Downloaded from informs.org by [128.84.127.144] on 26 February 2024, at 08:14 . For personal use only, all rights reserved.</p><p>Note that the tagged job of size x, no matter its age, always has priority over jobs that have rank higher than w x . Therefore, it can only be delayed by another job if that other job's age is in a w x -interval. See Figure <ref type="figure">1</ref> for an illustration. To ensure that the tagged job does not wait too long behind other jobs, the w x -intervals must be relatively short. We use the following condition to characterize the length of w x -intervals. ii. The w x -interval's endpoint is bounded by c &#63743; O(x &#951; ). 7</p><p>Note that to check that (i) and (ii) hold, it suffices to consider only right-maximal w x -intervals.</p><p>At an intuitive level, we can think of Condition 4.5 as saying the following about how much other jobs delay the tagged job of size x:</p><p>&#8226; If &#950; and &#952; are small enough, then w x -intervals are relatively short, so jobs of age greater than x cannot delay the tagged job for too long. Figure <ref type="figure">2</ref> illustrates the difference between the roles of &#950; and &#952; (see also Section 4.3).</p><p>&#8226; If &#951; is small enough, then there are no w x -intervals at sufficiently large ages, so jobs of sufficiently large age cannot delay the tagged job at all. Rank functions that satisfy Condition 4.5 do so for many possible parameter values. For instance, if Condition 4.5 is satisfied with parameters (&#950;, &#952;, &#951;), then it is also satisfied for (&#950; + &#948;, &#952; &#948; + &#9999;, &#951; + &#9999;) for all &#948;, &#9999; &gt; 0. But the idea is to find parameters that characterize a SOAP policy's rank function as tightly as possible because, as our main result below shows, if the parameters are small enough, then the policy is tail-optimal.</p><p>Theorem 4.6. Consider an M/G/1 with any nicely heavy-tailed job size distribution under a SOAP policy. Condition 4.5 implies the policy is tail-optimal if its parameters satisfy</p><p>We can apply Theorem 4.6 to show that Gittins is tailoptimal. Specifically, we will show that Gittins satisfies Condition 4.5 with &#950; &#224; 0, &#952; &#224; 1, and &#951; &#224; 1. As such, it achieves a value of zero on the left-hand side of (4.1), so Gittins is tail-optimal, regardless of the values of &#946; &#945; &gt; 1.</p><p>Theorem 4.7. The Gittins policy is tail-optimal for any nicely heavy-tailed job size distribution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Comparing Our Tail Optimality Condition to That of Scully et al. (2020c)</head><p>Having formally stated our sufficient condition for tail optimality in the heavy-tailed case, we may now compare it in more detail to that of <ref type="bibr">Scully et al. (2020b)</ref>. Their condition <ref type="bibr">(Scully et al. 2020b</ref>, assumption 3.2) and corresponding result <ref type="bibr">(Scully et al. 2020b, theorem 3.3</ref>) are the same as our Condition 4.5 and Theorem 4.6, respectively, but restricted to the &#952; &#224; 0 case. This means, roughly speaking, that <ref type="bibr">Scully et al. (2020b)</ref> only look at the lengths between "peaks" of the rank</p><p>Figure 1. (Color online) Illustration of Worst Ever Rank w x (Purple Dotted Line) and Maximal w x -Intervals (Shaded Orange Regions) for a SOAP Policy Given by Rank Function r (Solid Cyan Curve) Note. A tagged job of size x always has rank w x or better, so if another job has priority over the tagged job, that other job's age must be in a w x -interval. Notes. (a) Rank function with &#950; &#224; 1 and &#952; &#224; 0. (b) Rank function with &#950; &#224; 0 and &#952; &#224; 1. Roughly speaking, one should think of &#950; + &#952; as characterizing how far apart different "peaks" of the rank function are, and one should think of &#950;=(&#950; + &#952;) as characterizing how "steep" the rank function is between peaks. Both (a) and (b) have the same peaks, as reflected by (a) and (b) having the same value of &#950; + &#952;. But the slopes are much steeper in (a) than in (b), as reflected by the larger value of &#950;=(&#950; + &#952;) in (a) than in (b). Unfortunately, looking only at distances between peaks of the rank function is not enough to prove Gittins's tail optimality in the heavy-tailed case. For Gittins, it turns out that we cannot, in general, do better than setting &#951; &#224; 1. If we had to set &#952; &#224; 0, then to make Gittins satisfy Condition 4.5, it turns out that we would need to set &#950; &#224; 1, which is too large to satisfy (4.1). But the Gittins rank function looks less like Figure <ref type="figure">2</ref>(a) and more like Figure <ref type="figure">2(b)</ref>: even though the peaks can be far apart, there are "gentle slopes" between them. Setting &#952; &#224; 1 and &#950; &#224; 0 captures this behavior, and this satisfies (4.1). Thus, our refining of the sufficient condition of <ref type="bibr">(Scully et al. 2020b</ref>, assumption 3.2) is necessary to prove Gittins's tail optimality in the heavy-tailed case, at least for this SOAP-based approach.</p><p>Underlying the tail optimality results of Scully et al. ( <ref type="formula">2020b</ref>) is a busy period analysis combined with asymptotic response time bounds. We make use of their busy period analysis, which we distill into a simple statement (Lemma 7.4), but we replace their asymptotic response time bounds with a sharper analysis that accounts for the &#952; &gt; 0 possibility (Sections 7.1 and 7.2).</p><p>Finally, we reiterate that whereas Scully et al. (2020b) study only heavy-tailed size distributions, we also study light-tailed job size distributions. Our results for the light-tailed case are based on very different techniques, reflecting fundamental differences between the heavy-and light-tailed settings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Light-Tailed Job Sizes</head><p>Similarly to the previous section, we first define the class of light-tailed distributions and state the corresponding tail-optimality criterion in Section 5.1. The main results in the light-tailed case, presented in Section 5.2, are summarized as follows:</p><p>&#8226; Theorem 5.5 classifies SOAP policies into tailoptimal, tail-intermediate, and tail-pessimal for lighttailed job sizes.</p><p>&#8226; Theorem 5.8 shows that for light-tailed job sizes, Gittins can be any of tail-optimal, tail-intermediate, or tail-pessimal.</p><p>&#8226; Theorem 5.10 shows that making a small change to the Gittins rank function results in only a small change to mean response time.</p><p>&#8226; Theorem 5.11 shows that for a wide class of lighttailed job size distributions for which Gittins is tailpessimal, making a small change to Gittins's rank function results in a tail-optimal or -intermediate policy with mean response time arbitrarily close to Gittins's.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Background on Light-Tailed Job Sizes</head><p>Definition 5.1. The decay rate of random variable V,</p><p>Higher decay rates thus correspond to asymptotically lighter tails.</p><p>Roughly speaking, the light-tailed job size distributions we study are those with positive decay rate. Our main tool for investigating the decay rate of a random variable V is via its Laplace-Stieltjes transform L[V], defined as</p><p>Under mild conditions on V <ref type="bibr">(Nakagawa 2005</ref><ref type="bibr">(Nakagawa , 2007</ref>; Mimica 2016), we can determine its decay rate in terms of the convergence of its Laplace-Stieltjes transform:</p><p>(5.1)</p><p>The specific class of light-tailed job size distributions we consider, described below, are those that allow us to use (5.1) throughout this work (Online Appendix EC.3). The class includes essentially all light-tailed distributions of practical interest, such as finite-support, phase-type, and Gaussian-tailed distributions. In the terminology of <ref type="bibr">Abate and Whitt (1997)</ref>, we consider all "Class I" distributions. 8</p><p>Definition 5.2 (Light-Tailed Job Size Distribution). Given a job size distribution X, let</p><p>In informal discussion, we omit "nicely."</p><p>Definition 5.3 (Tail Optimality in Light-Tailed Case). Consider an M/G/1 with nicely light-tailed job size distribution X. We say a scheduling policy &#960; is</p><p>In each case, we mean minimizing or maximizing over preemptive work-conserving policies. In informal discussion, we omit "log-."</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Results for Light-Tailed Case</head><p>We have seen that a job's worst ever rank plays an important role in the heavy-tailed setting. When the job sizes are light-tailed, we are interested in the age at which the rank function's global maximum occurs.</p><p>Definition 5.4. The worst age, denoted a &#8676; , is the earliest age at which a job has the global maximum rank:</p><p>If the rank function has no maximum, we define a &#8676; &#224; x max .</p><p>As an example, FCFS has a &#8676; &#224; 0 because a job's priority is the worst before it starts service. In contrast, FB has a &#8676; &#224; x max because a job's priority gets strictly worse with age.</p><p>We already know that FCFS and FB are tail-optimal and tail-pessimal, respectively. The theorem below fills in the gaps for all other SOAP policies, showing that the performance of the response time tail is completely determined by the worst age a &#8676; . Theorem 5.5. Consider an M/G/1 with any nicely lighttailed job size distribution under a SOAP policy. Let</p><p>To apply Theorem 5.5 to the Gittins policy, we need to characterize how the job size distribution X affects Gittins's worst age a &#8676; . Definition 5.6. We define two classes of distributions: NBUE and ENBUE.</p><p>&#8226; We say X is New Better than Used in Expectation, writing X 2 NBUE, if for all ages a 2 [0, x max ),</p><p>Remark 5.7. It is well known that the NBUE class includes all distributions with (weakly) increasing hazard rate. Similarly, one can show that the ENBUE class includes all distributions with "eventually increasing" hazard rate, meaning that the hazard rate is increasing at all ages greater than some threshold a 0 &lt; x max .</p><p>Results of <ref type="bibr">Aalto et al. (2009</ref><ref type="bibr">Aalto et al. ( , 2011) )</ref> connect the classes NBUE and ENBUE to Gittins's worst age a &#8676; , implying the following characterization.</p><p>Theorem 5.8. Consider an M/G/1 with any nicely lighttailed job size distribution X. Gittins is</p><p>More generally, Theorem 5.5 can imply an analogue of Theorem 5.8 for other SOAP policies, whose rank at age a is related to the expected remaining size E[X a| X &gt; a], such as the SERPT policy (Remark 10.1).</p><p>The fact that Gittins can be log-tail-pessimal is intriguing, considering that it is optimal for mean response time and tail-optimal under heavy-tailed job sizes. Fortunately, in most cases where Gittins is logtail-pessimal, slightly tweaking Gittins yields a log-tailintermediate policy without sacrificing much mean response time performance. Definition 5.9. A SOAP policy &#960; is a q-approximate Gittins policy if there exists a constant m &gt; 0 such that for all ages a 2 [0, x max ),</p><p>We may assume without loss of generality that m &#224; 1 because the policy &#960; 0 with rank function r &#960; 0 (a) &#224; r &#960; (a)=m has identical behavior to policy &#960;.</p><p>Theorem 5.10. Consider an M/G/1 with any job size distribution. For any q 1 and any q-approximate Gittins policy &#960;, 9</p><p>An important observation is that a q-approximate Gittins policy has near-optimal mean response time for q close to one. At the same time, changing the Gittins rank function even within a small factor q can decrease the worst age, and therefore improve the tail performance.</p><p>Theorem 5.11. Consider an M/G/1 with nicely lighttailed job size distribution X &#8713; ENBUE. Suppose that the expected remaining size of a job at all ages is uniformly bounded, meaning</p><p>Then, for all &#9999; &gt; 0, there exists a (1 + &#9999;)-approximate Gittins policy that is log-tail-optimal or log-tail-intermediate.</p><p>As an example in which log-tail-pessimality may be avoided, consider a hyperexponential job size X with two different rates. That is, for i &#224; 1, 2, with probability p i , the job size is sampled from an exponential with rate &#181; i . Here, p 1 , p 2 &gt; 0 such that p 1 + p 2 &#224; 1, and we assume without loss of generality that &#181; 1 &gt; &#181; 2 .</p><p>Because the hazard rate of a hyperexponential distribution is decreasing, r Gtn is increasing. Therefore, Gittins reduces to FB, so it is log-tail-pessimal. Fortunately, though, E[X a| X &gt; a] &lt; 1=&#181; 2 for all ages a; hence, Theorem 5.11 can be applied: for any q &gt; 1, the policy with rank function</p><p>where &#227; is such that qr Gtn (&#227;) &#224; 1=&#181; 2 is a q-approximate Gittins policy with a &#8676; &#224; &#227; &lt; 1. See Figure <ref type="figure">3</ref>. Although Theorem 5.11 implies that any approximation factor q &gt; 1 suffices to prevent log-tail-pessimality, there is a tradeoff to be made when choosing q. Higher values of q result in worse guarantees for the mean (Theorem 5.10). But higher values of q lead to lower values of a &#8676; , which, in turn, result in better tail decay (Lemma 9.8). We leave exploring this tradeoff quantitatively to future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Proof Organization</head><p>The remainder of this paper is organized as follows. Necessary background and notation on the SOAP framework is given in Section 6. Then, we prove our results for heavy tails, Theorems 4.6 and 4.7, respectively, in Sections 7 and 8. Similarly, proofs of our results for the light-tailed case are given in Sections 9 (Theorem 5.5) and 10 (Theorems 5.8 and 5.11). The remaining main result, Theorem 5.10, requires substantially more technical machinery for its proof, which is why we defer it to Online Appendix EC.2. Finally, Section 11 describes how our results answer the questions posed in Section 1.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">SOAP Notation</head><p>We use the following notations related to SOAP policies, which are standard in the literature <ref type="bibr">(Scully and Harchol-Balter 2018</ref><ref type="bibr">, Scully et al. 2018</ref><ref type="bibr">, 2020a,b)</ref>. These definitions are necessary for writing down and working with the response time formulas of SOAP policies. All of these definitions are given in terms of a SOAP policy with rank function r. Throughout, it will always be clear from context which rank function is being referred to.</p><p>One may easily check that</p><p>) is indeed a w-interval, with only one exception: it may be that b 0 [w] &#224; c 0 [w] &#224; 0, in which case the interval is empty and, thus, not a w-interval. See Figure <ref type="figure">1</ref>, whose shaded orange regions are specifically the zeroth, first, and second maximal w x -intervals for the pictured size x. Definition 6.2. The kth w-relevant job segment is the random variable</p><p>The following lemma gives a convenient formula for moments of relevant job segments. Lemma 6.3 <ref type="bibr">(Scully et al. 2020b, lemma 6.16</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>E[X</head><p>Definition 6.4. For a job of size x, we define 11</p><p>See Figure <ref type="figure">4</ref>. Intuitively, y x is the earliest age at which a job of size x attains its worst ever rank w x , and z x is the earliest age at which the rank function exceeds w x . Note that it may be that y x &#224; x &#224; z x . This occurs if the rank function is strictly increasing at x, and x is a "running maximum," meaning r(a) &lt; r(x) for all ages a 2 [0, x).</p><p>Our final piece of notation is a generalization of a job's worst ever rank. Definition 6.5. The worst future rank of a job of size x at age a, written w x (a), is Notes. By Theorem 5.5, these have different tail asymptotics. The Gittins rank function attains its supremum of 1=&#181; 2 in the a ! 1 limit, so it is tail-pessimal. But the q-approximate Gittins rank function, described in (5.2), attains its supremum at a finite age &#227;, so it is tailintermediate. Note that the worst ever rank is simply the worst future rank at age zero-that is, w x &#224; w x (0).</p><p>One can write a formula characterizing T(x), the response time distribution of jobs of size x, in terms of the notation from Definitions 6.2 and 6.5. We refer the reader to <ref type="bibr">Scully and Harchol-Balter (2018)</ref> and <ref type="bibr">Scully et al. (2018)</ref> for details, noting here one simple consequence of that formula. Lemma 6.6. Under any SOAP policy, the response time of jobs of size x is stochastically increasing in x. That is, for all x 1 x 0 &gt; 0, we have T(x 0 ) &#63743; st T(x 1 ).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Heavy-Tailed Job Sizes: Tail Asymptotics of SOAP Policies</head><p>This section is devoted to proving Theorem 4.6, which gives a sufficient condition for a SOAP policy to be tailoptimal. Our first step is to invoke a result of <ref type="bibr">Scully et al. (2020b)</ref>. 12 Their result reduces the task of proving tail optimality to a much simpler task-namely, bounding various functions of moments of w x -relevant job segments. To state their result, we use a "polynomially strict" version of little-o notation, which we write as &#466;(&#8226;).</p><p>Definition 7.1. For p &gt; 0, the notation &#466;(x p ) stands for O(x p &#9999; ) for some unspecified &#9999; &gt; 0.</p><p>Condition 7.2. There exists q &gt; &#946; such that for all p 2 (0, q),</p><p>Lemma 7.4 <ref type="bibr">(Scully et al. 2020c)</ref>. Consider an M/G/1 with nicely heavy-tailed job size under a SOAP policy. Conditions 7.2 and 7.3 together imply tail optimality.</p><p>Our proof of Theorem 4.6 is thus based on verifying Conditions 7.2 and 7.3. That is, we want to show (7.1) under certain conditions. The first step is to compute the left-hand side of (7.1), while assuming only Condition 4.5. The following lemma does so, separating out the k &#224; 0 term because it has a slightly different form.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Proof of</head><p>Lemma 7.6. Suppose Condition 4.5 holds.</p><p>i. For all p 0,</p><p>ii. For all p 0,</p><p>The proof of Lemma 7.6 is largely computational, so we defer it to Online Appendix EC.1.1.</p><p>Proof of Lemma 7.5. Our goal is to choose q &gt; &#946; such that for all p 2 (0, q), (7.1) holds. Specifically, we choose We have q &gt; &#946; by (4.1), and an analogue of (4.1) holds for all p 2 (0, q)-namely,</p><p>By Lemma 7.6, it suffices to show that for all p 2 (0, q), both of the following hold:</p><p>Under the assumptions of Definition 4.1 and Condition 4.5, namely,</p><p>We show below that both of these are implied by (7.2). To reduce clutter, let &#957; &#224; &#945; 1 p :</p><p>For (7.4), we compute</p><p>and for (7.5), we compute</p><p>w (7:2): w Remark 7.7. Note that in addition to (7.2) implying (7.5), the reverse implication also holds. This suggests that the precondition of Theorem 4.6, and in particular (4.1), cannot be easily relaxed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.2.">Showing</head><p>Condition 7.3 Our goal in this section is to prove Proposition 7.8 below. It is applicable to proving Theorem 4.6 because (4.1) implies its precondition. Proposition 7.8. If &#950; &lt; 1 or &#951; &lt; 1, then Condition 4.5 implies Condition 7.3.</p><p>Proof. The &#951; &lt; 1 case follows from a result of <ref type="bibr">(Scully et al. 2020b, lemma 7.</ref>3), so we address only the &#950; &lt; 1 case. We first observe that for all &#961; 0 2 [0, &#961;], we have</p><p>We can rewrite the integrand as</p><p>Of course, the integrand is also bounded above by</p><p>A job of size x attains its worst ever rank w x at age y x . This means that for all a &lt; y x , we have w x (a) &#224; w x , which by Definition 6.4 implies c 0 [w x (a) ] &#224; y x . Splitting the integral in (7.7) at a &#224; y x yields</p><p>Because y x &#63743; x and &#945; &gt; 1, it suffices to show that the integral in (7.8) is &#466;(x).</p><p>The main remaining obstacle is bounding c 0 [w x (a) ]. We do so in Lemma 7.9, which states</p><p>Operations Research, Articles in Advance, pp. 1-18, &#169; 2024 INFORMS Downloaded from informs.org by [128.84.127.144] on 26 February 2024, at 08:14 . For personal use only, all rights reserved.</p><p>for some &#954; 2(&#945; 1). Plugging this into (7.8) and substituting u &#224;</p><p>Because &#950; &lt; 1 and (&#945; 1)=&#954; 2 (0, 1=2], this is &#466;(x), as desired. w</p><p>The following lemma bounds c 0 [w x (a) ]. We defer its proof to Online Appendix EC.1.1. Lemma 7.9. Suppose Condition 4.5 holds, and let &#954; &#224; 2 max{&#945; 1, &#952;}. For all x 0 and a 2 (y x , x),</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.">Heavy-Tailed Job Sizes: Gittins Is Tail-Optimal</head><p>In this section, we prove Theorem 4.7-namely, that Gittins is tail-optimal for heavy-tailed job sizes. Specifically, we will show that Gittins satisfies Condition 4.5 with &#950; &#224; 0, &#952; &#224; 1, and &#951; &#224; 1:</p><p>This suffices because with the above values, (4.1) holds for all &#946; &#945; &gt; 1, so Theorem 4.6 implies tail optimality.</p><p>In fact, Condition 4.5 holds trivially when &#951; &#224; 1, so only Condition 4.5 remains.</p><p>Our goal is thus show that for the Gittins rank function, 13   any right-maximal w x -interval (b, c) with b x has length c b &#63743; O(x):</p><p>(8.1)</p><p>In words, (8.1) says that whenever the Gittins rank function dips below the worst rank a job of size x ever has, it does so for an interval of length at most O(x). See Figure <ref type="figure">5</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.1.">The Time-Per-Completion Function</head><p>To make further progress, we need to consider the specific form of the Gittins rank function. One useful way of thinking about the Gittins rank function uses the following definition <ref type="bibr">(Aalto et al. 2009</ref><ref type="bibr">(Aalto et al. , 2011))</ref>.</p><p>Definition 8.1. The time-per-completion function for a given job size distribution S is defined for 0</p><p>The intuition is that if we serve a job from age b until either age c or its completion, whichever comes first, then we can interpret &#966;(b, c) as That is, under Gittins, a job's rank at age a is the best possible time-per-completion ratio achievable on any interval starting at age a.</p><p>How does using the time-per-completion function help us show (8.1)? The key observation is that for the Gittins rank function to be low over an interval (b, c), a job must be relatively likely to complete during (b, c), which would imply that &#966;(b, c) is also low. The following lemma, which we prove in Online Appendix EC.2.2, formalizes this intuition. Lemma 8.2. Under Gittins, for any right-maximal w-interval (b, c), we have &#966;(b, c) &#63743; w.</p><p>We note that although many results similar to Lemma 8.2 have been shown in prior work <ref type="bibr">(Aalto et al. 2009</ref><ref type="bibr">(Aalto et al. , 2011;;</ref><ref type="bibr">Scully et al. 2020b)</ref>, to the best of our knowledge, Lemma 8.2 itself is new.</p><p>With Lemma 8.2 in hand, showing (8.1) amounts to</p><p>&#8226; Proving an upper bound on w x , and &#8226; Proving a lower bound on &#966;(b, c) for all rightmaximal w x -intervals (b, c).</p><p>The first of these follows simply from prior work: Scully et al. (2020b, section 3.2) show that the Gittins rank function is bounded by r Gtn (a) &#63743; O(a), which implies</p><p>(8.3) Note. Any w x -interval (shaded orange regions)-namely, any interval where the Gittins rank function (solid cyan curve) is better than the worst ever rank w x of a job of size x-has length O(x). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.2.">Bounding Lengths of w x -Intervals</head><p>There is one more fact from prior work that we need to prove (8.4).</p><p>Lemma 8.4 <ref type="bibr">(Scully et al. 2021, theorem 6.4)</ref>. Under Gittins, y x &#224; &#920;(x) and z x &#224; &#920;(x).</p><p>In general, (y x , z x ) is a right-maximal w x -interval, but there can be plenty of other w x -intervals starting at values greater than z x . How can we use Lemma 8.4 to study such intervals? The key is to look not at y x and z x , but at y u and z u , where u is some point inside the w x -interval whose length we wish to bound.</p><p>Specifically, consider a right-maximal w x -interval (b, c) with b x and let u 2 (b, c) be an arbitrary size in the interval. Figure <ref type="figure">6</ref> illustrates the relationship between the interval (b, c), the worst ever rank w u of size u, and the ages y u and z u (Definition 6.4). The figure suggests the following lemma, which is a slight generalization of a result of <ref type="bibr">Scully et al. (2020b, lemma 6.18</ref>). Lemma 8.5. Under any SOAP policy, for any w x -interval (b, c) with b x and any u 2 (b, c),</p><p>Proof. It is clear from Definition 6.4 that y u &#63743; u &#63743; z u . It thus suffices to show that y u &#8713; (b, c) and z u &#8713; (b, c). Both steps use the fact that x &#63743; b &lt; u implies w x &#63743; w u (Definition 4.3).</p><p>We first show that z u &#8713; (b, c). By Definition 4.4, the rank function does not exceed w x over the interval (b, c). But Definition 6.4 implies that every neighborhood of z u contains a point whose rank is greater than w u . Because w u w x , it must be that z u &#8713; (b, c).</p><p>If w u &gt; w x , then the argument that y u &#8713; (b, c) is analogous to that for z u . The difference is that this time, Definition 6.4 implies that for all &#9999; &gt; 0, every neighborhood of y u contains a point whose rank is greater than w u &#9999;. Because w u &#9999; &gt; w x for small enough &#9999;, it must be that y u &#8713; (b, c).</p><p>If, instead, w u &#224; w x , then y u &#224; y x , and we can reason more simply: Definition 6.4 implies y x &#63743; x, and we have assumed x &#63743; b, so y u &#63743; b. w Lemmas 8.4 and 8.5 combine to prove (8.4), from which Theorem 4.7 follows. The fact that c b &#63743; O(x) means that Gittins satisfies Condition 4.5 with &#950; &#224; 0, &#952; &#224; 1, and &#951; &#224; 1. These obey (4.1), so by Theorem 4.6, Gittins is tail-optimal. w</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Proof of</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="9.">Light-Tailed Job Sizes: Tail Asymptotics of SOAP Policies</head><p>In this section, we prove our main theorem for lighttailed job sizes.</p><p>Theorem 5.5. Consider an M/G/1 with any nicely lighttailed job size distribution under a SOAP policy. Let Operations Research, Articles in Advance, pp. 1-18, &#169; 2024 INFORMS Downloaded from informs.org by [128.84.127.144] on 26 February 2024, at 08:14 . For personal use only, all rights reserved.</p><p>&#8226; Log-tail-intermediate if 0 &lt; a &#8676; &lt; x max , and &#8226; Log-tail-pessimal if a &#8676; &#224; x max .</p><p>Proof. The result follows from Propositions 9.1, 9.2, and 9.9, which we prove in the rest of this section. w 9.1. Tail-Optimal Case Proposition 9.1. Consider an M/G/1 with any nicely light-tailed job size distribution under a SOAP policy. The policy is log-tail-optimal if a &#8676; &#224; 0.</p><p>Proof. Suppose that a &#8676; &#224; 0. In this case, because of the FCFS tiebreaking (Definition 3.1), the oldest job in the system always has priority over all other jobs. Therefore, the SOAP policy is exactly FCFS, which is known to be log-tail-optimal <ref type="bibr">(Stolyar and</ref><ref type="bibr">Ramanan 2001, Boxma and</ref><ref type="bibr">Zwart 2007</ref>). w 9.2. Tail-Pessimal Case <ref type="bibr">Mandjes and Nuyens (2005)</ref> show that the FB policy, which has rank function r(a) &#224; a, is log-tail-pessimal.</p><p>Their argument focuses on analyzing the response time of very large jobs, showing that such jobs' response time tails have small decay rate. The fact that their argument focuses on just the very large jobs suggests that the essential property of FB is that it assigns the largest rank at the largest ages. Our tail pessimality result below shows this is indeed the case.</p><p>Proposition 9.2. Consider an M/G/1 with any nicely light-tailed job size distribution under a SOAP policy. The policy is log-tail-pessimal if a &#8676; &#224; x max .</p><p>We prove Proposition 9.2 in Online Appendix EC.1.2 by generalizing the proof that <ref type="bibr">Mandjes and Nuyens (2005)</ref> give for FB, making use of several of their intermediate results along the way. We describe the approach below, after which we present the proof.</p><p>Consider first the case where x max &lt; 1, meaning we could have a job of size x max . Because a &#8676; &#224; x max , jobs of size x max complete at the end of the busy period during which they arrive. This is the latest time a job can complete under a work-conserving policy, so d(T(x max )) &#224; d min , where d min is the minimal, and thus pessimal, response time tail decay rate. If X has an atom at x max , meaning that X &#224; x max occurs with positive probability, then d(T) &#224; d(T(x max )) &#224; d min .</p><p>Of course, for general job size distributions X, it may be that X &#224; x max occurs with probability zero. This is certainly the case if x max &#224; 1. Therefore, to generalize the argument above, instead of considering T(x max ), we consider T(x) in the x ! x max limit. Although d(T(x)) &gt; d min for any fixed x &lt; x max , we show that lim x!x max d (T(x)) &#224; d min . This means that for any &#9999; &gt; 0, a positive fraction of jobs experience decay rate less than d min + &#9999;, implying d(T) &#224; d min , as desired.</p><p>The last ingredient we need is notation for discussing the M/G/1 and, in particular, busy periods. This is because d min turns out to be a busy period's decay rate <ref type="bibr">(Mandjes and Nuyens 2005, corollary 6</ref>). Definition 9.3.</p><p>i. We denote by B the distribution of an M/G/1 busy period length with arrival rate &#955; and job size distribution X. More generally, we write B(u) for a busy period with initial work u.</p><p>ii. We denote by B a the distribution of an M/G/1 busy period length with arrival rate &#955; and truncated job size distribution min{X, a}. More generally, we write B a (u) for a such busy period with initial work u.</p><p>iii. We denote by W the distribution of the total amount of work in an M/G/1.</p><p>It is known that d min &#224; d(B) (Mandjes and Nuyens 2005, corollary 6), so proving Proposition 9.2 amounts to showing lim x!x max d(T(x)) &#224; d(B). To show this, we first prove d(T(x)) &#63743; d(B y x ), so it suffices to show lim x!x max d(B y x ) &#224; d(B). This follows from the known fact that lim y!x max d(B y ) &#224; d(B) (Mandjes and Nuyens 2005, proposition 8) and additional computation. See Online Appendix EC.1.2 for details.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="9.3.">Tail-Intermediate Case</head><p>We finally turn to the case where 0 &lt; a &#8676; &lt; x max , where we will show that the corresponding SOAP policy is log-tail-intermediate. We first simplify the problem of analyzing an arbitrary SOAP policy with 0 &lt; a &#8676; &lt; x max to the problem of analyzing two policies, called Step and Spike (Definition 9.4 and Figure <ref type="figure">7</ref>), with similar rank functions. We then bound the decay rates of Step and Spike.</p><p>For brevity, we give only the key definitions and lemma structure of the proof, building up to Proposition 9.9, which states the main result for the tailintermediate case. The proofs of the individual steps are either largely computational or follow easily from prior work, so we defer the proofs to Online Appendix EC.1.2. Definition 9.4. For a given value of a &#8676; 2 (0, x max ) and r &#8676; &gt; 0, the Step and Spike policies are the SOAP policies given by the following rank functions, which are illustrated in Figure <ref type="figure">7</ref>:</p><p>r spike (a) &#224; r &#8676; 1(a &#224; a &#8676; ):</p><p>We will compare the response time tail of a SOAP policy with 0 &lt; a &#8676; &lt; x max and r(a &#8676; ) &#224; r &#8676; to the response time tails of Step and Spike for those values of a &#8676; and r &#8676; . This comparison is possible because, as illustrated in Figure <ref type="figure">7</ref>, all three policies have qualitatively similar rank functions. 14  For the remainder of this section, we consider SOAP policies &#960; with 0 &lt; a &#8676; &lt; x max and r(a &#8676; ) &#224; r &#8676; . We divide jobs in two classes:</p><p>&#8226; Class 1, jobs of size at most a &#8676; ; and</p><p>&#8226; Class 2, jobs of size greater than a &#8676; . For each class i 2 {1, 2}, let &#955; (i) be the arrival rate of class i jobs, and let T (i)  &#960; be the response time of class i jobs under policy &#960;.</p><p>It turns out that only class 2 jobs affect the asymptotic decay rate of response time. Moreover, it turns out that the Step and Spike policies represent the worst-case and best-case scenarios for class 2 jobs. These facts, expressed in the following lemma, imply that it suffices to analyze the response time decay rate for class 2 jobs under Step and Spike. Lemma 9.5. Let &#960; be a SOAP policy with 0 &lt; a &#8676; &lt; x max . We have</p><p>With Lemma 9.5 in hand, to show that &#960; is tailintermediate, it suffices to show that both Step and Spike are tail-intermediate by bounding d(T (2) step ) and d(T (2)  spike ). We begin by characterizing T (2) step and T (2) spike in terms of the M/G/1 concepts defined in Definition 9.3.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Lemma 9.6. The response time distributions of class 2 jobs under</head><p>Step and Spike are</p><p>) is the size distribution of class 2 jobs, and the random variables in each sum are mutually independent.</p><p>Combining Lemmas 9.5 and 9.6 reduces the question of analyzing the decay rate of &#960; to analyzing the decay rate of the busy periods in Lemma 9.6. We will see soon that B a &#8676; (W) is the dominant term, so both Step and Spike have decay rate d(B a &#8676; (W)). In order to prove that B a &#8676; (W) is indeed the dominant term, and in order to bound its decay rate, we make heavy use of Laplace-Stieltjes transforms (Section 5.1).</p><p>Recall from (5.1) that one can determine a random variable's decay rate by determining when its Laplace-Stieltjes transform converges. 15 We therefore introduce notation to describe when a transform converges. For functions f : R ! R [ { 1, 1}, which diverge below a certain value and converge above it, define</p><p>to be the value at which f switches from diverging to converging. We can thus rewrite (5.1) as</p><p>Because we will be working with Laplace-Stieltjes transforms, we begin by recalling standard results for the transforms of the quantities defined in Definition 9.3. Let</p><p>letting &#963;(s) &#224; 1 if there is no real solution. We define &#963; a similarly.</p><p>Standard M/G/1 results (Harchol-Balter 2013) express the Laplace-Stieltjes transforms of random variables in Definition 9.3 using &#963; and &#963; a . Specifically, for any nonnegative random variable U,</p><p>Therefore, to understand the decay rates of random variables in Definition 9.3, we need to understand &#963; 1 , &#963;, and &#963; a . Figure <ref type="figure">8</ref> illustrates &#963; and &#963; 1 and some key Notes. (a) Rank of Step. (b) Rank of Spike. (c) Rank of generic policy. As shown in Lemma 9.5, for a fixed job size distribution and value of a &#8676; , (a) the Step policy gives the worst possible tail decay, whereas (b) the Spike policy gives the best possible tail decay. This is because in (a), jobs of age a &#8676; or greater have the worst possible rank r &#8676; , whereas in (b), jobs of age a &#8676; or greater have the best possible rank zero. Under (c), a generic SOAP policy &#960;, jobs of age a &#8676; or greater have rank somewhere in between.</p><p>values associated with them. We make frequent use of relationships between these values and other properties of &#963; and &#963; 1 , which are summarized below.</p><p>Lemma 9.7. Consider an M/G/1 with nicely light-tailed job size distribution X, and define</p><p>Then, as illustrated in Figure <ref type="figure">8</ref>, the following hold: i. &#963; 1 is convex on (s 0 , 1), decreasing on (s 0 , s 2 ), and increasing on (s 2 , 1);</p><p>ii. s 0 &lt; s 1 &lt; s 2 &lt; s 3 &lt; 0.</p><p>Analogous statements hold for &#963; a for all a 2 (0, x max ].</p><p>Together, (9.2) and Lemma 9.7 give us the last ingredients we need to compute the decay rate of T &#960; . We then show that this decay rate is neither optimal nor pessimal.</p><p>Lemma 9.8. Let &#960; be a SOAP policy with 0 &lt; a &#8676; &lt; x max . Then, the decay rate of its response time is</p><p>Proposition 9.9. Consider an M/G/1 with any nicely light-tailed job size distribution under a SOAP policy. The policy is log-tail-intermediate if 0 &lt; a &#8676; &lt; x max .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="10.">Light-Tailed Job Sizes: Gittins Can Be</head><p>Tail-Optimal, Tail-Pessimal, or in Between Theorem 5.8. Consider an M/G/1 with any nicely lighttailed job size distribution X. Gittins is</p><p>Proof. By Theorem 5.5, it suffices to determine the worst age a &#8676; . Combining the following two prior results characterizes a &#8676; in terms of whether X is in each of NBUE and ENBUE.</p><p>&#8226; A result of <ref type="bibr">Aalto et al. (2009, proposition 7</ref>) implies a &#8676; &#224; 0 if and only if X 2 NBUE.</p><p>&#8226; A result of <ref type="bibr">Aalto et al. (2011, proposition 9</ref>) implies a &#8676; &lt; x max if and only if X 2 ENBUE w Theorem 5.11. Consider an M/G/1 with nicely lighttailed job size distribution X &#8713; ENBUE. Suppose that the expected remaining size of a job at all ages is uniformly bounded, meaning</p><p>Then, for all &#9999; &gt; 0, there exists a (1 + &#9999;)-approximate Gittins policy that is log-tail-optimal or log-tail-intermediate.</p><p>so r Gtn (a) is also uniformly bounded. This means that for any &#9999; &gt; 0, there exists some sufficiently large age a(&#9999;) such that increasing the rank at age a(&#9999;) &lt; x max from r Gtn (a(&#9999;)) to (1 + &#9999;)r Gtn (a(&#9999;)) and leaving all other ranks unchanged yields a new SOAP policy with worst age a &#8676; &#224; a(&#9999;). By construction, the new policy is a (1 + &#9999;)-approximate Gittins policy, and because its worst age is a &#8676; &lt; x max , Theorem 5.5 implies it is logtail-optimal or log-tail-intermediate. w</p><p>Recall from Theorem 5.10 that a (1 + &#9999;)-approximate Gittins policy achieves mean response time within a factor of 1 + &#9999; of optimal. We defer its proof to Online Appendix EC.2. This means that Theorem 5.11, whose precondition applies to nonpathological light-tailed job size distributions, gives a non-tail-pessimal policy with near-optimal mean response time. Notes. The partial inverse of &#963; 1 -namely, &#963;-corresponds to the branch going from the minimum of &#963; 1 to the right (orange highlight). As shown in Lemma 9.7, we have s 0 &lt; s 1 &lt; s 2 &lt; s 3 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Scully and van Kreveld: The Gittins Policy's Response Time Tail</head><p>Operations Research, Articles in Advance, pp. 1-18, &#169; 2024 INFORMS Remark 10.1. The Shortest Expected Remaining Processing Time (SERPT) policy, which has rank function r SERPT (a) &#224; E[X a| X &gt; a], is sometimes considered as a simpler alternative to Gittins <ref type="bibr">(Scully and Harchol-Balter 2018</ref><ref type="bibr">, Scully et al. 2018</ref><ref type="bibr">, 2020a)</ref>. Our results imply that SERPT has the same M/G/1 tail optimality properties as Gittins for the class of job size distributions we consider.</p><p>&#8226; <ref type="bibr">Scully et al. (2020c)</ref> show that SERPT is always tail-optimal in the heavy-tailed case, which matches what we show for Gittins.</p><p>&#8226; Theorem 5.5 and Definition 5.6 imply that in the light-tailed case, SERPT is tail-optimal, tail-intermediate, and tail-pessimal under the same conditions as we show for Gittins in Theorem 5.8. In fact, one can show a stronger property: SERPT's and Gittins's response time distributions have the same decay rate. This follows from the fact that SERPT and Gittins have the same worst age a &#8676; , as argued in the proof of Theorem 5.8.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="11.">Conclusion</head><p>In this paper, we have characterized the asymptotic tail performance of the response time in an M/G/1 queue under very broad conditions-namely, for every SOAP policy and for both heavy-and light-tailed job size distributions. In the heavy-tailed case, we characterize tailoptimal policies by a sufficient condition on the rank function <ref type="bibr">(Theorem 4.6)</ref>. This condition holds for a wide range of SOAP policies, and specifically for the Gittins policy (Theorem 4.7), providing the first proof of its tail optimality under general conditions. In the light-tailed case, we classify policies' performance as tail-optimal, tail-pessimal, or tail-intermediate. We show that the performance of a SOAP policy depends on the age at which the maximal rank is attained (Theorem 5.5). It turns out that the Gittins policy may belong to any of the three categories, depending on the job size distribution (Theorem 5.8). Finally, when Gittins has pessimal tail performance, boundedness of the expected remaining job size implies that there exists a slight modification of Gittins that has optimal or intermediate tail, while maintaining nearoptimal mean response time (Theorem 5.11).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="11.1.">Returning to the Motivating Questions</head><p>We conclude by returning to Questions 1.1-1.3, restated below for convenience.</p><p>Question 1.1. Does any scheduling policy simultaneously optimize the mean and asymptotic tail of response time in the M/G/1? Question 1.2. For which job size distributions is Gittins tail-optimal for response time? Question 1.3. For job size distributions for which Gittins is tail-pessimal, is there another policy that has near-optimal mean response time while not being tailpessimal?</p><p>Our characterization of Gittins's tail asymptotics (Theorems 4.7 and 5.8) answers Question 1.2, and our modification in the case where Gittins is tail-pessimal (Theorem 5.11) answers Question 1.3 affirmatively. This leaves only Question 1.1. In cases where we have shown that Gittins is tail-optimal, the answer is clearly affirmative. We might hope to conclude that the answer is negative in cases where we have shown that Gittins is tail-pessimal or tail-intermediate, but the situation is still slightly unclear. The remaining ambiguity is due to the fact that we have only considered FCFS tiebreaking when two jobs have the same rank (Definition 3.1), as we explain in more detail below.</p><p>The Gittins policy minimizes mean response time with arbitrary tiebreaking between jobs of the same rank <ref type="bibr">(Gittins 1989</ref><ref type="bibr">, Gittins et al. 2011</ref><ref type="bibr">, Scully and Harchol-Balter 2021)</ref>. Moreover, these proofs can be extended to show that any "non-Gittins" policy is strictly suboptimal for mean response time, where a "non-Gittins" policy is one that for a nonvanishing fraction of time serves a job other than one of minimal Gittins rank. Therefore, to fully answer Question 1.1, one would have to consider Gittins under arbitrary tiebreaking rules. We conjecture that using a different tiebreaking rule cannot improve the asymptotic decay rate of Gittins's response time in the light-tailed case.</p><p>Finally, we note that we have, of course, only answered Questions 1.1-1.3 for the classes of heavyand light-tailed job size distributions that we consider in this work (Definitions 4.1 and 5.2). Practically speaking, we believe the classes of distributions we consider are likely broad enough to draw useful conclusions. But it remains an open question whether we may extend our theory to broader classes of distributions. In particular, it seems likely that our proofs may hold mostly unchanged for additional light-tailed job size distributions, as discussed in Online Appendix EC.3.3.</p><p>Operations Research, Articles in Advance, pp. 1-18, &#169; 2024 INFORMS Downloaded from informs.org by [128.84.127.144] on 26 February 2024, at 08:14 . For personal use only, all rights reserved. (i) For all p 0,</p><p>(ii) For all p 0,</p><p>Proof. We first show (i). Because (x, c 0 [w x ]) is a w x -interval, Condition 4.5 implies</p><p>We compute</p><p>[by Lemma 6.3] &#63743; Z O(x max{1,&#8675;+&#10003;} ) 0 O(t p &#8629; ) dt [by Definition 4.1 and (EC.1.1)] = 8 &gt; &lt; &gt; :</p><p>thus proving (i).</p><p>We now show (ii), following a similar argument but with a more involved computation. Note that Definitions 6.1 and 6.5 together imply b k [w x ] x for all k 1. (EC.1.2)</p><p>We compute</p><p>O(t &#8629; ) dt</p><p>[by Definition 4.1 and Condition 4.5] &#63743;</p><p>thus proving (ii).</p><p>LEMMA 7.9. Suppose Condition 4.5 holds, and let &#63743; = 2 max{&#8629; 1, &#10003;}. For all x 0 and a 2 (y x , x),</p><p>Proof. Because &#63743; &gt; &#10003; 0, by Condition 4.5 and (EC.1.2), for all u 0 and k 1,</p><p>We now plug in u = c 0 [w x (a) ] and make the following observations.</p><p>&#8226; By Definition 6.1, we know u = c 0 [w x (a) ] is the earliest age at which a job has rank at least w x (a), so w u = w x (a).</p><p>&#8226; By Definition 6.5, a job's rank is at most w x (a) between ages a and x, so there exists k 1 such that</p><p>In Recall that y x denotes the (first) age of the maximum rank in the interval [0, x]. Since T (x) is stochastically increasing in x (Lemma 6.6), it holds that P[T (x) &gt; t] P[T (y x ) &gt; t] for all t 0. Additionally we have that P[T (y x ) &gt; t] P[T FB (y x ) &gt; t] for all t, x 0, where T FB (x) is the response time for a job of size x under FB. The reason for this last inequality is that a job of size y x must wait for all other jobs to receive up to y x units of service before completing.<ref type="foot">foot_4</ref> As a result, <ref type="bibr">(Mandjes and Nuyens, 2005</ref>, Proposition 8) implies</p><p>Additionally, up to its last line the proof of <ref type="bibr">(Mandjes and Nuyens, 2005, Lemma 9</ref>) is valid for arbitrary service policies. If x 0 &gt; 0 is such that P[X x 0 ] &gt; 0, we thus find  ), so our task is to show that the limit still holds with y x instead of x.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Consider arbitrary</head><p>for all x &gt; x 0 . Because a &#8676; = x max , there exists x 1 &gt; x 0 such that y x 1 = x 1 , and thus |d(B yx 1 ) d(B)| &lt; ".</p><p>But d(B yx ) is decreasing in x, because y x is increasing in x, and B x is stochastically increasing in x.</p><p>We conclude that for all x &gt; x 1 , we have |d(B yx ) d(B)| &lt; ". Our choice of " &gt; 0 was arbitrary, so ), as desired.</p><p>LEMMA 9.5. Let &#8673; be a SOAP policy with 0 &lt; a &#8676; &lt; x max . We have</p><p>Proof. Clearly, T &#8673; is a mixture of T (1) &#8673; and T (2) &#8673; . Lemma 6.6 implies T (2)</p><p>. The same reasoning applies to Step and Spike. The lemma thus follows if we can show</p><p>step .</p><p>(EC.1.5)</p><p>The comparison in (EC.1.5) follows from a key fact from the SOAP analysis <ref type="bibr">(Scully and Harchol-Balter, 2018</ref>) called the Pessimism Principle, which states that the response time of a particular job J is unaffected if, instead of following the usual rank function, job J follows its worst future rank function (Definition 6.5). The intuition is that any jobs that will get served ahead of job J in the future may as well be served ahead of it right now.  Worst future rank functions (Definition 6.5, abbreviated w.f.r., dotted magenta curves) of the policies shown in Figure <ref type="figure">9</ref>.1, with the original rank functions (translucent cyan curves) for reference. We show the worst future rank functions for a class 2 job of size x &gt; a &#8676; .</p><p>We illustrate in Figure EC.1 the worst future rank under Step, Spike, and &#8673;. Notice that, for any size</p><p>The Pessimism Principle says that we can compute a particular job J's response time by imagining that it always has its worst future rank. Increasing a job's rank can only increase its response time, so the above worst future rank comparisons imply that for all x &gt; a &#8676; , T spike (x) &#63743; st T &#8673; (x) &#63743; st T step (x).</p><p>The desired (EC.1.5) follows because class 2 jobs are those of size greater than a &#8676; . LEMMA 9.6. The response time distributions of class 2 jobs under Step and Spike are</p><p>where X (2) = (X | X &gt; a &#8676; ) is the size distribution of class 2 jobs, and the random variables in each sum are mutually independent.</p><p>Proof. This result follows easily from the SOAP analysis <ref type="bibr">(Scully and Harchol-Balter, 2018)</ref>. For completeness, we sketch the main ideas of how the SOAP analysis applies to Step and Spike. Consider a class 2 job J.</p><p>&#8226; Under Step, job J always has worst future rank r &#8676; (Figure EC.1(a)). Job J is thus delayed by any jobs present when it arrives, plus the pre-age-a &#8676; portion of any jobs that arrive while it is in the system.</p><p>&#8226; Under Spike, job J has worst future rank r &#8676; only until age a &#8676; (Figure EC.1(a)). Job J is thus delayed by any jobs present when it arrives, plus the pre-age-a &#8676; portion of any jobs that arrive before it reaches age a &#8676; . Once job J reaches age a &#8676; , its worst future rank is 0, so no further arrivals delay it.</p><p>The reason in both cases for looking at the pre-age-a &#8676; portion of new arrivals is because at age a &#8676; , those new arrivals reach rank r &#8676; , and thus job J has priority over them due to FCFS tiebreaking (Definition 3.1).</p><p>The delay due to jobs present when job J arrives corresponds to the W in each formula, and the delay due to new arrivals corresponds to the B a &#8676; (&#8226;) uses. The difference between the formulas is due to the fact that under Step, new arrivals delay job J until it completes, whereas under Spike, new arrivals delay job J only if they arrive before it reaches age a &#8676; , with the last X (2) a &#8676; portion of job J's service occurring uninterrupted.</p><p>LEMMA 9.7. Consider an M/G/1 with nicely light-tailed job size distribution X, and define</p><p>Then, as illustrated in Figure <ref type="figure">9</ref>.2, the following hold:</p><p>(i) 1 is convex on (s 0 , 1), decreasing on (s 0 , s 2 ), and increasing on (s 2 , 1);</p><p>Analogous statements hold for a for all a 2 (0, x max ].</p><p>Proof. We prove the statement just for , as the argument for a is analogous. The illustration in Figure <ref type="figure">9</ref>.2 may provide helpful intuition for the arguments that follow.</p><p>We begin by observing some general properties of 1 . Because L[X] is convex on (s 0 , 1), so is 1 .</p><p>This, along with the definition of s 2 , implies (a). The slope of 1 at zero is</p><p>and by Definition 5.2, we have 1 (s 0 ) = 1 &gt; 0. Additionally, Definition 5.2 implies s 0 &lt; 0. This means 1 is negative on a finite nonempty interval, namely (s 1 , 0), and nonnegative outside that interval.</p><p>We can now show the inequalities in (b).</p><p>&#8226; s 3 &lt; 0: Because 1 is negative on some interval, its global minimum is negative.</p><p>&#8226; s 1 &lt; s 2 : Because s 3 = 1 (s 2 ) &lt; 0, we must have s 2 2 (s 1 , 0).</p><p>&#8226; s 2 &lt; s 3 : Because 1 is convex with 1 (0) = 0 and ( 1 ) 0 (0) 2 (0, 1), we have s 2 &lt; 1 (s 2 ).</p><p>In some of the proofs below, we use the fact that for sums of independent random variables U, V 0, (9.1) implies</p><p>There are two important implications of (EC.1.7). The first implication is that the global minimum of 1 a is less than that of 1 b . But these global minima are ( a ) and ( b ), respectively (see Figure <ref type="figure">9</ref>.2), so</p><p>This means a (s) is finite whenever b (s) is. This contributes to the second implication of (EC.1.7): by Lemma 9.7(i),</p><p>Therefore, (EC.1.8) and (EC.1.9) together imply that there exists a value of s such that f ( b (s)) diverges while f ( a (s)) does not.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>EC.2. Properties of the Gittins Policy via the "Gittins Game"</head><p>The goal of this section is to prove two key remaining properties of the Gittins policy, Theorem 5.10 and Lemma 8.2. To prove both of these properties, we will use a different perspective on the Gittins policy called the "Gittins game" <ref type="bibr">(Scully et al., 2020a)</ref>. The Gittins game gives an alternative way to define the Gittins rank function. While it is less direct than the definitions we have used so far (Definitions 3.2 and 8.1), the intermediate steps it introduces turn out to be crucial for proving Theorem 5.10 and Lemma 8.2.</p><p>Aside from Theorem 5.10 and Lemma 8.2, most of the definitions and results in this section are due to <ref type="bibr">Scully et al. (2020a)</ref>, who actually study a much more general job model than ours. For simplicity, we restate the key definitions and results in our setting. However, the statements and proofs of Theorem 5.10 and Lemma 8.2 are straightforward to translate to the more general job model of <ref type="bibr">Scully et al. (2020a)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>EC.2.1. The Gittins Game</head><p>The Gittins game is an optimization problem. Its inputs are a job at some age b and a penalty w. During the game, we serve the job for as long as we like. If the job completes, the game ends. At any moment before the job completes, we may choose to give up, in which case we pay the penalty w and the game immediately ends. The goal of the game is to minimize the expected sum of the time spent serving the job plus the penalty paid.</p><p>We can think of the Gittins game with penalty w as an optimal stopping problem whose state is the age b of the job. Standard optimal stopping theory <ref type="bibr">(Peskir and Shiryaev, 2006;</ref><ref type="bibr">Shiryaev, 2008)</ref> implies that the optimal strategy thus has the following form: serve the job until it reaches some age c b, then give up. A possible policy here is never giving up, which is represented by c = 1. The (r Gtn , w)-relevant work of a job is related to the Gittins game via Lemma EC.2.3: it is the amount of time we would serve the job when optimally playing the Gittins game with penalty w. It turns out that mean (r Gtn , w)-relevant work directly translates into mean response time.</p><p>LEMMA EC.2.5 <ref type="bibr">(Scully et al. (2020a, Theorem 6.3)</ref>). Under any nonclairvoyant scheduling policy &#8673;, the mean response time can be written in terms of (r Gtn , w)-relevant work as</p><p>With Lemma EC.2.5 in hand, the proof of Theorem 5.10, restated below, reduces to bounding the mean amount of (r Gtn , w)-relevant work under q-approximate Gittins policies.</p><p>THEOREM 5.10. Consider an M/G/1 with any job size distribution. For any q 1 and any q-approximate Gittins policy &#8673;,<ref type="foot">foot_10</ref> </p><p>Proof. Recall from Definition 5.9 that we may assume r &#8673; (a)/r Gtn (a) 2 [1, q] for all ages a without loss of generality. We will prove [by Lemma EC.2.5]</p><p>To show the left-hand inequality of (EC.2.2), it suffices to show that an arbitrary job's (r Gtn , w)-relevant work is upper bounded by its (r &#8673; , qw)-relevant work (Definition EC.2.4). This is indeed the case: r Gtn (a) &#63743; w implies r &#8673; (a) &#63743; qr Gtn (a) &#63743; qw, so the job will reach rank w under Gittins after at most as much service as it needs to reach rank qw under &#8673;.</p><p>To show the right-hand inequality of (EC. We note that one can use the techniques of <ref type="bibr">Scully et al. (2018b)</ref> to generalize the statement and proof of Theorem 5.10 beyond SOAP policies. It turns out that Theorem 5.10 still holds even if we allow q-approximate Gittins policies to adversarially assign ranks to jobs, provided that the assigned ranks are still within a factor-q window around the rank Gittins would assign.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>EC.3. Relationship Between Decay Rate and Laplace-Stieltjes Transform</head><p>The goal of this appendix is to justify our computation of decay rates (Definition 5.1) by means of Laplace-Stieltjes transform convergence (Section 9.3). Our specific goal is to justify our use of (9.1), which states</p><p>). As a reminder, In order to prove Propositions EC.3.4 and EC.3.5 for Class II job size distributions, one would need to assume a regularity condition. We believe it would suffice to assume that L[X] 0 is regularly varying at (L[X]). The main change to the proofs would be additional casework. For example, it may be that L[W X ] still has a first-order pole, or it may be that it diverges without a pole because L[X] does. See <ref type="bibr">Abate and Whitt (1997)</ref> and references therein for additional discussion.</p><p>More generally, it likely suffices to assume that some higher-order derivative L[X] (n) is regularly varying at (L[X]), as the result of Mimica (2016, Corollary 1.3) we use applies to higher derivatives as well. Other results of <ref type="bibr">Mimica (2016)</ref> may allow one to relax the assumption even further.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>Downloaded from informs.org by [128.84.127.144] on 26 February 2024, at 08:14 . For personal use only, all rights reserved.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1"><p>Operations Research, Articles in Advance, pp. 1-18, &#169; 2024 INFORMS Downloaded from informs.org by [128.84.127.144] on 26 February 2024, at 08:14 . For personal use only, all rights reserved.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_2"><p>Operations Research, Articles inAdvance, pp. 1-18, &#169; 2024 INFORMS   </p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_3"><p>Although we are referring to the job size distribution, we should clarify that here we are still discussing scheduling with known job sizes, specifically SRPT. But, starting shortly, we will shift attention to the case where job sizes are unknown, but we still know the job size distribution from which job sizes are drawn.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_4"><p>One can give a more formal proof of the inequality using the SOAP analysis(Scully and Harchol-Balter,  </p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_5"><p>2018).</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_6"><p>Two clarifications about the computation below. First, the notation U &lt;st V means that P[U &gt; t] &#63743; P[V &gt; t] for all t 2 R, and the set of t 2 R such that P[U &gt; t] &lt; P[V &gt; t] has positive Lebesgue measure. Second, because a &lt; 1, the left-hand sides of the last two steps are always finite for all s &lt; 0.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_7"><p>Recall that done(b, c) 2 [0, 1] and that if done(b, c) = 0, then '(b, c) = 1 (Definition 8.1).</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_8"><p>Strictly speaking, it is optimal to continue serving the job if and only if the rank is upper bounded in a "forward neighborhood" of a, meaning there exists " &gt; 0 such that for all 2 [0, "), we have r(a + ) &#63743; w. For non-pathological job size distributions, this holds in the r(a) &lt; w case<ref type="bibr">(Aalto et al., 2011)</ref>, so it only needs to be checked when r(a) = w.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_9"><p>The c &gt; a restriction is why we need the rank to be bounded not just at a but in a forward neighborhood of a.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_10"><p>This is a special case of a more general result(Scully, 2022, Chapter 16), which appeared while this work was in revision.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_11"><p>Scully and Harchol-Balter (2018)  actually focus on E[W&#8673;(r&#8673;, w+)] = lim w 0 #w E[W&#8673;(r&#8673;, w 0 )] as opposed to E[W&#8673;(r&#8673;, w)], but the same reasoning applies to E[W&#8673;(r&#8673;, w+)].</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_12"><p>While Definition EC.3.3 assumes a single arrival rate , Proposition EC.3.5 easily generalizes to the case where L[WX ] and Y are defined using different arrival rates.</p></note>
		</body>
		</text>
</TEI>
