<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>DIFFERENTIALLY PRIVATE MECHANISM DESIGN VIAQUANTILE ESTIMATION</title></titleStmt>
			<publicationStmt>
				<publisher>ICLR</publisher>
				<date>02/02/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10570667</idno>
					<idno type="doi"></idno>
					
					<author>Yuanyuan Yang</author><author>Bhuvesh Kumar</author><author>Jamie Morgenstern</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[We investigate the problem of designing differentially private (DP), revenue-maximizing single item auction. Specifically, we consider broadly applicablesettings in mechanism design where agents’ valuation distributions are indepen-dent, non-identical, and can be either bounded or unbounded. Our goal is to designsuch auctions with pure, i.e., (ω, 0) privacy in polynomial time.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>Though prior-dependent auctions, which adjust parameters based on samples of value distributions, often yield better revenue than prior-independent auctions, they risk leaking information about the bids they were trained upon. To address this issue, differential privacy (DP) offers a promising solution <ref type="bibr">(Dwork, 2006;</ref><ref type="bibr">2008;</ref><ref type="bibr">McSherry and Talwar, 2007;</ref><ref type="bibr">Pai and Roth, 2013)</ref>, ensuring that a single data point minimally affects the algorithm's output, thus preventing inference of a specific data point.</p><p>We study the problem of learning a single-item auction with near-optimal revenue from samples of independent and non-identical value distributions. In this context, the optimal auction (i.e., Myerson's auction <ref type="bibr">(Myerson, 1979)</ref>), which relies on value distributions (i.e., prior-dependent), achieves optimal revenue. However, releasing the learned Myerson's auction raises privacy concerns, as the output mechanism may inadvertently reveal sensitive information about the distributions. To provably mitigate this risk, our goal is to integrate pure DP into the learning process of such auction.</p><p>Pure Differential Privacy. Given two datasets that differ in one data point, i.e., D, D &#8594; , we say an algorithm A satisfies (&#969;, &#949;)-approximate DP if for any given output s: Pr[A(D) = s] &#8594; e &#969; [A(D &#8594; ) = s] + &#949;. We say A satisfies pure DP if &#949; = 0. Pure DP allows no slack in privacy protection, and hence is more challenging to achieve than approximate DP. Previous attempts <ref type="bibr">(McSherry and Talwar, 2007;</ref><ref type="bibr">Nissim et al., 2012)</ref> to integrate DP with prior-dependent auctions have been computationally inefficient or guaranteed approximate rather than pure DP. To our knowledge, no algorithm guarantees pure DP for Myerson's auction in polynomial time.</p><p>Efficiency. Incorporating DP into the mechanism often sacrifices efficiency, as achieving privacy guarantees typically incurs additional computational overhead (e.g., random noise addition or extra sampling procedure). This issue has been observed in similar contexts, such as online learning <ref type="bibr">(Jain et al., 2012)</ref>, federated learning <ref type="bibr">(Zhang et al., 2023)</ref> and deep learning <ref type="bibr">(Abadi et al., 2016)</ref>. In our context, to achieve pure DP, implementing exponential mechanism <ref type="bibr">(McSherry and Talwar, 2007</ref>) over all possible mechanisms would incur exponential time (See Appendix D). To obtain pure DP more efficiently, we apply recent advances <ref type="bibr">(Durfee, 2023;</ref><ref type="bibr">Kaplan et al., 2022)</ref> in private quantile estimation. Our algorithm's running time increases by only polylog factors compared to the non-private version.</p><p>Under review as a conference paper at ICLR 2025 Notations We use M A to denote the optimal mechanism of distribution A, and we use Rev(M, A) to denote the revenue of deploying mechanism M to distribution A. We restricted ourselves to single item auctions; hence, M A denotes the Myerson auction fitted on distribution A, and we denote OPT(A) := Rev(M A , A) as the optimal revenue one could get from a distribution A. We use 1 k to denote a k-dimensional vector with all entries equal to 1. We use O and ! to hide polylog factors.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.1">RESULTS</head><p>Formally, we define the problem of learning a near-optimal auction with a pure DP: Problem 1.1 (Optimal Auction with (&#969; p , 0)-DP). Given n samples of k-dimensional distribution D, the goal is to learn a single item auction M with (&#969; p , 0)-DP, whose expected revenue on D is close to the optimal revenue, i.e., with prob.</p><p>Insight. To address this problem, we leverage the insight that, the expected optimal revenue from value distribution is insensitive to small statistical shifts and discretization in the quantile and value space. Additionally, we observe that the accuracy of the points returned by private quantile estimation (QE), assuming the data points follow a distribution, directly correlates with the statistical distance between the distribution formed by the returned points and the true distribution. Thus, we can reduce private Myerson fitting from samples to private quantile estimation of pre-specified quantiles.</p><p>Achieving pure DP while maintaining meaningful revenue guarantees is challenging. A crucial aspect is to ensure that the values (hence distribution) returned by DP Quantile Estimation (QE) possess meaningful and provable accuracy guarantees. To obtain such accuracy, our algorithm (Alg. 1) first additively discretize the empirical distribution in the value space to distribution D &#969; , then estimate the pre-specificed quantiles with DPQE. We improved the accuracy bound of DPQE (DPQUANT, <ref type="bibr">Kaplan et al. (2022)</ref>) to accommodate cases with duplicate values. This improved bound allows us to upper bound the statistical distance between the output distribution and D &#969; , thus upper bounding the revenue loss incurred from fitting a Myerson on the output distribution.</p><p>Theorem 1.2 briefly presents the near-optimal revenue of our proposed mechanism. The final privacy parameter has a dependency on k since the output of mechanism M is of dimension 2k. We present complete details in Section 3 and the complete theorem statement in Theorems 3.2 and 3.3.</p><p>Theorem 1.2 (Revenue Guarantee of Private Myerson, Bounded). Given n = !(&#969; &#8593;2 ) samples V of the joint distribution D &#8595; [0, h] k , there exist a mechanism M that is 2k&#969; p differentially private with running time !(kn) and takes !(1) pass of the distribution. With probability 1 &#8593; &#949;, this mechanism</p><p>The prior algorithm does not work for unbounded distributions. Our second algorithm (Alg. 9) addresses the case for &#977;-strongly regular value distributions by efficiently truncating them to bounded distributions with small expected revenue loss. This approach enables the application of our previous mechanism (Alg. 1) designed for the bounded distribution case. Since the truncation point is a function of the optimal revenue, we develop Alg. 7 to approximate this point by achieving a !(k)approximation of the optimal revenue, where k denotes the dimension of the product distribution.</p><p>Theorem 1.3 outlines the accuracy of our proposed mechanism for certain parameter settings. Since this truncation point depends adaptively on the desired accuracy, the revenue gap exceeds that for the bounded case, and the tradeoff between privacy and revenue are more pronounced. We present more details in Section 4, and the complete theorem statement is in Theorems 4.1 and I.13. Theorem 1.3 (Revenue Guarantee of Private Myerson, Unbounded). Given n = !(&#969; &#8593;2 ) samples V of &#977;-strongly regular joint distribution D &#8595; R k , there exist a mechanism M for unbounded distribution that is 2k&#969; p differentially private with running time !(kn) and takes O(n) passes. With probability</p><p>Application: Online auction with nonmyopic bidders. We now describe how our mechanisms incentivize truthful bidding from nonmyopic bidders under practical online auction settings. <ref type="foot">2</ref> In the online setting, auctions are deployed iteratively and later auctions are informed by previous bids.</p><p>Since future auctions can be affected by earlier bids, nonmyopic bidders may strategically bid in earlier rounds to increase winning chances and/or secure lower prices, increasing their utility.</p><p>To prevent from strategic bidding, we integrate our previous solutions (Alg. 1, Alg. 9) with a commitment mechanism. Our DP Myerson naturally upper bound the utility gain (of future rounds) by definition, in that the change of one bid affect the outcome's probability by privacy parameter &#969; p . Our algorithm operates in two stages. In the first stage, it employs a commitment mechanism that penalizes strategic bids. In the second stage, the algorithm fits a DP Myerson auction from the collected bids and generates revenue in the remaining rounds. This approach ensures that strategic bids only lies in a small neighbor of the true value; otherwise, the bidder's utility becomes negative.</p><p>We present the regret (i.e., the time-averaged revenue of the proposed mechanism compared to the optimal one) of our proposed mechanism (Alg. 3) in Theorem 1.4, which shows the accuracy of our algorithm in terms of regret. We defer readers to Section 5 and Theorem 5.4 for further details. Theorem 1.4 (Revenue Guarantee of Online Mechanism). Given &#969; &#8595; [0, 1/4], under the online auction setting described in Section 5.1), there exists an algorithm (Alg. 3) run with parameter T = !(&#969; &#8593;2 ) that, with probability 1 &#8593; &#949;, achieves diminishing regret, i.e.,</p><p>where &#977; is a constant specific to bidders' utility model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.2">PRIOR WORK</head><p>DP Mechanism Design. Emerging from <ref type="bibr">McSherry and Talwar (2007)</ref>, there has been interest in delivering mechanisms with DP guarantees <ref type="bibr">(Nissim et al., 2012;</ref><ref type="bibr">Huang et al., 2018a;</ref><ref type="bibr">Zhang and Zhong, 2022;</ref><ref type="bibr">Huh and Kandasamy, 2024)</ref>. These mechanism are either no longer optimal in our setting, or doen't generalize to unbounded distribution setting.</p><p>Online Learning in Repeated Auction. Regarding the single item online auction setting, <ref type="bibr">Kanoria and Nazerzadeh (2014)</ref>; <ref type="bibr">Huang et al. (2018a)</ref> established near-optimal solutions when bidders' utility is discounted and valuations are i.i.d.. <ref type="bibr">Deng et al. (2020)</ref>; <ref type="bibr">Abernethy et al. (2019)</ref> introduced specific incentive metrics to quantify bidders' willingness to bid other than their true values and developed mechanisms that minimize incentives for strategic bidding under these metrics in large markets.</p><p>For a detailed, complete list of related work topics, please see Appendix C.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.3">CONTRIBUTIONS</head><p>Revenue Maximizing Auctions with Pure Privacy Guarantee. Our work is the first to develop a mechanism with pure DP that obtains near optimal revenue for single item auction with independent and non-identical bidders, and for both bounded and unbounded &#977;-strongly regular distributions. For bounded distributions, our mechanism achieves optimal time complexity within polylog factors.</p><p>Application to Online Auction Setting. We apply our mechanism into the online auction setting with nonmyopic, independent and non-identical bidders. Combined with our designed commitment strategy, the integrated solution restricts the bids to a small neighbor around the corresponding value. Consequently, these approximately truthful bids enables our solution to generate revenue guarantee that converges to the optimal revenue over time, for time-discounted, or large market bidders. We generalize the i.i.d bidder setting in <ref type="bibr">Huang et al. (2018a)</ref> and solve the open problem they proposed.</p><p>Extended Analysis of Private Quantile Algorithm. We extend the analysis of the quantile estimation oracles employed in this paper. For quantile estimation on bounded datasets <ref type="bibr">(Kaplan et al., 2022)</ref>, the paper assumes that all data points are distinct and derive accuracy bounds dependent on the dataset's range. We generalize their analysis to accommodate cases where multiple data points may share identical values. Additionally, for quantile estimation of unbounded distributions <ref type="bibr">(Durfee, 2023)</ref>, we provide theoretical accuracy guarantees, complementing the paper's focus on empirical performance.</p><p>In this section, we outline the preliminaries on mechanism design, differential privacy, and quantile estimation. Additional information can be found in Appendix E.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">MECHANISM DESIGN BASICS</head><p>We now formally define the allocation rule and payment rule of a single item auction. </p><p>, where x j denotes the probability that the j-th bidder gets the item, and p j denotes her payment.</p><p>Under truthful sample access, the Myerson's auction maximizes the expected revenue. Definition 2.2 (Myerson's Single Item Auction <ref type="bibr">(Myerson, 1981)</ref>). For a discrete product distribution <ref type="bibr">(Elkind, 2007)</ref>, the virtual value for D j at value v j i with support</p><p>, where v j i s are ordered in increasing order of i,</p><p>and</p><p>For these distributions D with nondecreasing virtual value, Myerson's allocation rule A key property we leverage from differential privacy is its immunity to post-processing. Postprocessing refers to any computation or transformation applied to the output of a DP algorithm after the data has been privatized. In our context, Myerson's auction can be seen as a post-processing step. Therefore, applying Myerson's auction to a differentially private release of the empirical distribution preserves the original privacy guarantees of the input distribution. Lemma 2.4 (Immunity to Post-Processing). Let A : R n + &#8771; R be an (&#969;, &#949;)-DP algorithm, and let f : R &#8771; R be a random function. Then, f &#8656; A : R n + &#8771; R is also (&#969;, &#949;)-DP.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">QUANTILE ESTIMATION</head><p>Quantile estimation (QE) is used for estimating a value of specified quantiles from samples. Given samples from a distribution, an accurate QE from samples directly translates to an accurate CDF estimation of the underlying distribution. Below, we formally introduce the definition of QE. Definition 2.5 (Quantile Estimation). Given a range of the data as H, a dataset X &#8658; H n containing n points from range H, and a set of m quantiles 0 &#8594; q 1 , . . . , q m &lt; 1, identify quantile estimations v 1 , . . . v m such that for every j &#8595;</p><p>We now present the definition of statistical dominance and KS-distance below. (2) For some x, F D(x) &lt; F D &#8594; (x).</p><p>The KS distance between D and</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">PRIVATE MYERSON'S AUCTION FOR BOUNDED DISTRIBUTIONS</head><p>In this section, we introduce the algorithm for fitting a Myerson's auction with a pure privacy guarantee. To ensure pure privacy, since DP is immune to postprocessing, it is sufficient to input a private distribution estimated from samples to the Myerson. The challenge lies in finding such distributions that still yield near-optimal revenue.</p><p>Our approach leverages private quantile estimation (QE) over samples to achieve the desired guarantee. However, the standard guarantees of DPQE collapse when the dataset contains points that are extremely close. This is a critical issue in our setting, as increasing the sample size n from continuous value distributions inherently causes the minimum distance between samples to approach zero. To address this, we introduce additional discretization steps to prevent non-identical points from being too close together, and we develop new DPQE guarantees specifically tailored to handle samples with identical values.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">PRIVATE MYERSON FOR BOUNDED DISTRIBUTIONS</head><p>Next, we present DPMYER algorithm (Alg. 1). The algorithm first value-discretize the samples of the distribution additively by &#969; a , then quantile-discretize these samples by &#969; q with pure privacy guarantee. Specifically, the quantile discretization estimates the values of the quantile set [&#969; q , 2&#969; q , . . . , 1] with pure privacy. Next, DPMYER use the estimated quantile values and the quantile set to construct a distribution, then perturb it to a final distribution that is stochastically dominated by the ground truth. Finally, the final distribution is then used to implement Myerson's mechanism.</p><p>Algorithm 1 DP Myerson, Bounded Distribution DPMYER(V, &#969; q , &#969; a , h, &#969; p )</p><p>Input: n samples V &#8595; R k&#8599;n + , discretization parameter &#969; q , &#969; a , upper bound h, privacy parameter &#969; p 1: Discretize all values into multiples of &#969; a ; let the resulting samples be V . 2: Prepare the quantile to be estimated: Q &#8660; {&#969; q , 2&#969; q , . . . , . . . , &#8598;(1/&#969; q )&#8601; &#8226; &#969; q , 1}. 3: For each dimension i &#8595; [k], decide the prices S [i,:] &#8660; QESTIMATE(Q, V [i,:] , &#969; p ).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>4:</head><p>&#1009; Estimate the quantiles by DPQUANT (Alg. 4) 5: Construct distribution D based on S, treating the valuations in S as if each has probability &#969; q . 6: For each i &#8595; [k], shift the top &#969; q quantile of D i to the bottom, fit Myerson on this distribution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">REVENUE OPTIMALITY AND RUNNING TIME</head><p>Next, we show the revenue optimality and the efficiency of our algorithm. To upper bound the reveue loss, we derive the revenue shift theorem, which upper bounds the revenue difference between two distributions by a linear function of their statistical distance. Theorem 3.1 (Revenue Shift Theorem). Given two product distribution D &#8659; D &#8594; whose valuations are bounded by h, with d ks (D i , D &#8594; i ) &#8594; &#962; i for any bidder i, the optimal revenue of these distribution satisfies:</p><p>We apply this theorem to upper bound the revenue loss between 1) the quantile-discretized distribution and its pre-quantized counterpart, and 2) the distribution obtained from private quantile estimation and that from the groundtruth quantile estimation. The first one is evident, while the second arises from DPQUANTILE's ability to control the KS-distance between the estimation and the ground truth.</p><p>We now present the accuracy guarantee of the private Myerson algorithm. Provided the privacy parameter is not too small (i.e, &#969; p = "(&#969; &#8593;1 )), our guarantee implies that the optimal revenue of the distribution does not exceed the revenue of our algorithm on its samples by more than !(&#969;kh).</p><p>Theorem 3.2 (Revenue Guarantee of Private Myerson (Alg. 1)). Given n samples V &#8595; [0, h] k&#8599;n of the joint distribution D, DPMYER (Alg. 1) is (2k&#969; p , 0)-DP, and the expected revenue of this mechanism is close to the optimal revenue of distribution D, i.e., with probability 1 &#8593; &#949;:</p><p>under parameter &#969; a = &#969; q = &#969; and n = !(&#969; &#8593;2 ), where we hide the polylog factors in ! and O .</p><p>Proof Sketch. We begin by deriving the privacy guarantee of our algorithm. Next, we establish an upper bound on the distance between the private distribution D p and the additively discretized distribution D &#969; . This enables us to apply the revenue shift theorem (Thm.3.1) to upper bound the revenue loss from private quantile estimation. By aggregating this loss with the revenue loss due to value discretization, we arrive at the final result. In this proof sketch, we omit the polylog factors that depends on k, n, &#949;, &#969; a , &#969; p , &#969; q for a clear presentation. Further details are provided in Appendix H.2.</p><p>Privacy Guarantee. We know that the quantile estimates from DPQE is (&#969; p , 0) private (Lem. H.2). Since DP is immune to post-processing (Lem. E.4), and that the output of allocation and payment combination is 2k dimensional, by composition theorem (Lem. E.5), our algorithm is (2k&#969; p , 0)-DP.</p><p>Upper Bounding the Statistical Distance The distribution D p is obtained by changing from distribution D through distribution D, the distribution D &#969; and D q (Figure <ref type="figure">1</ref>). We upper bound the statistical KS distance of these distributions: 1) By DKW inequality, we upper bound the KS-distance between D and D by !(1/ &#8596; n) for each coordinate i (with probability 1 &#8593; &#949;/2). 2) By definition, we upper bound the KS-distance between D &#969; and D q by k&#969; q . 3) By developing and converting the bound of the DP quantile algorithm (Lem. H.3) into a bound on the CDF, we upper bound the KS-distance between D q and D p by k &#969; for &#969; := !(1/(&#969; p n)) (with probability 1 &#8593; &#949;/2).</p><p>Upper Bounding the Revenue Loss. We then upper bound optimal revenue loss from D to D p . This upper bound can be obtained by combining the revenue loss from the aforementioned distributions (by revenue shift theorem), with an additive &#969; a revenue loss from discretization (by Lem. F.1). The revenue loss from statistical shift aggregates to !((1/ &#8596; n + &#969; q + &#969;)kh) with probability 1 &#8593; &#949;.</p><p>Putting it all together. Finally, condition on the DPQUANT proceeds successfully and the samples are close to the underlying distribution (with probability 1 &#8593; &#949;), we get that the expected revenue of DPQUANT on the underlying distribution is at least the optimal revenue from this distribution minus the revenue difference between D and D p by the following inequality:</p><p>where the first inequality follows from the optimality of M D on D and the second inequality follows from adding OPT( D p ). By our construction of D p , this distribution is stochastically dominated by D, thus from the strong revenue monotonicity (Lem. F.3), we get that E[Rev(M D p , D)&#8593;OPT( D p )] &#8600; 0. Thus, we concluded that the revenue gap is upper bounded by !((1/ &#8596; n + &#969; q + &#969;)kh + &#969; a ). We set &#949; in the statement as 1/k of the &#949; we used in this proof to generate the final revenue guarantee. We establish connections between the accuracy/revenue guarantee of the original distribution D with the empirical distribution D, the valuediscretized D &#969; , the quantile-discretized D q and the distribution D p returned by DPQUANT(Alg. 4).</p><p>Next, we demonstrate the efficiency of our algorithm, which is achieved through a organized implementation of the DP Quantile algorithm. Intuitively, given m ordered quantiles, the algorithm iteratively identifies and estimates the median (the m/2-th), followed by the m/4 and the 3m/4 quantiles, and so on. This hierarchical structure ensures that each data point is used in at most log m quantile estimates (of a single quantile). For more details, we refer readers to Appendix H.1. Theorem 3.3 (Time Complexity for Private Myerson, Bounded). Given the same parameters as stated in Theorem 3.2, DPMYER (Alg.1) runs in !(kn) time and requires !(1) passes of the samples.</p><p>Proof Sketch. The time dominant step is quantile estimation, which requires log(&#8598;1/&#969; q &#8601; + 1) passes of the dataset. It takes O(k log(&#8598;1/&#969; q &#8601; + 1)/(&#969; a &#969; q )) = !(kn) time, since n = !(&#969; &#8593;2 ). This step calculates the utility of k&#8598;h/&#969; a &#8601; over &#8598;1/&#969; q &#8601; quantiles for at most !(1) time. For full version of this proof, please refer to Appendix H.3</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">GENERALIZATION TO UNBOUNDED DISTRIBUTIONS</head><p>Generalizing the DP Myerson mechanism to unbounded distributions introduces new challenges. The revenue loss upper bound produced by previously introduced quantile estimation algorithm and revenue shift theorem both depends (positively) on the range of the distribution. Without a finite range, these upper bound becomes infinite and fail to effectively control the revenue loss.</p><p>We consider the widely accepted &#977;-strongly regular distributions, which decays at least as fast as exponential distributions. A key element of our approach is appropriately truncating the distribution, which enables us to extend the discretize-then-DP-quantile method to the unbounded setting. Specifically, we apply the property of the regular distribution that <ref type="bibr">(Devanur et al., 2016)</ref>, truncating the distribution by 1 &#969; OPT(D) costs at most 2&#969; fraction of the optimal revenue (Lem. I.1). Hence, for the truncation to work, it is essential to approximate the optimal revenue based on sample data. Meanwhile, incorporating the truncation with pure DP introduces additional complexities.</p><p>We are now ready to present our approach for a k-approximation of the optimal revenue with pure DP for &#977;-strongly regular product distributions. Our DPKOPT (Alg. 2) algorithm approximates the optimal revenue by running a empirical reserve(ER) over each bidder's distribution truncated at the top &#977; 1/(1&#8593;&#949;) /4 quantile. <ref type="foot">5</ref> Summing up these estimates gives us a !(k)-approximation of the optimal revenue, by the fact that kOPT(D)</p><p>Algorithm 2 DP Estimation for Optimal Revenue DPKOPT(V, &#969; q , &#969; a , &#969; p , &#977;) Input: n samples V = {v 1 , . . . , v n }, quantile discretization &#969; q , additive discetization &#969; a , privacy parameter &#969; p , regularity parameter &#977;.</p><p>&#1009; Estimate the truncation point of D d . 5:</p><p>Prepare the quantile to be estimated,</p><p>&#1009; Apply DP quantile estimate (Alg. 4).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>7:</head><p>Let F d be the distribution generated by value profile S [d,:] and quantile set Q.</p><p>8:</p><p>&#1009; Estimate the optimal revenue from F d (Alg. 6). 9: end for 10: KREV &#8660; d&#8596;[k] SREV 11: return KREV To guarantee pure privacy, our algorithm estimates the optimal revenue using a DP-estimated proxy F [k] derived from the sample data. This proxy is obtained from truncating the distribution by DPQUANTU (Alg. 7) and quantile-discretizing the distribution by DPQUANT. During this process, the truncation by DPQUANTU cost at most a constant fraction of the optimal revenue, and DPQUANT cost at most an additional !( 1 &#969;pn k + &#969; a ). Aggregating these revenue loss concludes that the output is a !(k)-approximation of the optimal revenue. See Appendix I.4 for more details.</p><p>Our private Myerson algorithm for the unbounded distribution (Alg. 9) integrate DPKOPT and yields the following accuracy bound. See Appendix I.5 for formal statements and more details. Theorem 4.1 (Revenue Guarantee of Private Myerson, Unbounded). Given &#969; &#8595; [0, 1/4], n samples V of the joint distribution D &#8595; [0, h] k , the output of Myerson fitted under DPMYERU (Alg. 9) is (2k&#969; p , 0)-DP, and under</p><p>Under review as a conference paper at ICLR 2025</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">APPLICATION: ONLINE MECHANISM DESIGN FROM BIDS</head><p>We now study how to integrate our previous solutions into the online auction setting, such that, the algorithm produces time-averaged revenue guarantee that converges to the optimal. The auction now spans multiple rounds, where each auction is informed by the bids from previous rounds. We consider the setting where bidders are non-myopic bidders, and have incentives to bid strategically in the current round to increase their utilities over future auctions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">APPLICATION BACKGROUND</head><p>Before presenting our algorithm, we first provide the formal problem definition of the online auction setting. We study online mechanism design over a time horizon of T , where an identical item is sold at each iteration. Each bidder has a publicly observable attribute. Bidders with the same attribute have the same valuation distribution.</p><p>We are now ready to describe interactions between bidders and the auctioneer over time horizon T , as shown in Figure <ref type="figure">2</ref>. We defer to Appendix J.2 for more details how bidder generates the samples.</p><p>For each time t &#8595; [T ]:</p><p>&#8226; The learner/auctioneer sells a fresh copy of the item.</p><p>&#8226; The learner collects the bids in the form of (b j , a j ), where b j and a j denote the bid and the attribute of the j &#8595; [d t ]-th bidder, respectively.</p><p>&#8226; The learner decides the allocation rule x t and payment p t accordingly. Each item the auctioneer sells is identical, and each bidder has an additive (discounted) utility of the items across rounds. We consider the bidders either have discounted utility or are in a large market. Definition 5.1 (Bidder's Utility). Each bidder j has a quasi-linear utility function at time t: u t j = x t j (v t j &#8593; p t j ), where x t j , v t j , p t j are the allocation, value, price for bidder j at time t, respectively. We consider two nonmypoic bidders' utility models: Discounted Utility: For discount factor &#966; &#8595; [0, 1], the bidders seek to maximize the sum of utilities discounted by &#966;. At the t-th iteration, the discounted utility is u t j = T r=t u r j &#966; r&#8593;t . Large Market: <ref type="bibr">(Anari et al., 2014;</ref><ref type="bibr">Jalaly Khalilabadi and Tardos, 2018;</ref><ref type="bibr">Chen et al., 2016)</ref>: The bidder only participates in a subset S j of auctions, i.e., for each u 1:T = t&#8596;Sj u t , with subset |S j | &lt; l.</p><p>Ideally, the learner's objective is maximize time-averaged revenue with high probability. Our regret compare this revenue against the optimal revenue of the (unobservable) value history. Definition 5.2 (Learner's Objective). Given &#949;, the learner's objective is to decide an allocation x 1:T and a payment p 1:T that achieves sublinear regret, i.e., with probability 1 &#8593; &#949;,</p><p>with the expectation taken over the value distribution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">TWO-STAGE MECHANISM FOR BOUNDED DISTRIBUTION</head><p>This two-stage algorithm (Alg. 3) consists of repeated auctions over T rounds, and the participating bidders' values in each round are upper bounded by a known constant h. The algorithm first collects the samples for the first T 1 rounds, by running a commitment algorithm (Alg. 10) that punishes nontruthful bids. Then, the algorithm deploys our previously developed DP Myerson's Algorithm (Alg. 1, Alg. 9) for the remaining rounds to obtain near optimal revenue. In addition to these two steps, our algorithm includes a step where all samples are reduced by &#8636; (line 4 of Alg. 3) and projected onto nonnegative value spaces. This step is designed to offset the impact of strategic bidding.</p><p>Under review as a conference paper at ICLR 2025</p><p>Algorithm 3 Two-Stage Algorithm A BOUNDED Input: Rounds T , learning rounds T 1 , parameter &#969; a , &#969; q , &#969; p , &#8636;, upper bound h. 1: for t &#8660; 1, . . . T 1 do &#1009; Collection Stage 2: Receive bids b t , and attributes a t . 3: Return (x t , p t ) &#8660; COMMIT(b t ). &#1009; Commitment Algorithm(Alg. 10) 4:</p><p>b t &#8660; P [0,h] [b t &#8593; &#8636;1 k ] 5: end for 6: ( x(&#8226;), p(&#8226;)) &#8660; DPMYER( b 1:T1 , a 1:T1 , &#969; q , &#969; a , h, &#969; p ) &#1009; Fit Myerson's auction (Alg. 1, or Alg. 9) 7: for t &#8660; T 1 + 1, . . . T do &#1009; Revenue Stage 8:</p><p>Receive bids b t , and attributes a t . 9:</p><p>(x t , p t ) &#8660; MYERSON( x(&#8226;), p(&#8226;)); 10: end for Specifically, the parameter &#8636; is carefully calibrated to ensure that the bid distribution fed into the private Myerson mechanism is stochastically dominated by the empirical distribution. Our algorithm provides an incentive guarantee that bids lie within a small, controllable neighborhood of the true values. The range of this neighborhood is determined by the privacy parameter &#969; p (hence is controlled by our algorithm), and the bidders' utility functions. By setting &#8636; to match the range of this neighborhood, the resulting distribution is dominated by the empirical distribution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3">REVENUE GUARANTEE OF THE ALGORITHM</head><p>Before presenting the revenue guarantee of our main algorithm, we first introduce a lemma that upper bounds how a bidder's bid deviates from its true value during the collection stage. Intuitively, by the design of our commitment algorithm the bidder will incurs a loss that scales (positively) with the bid deviation, compared to truthful bidding. Furthermore, our private Myerson ensures that the bidder's future utility gain is upper bounded (Lem. J.5). Thus, bidders are incentivized to report bids within a certain range of their true values to optimize their overall utility. More details in Appendix J.4. Lemma 5.3 (Bid Deviation). For any t &#8595; [0, T 1 ], the bidder will bid only b t such that |b t &#8593; v t | &#8594; 2&#962;, where &#962; = 2(l &#8593; 1)&#969; p hk for bidders in a large market; and &#962; = 2&#977;&#969;p 1&#8593;&#977; kh for discounting bidder.</p><p>From this lemma, we get that selecting a small &#969; p would incentivize bid distributions that are close to the ground-truth. Let &#8636; = 2&#962; in our algorithm (line 4, Alg. 3) would yield a distribution that is stochatically dominated by, yet close in revenue guarantee to, the true distribution. Run our DP Myerson algorithm on this distribution would give us sublinear regret, as stated below. Theorem 5.4 (Accuracy Guarantee of Two-stage Mechanism). Given &#969; &#8595; [0, 1/4], n samples of the joint distribution D &#8595; [0, h] k , and T 1 = !(&#969; &#8593;2 log(k/&#949;)), T = "(T 1 ), &#969; a = &#969; q = &#969; p = &#969;, with probability 1 &#8593; &#949;, Alg. need not be optimal over D, we have that:</p><p>where in the last inequality we apply revenue shift theorem (Thm. 3.1) to upper bound the first term and apply Lemma J.9 to upper bound the second term. Please refer to Appendix J.3 for more details.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">EXPERIMENTS</head><p>In this section, we present the experimental results for the Differentially Private (DP) Myerson mechanism, comparing its performance against two standard mechanism design baselines: the Myerson (optimal) auction and the Vickrey (second-price) auction. The former is designed to achieve near-optimal revenue for a given value distribution, whereas the latter, while strategy-proof, offers no revenue guarantees in settings with independent and non identical value distributions.</p><p>Our experiments are conducted on normal and lognormal distributions truncated to positive domains. The lognormal distribution is widely considered a representative or "groundtruth" model in many auction settings, thanks to its capacity to capture a broad spectrum of value distributions commonly observed in economic and market contexts <ref type="bibr">(Gorbenko and Malenko, 2014)</ref>. A random variable V is said to be lognormal distributed with parameter (&#181;, &#8637;), if ln(V ) follows normal distribution N (&#181;, &#8637;).</p><p>For each value profile, we test various hyperparameters-additive discretization (&#969; a ), quantile discretization (&#969; q ), and the privacy parameter (&#969; p )-and select the configuration with the best performance. For details on DP Myerson's sensitivity to hyperparameters, see Appendix A.</p><p>Bidder Profile DP Myerson Second Price Myerson Ref. Normal N (0.3, 0.5) 0.25272 0.15154 (66.7 %) 0.32598 Table 2 Lognormal (&#181;, &#8637;) = (&#8593;1.87, 1.15) Normal N (0.3, 0.5) 0.37691 0.33741 (11.7 %) 0.50204 Table 3 Normal N (0.5, 0.7) Lognormal (&#181;, &#8637;) = (&#8593;1.87, 1.15) 0.13912 0.11578 (20.2 %) 0.21292 Table 4 Lognormal (&#181;, &#8637;) = (&#8593;1.24, 1.04) Table <ref type="table">1</ref>: Empirical Revenue of DPMyerson (Alg. 1) under 2-dimensional non-identical value distributions. Each DPMyerson configuration is averaged over 50 draws, with revenue evaluated on 10, 000 samples. Percentages in parentheses represent the improvement over the second-price mechanism.</p><p>In Table <ref type="table">1</ref>, under non i.i.d distribution settings where there is a significant revenue gap between the Vickrey auction and the Myerson auction, DPMyerson achieves a notable revenue increase (at least 11% ) over the second-price mechanism.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">CONCLUSION</head><p>We investigate the problem of learning a single-item auction (i.e., Myerson) from samples with pure DP. We consider the broader setting where the agents' valuations are independent, non-identical, and can either be bounded or unbounded. By recognizing that the optimal auction mechanism exhibits robustness to small statistical perturbations in the underlying distribution, we reduce the challenge of privately learning an optimal auction from sample data to the task of privately approximating pre-specified quantiles. Specifically, our approach ensures pure privacy while generating a distribution that is closely aligned with the underlying distribution in terms of expected revenue.</p><p>We then extend this framework to the online auction setting, where later auctions are fitted on bids from previous auctions. In this setting, non-myopic bidders reasons about their utility accross rounds, and can bid strategically under (one-shot) truthful auctions. By leveraging our private Myerson mechanisms with an extra commitment mechanism, we achieve near-optimal revenue outcomes over the bidders' (unobservable) value samples, despite the strategic complexity introduced by non-myopic behavior (i.e., time discounting bidder and/or non-discounting bidders in a large market). This result highlights the robustness of our approach in both protecting privacy and maintaining near optimal expected revenue in dynamic, strategic environments.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>This failure probability &#969; is inevitable due to the inherent uncertainty in learning from a finite sample set, see Chapter 1<ref type="bibr">Kearns and Vazirani (1994)</ref> </p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1"><p>In practice, recognizable non-i.i.d. value distributions are common, e.g., Meta Ad platform (met) requires that each advertiser selects one of six objectives, corresponding to different distributions based on the industry or advertisement topic.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2"><p>We define the virtual value inverse &#949; &#8594;1 i (&#949;) as arg minv&#8593;V &#949;i(v) &#8594; &#949;.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3"><p>More formally, vj &#8593; X is the minimum value such that this quantity exceeds qjn.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4"><p>Without privacy constraints, truncating at the top &#977; 1/(1&#8594;&#969;) -suffices by Lem. I.4. Our algorithm adopt a looser truncation since the DPQUANTU algorithm only return the value of given quantiles approximately.</p></note>
		</body>
		</text>
</TEI>
