<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>On Permutation Selectors andtheir Applications inAd-Hoc Radio Networks Protocols</title></titleStmt>
			<publicationStmt>
				<publisher>Springer Nature Switzerland</publisher>
				<date>12/27/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10598869</idno>
					<idno type="doi">10.1007/978-3-031-74580-5_8</idno>
					
					<author>Jordan Kuschner</author><author>Yugarshi Shashwat</author><author>Sarthak Yadav</author><author>Marek Chrobak</author><author>Quentin Bramas</author><author>Arnaud Casteigts</author><author>Kitty Meeks</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Selective families of sets, or selectors, are combinatorial tools used to “isolate” individual members of sets from some set family. Given a set X and an element x ∈ X, to isolate x from X, at least one of the sets in the selector must intersect X on exactly x. We study (k,N)-permutation selectors which have the property that they can isolate each element of each k-element subset of {0, 1, ..., N − 1} in each possible order. These selectors can be used in protocols for ad-hoc radio networks to more efficiently disseminate informa- tion along multiple hops. In 2004, Gasieniec, Radzik and Xin gave a construc- tion of a (k, N )-permutation selector of size O(k2 log3 N ). This paper improves this by providing a probabilistic construction of a (k, N )-permutation selector of size O(k2 log N ). Remarkably, this matches the asymptotic bound for standard strong (k,N)-selectors, that isolate each element of each set of size k, but with no restriction on the order. We then show that the use of our (k, N )-permutation selector improves the best running time for gossiping in ad-hoc radio networks by a poly-logarithmic factor.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Selective families of sets, or selectors, are combinatorial tools used to "isolate" individual members of sets belonging to a given collection of sets. Given a set X and some element x &#8712; X, to isolate x from X, at least one of the sets in the selector must intersect X on exactly x. Various types of selectors have been constructed in the literature and used in applications ranging from group testing to coding, bioinformatics, multipleaccess channel protocols, and information dissemination in radio networks.</p><p>To illustrate this concept on a concrete example, consider the contention resolution problem in multiple-access channels (MACs) without feedback: N devices are connected to a shared broadcast channel (say, a radio frequency or ethernet), and each has a unique identifier from the set U = {0, 1, ..., N -1}. If exactly one device transmits in some time slot, then its message will be delivered through the channel to all other devices. However, if two or more devices transmit simultaneously, then a collision on the broadcast channel occurs. Channel collisions are indistinguishable from background noise, and the sender does not receive any feedback about the fate of its transmission.</p><p>Suppose that some unknown set of k devices initially activate and have messages to be transmitted. (See Fig. <ref type="figure">1</ref>.) We seek a protocol that would allow these k active devices to transmit successfully, providing that the other devices remain idle. Without any feedback, each protocol for this model is non-adaptive, that is, it can be uniquely identified with the sequence of transmission sets S = S 0 , S 1 , ..., S m-1 , where each S t is the set of devices allowed to transmit at time slots t, if active. A trivial protocol would avoid collisions altogether by using singleton transmission sets {0}, {1}, ..., {N -1}, but this requires m = N steps to complete k transmissions, which seems wasteful if k is small.</p><p>For an active device x to transmit successfully at a slot t of a transmission sequence S, x must be the only active device in set S t . Therefore S must satisfy the following property: for any set X &#8838; U of cardinality k and any x &#8712; X there is t &#8712; {0, 1, ..., m -1} such that S t &#8745; X = {x}. A family S that satisfies this condition is called a strong (k, N )-selector. In other words, a strong (k, N )-selector isolates each element of each subset of U of cardinality k. Optimizing the time to complete all transmissions in the above MAC model is equivalent to constructing a strong (k, N )selector of minimum size m.</p><p>Very similar challenges arise in information dissemination protocols for ad-hoc radio networks. A radio network is a directed graph whose nodes represent radio transmitters/receivers and edges represent their transmission ranges. There are n nodes, each assigned a unique label from U = {0, 1, ..., N -1}. When a node transmits, its message is sent to all its out-neighbors; however, the possibility of collisions at the outneighbors can result in the loss of transmitted messages. The ad-hoc property dictates that the nodes are oblivious to the network's topology at the beginning. In particular, in the scenario with a node v having k in-neighbors that attempt to transmit to v, the challenge is equivalent to the MAC contention resolution. (See Fig. <ref type="figure">1</ref>.) A protocol that applies a strong (k, N )-selector will guarantee that in at most m steps all in-neighbors of v will successfully transmit their messages to it. This property is particularly useful in the problem of gossiping (full information exchange), where all nodes in the network have some information that needs to be delivered to all other nodes in the network. (The formal definition of the gossiping problem is given in Sect. 3.)</p><p>Strong (k, N )-selectors have been well studied in the literature (see the discussion at the end of this section). It is known that there are strong (k, N )-selectors of size m = O(k 2 log N ), beating the earlier-mentioned trivial bound of N if k is small. Our Contribution. We study a new type of selectors called (k, N )-permutation selectors. We earlier saw that a strong (k, N )-selector isolates all elements of each k-element set X &#8838; U . The (k, N )-permutation selector guarantees an extra property that for each possible permutation of X, it isolates all x &#8712; X in the order of this permutation.</p><p>The motivation comes from radio networks, where, unlike in the MAC contention resolution problem, a message may need to travel along a path of multiple nodes. At each node v along the path, the in-neighbors of v need to overcome contention to successfully transmit to v. Consider such a path P = v 0 v 1 ...v s , and suppose that the inneighborhood X of P (the set of nodes with edges going to P ) has at most k nodes. (See Fig. <ref type="figure">2</ref> for illustration.) An s-fold repetition of a strong (k, N )-selector will deliver a message from v 0 to v s in time O(sk 2 log N ) = O(k 3 log N ). A (k, N )-permutation selector of size m can achieve this in time m, because it guarantees that in m steps the nodes v 0 , v 1 , ..., v s-1 will successfully transmit, one by one, in this order.</p><p>37 22 12 11 42 30 26 20 41 19</p><p>Fig. <ref type="figure">2</ref>. An example of a path and its in-neighborhood, with nodes identified by their labels. The path is P = 30, 11, 22, 42 and its neighborhood (shaded) has 9 nodes.</p><p>The concept of (k, N )-permutation selectors is implicit in the work by Gasieniec, Radzik and Xin <ref type="bibr">[10]</ref>. They show that a (k, N )-permutation selector can be constructed by interleaving other known types of selectors<ref type="foot">foot_0</ref> . Their construction is called a path selector and it has size O(k 2 log N log 2 k). Thus, assuming that N is polynomial in the network size n, a protocol based on their path selector will deliver a message along a path with in-neighborhood of size at most k in time O(k 2 log 3 n). They used this idea to give an O(n 4/3 log 10/3 n)-time protocol for gossiping in ad-hoc radio networks. This is the best currently known upper bound for this problem.</p><p>The main result of our paper is an improved upper bound on the size of (k, N )permutation selectors. Let 2 &#8804; k &#8804; N . Using a probabilistic construction, we prove the following theorem in Sect. 2.</p><p>This bound matches the best bound on strong (k, N )-selectors. This seems surprising, as (k, N )-permutation selectors offer more capability: they can isolate any k elements in any given order, while strong selectors will only do so in some unknown order. Theorem 1 leads to a poly-logarithmic improvement of the time complexity of gossiping in ad-hoc radio networks. A further minor improvement can be achieved by using a faster procedure for broadcasting <ref type="bibr">[6]</ref> inside the gossiping protocol. This leads to the following result: Theorem 2. If N is polynomial in n, then the gossiping problem in ad-hoc radio networks can be solved in time O(n 4/3 log 2 n(loglog n) 2/3 ).</p><p>In Sect. 4 we consider even more general structures that we named (k, q, N )permutation selectors. These are defined by the following property: for each permutation of each k-element set X, the selector isolates some q-elements of X in the order of this permutation. This naturally extends the concept of (k, q, N )-selectors, which isolate q elements of a k-element set without restricting the order. (See the discussion below.) Extending the proof of Theorem 1, we show that there exists a (k, q, N )permutation selector of size O(kq log N ).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Related Work.</head><p>Combinatorial structures closely related to selectors have been studied in different settings and under different terminology, with connections between these concepts not always obvious. Some of these structures are equivalent to strong (k, N )selectors, while others are related to the concept of weak (k, N )-selectors, which only isolate any one element of each k-element set. Examples include superimposed codes used in information retrieval <ref type="bibr">[12]</ref>, cover-free set families <ref type="bibr">[9]</ref>, as well as protocols for non-adaptive group testing <ref type="bibr">[11]</ref> and for MAC contention resolution <ref type="bibr">[13]</ref>. The use of selectors in protocols for ad-hoc radio networks was initiated in <ref type="bibr">[1,</ref><ref type="bibr">3]</ref>, and various forms of selectors have been used in essentially all deterministic protocols for information dissemination in such networks.</p><p>One classical example of applications of selectors outside of networking is group testing, a method used in fields such as medical diagnostics, quality control, or bioinformatics. The goal in group testing is to efficiently identify individuals who test positive for a specific trait, such as a disease. Instead of testing these individuals separately, group testing works by pooling individuals into groups for collective testing. With this approach, selectors can be employed to minimize the number of required tests. For more thorough discussion of applications of selectors, see <ref type="bibr">[7]</ref> and the references therein.</p><p>For strong (k, N )-selectors, the upper bound of O(k 2 log N ) can be established by a probablistic construction <ref type="bibr">[9,</ref><ref type="bibr">12,</ref><ref type="bibr">13</ref>]. An explicit construction matching this bound can be found in <ref type="bibr">[16]</ref>. Note that such bounds are of interest only for k = O( N/ log N ), because for larger values of k a trivial construction using N singletons is better. This <ref type="bibr">[2,</ref><ref type="bibr">5,</ref><ref type="bibr">8]</ref>. De Bonis et al. <ref type="bibr">[7]</ref> introduced a more general model of selectors called (k, q, N )selectors. A (k, q, N )-selector has the property that it can isolate at least q elements from each k-element subset of U . A strong (k, N )-selector is a special case when q = k, and a weak (k, N )-selector is a special case when q = 1. They proved that there are (k, q, N )-selectors of size O(k 2 /(kq + 1) log N ).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Permutation Selectors</head><p>The objective in this section is to prove Theorem 1, namely to show that there is a</p><p>We start with a more formal definition of our permutation selectors. Recall that U = {0, 1, ..., N -1}, and let 2 &#8804; k &#8804; N . Consider a sequence S = S 0 , S 1 , ..., S m-1 of subsets of U . We call S a (k, N )-permutation selector if it has the following property:</p><p>(PS) For every X &#8838; U with |X| = k and for each permutation &#960; = x 1 , x 2 , ..., x k of X, there exists an increasing sequence of indices</p><p>If S satisfies this property for a set X and a permutation &#960;, we will say that it isolates X in order &#960;, or simply that S isolates &#960;. Thus S is a (k, N )-permutation selector if it isolates each k-permutation of U .</p><p>The proof idea is to choose each set S i randomly, by having each label in U , independently, add itself to S i with probability 1 k . This way, for any fixed set X of cardinality k, each S i will isolate some element of X with probability &#947; = (1</p><p>We then show that the probability that X is not isolated in order &#960; is exponentially small with respect to m/k. This, using the union bound, will give us that the probability that S is not a (k, N )-permutation selector is less than 1 if m = &#920;(k 2 log N ), with a sufficiently large constant hidden behind the big-Theta. Therefore, some sequence</p><p>We now proceed with the details. As in the first part of the proof we fix X and &#960;, we can as well assume that X = {0, 1, ..., k -1} and that the desired permutation of X is &#960; = 0, 1, ..., k -1. Our first goal is to estimate the probability that S does not isolate X in order &#960;.</p><p>To this end, we start by considering an auxiliary problem, which is basically a variant of coupon collection: Suppose that we generate uniformly a random sequence r = r 0 , r 1 , ..., r &#8467;-1 of elements of {0, 1, ..., k -1}, where &#8467; &#8805; k &#8805; 2. We want to compute the probability p &#8467;,k of the event "r does not contain &#960; as a subsequence". To compute p &#8467;,k , we reason as follows. For a given j &#8712; {0, 1, ..., k -1}, if r contains 0, 1, .., j -1 as a sub-sequence, associate with r the unique lexicographically-first appearance of 0, 1, .., j -1, namely the increasing sequence i 0 , i 1 , ..., i j-1 of positions in r, where i 0 is the position of the first 0, i 1 is the first position of 1 after i 0 , and so on. The number of r's that contain 0, 1, 2, ..., j -1 but not 0, 1, 2, ..., j can be computed by multiplying the number of choices, &#8467; j , for the lexicographically-first appearance of 0, 1, ..., j -1, and the number of ways for filling the remaining &#8467;j positions, which is (k -1) &#8467;-j . This yields the formula for p &#8467;,k , given below, for which we then derive an upper bound estimate.</p><p>where the last step follows from &#8467;/(k -1) &#8804; 2&#8467;/k and</p><p>Next, still with X and &#960; fixed as above, we estimate the probability that S does not isolate X in order &#960;. Let h be the random variable representing the number of sets S i that isolate some element of X (with repetitions counted). As stated earlier, the probability that some element of X is isolated by a given S i is &#947; &#8712; ( 1 e , 1  2 ]. So h is a binomial random variable with success probability &#947; and mean &#956; = &#947;m. Therefore, letting &#948; = 1 -1/(4&#947;) (note that &#948; &#8712; [ 1 2 , 1e/4)) and using the Chernoff bound for the lower tail of h's distribution, we get</p><p>where &#945; = e -&#948; 2 &#947;/2 &#8712; (0, 1).</p><p>Consider now the length-h sequence consisting of elements of X (with repetitions) that are isolated by S, in order in which they are isolated. For an integer &#8467; &#8805; k, let E &#8467; be the event that the first min(&#8467;, h) elements of this sequence do not contain our permutation &#960; as a subsequence. Then Pr[E &#8467; |h &#8805; &#8467;] = p &#8467;,k , so</p><p>for &#946; = max(&#945;, e -1/4 ). (We can assume that m &#8805; 2k.) Clearly, &#946; &#8712; (0, 1). Let m = ck 2 log N , for some constant c that will be specified later. To upper bound the probability that S is not a (k, N )-permutation selector, we apply the union bound. We have N k choices of k-element sets X, and each can be permuted in k! ways. So, using the bound (3), we obtain</p><p>where the last inequality holds if c is large enough so that c&#946; c &lt; 1  16 . This proves that for this c and m = ck 2 log N there exists a (k, N )-permutation selector S of length m.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Gossiping in Ad-Hoc Radio Networks</head><p>In this section we prove Theorem 2. We start by giving a formal description of the adhoc radio network model. The gossiping protocol we use is essentially identical to the one in <ref type="bibr">[10]</ref>, but we include its high-level description and sketch the analysis, for the sake of completeness.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Ad-Hoc Radio Network Model.</head><p>A radio network can be naturally modeled as a directed graph G = (V, E) whose nodes represent processing elements equipped with radio transmitters/receivers. Each node has a unique label from the set U = {0, 1, . . . , N -1}. The directed edges represent the nodes' transmission ranges, that is (u, v) &#8712; E iff v is in the range of the transmissions from u. If (u, v) &#8712; E then v is called an out-neighbor of u and u is an in-neighbor of v.</p><p>Initially, all nodes know only their own label and the upper bound N on the number of labels. They do not have any information about the network's topology. The time is discrete, divided into equal-length time steps. If a node u transmits a message, this message is sent to all its out-neighbors at the same time step. If v is one of these outneighbors, and u is the only in-neighbor of v transmitting at this step, then v will receive u's message. But if some other in-neighbor of v transmits at the same step, a collision occurs. The model does not assume any collision detection capability, so u will not receive any collision notification and v will not know that u transmitted. There are no restrictions on message size or local computation.</p><p>The two most basic information dissemination primitives in this model are broadcasting and gossiping. In broadcasting (or one-to-all communication) the goal is to deliver a message from a designated source node to all other nodes. In gossiping (or all-to-all communication) each node starts with its own piece of information that we call a rumor, and the rumors from each node must be delivered to all other nodes. For these problems to be well defined, G must satisfy appropriate connectivity assumptions: in broadcasting all nodes must be reachable from the source node, and in gossiping the network must be strongly connected.</p><p>The Gossiping Protocol. We describe the protocol under the assumption that N = n, and later we will show how to extend it to the general case, when N is polynomial in n. With this assumption, a trivial gossiping protocol would broadcast the rumors from nodes labeled 0, 1, . . . , n -1 one by one. Denoting by B(n) the running time of a broadcasting protocol, this would take time O(nB(n)). To speed this up, the gossiping protocols in <ref type="bibr">[4,</ref><ref type="bibr">10,</ref><ref type="bibr">14]</ref> work by grouping rumors in some nodes so that these nodes can broadcast the collected rumors in a single message.</p><p>To reduce the number of such broadcasts, the idea is to broadcast from nodes that have many rumors. We call a rumor active if it has not been broadcast yet, and a node is active if its rumor is active and dormant otherwise. We use procedure DISPERSE(&#956;), which repeatedly chooses a node with at least &#956; active rumors and then broadcasts from this node. Choosing such a node can be accomplished with broadcasting and binary search <ref type="bibr">[4]</ref>. The number of chosen nodes will be O(n/&#956;), so the total running time of</p><p>One clever observation in <ref type="bibr">[10]</ref> is that it is sufficient to give a protocol for a task called quasi-gossiping, where each node needs to either become dormant itself or have its rumor delivered to a dormant node. This is because a single repetition of the transmission sequence from the quasi-gossiping protocol will in fact complete full gossiping. The pseudo-code for the quasi-gossiping protocol is given in Algorithm 1. It uses a parameter &#954; whose value will be determined later.</p><p>The correctness of Protocol QUASIGOSSIP is justified by focussing on active paths, which are paths consisting only of active nodes. For any active path, define its active in-</p><p>transmit from node v 3: DISPERSE(&#954;) 4: repeat log &#954; + 1 times 5:</p><p>the active nodes transmit according to a (&#954;, n)-permutation selector 6: DISPERSE(&#954;/2) neighborhood to be the set of active in-neighbors of the nodes on this path. Let &#8467; be the largest number such that each active path with &#8467; nodes has its active in-neighborhood size (strictly) smaller than &#954;. After the execution of DISPERSE(&#954;) in Line 3, each node will be left with fewer than &#954; active in-neighbors, so at this point we have &#8467; &#8805; 1.</p><p>Each iteration of the repeat loop at least doubles the value of &#8467;. Since all nodes (except possibly the last) on an active path belong to the active in-neighborhood, the value of &#8467; cannot exceed &#954;. So after log &#954; iterations there cannot be any active paths left of length at least &#954;, and then the last iteration will complete the quasi-gossiping task.</p><p>In line 5 we use the (&#954;, n)-permutation selector from Theorem 1, so the total cost of these selectors will be O(&#954; 2 log 2 n). The cost of all calls to DISPERSE() is O((n/&#954; + log n)B(n) log n). Thus the overall running time of Protocol QUASIGOSSIP is asymptotically bounded by:</p><p>Letting &#954; = (nB(n)/ log n) 1/3 , and using the bound B(n) = O(n log n loglog n) from <ref type="bibr">[6,</ref><ref type="bibr">15]</ref> on the complexity of broadcasting, we obtain a gossiping protocol with running time O(n 4/3 log 2 n(loglog n) 2/3 ).</p><p>To extend Protocol QUASIGOSSIP to the case when N is polynomial in n, we first replace n by N in all invocations of selectors. After this, the only problematic part of Protocol QUASIGOSSIP is the for loop in Lines 1-2 that reduces the active in-neighborhoods of the individual nodes. Instead of this loop, the protocol in <ref type="bibr">[10]</ref> uses (s, s/4, N)-selectors for geometrically decreasing values of s, combined with operations DISPERSE(s/4), to gradually reduce these in-neighborhoods. (This is similar to the method in <ref type="bibr">[13]</ref>.) The running time of this process amortizes to O((n/&#954;)B(n) log n)-same time as for the case of small labels. So the overall running time also remains the same, completing the proof of Theorem 2.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">(k, q, N )-Permutation Selectors</head><p>In Sect. 2 we defined (N, k)-permutation selectors, a family of sets that isolates all elements of any k-permutation &#960; of the label set U = {0, 1, . . . , N -1}, meaning that all elements in &#960; are isolated in their listed order in &#960;.</p><p>We now generalize this concept analogously to the way (k, q, N )-selectors generalize standard selectors, by requiring that some q elements of each k-permutation &#960; are isolated in the order of &#960;. Formally, let S = S 0 , S 1 , . . . , S m-1 be a sequence of subsets of U . S is called a (k, q, N )-permutation selector if the following property holds: (PS') For each X &#8838; U with |X| = k, and for each permutation &#960; = x 1 , x 2 , ..., x k of X, there are increasing sequences of indices 0 &#8804; i 1 &lt; i 2 &lt; ... &lt; i q &#8804; m -1 and d 1 &lt; d 2 &lt; ... &lt; d q such that, for all l = 1, 2, ..., q, set S i l isolates x d l from X.</p><p>This section provides a probabilistic construction of a (k, q, N )-permutation selector, proving the following theorem. Let 2 &#8804; k &#8804; N and 1 &#8804; q &#8804; k.</p><p>Theorem 3. There exists a (k, q, N )-permutation selector S = S 0 , S 1 ,</p><p>Proof. As in the proof of Theorem 1, we use a probabilistic argument. The construction is the same: for each j = 0, 1, ..., m -1, we let S j be a random subset of U obtained by each element x &#8712; U adding itself to S j , independently, with probability 1 k . We then need to prove that if c is a sufficiently large constant and m = c &#8226; qk log N , then Pr[ S is not a (k, q, N )-permutation selector] &lt; 1.</p><p>(</p><p>The proof's high-level strategy is the same as in the proof of Theorem 1. We use the Chernoff bound to reduce the problem to a version of the coupon collection problem. We consider the following variant of the coupon collection problem: For a random sequence r = r 0 , r 1 , ..., r &#8467;-1 of coupons from {0, 1, ..., k -1}, where &#8467; &#8805; k &#8805; 2, let p &#8467;,k,q be the probability of the event "r does not contain an increasing subsequence of length q". To show <ref type="bibr">(5)</ref>, it is sufficient to prove that p &#8467;,k,q &#8804; &#947; &#8467;/q (2&#8467;/q) q , (</p><p>for some constant &#947; &#8712; (0, 1). Indeed, this is analogous to <ref type="bibr">(1)</ref>. We can then use the Chernoff-based bound (2), to obtain a bound of &#948; m/q (m/q) q , for some 0 &lt; &#948; &lt; 1, on the probability that S does not isolate q elements of &#960; in order, analogously to (3).</p><p>For the union-bound estimate, we then take m = ckq log N , with large enough c. Since then &#948; m/q (m/q) q = &#948; ck log N (ck log N ) q &#8804; &#948; ck log N (ck log N ) k , the derivation for the union bound (4) will be essentially the same, with &#948; instead of &#946;. It remains to estimate p &#8467;,k,q . We reason as follows. First, to avoid clutter in the calculations below, we will assume that q is a divisor of k. (If it is not, the values of k/q in the calculations below need to be rounded up or down. This does not affect our asymptotic estimate.)</p><p>We now consider a special type of increasing sub-sequences that we refer to as q-jump sub-sequences. Partition the set of coupons {0, 1, ..., k -1} into q equal size blocks B 0 , B 1 , ..., B q-1 , each having k/q consecutive coupons. That is, the h-th block is B h = h k q , h k q + 1, ..., (h + 1) k q -1 , for h = 0, 1, ..., q -1. For j &#8804; q, a jump sub-sequence is a sequence of j coupons a 0 , a 1 , ..., a j-1 such that a h &#8712; B h for each h = 0, ..., j -1.</p><p>Define now p &#8467;,k,q to be the probability of the event "r does not contain a jump sub-sequence of length q". Since p &#8467;,k,q &#8805; p &#8467;,k,q , it is sufficient to prove that the same inequality (6) holds for p &#8467;,k,q instead of p &#8467;,k,q . We do this by refining the argument in the proof of Theorem 1. For a given j &#8712; {0, 1, ..., q -1}, if r contains a length-j jump sub-sequence then associate with r the unique lexicographically-first appearance of a length-j jump sub-sequence. The number of r's that contain a length-j jump subsequence but not a length-(j + 1) jump sub-sequence can be obtained by choosing the lexicographically first length-j jump sub-sequence in &#8467; j &#8226; (k/q) j ways and multiplying it by the number of ways of filling the remaining &#8467;j positions, which is (kk/q) &#8467;-j . We thus have p &#8467;,k,q &#8804; p &#8467;,k,q = 1 k &#8467; q-1 j=0 &#8467; j &#8226; (k/q) j &#8226; (kk/q) &#8467;-j = (1 -1/q) &#8467; q-1 j=0 &#8467; j (q -1) -j &#8804; (1 -1/q) &#8467; q-1 j=0 (2&#8467;/q) j &#8804; e -&#8467;/q (2&#8467;/q) q .</p><p>This proves (6) (with &#947; = 1/e), completing the proof.</p><p>For q = k, the bound in Theorem 3 matches the bound from Theorem 1 and the best bound for strong (k, N )-selectors. For q = 1, it also matches the O(k log N ) bound for weak (k, N )-selectors. We are not sure about the optimum size of (k, q, N )-permutation selectors for intermediate values of q. The case when q = &#8730; k is particularly interesting. From the bound in <ref type="bibr">[7]</ref>, (k, &#8730; k, N )-selectors have size O(k log N ). Can some probabilistic construction produce a (k, &#8730; k, N )-permutation selector of size m = &#213;(k)? For such m, a random sequence of coupons from {0, 1, ..., k -1} can be thought of as a "noisy" permutation. We would need to show that the probability that this random sequence does not contain an increasing sub-sequence of length &#8730; k is exponentially small. This question is closely related to Ulam's Problem about the distribution of the longest increasing subsequence (LIS) in a random permutation. Ulam's Problem has been extensively studied, and it is known that for permutations of 0, 1, ..., k -1 the expected length of LIS is 2 &#8730; k + O(k 1/6 ). (See <ref type="bibr">[17]</ref>, for example.) To our knowledge, however, the published concentration bounds are not sufficient for refining the probabilistic construction in the proof of Theorem 3 to yield a better bound.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>We remark that in<ref type="bibr">[10]</ref> the authors use a different style for specifying the parameters of selectors. The notation in our paper follows the convention from<ref type="bibr">[7]</ref>.</p></note>
		</body>
		</text>
</TEI>
