<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Efficient and low-overhead uplink scheduling for large-scale wireless Internet-of-Things</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>05/07/2018</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10073227</idno>
					<idno type="doi">10.23919/WIOPT.2018.8362813</idno>
					<title level='j'>16th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)</title>
<idno></idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Bin Li</author><author>Bo Ji</author><author>Jia Liu</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[With the rapid growth of Internet of Things (IoT) applications in recent years, there is a strong need for wireless uplink scheduling algorithms that determine when and which subset of a large number of users should transmit to the central controller. Different from the downlink case, the central controller in the uplink scenario typically has very limited information about the users. On the other hand, collecting all such information from a large number of users typically incurs a prohibitively high communication overhead. This motivates us to investigate the development of an efficient and low-overhead uplink scheduling algorithm that is suitable for large-scale IoT applications with limited amount of coordination from the central controller. Specifically, we first characterize a capacity outer bound subject to the sampling constraint where only a small subset of users are allowed to use control channels for system state reporting and wireless channel probing. Next, we relax the sampling constraint and propose a joint sampling and transmission algorithm, which utilizes full knowledge of channel state distributions and instantaneous queue lengths to achieve the capacity outer bound. The insights obtained from this capacity-achieving algorithm allow us to develop an efficient and low-overhead scheduling algorithm that can strictly satisfy the sampling constraint with asymptotically diminishing throughput loss. Moreover, the throughput performance of our proposed algorithm is independent of the number of users, a highly desirable property in large-scale IoT systems. Finally, we perform extensive simulations to validate our theoretical results.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>Internet of Things (IoT) refers to the internetworking of a large number of heterogenous smart devices (e.g., smart phones, tablets, sensors and actuators) to meet the demands of the ever-increasing applications in personalized health care, smart home, ubiquitous environmental monitoring, smart manufacturing, etc. It is estimated that the global market for IoT will reach 20.4 billion devices by 2020 <ref type="bibr">[1]</ref>. However, unlike traditional data communication networks where the predominant amount of data is transmitted in the downlink, a distinct feature of IoT applications is that a significant portion of the IoT data traffic is carried in the uplink (i.e., from user devices to the central controller). In most IoT applications, each device generates sparse or intermittent data traffic and transmits them to a central controller or access point (AP) for data processing, typically through wireless connections This work is supported in part by NSF grants: CNS-1717108, ECCS-1818791, CCF-1758736, CNS-1758757, CNS-1446582, CNS-1651947 and CCF-1657162; and ONR grant: N00014-17-1-2417.</p><p>(referred to as wireless IoT uplink systems in this paper). As a result, to support the enormous amount of devices given limited spectral resources, there is a compelling need for efficient and low-overhead wireless IoT uplink scheduling algorithms that determine when and which devices (referred to as users in the rest of the paper) are allowed to transmit, with the goal of supporting as many users as possible, or equivalently, throughput-optimal scheduling.</p><p>Over the years, throughput-optimal scheduling has been extensively studied in the networking research community and many results are available. However, due to its unique features and applications, IoT uplink scheduling design is far more challenging compared to its counterparts in traditional wireless networks (see Section II for a detailed discussion). In particular, due to the sheer size of IoT systems, a welldesigned uplink scheduling policy not only has to be near throughput-optimal, it also needs to be low-overhead. To this end, in this paper, we propose an efficient and simple design based on the key idea termed "pick and compare (PC)" (e.g., <ref type="bibr">[22]</ref>, <ref type="bibr">[13]</ref>, <ref type="bibr">[4]</ref>, <ref type="bibr">[20]</ref>). Simply speaking, a PC scheme always stores the most congested user and compares its queue-length with a randomly selected user. It has been shown that the class of PC algorithms achieves the maximum throughput through gradually improved quality of transmission decisions over time. The simplicity, low-complexity, and scalability of PC algorithms motivate us to consider their deployments in large-scale IoT uplink systems, where extensive coordination from the AP is typically infeasible.</p><p>However, due to several technical challenges, developing a PC-based scheduling scheme for IoT uplink systems and conducting rigorous performance analysis is highly non-trivial. Traditionally, PC algorithms were developed for systems where the queue-length evolution processes are smooth. Unfortunately, in the presence of wireless channel fading, the time-varying channel rates could change rather abruptly. Consequently, most of the proof techniques used for establishing the throughput-optimality of PC-based algorithms fail under wireless channel fading settings. In fact, it remains an open question whether it is possible to design an efficient and low-overhead PC-based scheduling algorithm for IoT uplink systems with wireless channel fading. A key contribution of this paper is that we show that the answer is "yes." Our main results in this paper are summarized as follows:</p><p>&#8226; First, we characterize a capacity outer bound for a wireless IoT uplink system subject to a sampling constraint, i.e., we limit the number of users that can use control channels for reporting system state information and probing uplink channels. This capacity outer bound lays the foundation for the performance analysis of our proposed algorithms.</p><p>&#8226; Next, we consider an efficient and low-overhead uplink scheduling design. In particular, we relax the sampling constraint and propose a two-stage MaxWeight-type scheduling algorithm that utilizes the full knowledge of channel state distributions and instantaneous queue lengths to achieve the capacity outer bound.</p><p>&#8226; Finally, building upon this capacity-achieving algorithm, we develop an iterative pick-and-compare (IPC) algorithm that strictly satisfies the sampling constraint with a vanishing throughput loss. More precisely, let &#923;(M, K) denote the capacity outer bound for an N -user M -channel uplink system with K-user sampling constraint. We show that our IPC algorithm can stabilize any arrival rates within the region &#923;(M, K -1). Further, the gap between &#923;(M, K) and &#923;(M, K -1) is vanishing as K &#8593; N , and thus proving the asymptotic throughput-optimality of our IPC algorithm. Moreover, the achievable rate region of our IPC algorithm is independent of the number of users, a highly desirable property in large-scale IoT systems.</p><p>To our knowledge, our work is the first to offer asymptotic throughput-optimality for IoT uplink scheduling. The remainder of this paper is organized as follows. Section II presents related work, putting this paper into perspective. Section III introduces the network model and problem formulation. Section IV provides a capacity outer bound characterization. Section V covers the key elements of our low-overhead uplink scheduling design. Section VI presents numerical results, and Section VII concludes this paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. RELATED WORK</head><p>In this section, we provide a quick overview of throughputoptimal scheduling to put our work into perspective. In the literature, a well-known class of throughput-optimal scheduling algorithms is the family of MaxWeight policies, which date back to the seminal work by Tassiulas and Ephremides (e.g., <ref type="bibr">[23]</ref>, <ref type="bibr">[24]</ref>) and consist of many follow-ups (e.g., <ref type="bibr">[5]</ref>, <ref type="bibr">[14]</ref>, <ref type="bibr">[21]</ref>, <ref type="bibr">[15]</ref>, <ref type="bibr">[6]</ref>). The basic idea underpinning MaxWeight policies is to schedule users who have both good channel qualities and high congestion levels (evaluated by queue lengths or delays, etc.). However, it is exactly the requirement of both congestion and channel state information that renders MaxWeight policies unsuitable for IoT uplink scheduling. In many IoT applications, due to the large number of devices, congestion and channel state information is prohibitively expensive to collect. Exacerbating this problem is the fact that even those reduced channel-probing-overhead MaxWeight variants designed for large-size downlink systems (e.g., for energy minimization <ref type="bibr">[11]</ref> or limited channel quality feedback <ref type="bibr">[17]</ref>) cannot be directly applied in IoT uplink systems. The reason is that these MaxWeight variants still require queue-length information of all users, while the AP in IoT uplink systems usually has limited queueing information about their associated users. As a result, the assumption of accurate congestion level information for all users, which is usually valid for downlink scheduling, no longer holds in the uplink counterpart.</p><p>To overcome the limitations of MaxWeight policies, one solution is to adopt the carrier sense multiple access (CSMA) design (e.g., <ref type="bibr">[8]</ref>, <ref type="bibr">[18]</ref>, <ref type="bibr">[16]</ref>), where each user judiciously selects its CSMA parameters to adapt to its congestion levels (e.g., the queue-lengths). However, most existing CSMA-based schemes assume that the weight of each user (typically some function of the queue-length) changes slowly over time, which makes their extensions to settings with wireless channel fading quite difficult. Indeed, in the presence of fading, the weight of each user (determined by the product of the queue-length and the current feasible service rate) is stochastic and can change rather abruptly. Although this restriction on slow timevarying weight has been relaxed in recent works (e.g., <ref type="bibr">[26]</ref>, <ref type="bibr">[9]</ref>), a more fatal limitation of CSMA-based policies is that the average time required for successfully acquiring the channel in the channel contention grows exponentially with the number of users. This limitation renders CSMA-type schemes impractical for IoT applications that typically involve a huge number of users. As will be seen shortly, our proposed PC-based scheme avoids both the slow time-varying restriction and the scalability pitfall of CSMA.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. SYSTEM MODEL</head><p>We consider an IoT uplink system with one access point (AP) and N users, where each user transmits data to the AP through M orthogonal channels. We assume that the system operates in slotted time with normalized slots t &#8712;{ 1, 2,...}. We use C[t]=( C i,j [t],i =1 , 2,...,N,j =1 , 2,...,M) to capture the wireless channel fading, where C i,j [t] denotes the maximum amount of service available for user i to transmit packets on channel j in slot t. Due to the finite number of modulation and coding schemes, we assume that each channel for each user has R possible channel rates c 1 ,c 2 , ..., c R with 0=c 1 &lt;c 2 &lt;. . .&lt;c R = c max , which measure the maximum number of packets that can be delivered in one time slot. We assume that C[t] are independently distributed across both users and channels, and independently and identically distributed (i.i.d.) over time.</p><p>In the IoT uplink system, each user needs to send a control message to the AP in order for the AP to obtain its state information (e.g., congestion levels and channel rates). In particular, the AP fetches the congestion level of a user by decoding its control message, and obtains its channel rate by measuring the signal-to-noise ratio of the received control message. However, due to the restrictive amount of uplink resources, the AP usually has very limited information about the state of each user in a large-scale IoT uplink system. Therefore, we assume that the AP can at most sample K (K N ) users to acquire their channel states and congestion levels on each channel at the beginning of each time slot.</p><p>We denote the sampling schedule as X[t]=( X i,j [t],i = 1, 2,...,N,j =1 , 2,...,M), where X i,j [t]=1if user i is sampled to send a control message on channel j such that the AP acquires its congestion level and the state of channel j in time slot t, and X i,j [t]=0otherwise. Let X (K) be the collection of all sampling schedules where K users are sampled to send their control messages on each channel. To avoid interference, at most one user is allowed to transmit over each channel in each time slot. We call a schedule where at most one user is active on each channel in each time slot as a feasible schedule and denote it as S[t]=( S i,j [t],i =1 , 2,...,N,j =1 , 2,...,M), where S i,j [t]=1if user i is scheduled on channel j in time slot t and S i,j [t]=0otherwise. We use S to denote the collection of all feasible schedules. Here, we assume that only the sampled users are allowed to transmit in each time slot.</p><p>We assume that each user i serves its own data traffic and maintains packets in a queue with Q i [t] denoting its queue length at the beginning of time slot t, which reflects its congestion level. The larger the Q i [t], the more congested the user i. Let A i [t] denote the number of packets arriving at user i in time slot t, which is independently distributed across users and i.i.d. over time with mean &#955; i &gt; 0. We assume</p><p>This is a reasonable assumption since, as mentioned earlier, most IoT data traffic are low-rate and intermittent. Then, the evolution of user i's queue can be described as follows:</p><p>+ for i =1 , 2,...,N, where (x) + max{x, 0}. In this paper, we are interested in developing an efficient uplink scheduling algorithm with limited coordination from the AP. In particular, our goal is to find a joint sampling and transmission schedule {X[t], S[t]} t&#8805;1 such that, in each time slot and for each channel, (i) K users are allowed to send their control messages in order for the AP to acquire their queue-lengths and channel rates; and (ii) at most one user can be scheduled among these K sampled users. A key difficulty of this joint sampling and scheduling problem is that the information available for making transmission scheduling decision S[t] heavily relies on the sampling decision X[t].</p><p>We consider the class P of stationary sampling and transmission policies that first decide the sampling schedule X[t] at slot t based on the available information</p><p>) and channel state distributions, and then determine the transmission schedule</p><p>), where &#8855; stands for component-wise multiplication. In other words, a joint sampling and transmission policy in P is a two-stage mapping where it first maps from the space of Q[t] to the space of sampling schedules X (K) during the sampling stage and then maps from the space of (Q</p><p>) to the space of feasible schedules S during the transmission stage. Under any policy in P, the queue length process {Q[t]} t&#8805;1 forms a Markov Chain.</p><p>We say that queue i is strongly stable</p><p>The system is stable if all queues in the system are strongly stable. We let &#920;(M, K) denote the capacity region for an IoT uplink system with M orthogonal channels and K allowed users in sampling for each channel in each time slot, which represents the maximum set of arrival rate vectors &#955; =( &#955; i ) N i=1 for which the system is stable under some policy. We call an algorithm optimal if it keeps the system stable for any arrival rate vector that lies strictly inside &#920;(M, K). Note that it is in general difficult to directly characterize &#920;(M, K). To that end, we first derive an outer bound for the capacity region in the next section.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. CAPACITY OUTER BOUND CHARACTERIZATION</head><p>In this section, we characterize an outer bound of the capacity region &#920;(M, K) for a wireless IoT uplink system with M orthogonal channels under the sampling constraint that K users are allowed to send their control messages on each channel in order for the AP to acquire their channel rates and queue-lengths and the scheduling constraint that at most one user is allowed to transmit on each channel in each time slot.</p><p>Proposition 1 (Capacity Outer Bound):</p><p>, where &#923;(M, K) is defined as the set of arrival rate vectors &#955; =( &#955; i ) N i=1 for which there exist non-negative numbers &#945;(x) and &#946;(x, c; s) such that the following expressions are satisfied:</p><p>x i,j c i,j s i,j , &#8704;i, (1) s&#8712;S &#946;(x, c; s)=1, &#8704;x, c, and</p><p>where p(c) Pr{C[t]=c} denotes the probability that the channel state is c, and &#945;(x) and &#946;(x, c; s) denote the probabilities of selecting the sampling schedule x &#8712;X (K) and selecting the transmission schedule s &#8712;Sgiven the sampling schedule x and channel state c, respectively.</p><p>Remark: In (1), the right-hand-side is the total average service rate provided for each user, and the left-hand-side is the user's average arrival rate. Thus, in order to ensure that the system is stable under some policy, (1) must be satisfied.</p><p>Proof: The proof follows a similar line of analysis as that in <ref type="bibr">[14]</ref> and is omitted due to space limitation. We refer readers to our technical report <ref type="bibr">[10]</ref> for proof details.</p><p>Having established the capacity outer bound &#923;(M, K),w e are now in a position to develop an efficient and low-overhead uplink scheduling algorithm.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>V. L OW-OVERHEAD UPLINK SCHEDULING DESIGN</head><p>In this section, we consider an efficient and low-overhead uplink scheduling design based on the idea of pick-andcompare (PC). In particular, we first develop an iterative sampling and transmission algorithm with full information to achieve the capacity outer bound &#923;(M, K), which motivates the development of an efficient and low-overhead uplink scheduling design. Then, we discuss the throughput deficiency of a natural variant of the proposed algorithm in the singlechannel system under the stringent constraint that K users are allowed to send their control messages. Finally, we propose a PC-based efficient low-complexity iterative sampling and transmission algorithms under the strict sampling constraint.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Iterative Sampling and Transmission Algorithm Design</head><p>In this subsection, we develop an efficient joint sampling and transmission algorithm that can achieve the capacity outer bound &#923;(M, K). Although this algorithm requires full knowledge of channel state distributions and instantaneous queue lengths of all users, it provides a guideline for our design with the desired sampling constraint. We consider M +1 rounds in each time slot. By slightly abusing the notations, we use Q i,j [t] to denote the (virtual) queue-length of user i at the end of round j, where j =0, 1, 2,...,M, and</p><p>Iterative Joint Sampling and Transmission (IJST) Algorithm: In each time slot t, all users report their queue-lengths, i.e.,</p><p>Then, for each round j =1, 2,...,M, perform the following:</p><p>(1) Sampling Decision: Set the sampling vector X * j [t] as:</p><p>where X j is the j-th column of a N &#215;M matrix X and X j (K) denotes the collection of sampling schedules on channel j under the constraint that K users send their control messages on channel j. Users with X * i,j [t]=1are also required to send their control messages on channel j.</p><p>(2) Transmission Scheduling Decision: Schedule the transmission of user i * j [t] on channel j that satisfies:</p><p>(3) (Virtual) Queue-length Update:</p><p>After M -round decision making, users {i * j [t]} M j=1 transmit on their corresponding channels in the rest of time slot t.</p><p>Here, the IJST Algorithm uses the idea of iterative scheduling that is similar to that of <ref type="bibr">[3]</ref>, <ref type="bibr">[7]</ref> in order to improve delay performance. This is due to the fact that users with the larger queue-lengths may have priority over multiple channels, and thus users with slightly smaller queue-lengths suffer from poor delay performance (see <ref type="bibr">[2]</ref>, <ref type="bibr">[7]</ref>), especially when the number of channels is large. In the IJST Algorithm, all users need to report their queue-length information at the beginning of each time slot. Then, in the j-th round of the IJST Algorithm, (i) we first solve the optimization problem (3) to get the optimal sampling schedule X * j [t]; (ii) users with X * i,j [t]=1send their control message in order for the AP to acquire their channel state information; (iii) After collecting both queuelength and channel state information from users, user with the maximum product of queue-length and channel rate is selected for data transmission on channel j, and then the AP virtually updates the queue-length of the selected user. After M -round decision making, the selected users {i * j [t]} M j=1 are allowed for data transmission in the rest of the time slot t. The next proposition shows that the proposed IJST Algorithm can stabilize the system for any arrival rate vector strictly within the capacity outer bound &#923;(M, K).</p><p>Proposition 2: The IJST Algorithm achieves the capacity outer bound &#923;(M, K), i.e., for any arrival rate vector &#955; that is strictly inside &#923;(M, K), the IJST Algorithm stabilizes the system subject to the constraints of K allowed sampling users on each channel.</p><p>Proof: Select the Lyapunov function</p><p>and follows the standard Lyapunov arguments. The details can be found in our technical report <ref type="bibr">[10]</ref>. Note that the IJST Algorithm incurs a large amount of communication overhead that is linearly increasing with the number of users N before each data transmission. This is because the AP needs to know queue-length information of all N users to solve the optimization problem in (3) to obtain the optimal sampling schedule X * [t]. This motivates us to investigate whether there exist efficient policies that only allow K users to send their control messages on each channel, which significantly reduces the amount of communication overhead. Next, we provide an example to illustrate a non-trivial design of such policies starting from the single channel setting for the ease of exposition.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. A Motivating Example of Low-Overhead Uplink Scheduling: From "Power-of-K-Choices" to "Pick-and-Compare"</head><p>One way to reduce the amount of coordination by the AP in a single-channel wireless IoT uplink system works as follows: the AP randomly samples K users and asks them to send their control messages at the beginning of each time slot, and then selects the user with the maximum product of queue-length and channel rate for data transmission in the rest of the time slot. This algorithm is called Power-of-K-Choices. Similar ideas have been explored in the context of load-balancing algorithms (e.g., <ref type="bibr">[25]</ref>, <ref type="bibr">[12]</ref>) in data centers that distribute arriving jobs across servers with the goal of minimizing job waiting time. However, this algorithm suffers from a large throughput performance loss in wireless IoT uplink systems even in the single-channel setting.</p><p>To see the throughput inefficiency of the Power-of-K-Choices policy, we consider a single-channel uplink example with two groups of users without channel fading, where the first group has &#966;N users with the same mean arrival rate of 0.5/ &#966;N and the other has N -&#966;N users with the same mean arrival rate of &#955;, where &#966; &#8712; (0, 1) and x denotes the minimum integer no smaller than x. Here, it is easy to see that the capacity region is {&#955; :( N -&#966;N )&#955;&lt;0.5}. For the Power-of-Two-Choices policy (i.e., when K =2 ), the probability that at least one user sampled from the second group is 1 -&#966;N 2 / N 2 . Therefore, the Power-of-Two-Choices policy can at most support the throughput region:</p><p>. Thus, the second group of users suffer throughput loss by at least:</p><p>which amounts to 61.82% when N = 100 and &#966; =0.9. This simple example shows that the Power-of-K-Choices policy suffers from large throughput degradation even in the singlechannel and non-fading case, let alone in general settings with multiple channels and wireless channel fading. This is because the congested or heavily loaded users may not have an opportunity to be sampled and hence are not able to obtain service under the Power-of-K-Choices policy. Interestingly, in the single-channel non-fading case, there is a variant of the Power-of-Two-Choices policy, known as the Pick-and-Compare (PC) algorithm (e.g., <ref type="bibr">[22]</ref>, <ref type="bibr">[13]</ref>, <ref type="bibr">[4]</ref>, <ref type="bibr">[20]</ref>), which is known to be throughput-optimal. A PC-based scheme keeps track of the most congested user in the memory and compares its weight with a randomly selected user. The PC algorithm achieves the maximum throughput by gradually improving the scheduling decisions over time. However, we note that the PC algorithm in the literature only works under non-fading setting, while fading is the one of the key features in wireless communication channels. So far, it remains unclear how to generalize the PC algorithm to the fading settings and still achieve throughput performance guarantee. The main challenge in developing the PC algorithm for fading settings lies in the fact that the channel rates are time-varying and can change abruptly. This is very different from the smooth evolution of the queue-length process. In the next subsection, we will address this challenge and propose an efficient and low-overhead uplink scheduling algorithm. Moreover, this algorithm works for general multi-channel settings with fading.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Iterative Pick-and-Compare Algorithm Design</head><p>In this subsection, we focus on the efficient uplink scheduling design under the stringent constraints that K users are allowed to transmit their control messages on each channel. The key element in our approach is to decouple the optimization problem (3) such that it can be solved by only considering a small subset of users. To that end, we assume that the wireless fading channels satisfy the following assumption.</p><p>Assumption 1: For any given non-negative numbers n 1 ,n 2 ,...,n N , there exists a stochastic order among random variables n 1 C 1,j ,n 2 C 2,j ,...,n N C N,j , i.e., there exists a permutation (m 1 ,m 2 ,...,m N ) of (1, 2,...,N) such that</p><p>where j =1 , 2,...,M. Here, Z 1 &#8805; st Z 2 means that random variable Z 1 is stochastically greater than random variable Z 2 (see <ref type="bibr">[19]</ref>), i.e., Pr{Z 1 &gt;z}&#8805;Pr{Z 2 &gt;z}, &#8704;z &#8712; R.</p><p>Remark: If channel states are i.i.d., then <ref type="bibr">(7)</ref> trivially holds.</p><p>Assumption 1 provides an opportunity for decoupling the optimization problem (3) by only allowing a small portion of users to be sampled in order for the AP to obtain system state information of sampled users. Indeed, if Assumption 1 does not hold, it is almost impossible to obtain the optimal value of (3) by only collecting information from a small subset of users due to the abrupt changes of channel rates. Next, we incorporate wireless channel fading to generalize the traditional PC algorithmic design in the general multichannel systems. Similar to the IJST Algorithm, we use Q i,j [t] to denote the queue-length of user i at the end of round j, j =0 , 1, 2,...,M, and</p><p>to denote the j-th channel rate of user i in time slot t, while C i,j without time index t denotes a random variable with the same distribution as the j-th channel rate of user i.</p><p>Iterative Pick and Compare (IPC) Algorithm: In each time slot t, given users ( &#238;k,j [t -1],k =1 , 2,...,K -1,j = 1, 2,...,M) selected by the IPC Algorithm in time slot t -1, perform the following: For each channel j =1, 2,...,M,</p><p>(1) Pick Phase: Randomly pick one user r j [t], and ask it to report its current queue-length Q rj [t] [t] and channel state C rj [t],j [t] on channel j to the AP.</p><p>(2) Report Phase: Ask users ( &#238;k,j [t -1]) K-1 k=1 to report their queue-lengths and channel states of channel j to the AP.</p><p>(3) Compare Phase (Transmission Scheduling): Determine the transmission schedule of user i on channel j that satisfies:</p><p>where Xi,j [t]=1 if i &#8712; r j [t], &#238;k,j [t-1], &#8704;k =1, 2,...,K-1 , and Xi,j [t]=0otherwise.</p><p>(4) Update Phase: Select users ( &#238;k,j [t]) K-1 k=1 that achieve the</p><p>k=1 and the newly reporting user r j [t] in the stochastic ordering sense, i.e.,</p><p>where</p><p>After M -round decision making, users { i j [t]} M j=1 transmit on their corresponding channels in the rest of the time slot t.</p><p>In the IPC Algorithm, the AP exactly requires K sampling users ( &#238;k [t-1]) K-1 k=1 ,r(t) on each channel. This significantly reduces the amount of coordination from the AP compared to the IJST Algorithm. Next, we will show that the IPC Algorithm still possesses excellent throughput performance.</p><p>Proposition 3: Suppose that the channel state C j = (C i,j ) N i=1 on each channel j satisfies Assumption 1. Then, for any arrival rate vector &#955; =( &#955; i ) N i=1 that is strictly inside the rate region &#923;(M, K -1), the IPC Algorithm stabilizes the system subject to the constraints that K users are allowed to send control messages on each channel.</p><p>Proof: The key step is to establish that the IPC Algorithm performs similarly as its centralized counterpart (i.e., the IJST Algorithm) does, i.e.,</p><p>holds for any &#947; &#8712; (0, 1), where G &#947;,j &gt; 0 is some constant. The rest of the proof follows a Lyapunov analysis for which the detailed proof can be found in Appendix A.</p><p>Note that under the constraint of K sampling users for each channel, the maximum throughput region that can be achieved by any feasible algorithm is at most &#923;(M, K), while our IPC Algorithm can achieve the throughput region &#923;(M, K -1) with the same amount of communication overhead. Here, it is worth pointing out that the throughput region &#923;(M, K -1) is independent of the number of users in the system. This yields a significant advantage over the IJST Algorithm especially for a large number of users, which is the typical case in most IoT applications.</p><p>Moreover, the achieved throughput region &#923;(M, K -1) is close to the capacity outer bound &#923;(M, K) even for a small K when the number of channels M is small. For example, in a single-channel case of N users with i.i.d. ON-OFF fading with p =P r {C <ref type="figure">1</ref> shows the throughput performance loss percentage under the IPC Algorithm when p =0 .8. We can observe from Fig. <ref type="figure">1</ref> that the throughput loss decays exponentially fast with the increase of the number of allowed sampling users K, and is at most 3.23% even when K =3 . Therefore, the throughput performance loss under the IPC Algorithm is small. Throughput Loss (%) 16.67% 3.23% 0.64% 0.13% 0.03% 0.01% 0.00% 0.00% 0.00% Fig. <ref type="figure">1</ref>: Throughput loss in a single-channel system VI. NUMERICAL RESULTS In this section, we perform simulations for our proposed low-overhead IPC Algorithm and compare it to the IJST Algorithm in both single-channel and multi-channel cases. In the simulations, we consider N =2 0users. We assume that the number of arrivals occurring at each user in each time slot follows a Bernoulli distribution with mean &#955;. In order to capture the burstiness feature of IoT traffic, we assume that each incoming arrival brings F packets, where F is equal to 20 with probability 4/19 and 1 otherwise. Therefore, the expected number of packets that each arrival carries is equal to 5, i.e., E[F]=5. We consider ON-OFF channel fading models that are independently distributed over users and i.i.d. over time, where the first ten users have channel availability probability of 0.9 and all others have probability of 0.5. We assume that all M channels have the same channel fading model. Fig. <ref type="figure">3</ref>: Performance comparison: single channel case Fig. <ref type="figure">2</ref> shows the impact of the number of sampling users K on the system performance of the IJST Algorithm and the IPC Algorithm in the single-channel case. From Figs. <ref type="figure">2a</ref> and<ref type="figure">2b</ref>, we can observe that as K increases, both throughput and delay performance of these algorithms improve. Especially, we can see that K =4sampling users are sufficient for both algorithms to almost achieve the maximum throughput (i.e., when K =20). Moreover, the delay performance under the IPC Algorithm is only slightly worse than that under the IJST Algorithm, and their gap becomes smaller as K increases, as shown in Fig. <ref type="figure">3b</ref>. This indicates that in the single channel case, the IPC Algorithm with only four sampling users can achieve almost the same throughput and delay performance as the IJST Algorithm, which requires all queuelength information available before each data transmission and thus requires a significant amount of communication overhead. Hence, our proposed IPC Algorithm dramatically reduces the communication overhead with a negligible performance loss.</p><p>In Fig. <ref type="figure">4</ref>, we study the performance of our proposed IPC Algorithm in a multi-channel case and compare it to the IJST Algorithm. From Figs. <ref type="figure">4a</ref> and<ref type="figure">4b</ref>, we can observe that our proposed IPC Algorithm still performs well in both three and five channel cases compared to the IJST Algorithm when the number of allowable sampling users is four. This indicates that the IPC Algorithm is quite robust to the number of channels, which is significantly more advantageous in large-scale multichannel IoT uplink systems. In this paper, we considered the design of efficient and low-overhead uplink scheduling algorithms for large-scale IoT applications, where the central controller has a limited amount of information about the users. We first derived a capacity outer bound under the sampling constraint, where only a small subset of users are allowed to use control channels for system state reporting and channel probing. Then, we proposed a joint sampling and transmission algorithm with full information before each transmission and show that it achieves the capacity outer bound. However, this algorithm incurs a huge amount of communication overhead before each data transmission. To that end, we developed an efficient and low-overhead uplink scheduling algorithm that is suitable for large-scale IoT applications. Finally, we validated our theoretical results through extensive simulations.</p><p>APPENDIX A PROOF OF <ref type="bibr">PROPOSITION 3</ref> Since the channel states on each channel satisfy Assumption 1, the IJST Algorithm selects the K largest Q i,j [t]C i,j among N users on channel j+1 (j =0, 1,...,M-1) in the stochastic order sense given Q</p><p>where {i * k,j [t]} K k=1 are the users selected by the IJST Algorithm for reporting their channel states and i/ &#8712;{i * k,j [t]} K k=1 . Since the IPC Algorithm independently picks a user at uniformly random on each channel in each time slot, for any given &#947;&gt;0, there exists a D &#947;,j &gt; 0 such that</p><p>Under the IPC Algorithm, we have max k=1,2,...,K-1</p><p>&#8805; st max k=1,2,...,K-1</p><p>&#8805; st max k=1,2,...,K-1</p><p>where step (a) follows the definition of the IPC Algorithm, and (b) uses the fact that at most c max packets can be served on each channel in each time slot.</p><p>Without loss of generality, we assume that &#964; m1 &gt;&#964; m2 &gt; ...&gt;&#964; m K-1 , where (m 1 ,m 2 ,...,m K-1 ) is a permutation of (1, 2,...,K -1). Hence, we have  </p><p>where step (a) uses the definition of &#964; k , &#8704;k =1, 2,.. </p><p>which implies that  </p><p>where step (a) is true for B 1 (1 -&#947;) M j=1 G &#947;,j and follows from the fact that</p><p>i,j =1if i &#8712;{ i * k,j ,k = 1, 2,...,K-1}. This indicates that the selected weight by the IPC Algorithm is very close to the IJST Algorithm. The rest of the proof follows the standard Lyapunov arguments. The detailed proof is available in our technical report <ref type="bibr">[10]</ref>.</p></div></body>
		</text>
</TEI>
