<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Keyless Covert Communication in the Presence of Non-causal Channel State Information</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>2019 August</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10112769</idno>
					<idno type="doi"></idno>
					<title level='j'>IEEE Information Theory Workshop</title>
<idno></idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>H. Zivari-Fard</author><author>M. Bloch</author><author>A. Nosratinia</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[We consider the problem of covert communicationover a state-dependent channel, for which the transmitter andthe legitimate receiver have non-causal access to the channel stateinformation. Covert communication with respect to an adversary,referred to as the “warden,” is one in which the distributioninduced during communication at the channel output observed bythe warden is identical to the output distribution conditioned onan inactive channel-input symbol. Covert communication involvesfooling an adversary in part by a proliferation of codebooks;for reliable decoding at the legitimate receiver the codebookuncertainty is removed via a shared secret key that is unavailableto the warden. Unlike earlier work in state-dependent covertcommunication, we do not assume the availability of a sharedkey at the transmitter and legitimate receiver. Rather, a sharedrandomness is extracted at the transmitter and the receiver fromthe channel state, in a manner that keeps the shared randomnesssecret from the warden despite the influence of the channel stateon the warden’s output. An inner bound on the covert capacity,in the absence of an externally provided secret key, is derived.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>Covert communication refers to scenarios in which reliable communication over a channel must occur while simultaneously ensuring that a separate channel output at a node called the warden has a distribution identical to that induced by an inactive channel symbol <ref type="bibr">[1]</ref>- <ref type="bibr">[4]</ref>. For discrete memoryless channels (DMC), the inactive input symbol is denoted x 0 . It is known that in a point-to-point DMC without state, if the output distribution (at the warden) induced by x 0 is a convex combination of the output distributions generated by the other input symbols, then it is possible to achieve a positive rate; otherwise the number of possible bits that can be reliably and covertly communicated over n channel transmissions scales at most as O( &#8730; n). This result has motivated the study of other models in which positive rates are achievable.</p><p>Of particular relevance to the present work, Lee et al. <ref type="bibr">[5]</ref> have considered the problem of covert communication over a state dependent channel in which the channel state is known to the transmitter but unknown to the receiver and the warden. The authors derived the covert capacity when the transmitter and the receiver share a sufficiently long secret key and also derived a lower bound on the minimum length of the secret key needed to achieve the covert capacity. Given the presence of a channel state, one can wonder if covert communication</p><p>The work of H. ZivariFard and A. Nosratinia is supported by National Science Foundation (NSF) grant 1514050. The work of M. R. Bloch is supported by National Science Foundation (NSF) grant 1527387.</p><p>with positive rate is still possible without requiring an external secret key. In particular, several works demonstrate the benefits of exploiting common randomness and channel states to generate secret keys. For instance, the problem of stealth secret key generation from correlated sources has been studied in <ref type="bibr">[6]</ref>, <ref type="bibr">[7]</ref> and covert secret key generation has been studied in <ref type="bibr">[8]</ref>, <ref type="bibr">[9]</ref>. Most importantly, the usefulness of exploiting states for secrecy has been extensively investigated. The discrete memoryless wiretap channel with random states was first studied by Chen and Han Vinck <ref type="bibr">[10]</ref>, who studied a scenario in which the CSI is available only at the encoder. They established a lower bound on the secrecy-capacity based on a combination of wiretap coding with Gelfand-Pinsker coding. Generally speaking, the coding scheme with Channel State Information (CSI) outperforms the one without CSI, because perfect knowledge of the CSI not only enables the transmitter to beamform its signal toward the legitimate receiver but also provides a source of common randomness from which to generate a common secret key and enhance the secrecy capacity. Khisti et al. <ref type="bibr">[11]</ref> studied the problem of secret key generation from the channel state and established the secret key capacity. Chia and El Gamal studied the state-dependent wiretap channel with weak secrecy, proposing a scheme in which the transmitter and the receiver extract a key from the state, and protect the confidential message via a onetime-pad driven with the extracted key <ref type="bibr">[12]</ref> (see also <ref type="bibr">[13]</ref> and <ref type="bibr">[14]</ref>). Han and Sasaki <ref type="bibr">[15]</ref> subsequently extended the coding scheme to achieve strong secrecy. Goldfeld et al. <ref type="bibr">[16]</ref> proposed a superposition coding scheme for the problem of transmitting a semantically secure message over a state dependent channel while the channel state is available noncausally at the transmitter.</p><p>We study here the problem of covert communications over a state dependent discrete memoryless channel with channel state available non-causally at both the transmitter and the receiver (see Fig. <ref type="figure">1</ref>). The main innovation of this work is to show that the channel state can be used to simultaneously and efficiently accomplish two necessary tasks: using the channel state for a Gelfand-Pinsker style encoding to assist covert communication while also extracting shared randomness at the two legitimate terminals that is kept secret from the warden, to resolve the multiple codebooks that are necessary for covert communication. The shared randomness extracted from the state effectively takes the place of the external secret key in other models, thus generalizes and expands the applicability of covert communication. We derive an inner bound on the covert capacity of this problem, as well as a condition that shows when this scheme results in a positive rate.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. PRELIMINARIES</head><p>Throughout this paper, random variables are denoted by capital letters and their realizations by lower case letters. The set of -strongly jointly typical sequences of length n, according to P X,Y , is denoted by T</p><p>(n) (P X,Y ). For convenience, whenever there is no danger of confusion, typicality will reference the random variables rather than the distribution, e.g., we write T</p><p>denote the dimension of a vector, e.g., X n . The integer set {1, . . . , M } is denoted by 1, M , and</p><p>The total variation between probability mass functions (pmfs) is defined as ||q-p|| 1 = 1 2</p><p>x |p-q| and the Kullback-Leibler (KL) divergence between pmfs is defined as D(P ||Q) =</p><p>x p(x) log p(x) q(x) . The support of a probability distribution P is denoted by supp(P ). The n-fold product distribution constructed from the same distribution P is denoted</p><p>Consider a discrete memoryless state-dependent channel as shown in Fig. <ref type="figure">1</ref>. The finite sets (X , S, Y, Z) and the transition probability distribution W Y,Z|X,S are the constitutive components of this channel. Here, X is the channel input, Y and Z are the channel outputs at the legitimate receiver and the warden, respectively. All of these alphabets are finite. Let x 0 &#8712; X be a symbol corresponding to the case in which the transmitter is not communicating with the receiver. We assume that the channel state is independent and identically distributed (i.i.d.) and drawn according to</p><p>We define a non-negative cost g(x) for every channel input x &#8712; X . The average cost of an input sequence</p><p>The channel state is available non-causally at both the transmitter and the receiver (see Fig. <ref type="figure">1</ref>) while the warden does not have access to it. An (|M|, n) code consists of an encoder that maps (M, S n ) to X n &#8712; X n , and a decoder at the receiver that maps Y n to M &#8712; M. The transmitter and the receiver want to design a code that is both reliable and covert. The code is defined to be reliable if the probability of error P (n) e = P ( M = M ) goes to zero when n &#8594; &#8734;. The code is covert if the warden cannot determine whether communication is happening (hypothesis H 1 ) or not (hypothesis H 0 ). The probabilities of false alarm (accepting H 1 when the transmitter is not sending a meaningful information) and missed detection (accepting H 0 when the transmitter is sending a meaningful information), are denoted by &#945; and &#946;, respectively. We know that a blind test without looking at the channel output satisfies &#945; + &#946; = 1. If P Z n denotes the distribution induced at the warden's output when the transmitter sends a message, the optimal hypothesis test by the warden satisfies &#945; + &#946; &#8805; 1 -D(P Z n ||Q n 0 ) <ref type="bibr">[17]</ref>. Therefore, to show that the communication is covert it is sufficient to show that D(P</p><p>Consequently, our goal is to design a sequence of (2 nR , n) codes such that</p><p>We define the covert capacity as the supremum of all achievable covert rates and denote it by C NC .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. MAIN RESULT</head><p>Our main result is a lower bound for the covert capacity, obtained by designing a coding scheme in which the transmitter not only generates a key from S but also selects its codeword according to S by using a likelihood encoder <ref type="bibr">[18]</ref>.</p><p>Theorem 1. The covert capacity of DMC W Y,Z|S,X with noncausal CSI at the transmitter and the receiver is lowerbounded as</p><p>where the maximum is distributions of the form</p><p>such that</p><p>Remark 1. Theorem 1 suggests that the key rate that we extract from channel state (i.e. H(S|Z)) should be at least as large as the Right Hand Side (RHS) of the (6).</p><p>Proof. We adapt a block-Markov encoding scheme over B &gt; 0 consecutive and dependent blocks. This scheme involves the transmission of B -1 independent messages over B channel block transmissions each of length r, indexed by j = 1, 2, ..., B, such that n = rB. Here, we assume B and B -1 are fixed and sufficiently large positive integers. Therefore, the warden's channel output observation Z n can be described as Z n = (Z r 1 , . . . , Z r B ) in which each Z r j for j &#8712; 1, B indicates the channel output in block j. Hence, the induced distribution by the coding scheme at output of the</p><p>Fig. <ref type="figure">2</ref>. Functional dependence graph for the block-Markov encoding scheme warden is P Z n P Z r 1 ,...,Z r B and the target output distribution is</p><p>where Z B,r j+1 = {Z r j+1 , . . . Z r B }. Therefore, to show</p><p>As shown in the following, we achieve this by constructing a code that approximates Q r Z in each block and show that the dependencies across blocks, created by block-Markov coding, can be eliminated. The random code generation is as follows:</p><p>Fix P U (u), P U |S (u|s), x(u, s), P S,U,X,Y,Z (s, u, x, y, z) = Q S (s)P U |S (u|s)1 {x(u,s)=x} W Y,Z|X,S , and</p><p>Codebook for Key Generation: For each block j &#8712; 1, B , to generate an efficient key K j of rate R K by using the channel state S r j , we create a function &#934;(S r j ) &#8712; 1, 2 rR K through random binning. The key K j = &#934;(S r j ) obtained in block j &#8712; 1, B is used to assist the encoder in the next block.</p><p>Codebook for Message Transmission: For each block j &#8712; 1, B and for each k j-1 &#8712; 1, 2 rR k , m j &#8712; 1, 2 rR , and j &#8712; 1, 2 rR , randomly and independently generate 2 rR codewords u r (k j-1 , m j , j ) according to r i=1 p(u i ). These constitute the codebook C n . The indices (k j-1 , m j , j ) can be seen as a two layer binning. We define an ideal Probability Mass Function (PMF) for codebook C n as</p><p>) where W Z|U,S is the marginal distribution of W Y,Z|U,S in <ref type="bibr">(5)</ref>.</p><p>Encoding: In the first block, we choose a codeword randomly and transmit it over the channel and use the CSI to generate a key k 1 for the next block. In block j &#8712; 2, B , the codeword U r will be chosen based on message, key and the current CSI; simultaneously we generate a key from CSI for the next block.</p><p>For the first block, the transmitter generates a key from CSI and the encoder chooses (k 0 , m 1 , 1 ) uniformly at random and finds codeword u r (k 0 , m 1 , 1 ) according to these indices. It then transmits a codeword x r where</p><p>For block j &#8712; 2, B , to send message m j according to the generated key k j-1 from the previous block and the CSI of the current block, the encoder selects index j from bin (k j-1 , m j ) according to the distribution</p><p>where p r S|U is defined from the ideal PMF in <ref type="bibr">(8)</ref>. Each coordinate of the transmitted signal is a function of the state, as well as the corresponding sample of the transmitter codeword u i , i.e., x i = x(u i (k j-1 , m j , j ), s i ).</p><p>For a fixed codebook C n , the induced joint distribution is</p><p>Covertness Analysis: Now, we show that this coding scheme guarantees that <ref type="bibr">19, eq. (323)</ref>]. From the expansion in <ref type="bibr">(7)</ref>, note that for every j &#8712; 2, B ,</p><p>where (a) is because Z r j -K j -Z B,r j+1 forms a Markov chain as seen in the functional dependence graph depicted in Fig. <ref type="figure">2</ref>. Next, note that</p><p>where Q Kj is the uniform distribution on 1, 2 rR K and (b) follows from</p><p>Therefore by combining ( <ref type="formula">12</ref>) and ( <ref type="formula">7</ref>), we have</p><p>By using the triangle inequality we have:</p><p>To bound the first term on the RHS of (15) note that</p><p>where ( <ref type="formula">18</ref>) is due to <ref type="bibr">(9)</ref>. Hence,</p><p>where (c) follows from the symmetry of the codebook construction with respect to M and K j-1 . Based on the soft covering lemma <ref type="bibr">[20,</ref><ref type="bibr">Corollary VII.5</ref>] the RHS of (21) vanishes if</p><p>To bound the second term on the RHS of (15) note that</p><p>where W r S,Z|U = P r S|U W r Z|U,S . Using Pinsker's inequality, it is sufficient to bound </p><p>where ( <ref type="formula">28</ref>) is redundant because of (27).</p><p>Decoding and Error Probability Analysis: By following the same steps as in <ref type="bibr">[5]</ref>, the probability of error vanishes when n grows if</p><p>Input Cost Analysis: The proof follows similar lines to [5, Sec V].</p><p>The region in Theorem 1 is derived by remarking that the scheme requires R K &#8805; R k and applying Fourier-Motzkin to (22), (27), and (29).</p><p>S r ,Z r ,U r (s r j , z r j , u r (k j-1 , m j , j )) &#215; 1 2 rR K &#215; log W r S,Z|U (s r j , z r j |u r (k j-1 , m j , j )) 2 r(R k +R+R -R K ) Q r Z (z r j ) + (k j-1 ,m j , j ) =(kj-1,mj , j )</p><p>Q r S,Z (s r j , z r j ) 2 r(R k +R+R -R K ) Q r Z (z r j ) + sr j =s r j W r S,Z|U (s r j , z r j |u r (k j-1 , m j , j )) 2 r(R k +R+R ) Q r Z (z r j )</p><p>&#936; 1 = kj kj-1 mj j 1 2 r(R k +R+R +R K ) (s r j ,z r j ,u r (kj-1,mj , j ))&#8712;T S r ,Z r ,U r (s r j , z r j , u r (k j-1 , m j , j ))</p><p>&#215; log W r S,Z|U (s r j , z r j |u r (k j-1 , m j , j )) 2 r(R k +R+R -R K ) Q r Z (z r j ) + (k j-1 ,m j , j ) =(kj-1,mj , j )</p><p>Q r S,Z (s r j , z r j ) 2 r(R k +R+R -R K ) Q r Z (z r j ) + sr j =s r j W r S,Z|U (s r j , z r j |u r (k j-1 , m j , j )) 2 r(R k +R+R ) Q r Z (z r j ) + 1</p><p>&#8804; log 2 rR K 2 -r(1-)H(S,Z|U ) 2 r(R k +R+R ) 2 -r(1+ )H(Z) + 2 rR K 2 -r(1-)H(S,Z) 2 -r(1+ )H(Z) + 2 -r(1-)H(Z|U ) 2 r(R k +R+R ) 2 -r(1+ )H(Z) + 1 (25)</p><p>S r ,Z r ,U r (s r j , z r j , u r (k j-1 , m j , j ))</p><p>&#215; log W r S,Z|U (s r j , z r j |u r (k j-1 , m j , j )) 2 r(R k +R+R -R K ) Q r Z (z r j ) + (k j-1 ,m j , j ) =(kj-1,mj , j )</p><p>Q r S,Z (s r j , z r j ) 2 r(R k +R+R -R K ) Q r Z (z r j ) + sr j =s r j W r S,Z|U (s r j , z r j |u r (k j-1 , m j , j )) 2 r(R k +R+R ) Q r Z (z r j ) + 1</p></div></body>
		</text>
</TEI>
