香港中文大学：《Theory of Computational Complexity》课程教学资源（讲义）Lecture 11 Information theoretical argument.docx_大学文库

𝜇(𝑥𝑖 ,𝑦𝑖 , 𝑑𝑖 ) = (000, 010,001, 101) each with probability 1/4. That is, (𝑋𝑖 ,𝑌𝑖 ,𝐷𝑖 )’s are independent for different 𝑖’s, and each (𝑋𝑖 , 𝑌𝑖 ,𝐷𝑖 ) is distributed according to 𝜇. Now take a private-coin protocol with minimum worst-case communication cost among all protocols that have 𝜀-error in the worst case. Consider the messages over all rounds, namely the entire transcript of the conversation. Denote it by 𝛤, and its length (i.e. the number of bits) by |𝛤|. Note that since the input (𝑋, 𝑌) is a random variable, and the protocol is randomized, the induced transcript 𝛤 is also a random variable. By the relation among mutual information, entropy and the size of sample space, we have 𝑅𝜖 (𝐷𝑖𝑠𝑗𝑛 ) = |𝛤| ≥ 𝐻(𝛤|𝐷) ≥ 𝐼(𝛤;𝑋𝑌|𝐷). Note that both 𝑋 and 𝑌 can be decomposed into n bits: 𝑋 = 𝑋1 … 𝑋𝑛 and 𝑌 = 𝑌1 … 𝑌𝑛 . Each 𝑋𝑖 and 𝑌𝑖 are also random variables, and note that (𝑋𝑖 , 𝑌𝑖 ,𝐷𝑖 ) for different i are independent. Thus we can apply Lemma 1.1 and have 𝐼(𝛤; 𝑋𝑌|𝐷) ≥ ∑𝐼(𝛤; 𝑋𝑖𝑌𝑖 |𝐷) 𝑛 𝑖=1 . Since 𝐼(𝛤; 𝑋𝑖𝑌𝑖 |𝐷) = 𝐄𝑑−𝑖 [𝐼(𝛤; 𝑋𝑖𝑌𝑖 |𝐷𝑖 ,𝐷−𝑖 = 𝑑−𝑖 )], we have 𝐼(𝛤; 𝑋𝑌|𝐷) ≥ ∑𝐄𝑑−𝑖 [𝐼(𝛤; 𝑋𝑖𝑌𝑖 |𝐷𝑖 ,𝐷−𝑖 = 𝑑−𝑖 )] 𝑛 𝑖=1 . For each fixed 𝑑−𝑖 ∈ {0,1} 𝑛−1 , we design a worst-case ε-error private-coin protocol 𝛤𝑖 (𝑑−𝑖 ), for the function 𝑔(𝑥𝑖 ,𝑦𝑖 ) = 𝑥𝑖 ∧ 𝑦𝑖 , as follows. Note that the protocol should work for all possible inputs (𝑥𝑖 ,𝑦𝑖 ), not only those in the support of (𝑋𝑖 ,𝑌𝑖 ). 𝛤𝑖 (𝑑−𝑖 ): On input (𝑥𝑖 ,𝑦𝑖 ), 1. Generate (𝑋’−𝑖 , 𝑌’−𝑖 ) from 𝜇 𝑛−1 |𝐷−𝑖 = 𝑑−𝑖 . 2. Run protocol 𝛤 on (𝑥𝑖𝑋’−𝑖 , 𝑦𝑖𝑌’−𝑖 ) and output the answer. Lemma 2.2. 1. 𝛤𝑖 (𝑑−𝑖 ) is private-coin

2. 𝛤𝑖 (𝑑−𝑖 ) has 𝜀-error in the worst case. 3. 𝐼(𝛤;𝑋𝑖𝑌𝑖 |𝐷𝑖 ,𝐷−𝑖 = 𝑑−𝑖 ) = 𝐼(𝛤𝑖 (𝑑−𝑖 );𝑋𝑖𝑌𝑖 |𝐷𝑖 ). Proof. 1. Note that once 𝑑−𝑖 is fixed, then each (𝑋𝑗 ,𝑌𝑗 ) (𝑗 ∈ [𝑛] − 𝑖) is a product distribution (over Alice and Bob’s spaces). Indeed, if 𝑑𝑗 = 0, then (𝑋𝑗 ,𝑌𝑗 ) ∼ 0 × 𝑈 where 𝑈 is the uniform distribution on {0,1}. If 𝑑𝑗 = 1, then (𝑋𝑗 ,𝑌𝑗 ) ∼ 𝑈 × 0. 2. Since 𝜇 only puts weight on 0-inputs, 𝐷𝑖𝑠𝑗𝑛 (𝑥𝑖𝑋’−𝑖 ,𝑦𝑖𝑌’−𝑖 ) = 𝑥𝑖 ∧ 𝑦𝑖 , and thus 𝛤𝑖 (𝑑−𝑖 ) is correct on (𝑥𝑖 ,𝑦𝑖 ) iff 𝛤 is correct on (𝑥𝑖𝑋’−𝑖 ,𝑦𝑖𝑌’−𝑖 ). Thus the error probability of 𝛤𝑖 (𝑑−𝑖 ) on (𝑥𝑖 ,𝑦𝑖 ) is the average (over 𝑋’−𝑖 and 𝑌’−𝑖 ) error prob of Γ on (𝑥𝑖𝑋’−𝑖 ,𝑦𝑖𝑌’−𝑖 ). Since Γ has error probability at most 𝜀 on all inputs, so is 𝛤𝑖 (𝑑−𝑖 ) on all its possible inputs (𝑥𝑖 ,𝑦𝑖 ). 3. 𝛤𝑖 does nothing but invoking 𝛤 on the same input distribution 𝜇 𝑛 . □ Now we have 𝐼(𝛤; 𝑋𝑌|𝐷) ≥ ∑𝐄𝑑−𝑖 [𝐼(𝛤𝑖 (𝑑−𝑖 );𝑋𝑖𝑌𝑖 |𝐷𝑖 )] 𝑛 𝑖=1 . Next we show that any worst-case ε-error private-coin protocol for 𝑔 has to contain a constant amount of information about (𝑋𝑖 ,𝑌𝑖 ) (conditioned on 𝐷𝑖 ). Lemma 2.3. For any worst-case ε-error private-coin protocol for 𝑔 with transcript 𝛱, 𝐼(𝛱; 𝑋𝑖𝑌𝑖 |𝐷𝑖 ) = Ω(1). Proof. Denote the transcript for input (𝑎, 𝑏) by 𝛱𝑎𝑏. First, we show that if the protocol doesn’t contain enough information, then 𝜫𝟎𝟎 is close to both 𝜫𝟎𝟏 and 𝜫𝟏𝟎. Thus 𝜫𝟎𝟏 is close to 𝜫𝟏𝟎. Consider the Hellinger distance: ℎ(𝑝, 𝑞) 2 = 1 −∑√𝑝𝑖𝑞𝑖 𝑖 = 1 2 ∑(√𝑝𝑖 − √𝑞𝑖 ) 2 𝑖 . We have 𝐼(𝛱; 𝑋𝑖𝑌𝑖 |𝐷𝑖 ) = 1 2 𝐼(𝛱; 𝑋𝑖𝑌𝑖 |𝐷𝑖 = 0) + 1 2 𝐼(𝛱; 𝑋𝑖𝑌𝑖 |𝐷𝑖 = 1) ≥ 1 2 ℎ(𝛱00 , 𝛱01 ) 2 + 1 2 ℎ(𝛱00 , 𝛱10 ) 2 // For 𝐵 ∼ 𝑈, 𝐼(𝐵,𝑝𝐵 ) ≥ ℎ(𝑝0 , 𝑝1 ) 2 ≥ 1 4 ℎ(𝛱10, 𝛱01 ) 2 // Cauchy-Schwartz and Triangle

Then, we will show that any communication protocol enjoys the property that 𝒉(𝛱𝒙𝒚 , 𝛱𝒙’𝒚’ ) = 𝒉(𝛱𝒙𝒚 ′ ,𝛱𝒙′𝒚 ) for any 𝒙, 𝒚,𝒙’, 𝒚’. Thus 𝛱𝟎𝟎 is close to 𝛱𝟏𝟏. The property is proven by writing down 𝑃𝑟[𝛱𝑥𝑦 = 𝛾] and 𝑃𝑟[𝛱𝑥’𝑦’ = 𝛾] for any fixed 𝛾. It’s not hard to see that 𝑃𝑟[𝛱𝑥𝑦 = 𝛾] is product of some function of 𝑥 and some function of 𝑦. So by switching the function for 𝑥 and that for 𝑦’, we have that 𝑃𝑟[𝛱𝑥𝑦 = 𝛾]𝑃𝑟[𝛱𝑥’𝑦’ = 𝛾] = 𝑃𝑟[𝛱𝑥’𝑦 = 𝛾]𝑃𝑟[𝛱𝑥𝑦’ = 𝛾]. Then by the multiplicative nature of the definition of ℎ(𝑝, 𝑞), one gets ℎ(𝛱𝑥𝑦 , 𝛱𝑥’𝑦’ ) = ℎ(𝛱𝑥𝑦’ , 𝛱𝑥’𝑦 ). Now the contradiction comes: 𝒈(𝟎,𝟎) ≠ 𝒈(𝟏,𝟏), so 𝛱𝟎𝟎 should be far from 𝛱𝟏𝟏. The best probability gap to distinguish 𝛱𝑥𝑦 and 𝛱𝑥’𝑦’ is ‖𝛤00 − 𝛤11‖1 , which is upper bounded in terms of ℎ(𝑝, 𝑞) by the following fact: ‖𝑝 − 𝑞‖1 ≤ ℎ(𝑝, 𝑞)√2 − ℎ(𝑝,𝑞) 2 . □ Putting everything together, we get 𝑅𝜖 (𝐷𝑖𝑠𝑗𝑛 ) = |𝛤| ≥ 𝐼(𝛤; 𝑋𝑌|𝐷) ≥ ∑ 𝐸𝑑−𝑖 [𝐼(𝛤;𝑋𝑖𝑌𝑖 |𝐷𝑖 ,𝐷−𝑖 = 𝑑−𝑖 )] 𝑛 𝑖=1 = ∑ 𝐸𝑑−𝑖 [𝐼(𝛤𝑖 (𝑑−𝑖 );𝑋𝑖𝑌𝑖 |𝐷𝑖 )] 𝑛 𝑖=1 = ∑ Ω(1) 𝑛 𝑖=1 = Ω(𝑛). □ References [BYJKS04] Ziv Bar-Yossef, T. S. Jayram, Ravi Kumar, D. Sivakumar, An information statistics approach to data stream and communication complexity. Journal of Computer and System Sciences, 68(4), pp. 702-732, 2004. [CSWY01] Amit Chakrabarti, Yaoyun Shi, Anthony Wirth, and Andrew Yao, Informational complexity and the direct sum problem for simultaneous message complexity, in Proceedings of the 42nd Annual Symposium on Foundations of Computer Science, pp. 270-278, 2001. [CT06] Thomas Cover, Joy Thomas. Elements of Information Theory, Second Edition, Wiley InterScience, 2006