香港中文大学：《Theory of Computational Complexity》课程教学资源（讲义）Lecture 4 Multiparty Communication Complexity

团购合买资源类别：文库，文档格式：DOCX，文档页数：6，文件大小：43.6KB

2.Number in Hand,with a blackboard. 2.1 From rectangle to tensor The counterpart ofrectangle in NiH case is tensor.A tensor in X=Xx...x Xk is a set S= S x...x Sk where SiE Xi.The following is a straightforward generalization of the rectangle property Proposition 2.1.Any c-bit deterministic protocol for a function f in NiH model partitions the set X at most into 2 monochromatic tensors. We sometimes consider partial functions,which are functions with domain not the entire X but only a subset of it.Equivalently,we can imagine that there is a promise that the input comes from a fixed subset of X.For inputs that fall outside the subset,we don't care.A protocol is correct for this partial function if the protocol outputs the correct value on inputs coming from the subset,the protocol is allowed to output anything on other inputs.For partial functions,a tensor is monochromatic if it doesn't contain both 1-and 0-inputs.Indeed,as long as a protocol reaches a monochromatic tensor, then it can output a value that is right for all inputs on which the function is defined. 2.2 Disjointness Denote the input by (x1,...,xk)where eachxi=xi...xin is an n-bit string.Recall that Disjn in two-party case is defined as,in our current notation,Disjn(x1,x2)=1 iff 3js.t.x=x2j=1.When generalized to k-party case,different requirements can be imposed:We can require 3is.t.x=...= x=1,or only require the sets ,..,xk are not pairwise disjoint.Let's consider the partial function 1 if x1,...,Xk are pairwise disjoint Disjn.k((&,4)=0if3js.t.X=…=X为=1 Theorem 2.2.D(Disjn.k)=R(n/k). Proof.We show the following two properties. 1.There are(k+1)"1-inputs.The input is basically a partition of [n]into k+1 parts,with the i-th part given to player i and the last part unassigned to any player. 2.Any monochromatic tensor without a 0-input contains at most k"I-inputs.Fix a monochromatic tensor.For any element jE [n],there is a player's input xi not containing it.The size the tensor is at most the number ofassignments ofelements in [n]to [k]

2. Number in Hand, with a blackboard. 2.1 From rectangle to tensor The counterpart of rectangle in NiH case is tensor. A tensor in X = X1 × …× Xk is a set S = S1 × …× Sk where Si ∈ Xi . The following is a straightforward generalization of the rectangle property. Proposition 2.1. Any c-bit deterministic protocol for a function f in NiH model partitions the set X at most into 2c monochromatic tensors. We sometimes consider partial functions, which are functions with domain not the entire X but only a subset of it. Equivalently, we can imagine that there is a promise that the input comes from a fixed subset of X. For inputs that fall outside the subset, we don’t care. A protocol is correct for this partial function if the protocol outputs the correct value on inputs coming from the subset; the protocol is allowed to output anything on other inputs. For partial functions, a tensor is monochromatic if it doesn’t contain both 1- and 0-inputs. Indeed, as long as a protocol reaches a monochromatic tensor, then it can output a value that is right for all inputs on which the function is defined. 2.2 Disjointness Denote the input by (x1, …, xk) where each xi = xi1…xin is an n-bit string. Recall that Disjn in two-party case is defined as, in our current notation, Disjn(x1,x2) = 1 iff j s.t. x1j = x2j = 1. When generalized to k-party case, different requirements can be imposed: We can require i s.t. x1i = … = xki = 1, or only require the sets x1, …, xk are not pairwise disjoint. Let’s consider the partial function Disjn,k(x1 ,… , xk) = { 1 if x1 ,… , xk are pairwise disjoint 0 if ∃j 𝑠.𝑡. x1j = … = xkj = 1 . Theorem 2.2. D(Disjn,k ) = Ω(n/k). Proof. We show the following two properties. 1. There are (k+1)n 1-inputs. The input is basically a partition of [n] into k+1 parts, with the i-th part given to player i and the last part unassigned to any player. 2. Any monochromatic tensor without a 0-input contains at most kn 1-inputs. Fix a monochromatic tensor. For any element j ∈ [n], there is a player’s input xi not containing it. The size the tensor is at most the number of assignments of elements in [n] to [k]

Then,there shoud beat lest(monochromatie teno Therefore.the k theorem follows from the previous proposition. 2.3 Computing frequency moments in data stream Suppose that there is a stream of data x=(x1,...,Xm)E [n]m.We hope to compute some statistical value of the data using a small amount,ideally O(logn logm)bits,of space.The requirement for the algorithms is that we only see the data passing once in front of us.So we can processx in whatever way we like,but then it's gone and never comes back,and we see x2 and process x2,and so on.After formally initialized in a paper [AMS99],the model has been rapidly developed in the last decade or so. To see why sometimes techniquescan be used to achieve low space complexity,let's consider some interesting puzzles first. Example.Suppose m=n-1,and all xi's are different.Then there is exactly one number in [n]that is missing in the data.They want to find out the missing number.What's the lowest space your algorithm can achieve? Example.What if m=n-2 and they want to find the two missing numbers? From the above examples,you can see that algorithms can be crafty,which also raises the issue of proving lower bounds.Communication complexity is one powerful tool to prove lower bounds for space complexity of streaming algorithms.Here we illustrate the main idea using one classic example. In the data x=(x,..,Xm)E [n]m,suppose the number j appears ri times.Define the d-th frequency moment ofx to be fa(x)=n+...+rd.In particular,fo(x)is the number of distinct elements in x, and fi(x)=m.For d>2,fa(x)gives useful statistical information about the string. Theorem2.3.Ford≥3，D(fa)=2(n1-2/a). Proof.The idea is,as always,design a protocol to simulate an algorithm.Suppose that there is an algorithm A computing fa using only c bits of space.Let k=(n+1)1/d.We want to solve the Disjnk problem on input(y,..,y).Note that eachy;is a subset of[n].Player I gives the elements in y to A as a stream of data items and runs A.When A finishes reading these elements,Player 1 pass her space to Player 2 who takes over to continue to run A upon seeing elements in y2,and so on.Since A always uses c bits of space,the communication is at most(k-1)c bits.Since the itemsy;come in a

Then, there should be at least (k+1)n kn = (1 + 1 𝑘 ) 𝑛 ≈ 𝑒 𝑛/𝑘 monochromatic tensors. Therefore, the theorem follows from the previous proposition. □ 2.3 Computing frequency moments in data stream Suppose that there is a stream of data x = (x1 , …, xm) ∈ [n]m. We hope to compute some statistical value of the data using a small amount, ideally O(log𝑛 + log𝑚) bits, of space. The requirement for the algorithms is that we only see the data passing once in front of us. So we can process x1 in whatever way we like, but then it’s gone and never comes back, and we see x2 and process x2, and so on. After formally initialized in a paper [AMS99], the model has been rapidly developed in the last decade or so. To see why sometimes techniques can be used to achieve low space complexity, let’s consider some interesting puzzles first. Example. Suppose m = n-1, and all xi’s are different. Then there is exactly one number in [n] that is missing in the data. They want to find out the missing number. What’s the lowest space your algorithm can achieve? Example. What if m = n-2 and they want to find the two missing numbers? From the above examples, you can see that algorithms can be crafty, which also raises the issue of proving lower bounds. Communication complexity is one powerful tool to prove lower bounds for space complexity of streaming algorithms. Here we illustrate the main idea using one classic example. In the data x = (x1, …, xm) ∈ [n]m, suppose the number j appears rj times. Define the d-th frequency moment of x to be 𝑓𝑑 (𝑥) = 𝑟1 𝑑 + ⋯ + 𝑟𝑛 𝑑. In particular, f 0(x) is the number of distinct elements in x, and f1(x) = m. For d ≥ 2, fd(x) gives useful statistical information about the string. Theorem 2.3. For 𝑑 ≥ 3, 𝐷(𝑓𝑑 ) = 𝛺(𝑛 1−2/𝑑). Proof. The idea is, as always, design a protocol to simulate an algorithm. Suppose that there is an algorithm A computing f d using only c bits of space. Let k = (n + 1) 1/d. We want to solve the Disjn,k problem on input (y1, …, yk). Note that each yi is a subset of [n]. Player 1 gives the elements in y1 to A as a stream of data items and runs A. When A finishes reading these elements, Player 1 pass her space to Player 2 who takes over to continue to run A upon seeing elements in y2, and so on. Since A always uses c bits of space, the communication is at most (k-1)c bits. Since the items yi come in a

where the maximum is taken over all cylinder intersections S. Theorem 3.2. D(f) ≥ log2 disc(f). Proof. For any monochromatic cylinder intersections C, we have disc(f,C) = |C|/|X|. Therefore |C| = disc(f,C)|X| ≤ disc(f)|X|, and thus we need at least 1/disc(f) many monochromatic cylinder intersections to be able to cover the entire X. Applying Theorem 3.1 gives the claimed bound. □ The discrepancy bound is not easy to use, since one need to argue for all cylinder intersections. An easier bound is the binary cube bound. For any 𝑎𝑖 ,𝑏𝑖 ∈ 𝑋𝑖 , define a binary cube 𝐷 = (𝑎1,𝑏1 ) × … × (𝑎𝑘,𝑏𝑘). Note that here ai and bi may be the same, so D is a multi-set. Define 𝑓(𝐷) = ∏𝑥∈𝐷 𝑓(𝑥). Choose ai and bi uniformly at random from Xi , and define 𝐵(𝑓) = E[𝑓(𝐷)]. The following theorem was firstly given by Chung [Chu90]; see [Raz00] for a simplified proof. Theorem 3.3. 𝐵(𝑓) ≥ disc(𝑓) 2 𝑘 , thus 𝐷(𝑓) ≥ 1 2𝑘 log 1 𝐵(𝑓) . 3.3 A specific function: GIP The Generalized Inner Product (GIP) function is defined as follows. 𝐺𝐼𝑃𝑛,𝑘 (𝑥) =⊕𝑖=1 𝑛 ∧𝑗=1 𝑘 𝑥𝑖𝑗. Namely, the input x is an n × k matrix, and each of the k players gets all but one columns. Theorem 3.4. D(GIPn,k ) = Ω(𝑛4 −𝑘 ). Proof. We’ll show B(f) = (1 − 2 1−k) 𝑛 ≈ 𝑒 −𝑛/2 𝑘−1 , which, combined with Theorem 3.3, gives the claimed bound. Since the binary cube bound works for the {+1,-1} range, let’s first change f to g = (−1) f . Note that the change doesn’t affect the communication complexity. We need to compute B(g) = E{ai ,bi∈Xi} [𝑔(𝐷)] where D = (𝑎1,𝑏1 ) × … × (𝑎𝑘,𝑏𝑘 ). 𝑔(𝐷) = ∏𝑔(𝑥) 𝑥∈𝐷 = ∏(−1) ∑𝑖𝑥𝑖1…𝑥𝑖𝑘 𝑥∈𝐷 = ∏∏(−1) 𝑥𝑖1…𝑥𝑖𝑘 𝑛 𝑥∈𝐷 𝑖=1 = ∏∏(−1) 𝑥𝑖1…𝑥𝑖𝑘 𝑥∈𝐷 𝑛 𝑖=1 = ∏(−1) ∑𝑥∈𝐷𝑥𝑖1…𝑥𝑖𝑘 𝑛 𝑖=1 = ∏(−1) (𝑎𝑖1+𝑏𝑖1 )…(𝑎𝑖𝑘+𝑏𝑖𝑘) 𝑛 𝑖=1 = ∏(−1) 𝟏[𝑎𝑖𝑗≠𝑏𝑖𝑗,∀𝑗] 𝑛 𝑖=1

点击进入文档下载页（DOCX格式）

已到末页，全文结束

点击下载（DOCX格式）

浏览记录