O Reduction Program assume we have already loaded array into shared float partialSum unsigned int t= threadIdx xi for (unsigned int stride =1 stride blockDim x; stride *=2) syncthreads ()i if(t(2*stride)==0) partialsum[t] + partialSum[t+stridejiReduction Program 13