当前位置：和泉文库 > 计算机 > 浏览文档

南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Hashing and Sketching

文件格式：PDF，文件大小：1.45MB，售价：12.96元

文档详细内容（约46页）

The Mean Trick (for Variance Reduction) Variance and covariance: Var[X]=E[(X-E[X])2]=E[X2]-(EX])2 Cov(X,Y)=E (-E[X])(Y-E[Y]) ·Useful properties: Var[X a]Var[X] Var[ax]=a-Var[X] var[2冈2vami图+cmxx刘诗判 For pairwise independent identically distributed X's: v2-立m灯-ow1

The Mean Trick (for Variance Reduction) • Variance and covariance: • Useful properties: Var[X] = 𝔼[(X − 𝔼[X]) 2 ] = 𝔼[X2 ] − (𝔼[X]) 2 Cov(X, Y) = 𝔼 [(X − 𝔼[X])(Y − 𝔼[Y])] Var[X + a] = Var[X] Var[aX] = a2 Var[X] Var [∑ i Xi ] = ∑ i Var[Xi ] + ∑ i≠j Cov(Xi , Xj ) • For pairwise independent identically distributed X ’s: i Var [ 1 k k ∑ i=1 Xi ] = 1 k2 k ∑ i=1 Var[Xi ] = 1 k Var[X1]

Input:a sequencex,...,U=[N] Output:an estimation of z= uniform independent hash functions ,...,[0,1] Min Sketch: for each1≤j≤k,lety=minh,(x: 1≤isn n2=方-1wne7= j=1 ·For every1≤j≤k: linearity of 以本 expectation 网=本 1 Var[Y≤ independence (z+1)2 Var[]≤ k(2+1)2

• uniform & independent hash functions h1,…, hk : U → [0,1] Min Sketch: for each , let ; return where ; 1 ≤ j ≤ k Yj = min 1≤i≤n hj (xi ) Z ̂ = 1 Y − 1 Y = 1 k k ∑ j=1 Yj 𝔼 [Yj] = 1 z + 1 Var[Yj ] ≤ 1 (z + 1)2 • For every 1 ≤ j ≤ k: 𝔼 [Y] = 1 z + 1 Var [Y] ≤ 1 k(z + 1)2 linearity of expectation independence Input: a sequence Output: an estimation of x1, x2,…, xn ∈ U = [N] z = {x1, x2, …, xn}

Input:a sequencex,..,U=[N] Output:an estimation of= {1,,x uniform independent hash functions ,...,U[0,1] Min Sketch: for each1≤j≤k,lety=minh,(x E[]= l≤isn z+1 k j=1 Var[冈≤ka+1可。Goa:Pr E<(1-ekor2>1+e3 ≤6 assuming e≤1/2 4 Set k e26 -p> Pr 4 ≤6 (Chebyshev)

Y − 𝔼 [Y] > ϵ/2 z + 1 Pr [ Y − 𝔼 [Y] > ϵ/2 z + 1 ] ≤ 4 kϵ2 • Goal: Pr [ Z ̂ < (1 − ϵ)z or Z ̂ > (1 + ϵ)z ] ≤ δ assuming ϵ ≤ 1/2 (Chebyshev) k = ⌈ 4 ϵ2δ ⌉ Set ≤ δ • uniform & independent hash functions h1,…, hk : U → [0,1] Min Sketch: for each , let ; return where ; 1 ≤ j ≤ k Yj = min 1≤i≤n hj (xi ) Z ̂ = 1 Y − 1 Y = 1 k k ∑ j=1 Yj 𝔼 [Y] = 1 z + 1 Var [Y] ≤ 1 k(z + 1)2 Input: a sequence Output: an estimation of x1, x2,…, xn ∈ U = [N] z = {x1, x2, …, xn}

Input:a sequencex,..,=[N] Output:an estimation of z= ·uniform&independent hash functions h1,,hk:U→[0，l] Min Sketch:set k =[4/(e25)] for each1≤j≤k,lety=minh(x) 1≤i≤n wm=方-1wne-=2 1 Pr[1-ez≤2≤1+ek]≥1-6 .Space cost:) real numbers in [0,1] Storing k idealized hash functions

Min Sketch: for each , let ; return where ; 1 ≤ j ≤ k Yj = min 1≤i≤n hj (xi ) Z ̂ = 1 Y − 1 Y = 1 k k ∑ j=1 Yj • Space cost: real numbers in • Storing idealized hash functions. k = O ( 1 ϵ2δ) [0,1] k Pr [ (1 − ϵ)z ≤ Z ̂ ≤ (1 + ϵ)z ] ≥ 1 − δ set k = ⌈4/(ϵ2 δ)⌉ • uniform & independent hash functions h1,…, hk : U → [0,1] Input: a sequence Output: an estimation of x1, x2,…, xn ∈ U = [N] z = {x1, x2, …, xn}

Universal Hashing Universal Hash Family(Carter and Wegman 1979): A family of hash functions in -[m]is k-universal if for any distinct,...,U, Pr[h(c)=…=h()]≤ h∈t Moreover,is strongly k-universal(k-wise independent) if for any distinctx,..,Uand any y1,...,y[m], 八=小 1

Universal Hashing Universal Hash Family (Carter and Wegman 1979): A family of hash functions in is -universal if for any distinct , . Moreover, is strongly -universal ( -wise independent) if for any distinct and any , . ℋ U → [m] k x1, …, xk ∈ U Pr h∈ℋ [ h(x1) = ⋯ = h(xk)] ≤ 1 mk−1 ℋ k k x1, …, xk ∈ U y1, …, yk ∈ [m] Pr h∈ℋ [ k ⋀ i=1 h(xi ) = yi ] = 1 mk

点击进入文档下载页（PDF格式）

共46页，可试读17页，点击继续阅读 ↓↓

您可能感兴趣的文档

南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Greedy and Local Search
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Fingerprinting
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Introduction（Min-Cut and Max-Cut，尹⼀通）
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Concentration of Measure
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Balls into Bins
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Greedy and Local Search
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Fingerprinting
电子科技大学：《有限元理论与建模方法 Finite Element Analysis and Modeling》研究生课程教学资源（课件讲稿）第二篇有限元建模方法第十八章边界条件的建立 Creation of Boundary Condition
电子科技大学：《有限元理论与建模方法 Finite Element Analysis and Modeling》研究生课程教学资源（课件讲稿）第二篇有限元建模方法第十七章模型检查与处理 Model Checking and Processing
电子科技大学：《有限元理论与建模方法 Finite Element Analysis and Modeling》研究生课程教学资源（课件讲稿）第二篇有限元建模方法第十六章网格划分方法
电子科技大学：《有限元理论与建模方法 Finite Element Analysis and Modeling》研究生课程教学资源（课件讲稿）第二篇有限元建模方法第十五章单元类型及特性定义
电子科技大学：《有限元理论与建模方法 Finite Element Analysis and Modeling》研究生课程教学资源（课件讲稿）第二篇有限元建模方法第十四章几何模型的建立
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Lovász Local Lemma
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Rounding Data
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Dimension Reduction
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）LP Duality
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Rounding Linear Program
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）SDP-Based Algorithms
南京大学：《高级算法 Advanced Algorithms》课程教学资源（课件讲稿）Exercise Lecture For Advanced Algorithms（2022 Fall）
南京大学：《组合数学 Combinatorics》课程教学资源（课件讲稿）Basic Enumeration（主讲：尹一通）
南京大学：《组合数学 Combinatorics》课程教学资源（课件讲稿）Cayley
南京大学：《组合数学 Combinatorics》课程教学资源（课件讲稿）Existence
南京大学：《组合数学 Combinatorics》课程教学资源（课件讲稿）Extremal Combinatorics
南京大学：《组合数学 Combinatorics》课程教学资源（课件讲稿）Extremal Sets

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录