当前位置：和泉文库 > 计算机 > 浏览文档

《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data - A Tutorial

1 Introduction 2 Unsupervised Hashing 3 Supervised Hashing 4 Ranking-based Hashing 5 Multimodal Hashing 6 Deep Hashing 7 Quantization 8 Conclusion 9 Reference

文件格式：PDF，文件大小：10.7MB，售价：36元

共220页，可试读40页，点击往前阅读 ↑↑

文档详细内容（约220页）

Unsupervised Hashing Outline Introduction ② Unsupervised Hashing Supervised Hashing Ranking-based Hashing Multimodal Hashing Deep Hashing Quantization Conclusion Reference 口卡得三4元互Q0 Li (http://cs.nju.edu.cn/lwj) Learning to Hash Cs.NU21/210

Unsupervised Hashing Outline 1 Introduction 2 Unsupervised Hashing 3 Supervised Hashing 4 Ranking-based Hashing 5 Multimodal Hashing 6 Deep Hashing 7 Quantization 8 Conclusion 9 Reference Li (http://cs.nju.edu.cn/lwj) Learning to Hash CS, NJU 21 / 210

Unsupervised Hashing Problem Definition Input: Feature vectors:{xi(compact matrix form for all training points: x). Output: Binary codes:{bi(compact matrix form for all training points:B). Instances similar in the original feature space should have similar binary codes. or When xi is close to xj.the Hamming distance between bi and bj should be low. When xi is far away from xj.the Hamming distance between bi and bi should be high. 口卡4日12元，至90 Li (http://cs.nju.edu.cn/lvj) Learning to Hash CS.NJU 22 /210

Unsupervised Hashing Problem Definition Input: Feature vectors: {xi} (compact matrix form for all training points: X). Output: Binary codes: {bi} (compact matrix form for all training points: B). Instances similar in the original feature space should have similar binary codes. or When xi is close to xj , the Hamming distance between bi and bj should be low. When xi is far away from xj , the Hamming distance between bi and bj should be high. Li (http://cs.nju.edu.cn/lwj) Learning to Hash CS, NJU 22 / 210

Unsupervised Hashing PCA Hashing(PCAH) To generate a code of m bits,PCAH performs PCA on X,and then use the top m eigenvectors of the matrix XXT as columns of the projection matrix W ERdxm.Here,top m eigenvectors are those corresponding to the m largest eigenvalues generally arranged with the non-increasing order1≥2≥…≥入m-Let入=[A,A2,…,入mJT. Then A=WTXXTW diag(A) Define hash function h(x)=sgn(WTx) PCA2 PCAI 口卡+得二4元互)Q0 Li (http://cs.nju.edu.cn/lwj) Learning to Hash C5.NJU23/210

Unsupervised Hashing PCA Hashing (PCAH) To generate a code of m bits, PCAH performs PCA on X, and then use the top m eigenvectors of the matrix XXT as columns of the projection matrix W ∈ R d×m. Here, top m eigenvectors are those corresponding to the m largest eigenvalues {λk} m k=1, generally arranged with the non-increasing order λ1 ≥ λ2 ≥ · · · ≥ λm. Let λ = [λ1, λ2, · · · , λm] T . Then Λ = WT XXTW = diag(λ) Define hash function h(x) = sgn(WT x) Li (http://cs.nju.edu.cn/lwj) Learning to Hash CS, NJU 23 / 210

Unsupervised Hashing Spectral Hashing (Weiss et al.,2008) min∑Wlly:-yl2 subject to:yi∈{-l,l} ∑y=0 A∑wW-1 where Wij is the similarity between xi and xj,the constraint >iyi=0 requires each bit to be fire 50%of the time,and the constraint yiy=I requires the bits to be uncorrelated. NP hard problem! 日卡461工4元，至80 Li (http://cs.nju.edu.cn/lvj) Learning to Hash C5.NJU24/210

Unsupervised Hashing Spectral Hashing (Weiss et al., 2008) min {yi} X ij Wij ||yi − yj ||2 subject to : yi ∈ {−1, 1} k X i yi = 0 1 n X i yiy T i = I where Wij is the similarity between xi and xj , the constraint P i yi = 0 requires each bit to be fire 50% of the time, and the constraint 1 n P i yiy T i = I requires the bits to be uncorrelated. NP hard problem! Li (http://cs.nju.edu.cn/lwj) Learning to Hash CS, NJU 24 / 210

Unsupervised Hashing Spectral Hashing (Weiss et al.,2008) In matrix form,and by relaxation: mintr(YTCY) subject to:YT1=0 lYTY=I n where Y is a real-valued nxk matrix whose jth row is yf.C=D-W is the Laplacian matrix,D is a diagonal matrix with D(i,i)=>;W(i,j). Solution:simply the k eigenvectors of C with minimal eigenvalues after excluding the trivial eigenvector 1 which has eigenvalue 0. sign(Y)to get the binary codes (quantization stage). Out-of-sample extension with eigenfunctions by simply fitting a multidimensional rectangle distribution to the data (by using PCA to align the axes,and then assuming a uniform distribution on each axis). Li (http://cs.nju.edu.cn/lvj) Learning to Hash CS.NJU 25 /210

Unsupervised Hashing Spectral Hashing (Weiss et al., 2008) In matrix form, and by relaxation: min tr(YTLY) subject to : YT 1 = 0 1 n YT Y = I where Y is a real-valued n × k matrix whose jth row is y T j , L = D − W is the Laplacian matrix, D is a diagonal matrix with D(i, i) = P j W(i, j). Solution: simply the k eigenvectors of L with minimal eigenvalues after excluding the trivial eigenvector 1 which has eigenvalue 0. sign(Y) to get the binary codes (quantization stage). Out-of-sample extension with eigenfunctions by simply fitting a multidimensional rectangle distribution to the data (by using PCA to align the axes, and then assuming a uniform distribution on each axis). Li (http://cs.nju.edu.cn/lwj) Learning to Hash CS, NJU 25 / 210

点击进入文档下载页（PDF格式）

共220页，可试读40页，点击继续阅读 ↓↓

您可能感兴趣的文档

《大数据 Big Data》课程教学资源（参考文献）大数据机器学习 Big Data Machine Learning
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data
《大数据 Big Data》课程教学资源（参考文献）大数据机器学习 Big Data Machine Learning
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data Retrieval and Mining（南京大学：李武军）
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data Retrieval and Mining（南京大学：李武军）
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Decidability, Complexity（P, NP, NPC and related）
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Timed Automata
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Petri Net
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Transition System
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Turing Machine
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Properties of CFL（The Pumping Lemma for CFL’s）
《大数据 Big Data》课程教学资源（参考文献）Parallel and Distributed Stochastic Learning - Towards Scalable Learning for Big Data Intelligence（南京大学：李武军）
《人工智能、机器学习与大数据》课程教学资源（参考文献）Coherence functions for multicategory margin-based classification methods
《人工智能、机器学习与大数据》课程教学资源（参考文献）Latent Wishart processes for relational kernel learning
《人工智能、机器学习与大数据》课程教学资源（参考文献）Latent Wishart processes for relational kernel learning（讲稿）
《人工智能、机器学习与大数据》课程教学资源（参考文献）agiCoFi - Tag informed collaborative filtering
《人工智能、机器学习与大数据》课程教学资源（参考文献）Localized content-based image retrieval through evidence region identification
《人工智能、机器学习与大数据》课程教学资源（参考文献）Relation regularized matrix factorization
《人工智能、机器学习与大数据》课程教学资源（参考文献）Relation regularized matrix factorization（讲稿）
《人工智能、机器学习与大数据》课程教学资源（参考文献）Probabilistic relational PCA
《人工智能、机器学习与大数据》课程教学资源（参考文献）Gaussian process latent random field
《人工智能、机器学习与大数据》课程教学资源（参考文献）Multiple-instance learning via disambiguation
《人工智能、机器学习与大数据》课程教学资源（参考文献）Generalized latent factor models for social network analysis

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录