当前位置：和泉文库 > 计算机 > 浏览文档

《大数据 Big Data》课程教学资源（参考文献）大数据机器学习 Big Data Machine Learning

1 Introduction 2 Learning to Hash Isotropic Hashing Scalable Graph Hashing with Feature Transformation Supervised Hashing with Latent Factor Models Column Sampling based Discrete Supervised Hashing Deep Supervised Hashing with Pairwise Labels Supervised Multimodal Hashing with SCM Multiple-Bit Quantization 3 Distributed Learning Coupled Group Lasso for Web-Scale CTR Prediction Distributed Power-Law Graph Computing 4 Stochastic Learning Fast Asynchronous Parallel Stochastic Gradient Descent Distributed Stochastic ADMM for Matrix Factorization 5 Conclusion

文件格式：PDF，文件大小：6.4MB，售价：22.15元

共111页，可试读30页，点击往前阅读 ↑↑

文档详细内容（约111页）

Long o HEsh Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing (SGH) Problem o The memory cost and time complexity are at least O(n2)for graph hashing if all pairwise similarities are explicitly computed. oHow to utilize the whole graph and avoid O(n2)complexity? Scalable Graph Hashing(SGH) A feature transformation (Shrivastava and Li,NIPS 2014)method to effectively approximate the whole graph without explicitly computing it. oA sequential method for bit-wise complementary learning. ●Linear complexity. 日卡三4元，互)Q0 Li (http://cs.nju.edu.cn/lvj) Big Leaming CS.NJU 21 /115

Learning to Hash Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing (SGH) Problem The memory cost and time complexity are at least O(n 2 ) for graph hashing if all pairwise similarities are explicitly computed. How to utilize the whole graph and avoid O(n 2 ) complexity? Scalable Graph Hashing(SGH) A feature transformation (Shrivastava and Li, NIPS 2014) method to effectively approximate the whole graph without explicitly computing it. A sequential method for bit-wise complementary learning. Linear complexity. Li (http://cs.nju.edu.cn/lwj) Big Learning CS, NJU 21 / 115

Long o HEsh Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing(SGH) Objective function minw lles-sgn(K(X)WT)sgn(K(X)WT)T s.t.WK(X)TK(X)WT=I o Vx,define:K(x)= [(x,x1）-∑1p(x,x1)/m,…,(x,xm)-∑=1(xxm)/m 。55=2S-1∈(-1,1. Notation 。X={x1,.,Xn}T∈Rnxd:n data points. 。Pairwise similarity metric defined as::Sj=e-lk-xle/p∈(0，l] 日卡*2元至)Q0 Li (http://cs.nju.edu.cn/lwj) Big Leaming CS.NJU 22 /115

Learning to Hash Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing (SGH) Objective function minW ||cSe − sgn(K(X)WT )sgn(K(X)WT ) T ||2 F s.t. WK(X) T K(X)WT = I ∀x, define: K(x) = [φ(x, x1) − Pn i=1 φ(xi , x1)/n, . . . , φ(x, xm) − Pn i=1 φ(xi , xm)/n] Seij = 2Sij − 1 ∈ (−1, 1]. Notation X = {x1, . . . , xn} T ∈ R n×d : n data points. Pairwise similarity metric defined as: Sij = e −||xi−xj ||2 F /ρ ∈ (0, 1] Li (http://cs.nju.edu.cn/lwj) Big Learning CS, NJU 22 / 115

Long o HEsh Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing(SGH) oVx,define P(x)and Q(x): -x2 P()-ly 2(e2-1) e2+1-I✉2 e P x;V -e ep e Q(x)-ly 2(e2-1) -llxll e2+1-x e P x V -e ;-1 ep e 0x,xj∈X Pr@)-=4会x+ p 2e 川x喔-区房+2xx 2e -1=S 口卡+得二4元互)Q0 Li (http://cs.nju.edu.cn/lwj) Big Leamning CS.NJU 23/115

Learning to Hash Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing (SGH) ∀x, define P(x) and Q(x): P(x) = [s 2(e 2 − 1) eρ e −||x||2 F ρ x; r e 2 + 1 e e − ||x||2 F ρ ; 1] Q(x) = [s 2(e 2 − 1) eρ e −||x||2 F ρ x; r e 2 + 1 e e − ||x||2 F ρ ; −1] ∀xi , xj ∈ X P(xi) TQ(xj ) = 2[e 2 − 1 2e × 2x T i xj ρ + e 2 + 1 2e ]e − ||xi ||2 F +||xj ||2 F ρ − 1 ≈ 2e − ||xi ||2 F −||xj ||2 F +2x T i xj ρ − 1 = Seij Li (http://cs.nju.edu.cn/lwj) Big Learning CS, NJU 23 / 115

Loming o HEsh Scalable Graph Hashing with Feature Transformation Feature Transformation 。Here,weuse a approximtion2x+法≈e2 =x+出 0.5 -0.5 0.5 We assume-l≤axx≤l.It is easy to prove that p=2max{lxll2}21 can make-1≤2xxy≤1. Then we have S P(X)TQ(X) 日卡三4元，互Q0 Li (http://cs.nju.edu.cn/lvj) Big Leaming CS.NJU 24 /115

Learning to Hash Scalable Graph Hashing with Feature Transformation Feature Transformation Here, we use an approximation e 2−1 2e x + e 2+1 2e ≈ e x We assume −1 ≤ 2 ρ x T i xj ≤ 1. It is easy to prove that ρ = 2 max{kxik 2 F } n i=1 can make −1 ≤ 2 ρ x T i xj ≤ 1. Then we have Se ≈ P(X) TQ(X) Li (http://cs.nju.edu.cn/lwj) Big Learning CS, NJU 24 / 115

Long o HEsh Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing(SGH) o Direct relaxation may lead to poor performance.We adopt a sequential learning strategy in a bit-wise complementary manner Residual definition: R:=c5->sgn(K(X)w:)sgn(K(X)w:)T R1=CS oObjective function: min R:-sgn(K(X)w:)sgn(K (X)w:)T s.t.wfK(X)K(X)wt=1 By relaxation,we can get: max tr(wfK(X)R:K(X)wt) We s.t. wK(X)K(X)wt-1 日卡*2元至)Q0 Li (http://cs.nju.edu.cn/lwj) Big Learning C5.NJU25/115

Learning to Hash Scalable Graph Hashing with Feature Transformation Scalable Graph Hashing (SGH) Direct relaxation may lead to poor performance. We adopt a sequential learning strategy in a bit-wise complementary manner Residual definition: Rt = cSe − Pt−1 i=1 sgn(K(X)wi)sgn(K(X)wi) T R1 = cSe Objective function: min wt ||Rt − sgn(K(X)wt)sgn(K(X)wt) T ||2 F s.t. wT t K(X) T K(X)wt = 1 By relaxation, we can get: max wt tr(wT t K(X) T RtK(X)wt) s.t. wT t K(X) T K(X)wt = 1 Li (http://cs.nju.edu.cn/lwj) Big Learning CS, NJU 25 / 115

点击进入文档下载页（PDF格式）

共111页，可试读30页，点击继续阅读 ↓↓

您可能感兴趣的文档

《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data
《大数据 Big Data》课程教学资源（参考文献）大数据机器学习 Big Data Machine Learning
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data Retrieval and Mining（南京大学：李武军）
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data Retrieval and Mining（南京大学：李武军）
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Decidability, Complexity（P, NP, NPC and related）
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Timed Automata
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Petri Net
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Transition System
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Turing Machine
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Properties of CFL（The Pumping Lemma for CFL’s）
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Pushdown Automata
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data - A Tutorial
《大数据 Big Data》课程教学资源（参考文献）Parallel and Distributed Stochastic Learning - Towards Scalable Learning for Big Data Intelligence（南京大学：李武军）
《人工智能、机器学习与大数据》课程教学资源（参考文献）Coherence functions for multicategory margin-based classification methods
《人工智能、机器学习与大数据》课程教学资源（参考文献）Latent Wishart processes for relational kernel learning
《人工智能、机器学习与大数据》课程教学资源（参考文献）Latent Wishart processes for relational kernel learning（讲稿）
《人工智能、机器学习与大数据》课程教学资源（参考文献）agiCoFi - Tag informed collaborative filtering
《人工智能、机器学习与大数据》课程教学资源（参考文献）Localized content-based image retrieval through evidence region identification
《人工智能、机器学习与大数据》课程教学资源（参考文献）Relation regularized matrix factorization
《人工智能、机器学习与大数据》课程教学资源（参考文献）Relation regularized matrix factorization（讲稿）
《人工智能、机器学习与大数据》课程教学资源（参考文献）Probabilistic relational PCA
《人工智能、机器学习与大数据》课程教学资源（参考文献）Gaussian process latent random field
《人工智能、机器学习与大数据》课程教学资源（参考文献）Multiple-instance learning via disambiguation

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录