当前位置：和泉文库 > 数学 > 浏览文档

《模式识别》课程教学资源（书籍文献）Data Clustering - 50 Years Beyond K-means

文件格式：PDF，文件大小：2.71MB，售价：11.13元

文档详细内容（约39页）

Ground Truth 整盛烟 666666665-- Khmer Cultural Center

Ground Truth Khmer Cultural Center

Data Explosion The digital universe was ~281 exabytes (281 billion gigabytes)in 2007;it would grow 10 times by 2011 Images and video,captured by over one billion devices (camera phones),are the major source To archive and effectively use this data,we need tools for data categorization http://eon.businesswire.com/releases/information/digital/prweb509640.htm http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf 尚

Data Explosion • Th di it l i 281 b t The digital universe was ~281 exabytes (281 billion gigabytes) in 2007; it would grow 10 times by 2011 • Images and video, captured by over one billion d i ( h ) th j devices (camera phones), are the major source • To archive and effectively use this data, we need tools for data categorization http://eon.businesswire.com/releases/information/digital/prweb509640.htm http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf

Data Clustering Grouping of objects into meaningful categories Classification vs.clustering Unsupervised learning,exploratory data analysis, grouping,clumping,taxonomy,typology,Q-analysis Given a representation of n objects,find K clusters based on a measure of similarity Partitional vs.hierarchical A.K.Jain and R.C.Dubes.Algorithms for Clustering Data,Prentice Hall,1988.(available for download at:http://dataclustering.cse.msu.edu/)

Data Clustering • Grouping of objects into meaningful categories • Classification vs. clustering • Unsupervised learning, exploratory data analysis, grouping clumping taxonomy typology Q grouping, clumping, taxonomy, typology, Q-analysis analysis • Given a representation of n objects, find K clusters based on a measure of based on a measure of similarity similarity • Partitional vs. hierarchical A. K. Jain and R. C. Dubes. Algorithms for Clustering Data, Prentice Hall, 1988. (available for download at: http g) ://dataclustering.cse.msu.edu/)

Why Clustering? Natural classification:degree of similarity among forms (phylogenetic relationship or taxonomy) Data exploration:discover underlying structure, generate hypotheses,detect anomalies Compression:method for organizing data Applications:any scientific field that collects data! Astronomy,biology,marketing,engineering,..... Google Scholar:~1500 clustering papers in 2007 alone!

Why Clustering? • Natural classification: degree of similarity among forms (phylogenetic relationship or taxonomy) • Data exploration: discover underlying structure, generate hypotheses, detect anomalies • Compression: method for organizing data • Applications: any scientific field that collects data! Astronomy, biology, marketing, engineering,….. Google Scholar: ~1500 clustering papers in 2007 alone!

Historical Developments Cluster analysis first appeared in the title of a 1954 article analyzing anthropological data (STOR) Hierarchical Clustering:Sneath (1957),Sorensen (1957) K-Means:independently discovered Steinhaus1(1956),Lloyd2 (1957),Cox3(1957),Bal∥&Hal(1967),MacQueen5(1967) Mixture models (Wolfe,1970) Graph-theoretic methods (Zahn,1971) .K Nearest neighbors (Jarvis Patrick,1973) Fuzzy clustering (Bezdek,1973) Self Organizing Map(Kohonen,1982) Vector Quantization (Gersho and Gray,1992) 1Acad.Polon.Sci.,2Bell Tel.Report,3JASA,4Behavioral Sci.,5Berkeley Symp.Math Stat Prob. 合口

Historical Developments • Cluster analysis first appeared in the title of a 1954 article analyzing anthropological data (JSTOR) • Hierarchical Clustering: Sneath (1957) Sorensen (1957) Sneath (1957) , Sorensen (1957) • K-Means: independently discovered Steinhaus 1 (1956), Lloyd2 (1957), Cox3 (1957), Ball & Hall4 (1967), MacQueen 5 (1967) • Mixture models (Wolfe, 1970 ) • Graph-theoretic methods (Zahn, 1971) • K Nearest neighbors (Jarvis & Patrick, 1973) • Fuzzy clustering (Bezdek, 1973) • Self Organizing Map (Kohonen, 1982) • Vector Quantization (Gersho and Gray, 1992) 1Acad. Polon. Sci., 2Bell Tel. Report, 3JASA, 4Behavioral Sci., 5Berkeley Symp. Math Stat & Prob

点击进入文档下载页（PDF格式）

共39页，可试读13页，点击继续阅读 ↓↓

您可能感兴趣的文档

《模式识别》课程教学资源（书籍文献）Data Clustering - A Review（A.K. JAIN、M.N. MURTY、P.J. FLYNN）
《模式识别》课程教学资源（书籍文献）A tutorial on Principal Components Analysis（Lindsay I Smith）
《模式识别》课程教学资源（书籍文献）A Tutorial on Principal Component Analysis（Jonathon Shlens）
《模式识别》课程教学资源（书籍文献）Sequential Minimal Optimization - A Fast Algorithm for Training Support Vector Machines（John C. Platt）
《模式识别》课程教学资源（书籍文献）A Tutorial on Support Vector Machines for Pattern Recognition（CHRISTOPHER J.C. BURGES）
《模式识别》课程教学资源（书籍文献）Introduction to Support Vector Learning
《模式识别》课程教学资源（书籍文献）TRENDS & CONTROVERSIES TRENDS & CONTROVERSIES - Support vector machines
《模式识别》课程教学资源（书籍文献）Background and Foreground Modeling Using Nonparametric Kernel Density Estimation for Visual Surveillance
《模式识别》课程教学资源（书籍文献）Tutorial on maximum likelihood estimation
《模式识别》课程教学资源（书籍文献）Statistical Pattern Recognition - A Review
《模式识别》课程教学资源（书籍文献）Digital Image Processing（Second Edition，Review Material，Rafael C. Gonzalez、Richard E. Woods）
北京大学：《模式识别》课程教学资源（课件讲稿）人工神经网络简介
《模式识别》课程教学资源（书籍文献）Artificial neural networks - a tutorial（Anil K. Jain、Jianchang Mao）
《模式识别》课程教学资源（书籍文献）Learning in Linear Neural Networks - A Survey
《模式识别》课程教学资源（书籍文献）Neural Networks for Classification - A Survey
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第0章序言（覃思义）
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第1章预备知识第1节概率空间
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第1章预备知识第2节随机变量及其分布
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第1章预备知识第3节随机变量的函数
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第1章预备知识第4节随机变量的数字特征
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第1章预备知识第5节特征函数
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第2章随机过程的基本概念第1节随机过程的定义及分类
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第2章随机过程的基本概念第2节随机过程的分布
电子科技大学：《随机过程及应用 Stochastic Processes and Applications》课程教学资源（课件讲稿）第2章随机过程的基本概念第3节随机过程的数字特征

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录