当前位置：和泉文库 > 计算机 > 浏览文档

《知识发现和数据挖掘 Knowledge Discovery and Data Mining》课程教学课件（PPT讲稿）Chapter 10. Cluster Analysis：Basic Concepts and Methods

◼ Cluster Analysis: Basic Concepts ◼ Partitioning Methods ◼ Hierarchical Methods ◼ Density-Based Methods ◼ Grid-Based Methods ◼ Evaluation of Clustering ◼ Summary

文件格式：PPTX，文件大小：1.69MB，售价：25.5元

共100页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约100页）

The K-Means clustering Method EXample Assign Update ch the objects cluste 56 10 to most means center reassign reassign Arbitrarily choose K oject as initia cluster center pdate the cluster means 012345678910 16

16 The K-Means Clustering Method ◼ Example 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 K=2 Arbitrarily choose K object as initial cluster center Assign each objects to most similar center Update the cluster means Update the cluster means reassign reassign

R-Means Consider the following 6 two-dimensional data points x1:(0,0)2x2(1,0),x3(1,1,x4(2,1),x5(3,1),x6(3, If k-2, and the initial means are(0, 0)and(2 1) (using Euclidean distance) a Use K-means to cluster the points 17

K-Means ◼ Consider the following 6 two-dimensional data points: ◼ x1: (0, 0), x2:(1, 0), x3(1, 1), x4(2, 1), x5(3, 1), x6(3, 0) ◼ If k=2, and the initial means are (0, 0) and (2, 1), (using Euclidean Distance) ◼ Use K-means to cluster the points. 17

R-Means Now we know the initial means Mean one(0, 0)and mean two(2, 1), We are going to use euclidean distance to calculate the distance between each point and each mean For example Point xl(0,0), point xl is exactly the initial mean one, so we can directly put xl into cluster one

18 K-Means ◼ Now we know the initial means: ◼ Mean_one(0, 0) and mean_two(2, 1), ◼ We are going to use Euclidean Distance to calculate the distance between each point and each mean. ◼ For example: ◼ Point x1 (0, 0), point x1 is exactly the initial mean_one, so we can directly put x1 into cluster one

R-Means Next we check Point x2 (1,0) Distance a and mean one s? (1-0)2+(0-0)2=1 Distance? x2 and mean two=v(1-2)2+(0-1)2=2 Distancel<Distance2, so point x2 is closer to mean one Thus x2 belongs to cluster one 19

19 K-Means ◼ Next we check Point x2 (1, 0) Distance1: x2_and_mean_one = 2 (1 − 0) 2+(0 − 0) 2 = 1 Distance2: x2_and_mean_two = 2 (1 − 2) 2+(0 − 1) 2 = 2 2 Distance1<Distance2, so point x2 is closer to mean_one, Thus x2 belongs to cluster one

R-Means Similarly for point x3(1,1), Distance x3 and mean one=y(1-0)2+(1-0)2=32 Distance X3 and mean two=V(1-2)2+(1-1) Distancel>Distance, so point x3 is closer to mean two Thus x3 belongs to cluster two

20 K-Means Similarly for point x3 (1, 1), Distance1: x3_and_mean_one = 2 (1 − 0) 2+(1 − 0) 2 = 2 2 Distance2: x3_and_mean_two = 2 (1 − 2) 2+(1 − 1) 2 = 1 Distance1>Distance2, so point x3 is closer to mean_two, Thus x3 belongs to cluster two

点击进入文档下载页（PPTX格式）

共100页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

《人工智能原理及应用》课程教学大纲 Artificial Intelligence Principles and Applications
西安电子科技大学：《接入网技术及其应用》课程教学资源（PPT课件讲稿）第6章接入网应用（徐展琦）
《管理信息系统原理及开发》课程教学资源（PPT课件讲稿）第3、4讲管理信息系统的系统设计
西安电子科技大学：《现代密码学》课程教学资源（PPT课件讲稿）第四章公钥密码（主讲：董庆宽）
河南中医药大学（河南中医学院）：《计算机文化》课程教学资源（PPT课件讲稿）第二章计算机的前世今生（主讲：许成刚）
《计算机软件及应用》课程教学资源（PPT课件讲稿）第2章 Photoshop CS入门基础
《大型机高级系统管理技术》课程教学资源（PPT课件讲稿）第4章作业控制子系统
上海交通大学：《软件工程 Software Engineering》课程教学资源（PPT课件讲稿）软件开发过程 Software Development Processes
中国水利水电出版社：《计算机组装与维护实训教程》课程教学资源（PPT课件讲稿，共九章）
《大学生计算机基础》课程教学资源（PPT讲稿）第三章字处理软件（Word 2003）
北京大学：《高级软件工程》课程教学资源（PPT课件讲稿）第六讲网络环境中的软件质量
《计算机数据恢复技术》课程教学资源（PPT课件讲稿）第1章数据恢复技术概述
中国科学技术大学：《信号与图像处理基础 Signal and Image Processing》课程教学资源（PPT课件讲稿）小波分析 Wavelet Analysis（主讲：曹洋）
《计算机网络 Computer Networking》课程教学资源（PPT课件讲稿）Chapter 6 无线和移动网络 Wireless and Mobile Networks
《UNIX操作系统基础》课程教学资源（PPT课件讲稿）第三章 UNIX的文件与目录
上海交通大学：并发理论（PPT课件诗篇）Concurrency Theory
南京大学：《Java语言程序设计》课程教学资源（PPT课件讲稿）第2章 Java语言语法基础
南京大学：使用失效数据来引导决定（PPT讲稿，计算机系：赵建华）
南京航空航天大学：《C++》课程电子教案（PPT课件讲稿）第3章类的基础部分（主讲：陈哲）
《软件工程导论》课程教学资源（PPT课件讲稿）第9章面向对象方法学
河南中医药大学（河南中医学院）：《计算机文化》课程教学资源（PPT课件讲稿）第一章计算机网络概述（主讲：阮晓龙）
《数据库原理》课程教学资源（PPT课件讲稿）第三章关系数据库标准查询语言SQL
Excel 2010高级使用技巧（PPT讲稿）
电子工业出版社：《计算机网络》课程教学资源（第五版，PPT课件讲稿）第二章物理层

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录