当前位置：和泉文库 > 计算机 > 浏览文档

《知识发现和数据挖掘 Knowledge Discovery and Data Mining》课程教学课件（PPT讲稿）Chapter 10. Cluster Analysis：Basic Concepts and Methods

◼ Cluster Analysis: Basic Concepts ◼ Partitioning Methods ◼ Hierarchical Methods ◼ Density-Based Methods ◼ Grid-Based Methods ◼ Evaluation of Clustering ◼ Summary

文件格式：PPTX，文件大小：1.69MB，售价：25.5元

文档详细内容（约100页）

COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified by Dr. Lei Chen based on the slides provided by Jiawei Han, Micheline Kamber, and Jian Pei C2012 Han Kamber pei. All rights reserved

1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified by Dr. Lei Chen based on the slides provided by Jiawei Han, Micheline Kamber, and Jian Pei ©2012 Han, Kamber & Pei. All rights reserved

Chapter 10. Cluster Analysis: Basic Concepts and Methods Cluster Analysis: Basic Concepts Partitioning Methods Hierarchical methods Density-Based Methods Grid-Based methods Evaluation of clustering Summary

2 Chapter 10. Cluster Analysis: Basic Concepts and Methods ◼ Cluster Analysis: Basic Concepts ◼ Partitioning Methods ◼ Hierarchical Methods ◼ Density-Based Methods ◼ Grid-Based Methods ◼ Evaluation of Clustering ◼ Summary 2

What is Cluster Analysis? Cluster: a collection of data objects similar(or related) to one another within the same group dissimilar (or unrelated) to the objects in other groups Cluster analysis(or clustering, data segmentation,. Finding similarities between data according to the characteristics found in the data and grouping similar data objects into clusters Unsupervised learning: no predefined classes (i.e, learning by observations vs learning by examples: supervised) Typical applications As a stand-alone tool to get insight into data distribution As a preprocessing step for other algorithms

3 What is Cluster Analysis? ◼ Cluster: A collection of data objects ◼ similar (or related) to one another within the same group ◼ dissimilar (or unrelated) to the objects in other groups ◼ Cluster analysis (or clustering, data segmentation, …) ◼ Finding similarities between data according to the characteristics found in the data and grouping similar data objects into clusters ◼ Unsupervised learning: no predefined classes (i.e., learning by observations vs. learning by examples: supervised) ◼ Typical applications ◼ As a stand-alone tool to get insight into data distribution ◼ As a preprocessing step for other algorithms

Clustering for Data Understanding and Applications Biology: taxonomy of living things: kingdom, phylum, class, order, family, genus and species Information retrieval: document clustering Land use: ldentification of areas of similar land use in an earth observation database Marketing Help marketers discover distinct groups in their customer bases, and then use this knowledge to develop targeted marketing programs City-planning Identifying groups of houses according to their house type, value, and geographical location Earth-quake studies: Observed earth quake epicenters should be clustered along continent faults Climate: understanding earth climate, find patterns of atmospheric and ocean Economic Science: market resarch

4 Clustering for Data Understanding and Applications ◼ Biology: taxonomy of living things: kingdom, phylum, class, order, family, genus and species ◼ Information retrieval: document clustering ◼ Land use: Identification of areas of similar land use in an earth observation database ◼ Marketing: Help marketers discover distinct groups in their customer bases, and then use this knowledge to develop targeted marketing programs ◼ City-planning: Identifying groups of houses according to their house type, value, and geographical location ◼ Earth-quake studies: Observed earth quake epicenters should be clustered along continent faults ◼ Climate: understanding earth climate, find patterns of atmospheric and ocean ◼ Economic Science: market resarch

Clustering as a Preprocessing Tool ( Utility) Summarization Preprocessing for regression, PCA, classification, and association analysis Compression Image processing: vector quantization Finding K-nearest Neighbors Localizing search to one or a small number of clusters Outlier detection Outliers are often viewed as those far away' from any cluster

5 Clustering as a Preprocessing Tool (Utility) ◼ Summarization: ◼ Preprocessing for regression, PCA, classification, and association analysis ◼ Compression: ◼ Image processing: vector quantization ◼ Finding K-nearest Neighbors ◼ Localizing search to one or a small number of clusters ◼ Outlier detection ◼ Outliers are often viewed as those “far away” from any cluster

点击进入文档下载页（PPTX格式）

共100页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

《人工智能原理及应用》课程教学大纲 Artificial Intelligence Principles and Applications
西安电子科技大学：《接入网技术及其应用》课程教学资源（PPT课件讲稿）第6章接入网应用（徐展琦）
《管理信息系统原理及开发》课程教学资源（PPT课件讲稿）第3、4讲管理信息系统的系统设计
西安电子科技大学：《现代密码学》课程教学资源（PPT课件讲稿）第四章公钥密码（主讲：董庆宽）
河南中医药大学（河南中医学院）：《计算机文化》课程教学资源（PPT课件讲稿）第二章计算机的前世今生（主讲：许成刚）
《计算机软件及应用》课程教学资源（PPT课件讲稿）第2章 Photoshop CS入门基础
《大型机高级系统管理技术》课程教学资源（PPT课件讲稿）第4章作业控制子系统
上海交通大学：《软件工程 Software Engineering》课程教学资源（PPT课件讲稿）软件开发过程 Software Development Processes
中国水利水电出版社：《计算机组装与维护实训教程》课程教学资源（PPT课件讲稿，共九章）
《大学生计算机基础》课程教学资源（PPT讲稿）第三章字处理软件（Word 2003）
北京大学：《高级软件工程》课程教学资源（PPT课件讲稿）第六讲网络环境中的软件质量
《计算机数据恢复技术》课程教学资源（PPT课件讲稿）第1章数据恢复技术概述
中国科学技术大学：《信号与图像处理基础 Signal and Image Processing》课程教学资源（PPT课件讲稿）小波分析 Wavelet Analysis（主讲：曹洋）
《计算机网络 Computer Networking》课程教学资源（PPT课件讲稿）Chapter 6 无线和移动网络 Wireless and Mobile Networks
《UNIX操作系统基础》课程教学资源（PPT课件讲稿）第三章 UNIX的文件与目录
上海交通大学：并发理论（PPT课件诗篇）Concurrency Theory
南京大学：《Java语言程序设计》课程教学资源（PPT课件讲稿）第2章 Java语言语法基础
南京大学：使用失效数据来引导决定（PPT讲稿，计算机系：赵建华）
南京航空航天大学：《C++》课程电子教案（PPT课件讲稿）第3章类的基础部分（主讲：陈哲）
《软件工程导论》课程教学资源（PPT课件讲稿）第9章面向对象方法学
河南中医药大学（河南中医学院）：《计算机文化》课程教学资源（PPT课件讲稿）第一章计算机网络概述（主讲：阮晓龙）
《数据库原理》课程教学资源（PPT课件讲稿）第三章关系数据库标准查询语言SQL
Excel 2010高级使用技巧（PPT讲稿）
电子工业出版社：《计算机网络》课程教学资源（第五版，PPT课件讲稿）第二章物理层

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录