1 What's Cluster Analysis?(3) Clustering is an unsupervised learning: No predefined class number Data mining function of clustering analysis .As an independent tool to obtain data distribution .As other algorithms'preprocessing steps (e.g. feature and classification) DATA 6 Copyright 2019 by Xiaoyu Li
Copyright © 2019 by Xiaoyu Li. 6 Clustering is an unsupervised learning: No predefined class number Data mining function of clustering analysis As an independent tool to obtain data distribution As other algorithms’ preprocessing steps (e.g. feature and classification) 1 What’s Cluster Analysis?(3)
2 Typical Applications(1) 。Pattern recognition Spatial data analysis Cluster the similar regions and generate topic map in the GIS system. Exam spatial clustering and give its explanation in spatial data mining Image processing .Economics Especially in Market Research ATA 7 Copyright 2019 by Xiaoyu Li
Copyright © 2019 by Xiaoyu Li. 7 2 Typical Applications(1) Pattern recognition Spatial data analysis Cluster the similar regions and generate topic map in the GIS system. Exam spatial clustering and give its explanation in spatial data mining Image processing Economics( Especially in Market Research )
2 Typical Applications(2) Web Classify the documents on WEB Cluster the Web log data to find the same user access mode 。Marketing Help market analysts to find a different customer base from the customer base,so different customers can use different marketing strategy ●Earthquake research Cluster the observed epicenter points along the plate fault zone,and get the seismic risk zone ATA 8 Copyright 2019 by Xiaoyu Li
Copyright © 2019 by Xiaoyu Li. 8 2 Typical Applications(2) Web Classify the documents on WEB Cluster the Web log data to find the same user access mode Marketing Help market analysts to find a different customer base from the customer base, so different customers can use different marketing strategy Earthquake research Cluster the observed epicenter points along the plate fault zone, and get the seismic risk zone
2 Typical Applications(3) ●Land use In the database of earth monitoring,the same land use region is found ·Insurance industry The customer base of the higher claim rate in automobile insurance is found 。Urban planning Group it according to the type of house,value and location ATA 9 Copyright 2019 by Xiaoyu Li
Copyright © 2019 by Xiaoyu Li. 9 2 Typical Applications(3) Land use In the database of earth monitoring, the same land use region is found Insurance industry The customer base of the higher claim rate in automobile insurance is found Urban planning Group it according to the type of house, value and location
3 What's Good Clustering Analysis? High class internal similarity; Low class similarity; As a branch of statistics,the clustering analysis researchi theme is mainly based 0n distance- clustering a high-quality clustering analysis result will be decided on the used clustering method; The implementation of the similarity measure and the method used in clustering method; Ability to discover hidden patterns. ATA 10 Copyright 2019 by Xiaoyu Li
Copyright © 2019 by Xiaoyu Li. 10 3 What’s Good Clustering Analysis? High class internal similarity; Low class similarity; As a branch of statistics, the clustering analysis research theme is mainly based on distanceclustering ; a high-quality clustering analysis result will be decided on the used clustering method; The implementation of the similarity measure and the method used in clustering method; Ability to discover hidden patterns