当前位置：和泉文库 > 计算机 > 浏览文档

电子科技大学：《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源（课件讲稿）Lecture 2 BasicConcepts（Foundations of Data Mining）

文件格式：PDF，文件大小：4.03MB，售价：22.91元

文档详细内容（约116页）

Lecture 2 Foundations of Data Mining

Example 1 Payment prediction Can we predict the salary of one man according to his age,education year and working hours per week? Age Edu.year HoursPerWeek Pay 25 7 40 <50k 38 9 50 ≥50k 28 12 40 ≥50k 24 10 40 <50k 55 4 10 ? Classification

Example 1 Age Edu. year HoursPerWeek Pay 25 7 40 <50k 38 9 50 ≥50k 28 12 40 ≥50k 24 10 40 <50k 55 4 10 ? • Payment prediction – Can we predict the salary of one man according to his age, education year and working hours per week? Classification

Example 2 。Items Clustering Color based Shape based Clustering

Example 2 • Items Clustering Color based Shape based Clustering

What's tasks in ML? -Supervised learning:targets to learn the mapping function or relationship between the features and the labels based on the labeled data.Namely,Y F(X).(e.g.Classification, Prediction) -Unsupervised learning:aims at learning the intrinsic structure from unlabeled data.(e.g.Clustering,Latent Factor Learning and Frequent Items Mining) -Semi-supervised learning:can be regarded as the unsupervised learning with some constraints on labels,or the supervised learning with additional information on the distribution of data. Classification-Clustering-Association Rule Mining-Outlier Detection

What’s tasks in ML? – Supervised learning: targets to learn the mapping function or relationship between the features and the labels based on the labeled data. Namely, 𝑌 = 𝐹(𝑋|𝜃). (e.g. Classification, Prediction) – Unsupervised learning: aims at learning the intrinsic structure from unlabeled data. (e.g. Clustering, Latent Factor Learning and Frequent Items Mining) – Semi-supervised learning: can be regarded as the unsupervised learning with some constraints on labels, or the supervised learning with additional information on the distribution of data. Classification-Clustering- Association Rule Mining- Outlier Detection

Supervised Learning Given training data ={(x1,y1),(x2,y2),..,(XN,yN)}where yi is the corresponding label of data xi,supervised learning learns the mapping function Y F(X|0),or the posterior distribution P(Y X). Dependent variable:PLAY ·Supervised problems Play Don't Play 5 -Classification OUTLOOK Regression sunny overcast rain Learn to Rank Play 2 Play Play 3 Tagging Don't Play 3 Don't Play 0 Don't Play 2 HUMIDITY WINDY <=70 >70 TRUE FALSE Play 2 Play 0 Play 0 Play 3 Don't Play 0 Don't Play 3 Don't Play 2 Don't Play 0

Given training data 𝑋 = x1, y1 , x2, y2 , … , xN, yN where 𝑦𝑖 is the corresponding label of data 𝑥𝑖 , supervised learning learns the mapping function 𝑌 = 𝐹(𝑋|𝜃), or the posterior distribution 𝑃 𝑌 𝑋 . • Supervised problems – Classification – Regression – Learn to Rank – Tagging – …… Supervised Learning

点击进入文档下载页（PDF格式）

共116页，可试读30页，点击继续阅读 ↓↓

您可能感兴趣的文档

电子科技大学：《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源（课件讲稿）Lecture 1 Intro（主讲：邵俊明）
计算机科学与技术（PPT讲稿）Unlock with Your Heart - Heartbeat-based Authentication on Commercial Mobile Phones
计算机科学与技术（参考文献）VECTOR - Velocity Based Temperature-field Monitoring with Distributed Acoustic Devices
计算机科学与技术（参考文献）VSkin - Sensing Touch Gestures on Surfaces of Mobile Devices Using Acoustic Signals
计算机科学与技术（参考文献）RespTracker - Multi-user Room-scale Respiration Tracking with Commercial Acoustic Devices
计算机科学与技术（参考文献）Dynamic Speed Warping - Similarity-Based One-shot Learning for Device-free Gesture Signals
计算机科学与技术（参考文献）SpiderMon - Towards Using Cell Towers as Illuminating Sources for Keystroke Monitoring
计算机科学与技术（参考文献）Unlock with Your Heart：Heartbeat-based Authentication on Commercial Mobile Phones
计算机科学与技术（参考文献）QGesture - Quantifying Gesture Distance and Direction with WiFi Signals
计算机科学与技术（PPT讲稿）QGesture - Quantifying Gesture Distance and Direction with WiFi Signals
计算机科学与技术（参考文献）Gait Recognition Using WiFi Signals
计算机科学与技术（参考文献）Gait Recognition Using WiFi Signals
电子科技大学：《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源（课件讲稿）Lecture 3 Hashing
电子科技大学：《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源（课件讲稿）Lecture 4 Sampling for Big Data
电子科技大学：《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源（课件讲稿）Lecture 5 Data Stream Mining
电子科技大学：《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源（课件讲稿）Lecture 6 Graph Mining
电子科技大学：《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源（课件讲稿）Lecture 7 Hadoop-Spark
电子科技大学：《先进计算机网络技术》课程教学资源（课件讲稿）Introduction（冯钢）
电子科技大学：《先进计算机网络技术》课程教学资源（课件讲稿）Unit 1 Overview - A big Picture on Traffic Control and QoS in IP networks
电子科技大学：《先进计算机网络技术》课程教学资源（课件讲稿）Unit 2 Call-level Models and Admission Control
电子科技大学：《先进计算机网络技术》课程教学资源（课件讲稿）Unit 3 Traffic Policing and Shaping
电子科技大学：《先进计算机网络技术》课程教学资源（课件讲稿）Unit 4 TCP Traffic Control
电子科技大学：《先进计算机网络技术》课程教学资源（课件讲稿）Unit 5 Buffer Management
电子科技大学：《先进计算机网络技术》课程教学资源（课件讲稿）Unit 6 Packet Scheduling

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录