当前位置:高等教育资讯网  >  中国高校课件下载中心  >  大学文库  >  浏览文档

电子科技大学:《大数据分析与挖掘 Big Data Analysis and Mining》课程教学资源(课件讲稿)Lecture 5 Data Stream Mining

资源类别:文库,文档格式:PDF,文档页数:45,文件大小:3.28MB,团购合买
 What is data stream?  What is Concept Drift?  Data stream classification  Data stream clustering
点击下载完整版文档(PDF)

Lecture 5 Data Stream Mining

Lecture 5 Data Stream Mining

Outline ▣What is data stream? What is Concept Drift? Data stream classification Data stream clustering

 What is data stream?  What is Concept Drift?  Data stream classification  Data stream clustering Outline

Internet Surveillance SPAM SPAM FILTER Spam Filtering DATA Network Intrusion Industry STREAM Mobile Smart Phone Sensor *Note:some pictures derived from internet

DATA STREAM Internet Industry Surveillance *Note: some pictures derived from internet Sensor Network Intrusion Smart Phone Spam Filtering Mobile

Potential Applications Telecommunication calling records Business:credit card transaction flows Network monitoring and traffic engineering Financial market:stock exchange Engineering industrial processes:power supply manufacturing Sensor,monitoring surveillance:video streams,RFIDs ·Security monitoring Web logs and Web page click streams

Potential Applications • Telecommunication calling records • Business: credit card transaction flows • Network monitoring and traffic engineering • Financial market: stock exchange • Engineering & industrial processes: power supply & manufacturing • Sensor, monitoring & surveillance: video streams, RFIDs • Security monitoring • Web logs and Web page click streams

What is data stream? A data stream is a massive sequence of data objects which have some unique features: >One by One >Potentially Unbounded >Concept Drift data4 data3 data2 datal Data mining system Data stream

What is data stream? A data stream is a massive sequence of data objects which have some unique features:  One by One  Potentially Unbounded  Concept Drift data1 Data stream data4 data3 data2 Data mining system

Challenges Data Stream:(a)Infinite Length (b)Evolving Nature ◆Single Pass Handling ◆Memory Limitation ◆Low Time Complexity ◆Concept Drift

Challenges Data Stream: (a) Infinite Length (b) Evolving Nature  Single Pass Handling  Memory Limitation  Low Time Complexity  Concept Drift

What is concept drift? In predictive analytics and machine learning,the concept drift means that the statistical properties of the target variable, which the model is trying to predict,change over time in unforeseen ways. In a word,the probability distribution changes. ·Change in P(c) ·Change in P(X) ·Change in P(ClX)

What is concept drift? In predictive analytics and machine learning, the concept drift means that the statistical properties of the target variable, which the model is trying to predict, change over time in unforeseen ways. In a word, the probability distribution changes. • Change in P(C) • Change in P(X) • Change in P(C|X)

Real concept drift vs.Virtual concept drift Original data Real concept drift Virtual drift ● p(yX)changes p(X)changes,but not p(ylX) P(C,IX)=P(C)P(XIC,) P(X)

Real concept drift vs. Virtual concept drift P(C ) P(X | C ) (C | X) P(X) i i P i 

Example:Concept-Drift Current hyperplane 0 O 0 0 0 O 0 6 0 00 0 00 8 000 8 000 0 O Previous hyperplane A data chunk Negative instance● Instances victim of concept-drift Positive instance o

Example: Concept-Drift Negative instance Positive instance A data chunk Current hyperplane Previous hyperplane Instances victim of concept-drift

1,Concept Drift Detection

1、 Concept Drift Detection

点击下载完整版文档(PDF)VIP每日下载上限内不扣除下载券和下载次数;
按次数下载不扣除下载券;
24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
共45页,可试读15页,点击继续阅读 ↓↓
相关文档

关于我们|帮助中心|下载说明|相关软件|意见反馈|联系我们

Copyright © 2008-现在 cucdc.com 高等教育资讯网 版权所有