sztuka czytania między wierszami - r i data mining
DESCRIPTION
Slajdy stanowią ramy warsztatu z R i data miningu (poziom podstawowy). Materiały przykładowe z komentarzami w języku polskim: https://gist.github.com/kmrowca/publicTRANSCRIPT
![Page 1: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/1.jpg)
Sztuka czytania między wierszami
czyli język R i Data Mining w akcji
![Page 2: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/2.jpg)
Katarzyna Mrowca
<me>
</me>
![Page 3: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/3.jpg)
![Page 4: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/4.jpg)
The deal
![Page 5: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/5.jpg)
Agenda
• Quick glance on theory - Data mining• Exercises on… paper• Quick glance on tool – R console• Exercises – became friend with R• …
![Page 6: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/6.jpg)
Agenda
• Quick glance on theory - Data mining• Exercises on… paper• Quick glance on tool – R console• Exercises – became friend with R• …
Exercise
Theory
![Page 7: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/7.jpg)
Agenda
• Quick glance on theory - Data preparation• Exercises • Decision trees• Cluser analysis• Text mining• …
Exercise
Theory
![Page 8: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/8.jpg)
Agile is everywhere!
![Page 9: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/9.jpg)
Agile is everywhere!
• Retro after second break
![Page 10: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/10.jpg)
Quick glance on theory!
![Page 11: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/11.jpg)
What data mining is?
![Page 12: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/12.jpg)
What „google” says?
![Page 13: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/13.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), [1] an interdisciplinary subfield of computer science,
![Page 14: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/14.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 15: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/15.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 16: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/16.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 17: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/17.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 18: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/18.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 19: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/19.jpg)
What „google” says?
The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
![Page 20: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/20.jpg)
What „google” says?
The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
![Page 21: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/21.jpg)
What „google” says?
The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
![Page 22: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/22.jpg)
What „google” says?
Aside from the raw analysis step, it involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.
Source: wikipedia
![Page 23: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/23.jpg)
Data mining – what is „inside”
• Predictive• Regression• Classification• Collaborative Filtering
• Descriptive• Clustering / similarity matching• Association rules and variants• Deviation detection
![Page 24: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/24.jpg)
Data mining – what is „inside”
• Predictive:• Regression• Classification• Collaborative Filtering
• Descriptive:• Clustering / similarity matching• Association rules and variants• Deviation detection
![Page 25: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/25.jpg)
Data mining – what is „inside”
• Predictive:• Regression• Classification• Collaborative Filtering
• Descriptive:• Clustering / similarity matching• Association rules and variants• Deviation detection
![Page 26: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/26.jpg)
What data mining is not?
![Page 27: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/27.jpg)
Why Data Mining is so popular?
![Page 28: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/28.jpg)
What is a difference between statistics and data mining?
![Page 29: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/29.jpg)
Exercise
![Page 30: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/30.jpg)
Data preparation
![Page 31: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/31.jpg)
Variables
![Page 32: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/32.jpg)
Qualitative & Quantitative
![Page 33: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/33.jpg)
Tame R console!
![Page 34: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/34.jpg)
Take a break
![Page 35: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/35.jpg)
Regression
![Page 36: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/36.jpg)
Time series
![Page 37: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/37.jpg)
Decision trees
![Page 38: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/38.jpg)
Regression trees
![Page 39: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/39.jpg)
Classification trees
![Page 40: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/40.jpg)
K means
![Page 41: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/41.jpg)
Text mining
![Page 42: Sztuka czytania między wierszami - R i Data mining](https://reader033.vdocuments.site/reader033/viewer/2022042814/555cb5c0d8b42aad358b5718/html5/thumbnails/42.jpg)
Thank you!