cs2220 introduction to computational biology

32
CS2220 Introduction to Computational Biology Weka Introduction Xiaoli Li

Upload: others

Post on 09-Apr-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CS2220 Introduction to Computational Biology

CS2220 Introduction to Computational Biology 

Weka Introduction

Xiaoli Li

Page 2: CS2220 Introduction to Computational Biology

http://www.kdnuggets.com/polls/2012/analytics-data-mining-big-data-software.html

Most Popular Tools for Data Mining

Page 3: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Page 4: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaSelect the Data for Exploring

Page 5: CS2220 Introduction to Computational Biology

We can see and edit our data

Page 6: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

Actual Weka data format

Example: Predict if we want to play

Page 7: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaClass label: To Play or Not to Play?

Page 8: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaOutlook: To Play or Not to Play?

Page 9: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Temperature: To Play or Not to Play?

Page 10: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Humidity: To Play or Not to Play?

Page 11: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaWindy: To Play or Not to Play?

Page 12: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaClassification: To Play or Not to Play?

Page 13: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaJ48 Decision Tree: To Play or Not to Play?

Actual tree structure

Page 14: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?

Training set has been used as test set

Page 15: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?

Leave one out CVTree structure

Page 16: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?

Change the parameter of J48

Page 17: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

Page 18: CS2220 Introduction to Computational Biology

Pattern Discovery: ClusteringFind “natural” grouping of instances

given un-labeled data

Page 19: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 20: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 21: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaClustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 22: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 23: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 24: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 25: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 26: CS2220 Introduction to Computational Biology

Association Rules (Unsupervised Learning)Finding groups of items that tend to

occur together

Page 27: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

Page 28: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: Supermarket Purchases

Page 29: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: Supermarket Purchases

Page 30: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Association Rules: Supermarket Purchases

Page 31: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaAssociation Rules: Supermarket Purchases

Page 32: CS2220 Introduction to Computational Biology

Contact: [email protected] if you have questions