machine learning

33
Introduction to Introduction to Machine Learning Machine Learning Algorithms Algorithms

Upload: butest

Post on 24-Dec-2014

390 views

Category:

Documents


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Machine Learning

Introduction toIntroduction toMachine Learning AlgorithmsMachine Learning Algorithms

Page 2: Machine Learning

2

What is Artificial Intelligence (AI)?What is Artificial Intelligence (AI)?

Design and study of computer programs that behave intelligently.

Designing computer programs to make computers smarter.

Study of how to make computers do things at which, at the moment, people are better.

Page 3: Machine Learning

3

Research Areas and ApproachesResearch Areas and Approaches

ArtificialIntelligence

Research

Rationalism (Logical)Empiricism (Statistical)Connectionism (Neural)Evolutionary (Genetic)Biological (Molecular)

Paradigm

Application

Intelligent AgentsInformation RetrievalElectronic CommerceData MiningBioinformaticsNatural Language Proc.Expert Systems

Learning AlgorithmsInference MechanismsKnowledge RepresentationIntelligent System Architecture

Page 4: Machine Learning

4

Concept of Machine LearningConcept of Machine Learning

Page 5: Machine Learning

5

Page 6: Machine Learning

6

InformationTheory

ContextContext

ComputerScience

(AI)

CognitiveScience

Statistics

MachineLearning

Page 7: Machine Learning

7

Why Machine Learning?Why Machine Learning?

Recent progress in algorithms and theory Growing flood of online data Computational power is available Budding industry

Three niches for machine learning Data mining: using historical data to improve decisions

Medical records --> medical knowledge Software applications we can’t program by hand

Autonomous driving Speech recognition

Self-customizing programs Newsreader that learns user interests

Page 8: Machine Learning

8

Learning: DefinitionLearning: Definition

Definition Learning is the improvement of performance in some

environment through the acquisition of knowledge resulting from experience in that environment.

the improvementof behavior

the improvementof behavior

on someperformance task

on someperformance task

through acquisitionof knowledge

through acquisitionof knowledge

based on partial task experience

based on partial task experience

Page 9: Machine Learning

9

A Learning Problem: A Learning Problem: EnjoySportEnjoySport

Sky

What is the general concept?

Temp Humid Wind WaterForecast EnjoySports

Sunny Warm Normal Strong Warm Same Yes

Sunny Warm High Strong Warm Same Yes Rainy Cold High Strong Warm Change No

Sunny Warm High Strong Cool Change Yes

Page 10: Machine Learning

10

Metaphors and MethodsMetaphors and Methods

Neurobiology

BiologicalEvolution

HeuristicSearch

StatisticalInference

Memory andRetrieval

ConnectionistLearning

Genetic Learning Tree / RuleInduction

Case-BasedLearning

ProbabilisticInduction

Page 11: Machine Learning

11

What is the Learning Problem?What is the Learning Problem?

Learning = improving with experience at some task Improve over task T, With respect to performance measure P, Based on experience E.

E.g., Learn to play checkers T: Play checkers P: % of games won in world tournament E: opportunity to play against self

Page 12: Machine Learning

12

Machine Learning: TasksMachine Learning: Tasks

Supervised Learning Estimate an unknown mapping from known input- output pairs Learn fw from training set D={(x,y)} s.t.

Classification: y is discrete Regression: y is continuous

Unsupervised Learning Only input values are provided Learn fw from D={(x)} s.t.

Compression Clustering

Reinforcement Learning

)()( xxw fyf

xxw )(f

Page 13: Machine Learning

13

Machine Learning: StrategiesMachine Learning: Strategies

Rote learning Concept learning Learning from examples Learning by instruction Inductive learning Deductive learning Explanation-based learning (EBL) Learning by analogy Learning by observation

Page 14: Machine Learning

14

Supervised LearningSupervised Learning

Given a sequence of input/output pairs of the form <xi, yi>, where xi is a possible input and yi is the output associated with xi.

Learn a function f that accounts for the examples seen so far, f(xi) = yi for all i, and that makes a good guess for the outputs of the inputs that it has not seen.

Page 15: Machine Learning

15

Examples of Input-Output PairsExamples of Input-Output Pairs

Task Inputs Outputs

Recognition

Action

Janitor robot

problem

Descriptions of

objects

Classes that the

objects belong to

Actions or predictionsDescriptions of

situations

Descriptions of

offices (floor, prof’s office)

Yes or No (indicating

whether or not the

office contains a

recycling bin)

Page 16: Machine Learning

16

Unsupervised LearningUnsupervised Learning

Clustering A clustering algorithm partitions the inputs into a fixed

number of subsets or clusters so that inputs in the same cluster are close to one another.

Discovery learning The objective is to uncover new relations in the data.

Page 17: Machine Learning

17

Online and Batch LearningOnline and Batch Learning

Batch methods Process large sets of examples all at once.

Online (incremental) methods Process examples one at a time.

Page 18: Machine Learning

18

Machine Learning Algorithms and Machine Learning Algorithms and ApplicationsApplications

Page 19: Machine Learning

19

Machine Learning AlgorithmsMachine Learning Algorithms

Neural Learning Multilayer Perceptrons (MLPs) Self-Organizing Maps (SOMs)

Evolutionary Learning Genetic Algorithms

Probabilistic Learning Bayesian Networks (BNs)

Other Machine Learning Methods Decision Trees (DTs)

Page 20: Machine Learning

20

Neural Nets for Handwritten Digit Neural Nets for Handwritten Digit RecognitionRecognition

Pre-processing

… Input units

Hidden units

Output units0 1 2 3 9

Training Test

0 1 2 3 9

?

Page 21: Machine Learning

21

ALVINN System: ALVINN System: Neural Network Learning to Steer Neural Network Learning to Steer

an Autonomous Vehiclean Autonomous Vehicle

Page 22: Machine Learning

22

Learning to Navigate a Vehicle by Learning to Navigate a Vehicle by Observing an Human Expert (1/2)Observing an Human Expert (1/2)Inputs

The images produces by a camera mounted on the vehicle

Outputs The actions taken by the human driver to steer

the vehicle or adjust its speed.

Result of learning A function mapping images to control actions

Page 23: Machine Learning

23

Learning to Navigate a Vehicle by Learning to Navigate a Vehicle by Observing an Human Expert (2/2)Observing an Human Expert (2/2)

Page 24: Machine Learning

24

Data Recorrection by a Hopfield NetData Recorrection by a Hopfield Networkwork

original target data

corrupted input data

Recorrected data after

10 iterations

Recorrected data after

20 iterations

Fullyrecorrected data after

35 iterations

Page 25: Machine Learning

25

ANN for Face Recognition

960 x 3 x 4 network is trained on gray-level images of faces to predict whether a person is looking to their left, right, ahead, or up.

Page 26: Machine Learning

26

Data MiningData Mining

-- -- ---- -- ---- -- --

-- -- ---- -- ---- -- --

Target data

Cleaned data

Transformed data

Patterns/ model

KnowledgeDatabase/data warehouse

Selection& Sampling

Selection& Sampling

Preprocessing& Cleaning

Preprocessing& Cleaning

Transformation& reduction

Transformation& reduction

Interpretation/Evaluation

Interpretation/EvaluationData MiningData Mining

Performance system

Page 27: Machine Learning

27

Hot Water Flashing Nozzle with Hot Water Flashing Nozzle with Evolutionary AlgorithmsEvolutionary Algorithms

Start

Hot water entering Steam and droplet at exit

At throat: Mach 1 and onset of flashing

Hans-Paul Schwefel performed the original experiments

Page 28: Machine Learning

28

Machine Learning Applications in Machine Learning Applications in BioinformaticsBioinformatics

Page 29: Machine Learning

29

Bayesian NetworksBayesian Networksfor Gene Expression Analysisfor Gene Expression Analysis

Processed

dataData

Preprocessing

Learningalgorithm

Gene C Gene B

Gene A

Target

Gene D

Gene C Gene B

Gene A

Target

Gene D

Gene C Gene B

Gene A

Target

Gene D

Gene C Gene B

Gene A

Target

Gene D

The values of Gene C and Gene B are given.

Belief propagation Probability for the target is computed.

Learning

Inference

Page 30: Machine Learning

30

Multilayer Perceptrons for Gene Multilayer Perceptrons for Gene Finding and PredictionFinding and Prediction

Coding potential valueCoding potential value

GC CompositionGC Composition

LengthLength

DonorDonor

AcceptorAcceptor

Intron vocabularyIntron vocabulary

basesDiscrete

exon score

0

1

sequence

score

Page 31: Machine Learning

31

Self-Organizing Maps for DNA MiSelf-Organizing Maps for DNA Microarray Data Analysiscroarray Data Analysis

Two-dimensional arrayof postsynaptic neurons

Bundle of synapticconnections

Winning neurons

Input

Page 32: Machine Learning

32

Biological Information ExtractionBiological Information ExtractionText Data

DB

LocationDate

DB Record

Database TemplateFilling

Data Analysis &Field Identification

Data Classification &Field Extraction

Information Extraction

Field PropertyIdentification & Learning

Page 33: Machine Learning

33

Biomolecular ComputingBiomolecular Computing

011001101010001 ATGCTCGAAGCT