plotcon 2016 visualization talk by alexandra johnson

15
Visualizing Abstract Concepts in Machine Learning PIC Alexandra Johnson ___________ Software Engineer @ SigOpt #MachineLearning #MLViz Visualizing Abstract Concepts in Machine Learning | 1

Upload: sigopt

Post on 16-Apr-2017

1.005 views

Category:

Software


0 download

TRANSCRIPT

Page 1: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Visualizing Abstract Concepts inMachine Learning 

PICAlexandra Johnson

___________Software Engineer @ SigOpt

#MachineLearning #MLViz

Visualizing Abstract Concepts in Machine Learning | 1

Page 2: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Visualizing Abstract Concepts in Machine Learning | 2

What is Machine Learning?

Versicolor

Setosa

Virginica

Training Data + Model -> Labels (Classification)or Numbers (Regression)

Page 3: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Why is this so Intimidating?

Visualizing Abstract Concepts in Machine Learning | 3

In-brower deep neural net from playground.tensorflow.org

Hyperparameters = yourmodel's magic numbers Examples: learning rate, ratioof train to test data, numberof hidden layers, neurons perhidden layerHyperparameter values mustbe set before training

Page 4: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Solution: Hyperparameter OptimizationAnd four visualization challenges

Visualizing Abstract Concepts in Machine Learning | 4

Page 5: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Values you choose for yourhyperparameters have adirect effect on theperformance of your modelHard to capture interactionsof 20 hyperparameters

20 Dimensional Math is Hard

Visualizing Abstract Concepts in Machine Learning | 5

Page 6: Plotcon 2016 Visualization Talk  by Alexandra Johnson

−15 −10 −5 0 5

0.2

0.4

0.6

0.8

1

log_C

Accuracy

Visualizing Abstract Concepts in Machine Learning | 6

20 Dimensional Math is Hard

First try: graph modelperformance vshyperparameter value For every hyperparameterGood for understandingindivudal hyperparameters,bad for understandinginteractions

Page 7: Plotcon 2016 Visualization Talk  by Alexandra Johnson

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Accuracy

Visualizing Abstract Concepts in Machine Learning | 7

20 Dimensional Math is Hard

Graph up to 4 dimensions atonce: x, y, z axis + colorHard to visualize 4dimensions at once, imagine20!Maybe you want to use analgorithm to handlehyperparameter optimization

Page 8: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Visualizing Abstract Concepts in Machine Learning | 8

Hyperparameter OptimizationStrategies are Different

Grid Search Random Search Bayesian Optimization

Page 9: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Some Strategies ProduceBetter Results

0.96 0.97 0.98 0.990

5

10

15

20

25

Distribution of Best Found Values over Experiments of 25 Iterations

Maximum Accuracy

Ex

pe

rim

en

ts

Visualizing Abstract Concepts in Machine Learning | 9

Experiment = optimizinghyperparameters of yourmodel, results in somemaximum performanceSome hyperparameteroptimization strategies arestochastic, can't just look atone experimentLook at distribution ofmaximum performance overmany experiments optimizinghyperparameters of the samemodel

Page 10: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Some Strategies ProduceBetter Results

0.92 0.93 0.94 0.95 0.96 0.97 0.98 0.99 10

5

10

15

20

25

Distribution of Best Found Values over Experiments of 25 Iterations

Maximum Accuracy

Ex

pe

rim

en

ts

Random Search

Grid Search

Bayesian Optimization

Visualizing Abstract Concepts in Machine Learning | 10

Use the Mann-Whitney U Test to compare distributions ofmaximum performance

Page 11: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Some Strategies ProduceBetter Results, Faster

0 5 10 15 20

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Best Seen Trace

Timestep

Be

st

Se

en

Accu

ra

cy

Visualizing Abstract Concepts in Machine Learning | 11

How much time do you havefor optimization?Strategies that reliablyproduce better results fastercan optimize thehyperparameters of yourmodel in less time

Page 12: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Some Strategies ProduceBetter Results, Faster

0 5 10 15 20

0.4

0.5

0.6

0.7

0.8

0.9

1

Interquartile Range of Best Seen Traces

Timestep

Be

st

Se

en

Accu

ra

cy

Visualizing Abstract Concepts in Machine Learning | 12

Again, consider a distributionof optimization experiments25th - 75th percentile ofperformance our modelcould acheive if we stoppedearly

Page 13: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Some Strategies ProduceBetter Results, Faster

0 5 10 15 20

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Interquartile Ranges of Best Seen Traces

Timestep

Be

st

Se

en

Accu

ra

cy

Grid Search

Random Search

Bayesian Optimization

Visualizing Abstract Concepts in Machine Learning | 13

Compare the area under thecurve of different strategies Further reading atsigopt.com/research

Page 14: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Takeaways

Visualizing Abstract Concepts in Machine Learning | 14

Hyperparameter optimization is an invaluable part of any modernmachine learning pipeline

Concepts like comparing hyperparameter optimization strategiesare extremely abstract and difficult to understand 

Visualizations are in their infancy, but are an important part ofexplaining these ideas

Page 15: Plotcon 2016 Visualization Talk  by Alexandra Johnson

Thank You!

Visualizing Abstract Concepts in Machine Learning | 14

Email: [email protected]: @alexandraj777

www.sigopt.com