(bdt311) deep learning: going beyond machine learning

Post on 13-Apr-2017

3.291 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Chida Chidambaram

Vishal Deshpande

BDT311

Deep Learning

Going Beyond Machine Learning

October 2015

What to Expect from the Session

Data analytics options on AWS

Machine learning (ML) – high level

Amazon ML from AWS

ML sample use case

Deep learning (DL) – high level

DL sample use cases

AWS GPU/HPCC server family

Q&A

Data Analytics Options on AWS

Amazon EMR

AnalyzeStoreIngest

Amazon

Kinesis DynamoDBAmazon Redshift

RDSS3 Amazon Kinesis

ConsumerMachine Learning

Amazon Kinesis

Producer

Traditional Server Mobile Clients

EC2 Machines

Machine Learning

Machine Learning

How can a machine identify Bruce Willis vs Jason

Statham?

Bruce Willis ???

Machine Learning

Machine Learning

Artificial Intelligence

Optimization & Control

Neuroscience and Neural Networks

Statistical Modeling

Information Theory

Machine Learning

Bear

Eagle

People

Sunset

Machine Learning

• Using machines to discover trends and patterns and compute

mathematical predictive models based on factual past data

• ML models provide insights into likely outcomes based on the past –

machine learning helps uncover the probability of an outcome in the

future rather than merely state what has already happened in the past

• Past data and statistical modeling is used to make predictions based

on probability

Where traditional business analytics aims at answering questions about

past events, machine learning aims at answering questions about the

possibilities of future events

Machine Learning

Supervised learning

Human intervention and validation required

Photo classification and tagging

Unsupervised learning

No human intervention required

Auto-classification of documents based on context

Machine Learning

Collect

Validation data Test dataTraining data

Model training Model validation Final predictions

Machine Learning – Process

• Input feature selection – what are my predictions going

to be based on

• Target – what you want to predict

• Prediction function – regression, classification,

dimensionality reduction

Xn -> F(xn) -> T(x)

Machine Learning – Process

X1 X2 X3 X4 X5 Y

0.3 0.25 0.4 0.34 0.2 1

0.14 0.17 0.2 0.3 0.2 0

0.24 0.21 0.19 0.15 0.35 1

0.3 0.25 0.35 0.4 0.45 1

𝜒𝑛𝜖𝐹(𝑥𝑛) ; Target: y

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0 1 2 3 4 5

X1 X2 X3 X4 X5

Machine Learning – Process

How can a machine identify Bruce Willis vs Jason

Statham?

Image analysis –

Input feature set for image 1 -> bald, black suit

Bruce Willis ???

Machine Learning – Process• Start with data for which the answer is already known

• Identify the target – what you want to predict from the data

• Pick the variables/features that can be used to identify the patterns

to predict the target

• Train the ML model with the dataset for which you already know the

target answer

• Use the trained model to predict the target on the data for which the

answer is not known

• Evaluate the model for accuracy

• Improve the model accuracy as needed

Machine Learning – When to Use It

You need ML if

• Simple classification rules are inadequate

• Scalability is an issue with large number of datasets

You do not need ML if

• You can predict the answers by using simple rules and computations

• You can program predetermined steps without needing any data

driven learning

Machine Learning from AWS

Amazon Machine Learning is a service that makes it easy

for developers of all skill levels to use machine learning

technology.

Machine Learning from AWS

Machine Learning from AWS

Machine Learning from AWS

• AWS ecosystem integration

• Pre-built ML algorithms

• Batch and real-time prediction

• Faster models to predictions

• Data visualizations and exploration

• Data transformations

• Fully-managed

• Pay as you go

Machine Learning – Uses

Predictive analytics

• What is the likelihood that a customer visiting my e-commerce site will buy my product

• What is the probability of a congressional bill being passed

Classification / grouping

• Auto classification and tagging of images

• Video classification

• Auto categorization of raw text data based on predefined ontologies

Machine Learning Use Cases

• Personalization – present personalized e-commerce

experience

• Document classification – auto classify documents

based on the context

• Fraud detection – discover anomalies to regular

behavior to identify and flag fraudulent transactions

• Recommendation engines

• Customer churn prediction

Deep Learning – Advanced ML

Deep Learning – Going Beyond ML

ML algorithms that are either supervised or unsupervised

and

• Use many layers of nonlinear processing units for

feature extraction and transformation

• Are based on learning multiple levels of features or

representation in each layer, with the layers forming a

hierarchy of low-level to high-level features

Where traditional machine learning focuses on feature

engineering, deep learning focuses on end-to-end

learning based on raw features

Deep Learning

Bear

Eagle

People

Sunset

Object: Bear

Location: Yellowstone Park

Action: Looking for food

Object: Eagle

Location: Wakula Springs, FL

Action: Resting

Object: Multiple – people, ball

Location: Montana

Action: Playing

Object: ?

Location: Montana

Action: ?

Deep Learning – Neural Networks

A collection of simple, trainable mathematical units that

collectively learn complex functions

Output

Neural network

Input

Hidden layers

Deep Learning – Train

X Bear

Grizzly Bear

Polar Bear

Dog

Fox

Feedback

Neural network

Deep Learning – Deploy

Grizzly Bear

Neural network

Deep Learning – Flow

Train

DeployModel

Classification

Detection

Segmentation

Feedback

Training dataset

Solver

Neural network

Train

Solver

Network

Dashboard

Deep Learning – Data Representation

Hierarchy of representations

• Image – vectors of pixel, motif, part, contour, edge, etc.

• Videos – Image frames, pixels per frame, deltas per

frame, etc.

• Text – characters, words, clauses, sentences, etc.

• Speech – audio, band, frequency, wavelengths,

modulations, phonetics, etc.

Deep Learning – Advantages

• Features automatically deduced and optimally tuned for

the desired outcome

• Robustness to variations automatically learned

• Reusability – same neural network approach can be

used for many applications and data types

• Massively parallel computations through use of GPUs –

scalable for large volumes of data

Deep Learning – Traction

• Cloud and big data eco-system – cost reduction in

computation and storage capacity for huge volumes of

data

• New advancements in deep learning toolkit with better

GPU computation tools and libraries

• Advancements in GPU acceleration and availability of

GPU clusters through the cloud infrastructure

What is driving deep learning…

Deep Learning on AWS - GPU Servers

• Family of servers for DL/HPCC

• C4 instances – for high performance computing

• G2 instances – for additional CUDA processing used in

deep learning

• Four NVIDIA GRID GPUs, each with 1,536 CUDA cores and

4 GB of video

• 32 vCPUs

• 60 GB of memory

• 240 GB (2 x 120) of SSD storage

Application Code

CPUGPU

Compute

Intensive

Code

Rest of

Code

AWS GPU Servers

Deep Learning – GPU Acceleration

Batch size Training time

(CPU)

Training time

(GPU)

64 images 64s 7.5s

128 images 124s 14.5s

256 images 257s 28.5s

Training a deep neural network for image processing

CPU : Dual 10-core Ivy Bridge CPUs

GPU : 1 Tesla K40 GPU

Implemented with Caffe

* nVidia

Deep Learning – Software Tools and Libraries

• Theano (Python)

• Blocks (Python/Theano)

• Lasange (Python/Theano)

• Pylearn2 (Python)

• Torch (Lua)

• Deeplearning4J (Java)

• Caffe

• CUDA-convent

Deep Learning – Uses

• Automatic speech recognition

• Image recognition

• Natural language processing

• Drug discovery and toxicology

• CRM and e-commerce

• Human behavior analysis

• Driverless cars

• Search and advertising

Deep Learning – Research

And more…

Image Recognition /

Computer Vision

DL – Image Recognition / Computer Vision

• Visual searches for retail

Industries

• Self-driving cars

• Home security

• Wearables

Natural Language Processing and

Speech Recognition

DL - Natural Language Processing and Speech

Recognition

• Understanding the meaning

• Similar or dissimilar words

• Contextual meaning

• Language modeling

• Language neural network

Restaurants near me

ML to DL – From Siri/Cortana to J.A.R.V.I.S

Restaurants near me

Good morning, sir. Would you like a cup

of coffee or a shot of vodka? Probably

the vodka would be a better choice for

you today.

DL Implementation –

Driverless Cars

Driverless Cars

• Google, Baidu, Mercedes Benz , Audi,

Tesla

• Deep neural network (DNN) models

• Real-time pedestrian detection

algorithms

• Processes TBs of data in real-time

• Keep the car moving!

• In addition to basic functions

Eurocars.com

Demo

Demo

Useful Resources

• Bring Your Own Data (BYOData) campaign from Day1

http://day1solutions.com/byo-data

• Amazon Machine Learning

http://aws.amazon.com/machine-learning

• Deep-Learning lab and courses

https://developer.nvidia.com/deep-learning-courses

• Deep-Learning resources

http://deeplearning.net

• Public data sets for Deep-Learning research

http://deeplearning.net/datasets/

Remember to complete

your evaluations!

Thank you!

top related