applications of deep learning to...

32
APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect August 2015

Upload: dinhtram

Post on 11-Nov-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

APPLICATIONS OF DEEP

LEARNING TO GEOINT

Jon Barker Solutions Architect

August 2015

Overview

Motivation

Introduction to Deep Learning

GEOINT applications

Deep Learning deployment

Questions

Motivation

350 Million Images Uploaded a Day

100 Hours Video Uploaded Every Minute Rapid growth in remote sensing

numbers and capability

There is not enough time or expertise to write algorithms for each

individual information extraction task that needs to be performed

Deep Learning provides general algorithms that identify mission-

relevant content and patterns in raw data at machine speed

Tens of thousands of social and political

events indexed daily

Motivation Multi-INT analysis workflow

Metadata

filters

Human

perception Mission focused

analysis

Semantic

content

based filters

Mission focused

analysis

Automated

machine

perception

Noisy

content

Near-perfect

perception

Near-human

level

perception

Mission

relevant

content

BOTTLENECK TODAY

VISION

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

Deep Learning has become the most popular approach to developing Artificial Intelligence (AI) ndash machines that perceive and understand the world

The focus is currently on specific perceptual tasks and there are many successes

Today some of the worldrsquos largest internet companies as well as the foremost research institutions are using GPUs for deep learning in research and production

What is Deep Learning

CUDA for Deep Learning

Practical Deep Learning Examples

Image Classification Object Detection Localization Action Recognition Scene Understanding

Speech Recognition Speech Translation Natural Language Processing

Pedestrian Detection Traffic Sign Recognition Breast Cancer Cell Mitosis Detection Volumetric Brain Image Segmentation

Traditional Machine Perception ndash hand crafted features

Speaker ID

speech transcription hellip

Topic classification

machine translation

sentiment analysishellip

Raw data Feature extraction Result (Linear) Classifier

eg SVM

eg HMM

eg LSA

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 2: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Overview

Motivation

Introduction to Deep Learning

GEOINT applications

Deep Learning deployment

Questions

Motivation

350 Million Images Uploaded a Day

100 Hours Video Uploaded Every Minute Rapid growth in remote sensing

numbers and capability

There is not enough time or expertise to write algorithms for each

individual information extraction task that needs to be performed

Deep Learning provides general algorithms that identify mission-

relevant content and patterns in raw data at machine speed

Tens of thousands of social and political

events indexed daily

Motivation Multi-INT analysis workflow

Metadata

filters

Human

perception Mission focused

analysis

Semantic

content

based filters

Mission focused

analysis

Automated

machine

perception

Noisy

content

Near-perfect

perception

Near-human

level

perception

Mission

relevant

content

BOTTLENECK TODAY

VISION

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

Deep Learning has become the most popular approach to developing Artificial Intelligence (AI) ndash machines that perceive and understand the world

The focus is currently on specific perceptual tasks and there are many successes

Today some of the worldrsquos largest internet companies as well as the foremost research institutions are using GPUs for deep learning in research and production

What is Deep Learning

CUDA for Deep Learning

Practical Deep Learning Examples

Image Classification Object Detection Localization Action Recognition Scene Understanding

Speech Recognition Speech Translation Natural Language Processing

Pedestrian Detection Traffic Sign Recognition Breast Cancer Cell Mitosis Detection Volumetric Brain Image Segmentation

Traditional Machine Perception ndash hand crafted features

Speaker ID

speech transcription hellip

Topic classification

machine translation

sentiment analysishellip

Raw data Feature extraction Result (Linear) Classifier

eg SVM

eg HMM

eg LSA

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 3: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Motivation

350 Million Images Uploaded a Day

100 Hours Video Uploaded Every Minute Rapid growth in remote sensing

numbers and capability

There is not enough time or expertise to write algorithms for each

individual information extraction task that needs to be performed

Deep Learning provides general algorithms that identify mission-

relevant content and patterns in raw data at machine speed

Tens of thousands of social and political

events indexed daily

Motivation Multi-INT analysis workflow

Metadata

filters

Human

perception Mission focused

analysis

Semantic

content

based filters

Mission focused

analysis

Automated

machine

perception

Noisy

content

Near-perfect

perception

Near-human

level

perception

Mission

relevant

content

BOTTLENECK TODAY

VISION

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

Deep Learning has become the most popular approach to developing Artificial Intelligence (AI) ndash machines that perceive and understand the world

The focus is currently on specific perceptual tasks and there are many successes

Today some of the worldrsquos largest internet companies as well as the foremost research institutions are using GPUs for deep learning in research and production

What is Deep Learning

CUDA for Deep Learning

Practical Deep Learning Examples

Image Classification Object Detection Localization Action Recognition Scene Understanding

Speech Recognition Speech Translation Natural Language Processing

Pedestrian Detection Traffic Sign Recognition Breast Cancer Cell Mitosis Detection Volumetric Brain Image Segmentation

Traditional Machine Perception ndash hand crafted features

Speaker ID

speech transcription hellip

Topic classification

machine translation

sentiment analysishellip

Raw data Feature extraction Result (Linear) Classifier

eg SVM

eg HMM

eg LSA

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 4: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Motivation Multi-INT analysis workflow

Metadata

filters

Human

perception Mission focused

analysis

Semantic

content

based filters

Mission focused

analysis

Automated

machine

perception

Noisy

content

Near-perfect

perception

Near-human

level

perception

Mission

relevant

content

BOTTLENECK TODAY

VISION

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

BIG DATA

Deep Learning has become the most popular approach to developing Artificial Intelligence (AI) ndash machines that perceive and understand the world

The focus is currently on specific perceptual tasks and there are many successes

Today some of the worldrsquos largest internet companies as well as the foremost research institutions are using GPUs for deep learning in research and production

What is Deep Learning

CUDA for Deep Learning

Practical Deep Learning Examples

Image Classification Object Detection Localization Action Recognition Scene Understanding

Speech Recognition Speech Translation Natural Language Processing

Pedestrian Detection Traffic Sign Recognition Breast Cancer Cell Mitosis Detection Volumetric Brain Image Segmentation

Traditional Machine Perception ndash hand crafted features

Speaker ID

speech transcription hellip

Topic classification

machine translation

sentiment analysishellip

Raw data Feature extraction Result (Linear) Classifier

eg SVM

eg HMM

eg LSA

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 5: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning has become the most popular approach to developing Artificial Intelligence (AI) ndash machines that perceive and understand the world

The focus is currently on specific perceptual tasks and there are many successes

Today some of the worldrsquos largest internet companies as well as the foremost research institutions are using GPUs for deep learning in research and production

What is Deep Learning

CUDA for Deep Learning

Practical Deep Learning Examples

Image Classification Object Detection Localization Action Recognition Scene Understanding

Speech Recognition Speech Translation Natural Language Processing

Pedestrian Detection Traffic Sign Recognition Breast Cancer Cell Mitosis Detection Volumetric Brain Image Segmentation

Traditional Machine Perception ndash hand crafted features

Speaker ID

speech transcription hellip

Topic classification

machine translation

sentiment analysishellip

Raw data Feature extraction Result (Linear) Classifier

eg SVM

eg HMM

eg LSA

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 6: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Practical Deep Learning Examples

Image Classification Object Detection Localization Action Recognition Scene Understanding

Speech Recognition Speech Translation Natural Language Processing

Pedestrian Detection Traffic Sign Recognition Breast Cancer Cell Mitosis Detection Volumetric Brain Image Segmentation

Traditional Machine Perception ndash hand crafted features

Speaker ID

speech transcription hellip

Topic classification

machine translation

sentiment analysishellip

Raw data Feature extraction Result (Linear) Classifier

eg SVM

eg HMM

eg LSA

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 7: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Traditional Machine Perception ndash hand crafted features

Speaker ID

speech transcription hellip

Topic classification

machine translation

sentiment analysishellip

Raw data Feature extraction Result (Linear) Classifier

eg SVM

eg HMM

eg LSA

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 8: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Neural Network (DNN)

Modern reincarnation of Artificial Neural Networks

A very large collection of simple trainable mathematical units

Collectively they can learn very complex functions mapping raw

data to decisions

Loosely inspired by biological brains

ldquodogrdquo

Raw data Output

decision

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 9: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning approach

Train Dog

Cat

Honey badger

Dog

Cat

Raccoon Feature extraction

(Linear) Classifier

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 10: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning approach

Train Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 11: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning approach

Train

Deploy

Dog

Cat

Honey badger

Errors

Dog

Cat

Raccoon

Dog

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 12: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning for Visual Perception

Input pixels

Output image class prediction

Biologically inspired Convolutional Neural Network (CNN)

Application components

bull Task objective

bull eg Identify face

bull eg Classify age

bull Training data

bull Typically 10K ndash

100M samples

bull Network architecture

bull Learning algorithm

Local receptive field

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 13: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Visual Perception DL State of the Art

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

NORB dataset (2004)

Y LeCun

NORB Dataset (2004) 5 categories multiple views and illuminations

Training instances Test instances

291600 training samples

58320 test samples

Less than 6 error on

test set with cluttered

backgrounds

5 object classes

Multiple views and illuminations

291600 training images

58230 test images [2]

lt6 classification error on test set with

cluttered backgrounds (NYU)

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 14: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning Dominates at Visual Perception

person

car

helmet

motorcycle

bird

frog

person

dog

chair

person

hammer

flower pot

power drill

1000 object classes

12 million training images [1]

Top-5 error (Google) 48

Top-5 error (Human) 51

4

60

110

0

20

40

60

80

100

120

2010 2011 2012 2013 2014

GPU Entries

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 15: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Remote Sensing Imagery Exploitation

Object detection and classification

Scene segmentation

Land usage classification

Geologic feature classification

Change detection

Crop yield prediction

Surface water estimation

Population density estimation

Super-resolution

Photogrammetry

[3] Keio University Japan ndash SPIE EI 2015

[4] University of Arizona

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 16: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning supports the analyst

NVIDIA 2015

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 17: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Advanced Imaging Modalities

D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone

Detection from LiDAR In ICRA 2015

CNN architecture supports

MSIHSI data cubes

SAR imagery

Volumetric data eg LIDAR

Low-TRL research topics

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 18: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Open-source Imagery Exploitation

Object detection

Scene labeling

Face recognition

Image geo-location estimation

Text extraction from images

Geographic property estimation

Image de-noising [6] Stanford University NLP group

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 19: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning Dominates at Visual Perception

NVIDIA 2014

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 20: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning supports the analyst

NVIDIA 2015

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 21: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

x1

x2

x3

xN

Deep Learning generalizes across problems

Classification

Regression

Unsupervised learning

bull Clustering

bull Topic extraction

bull Anomaly detection

Real-valued feature vector

Varied data types

(and multi-source) Varied tasks

Sequence prediction

Control policy learning

Constants Big (high dimensional) Data + a complex function to learn

NUMBERS

IMAGES

SOUNDS

VIDEOS

TEXT

Unstructured

Structured

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 22: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Geospatial Analytics

12 years of San

Francisco crime

reports

Given date time and

location DL model

predicts crime

Top-5 error 59

~4 hours work

(including training)

using open source

tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 23: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Geospatial activity data

Deep Neural

Networks (DNNs)

naturally ingest

structured data

Modern networks

can learn complex

predictive patterns

including temporal

sequences

Real-time destination prediction for taxis using DNN

Montreal Institute for Learning Algorithms (MILA) 2015

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 24: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

SensorPlatform Control

Data sequence Planning +

Control

policy

Δ(predicted future reward actual reward)

Applications

Sensor tasking

Autonomous

vehicle navigation

Reinforcement learning

[11] Google DeepMind in Nature

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 25: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Why is Deep learning hot now

Big Data Availability New DL Techniques GPU acceleration

350 millions images uploaded per day

25 Petabytes of customer data hourly

100 hours of video uploaded every minute

Three Driving Factorshellip

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 26: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Why are GPUs good for deep learning

GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

Neural

Networks GPUs

Inherently

Parallel Matrix

Operations

FLOPS

Bandwidth

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 27: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep learning with COTS HPC systems

A Coates B Huval T Wang D Wu A Ng B Catanzaro

ICML 2013

GOOGLE DATACENTER

1000 CPU Servers 2000 CPUs bull 16000 cores

600 kWatts

$5000000

STANFORD AI LAB

3 GPU-Accelerated Servers 12 GPUs bull 18432 cores

4 kWatts

$33000

Now You Can Build Googlersquos

$1M Artificial Brain on the Cheap

ldquo ldquo

GPUs make deep learning accessible

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 28: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning deployment options

Training

data Classifier

Enterprise

desktop

(virtual or local)

Stream

processor

Embedded

mobile systems

HPC Data Center or Cloud Train

Deploy

Long training (hours

to days)

batch updates

leverage GPU

acceleration

lt100ms response

for new data sample

model interactivity

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 29: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Deep Learning is a GEOINT force multiplier

Managing Big Data

Real-time near-human level perception at web-scale

Integrates into analytical workflows

Semantic content based filtering and search

Drives data exploration and visualization

Models improve based on analyst feedback

Scales across problems

Models improve with more varied data

Models from one dataset can be leveraged in new problems

Compact models can be easily shared and deployed

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 30: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Summary

GPU accelerated Deep Learning is

Revolutionizing machine perception accuracy

Adaptable to many varied GEOINT workflows and

deployments scenarios

Scalable ndash thrives on complex raw data

Available to apply in production and RampD today

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 31: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

THANK YOU

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article

Page 32: APPLICATIONS OF DEEP LEARNING TO GEOINTon-demand.gputechconf.com/gtc/2015/webinar/deep-learning-geoint.pdf · APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect

Resources

Popular DL frameworks Caffe (UC Berkeley)

Theano (U Montreal)

Torch

DIGITS

Examples from talk [1] Imagenet Large Scale Visual Recognition Challenge

[2] NORB dataset

[3] Keio University Japan - Aerial image segmentation

[4] University of Arizona - Geographic feature detection [5] D Maturana and S Scherer 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR In ICRA 2015

[6] [8] Stanford NLP group Deep Learning research

[9] Kaggle Taxi Trajectory Prediction Competition

[10] Kaggle San Francisco Crime Classification Competition

[11] Google DeepMind Nature article