cloud foundry for data science

19
1 © Copyright 2014 Pivotal. All rights reserved. Cloud Foundry for Data Science Ian Huston, Data Scientist

Upload: ian-huston

Post on 15-Jul-2015

1.020 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Cloud Foundry for Data Science

1 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston 1 © Copyright 2014 Pivotal. All rights reserved. 1 © Copyright 2015 Pivotal. All rights reserved. 1 © Copyright 2015 Pivotal. All rights reserved. 1 © Copyright 2014 Pivotal. All rights reserved.

Cloud Foundry for Data Science

Ian Huston, Data Scientist

Page 2: Cloud Foundry for Data Science

2 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Who am I?

�  Ian Huston

� @ianhuston

� github.com/ihuston

� www.ianhuston.net

� Talk resources: http://tinyurl.com/cf4ds-talk

� Data Scientist

� Previously a theoretical physicist using Python for numerical simulations & HPC

� Use Python/R/SQL/… for predictive analytics and machine learning

Page 3: Cloud Foundry for Data Science

3 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

What is Cloud Foundry?

http://cloudfoundry.org

Open Source Cloud Platform

Simple App Deployment, Scaling & Availability

No Cloud Provider Lock In

Page 4: Cloud Foundry for Data Science

4 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

How can CF help data scientists? �  Jamie is a data scientist who has just finished some

analysis. They want to put up a simple internal web app with Javascript visualisations connected to internal data stores.

� Sam is a data engineer who wants to set up a REST API to expose a production machine learning model as a service.

� Alex is a data scientist who has an existing RShiny or Python app that they want to make available with multiple instances.

Page 5: Cloud Foundry for Data Science

5 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Cloud Foundry is a Platform

You bring the apps, the rest is taken care of!

Source: Albert Barron (IBM), https://www.linkedin.com/pulse/20140730172610-9679881-pizza-as-a-service

Page 6: Cloud Foundry for Data Science

6 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Cloud Foundry Foundation: Industry Standard

Gold

Silver

Page 7: Cloud Foundry for Data Science

7 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

CF for data scientists & developers Easily deploy your web app

cf  push  myapp  Scale up and out quickly

cf  scale  myapp  –i  5  –m  1G  Create and bind services

cf  bind-­‐service  myapp  redis  

 

Page 8: Cloud Foundry for Data Science

8 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Simple Python Flask App Demo � Simple one page “Hello World” web app

� Video: https://www.youtube.com/watch?v=QOfD6tnoAB8

� Demonstrates: –  Installation of requirements –  Scaling properties

� Need to Provide: –  App files –  Dependencies listed in requirements.txt file –  Optional manifest.yml file with configuration for deployment

Page 9: Cloud Foundry for Data Science

9 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Official Language Support on Cloud Foundry

Go

Ruby

Page 10: Cloud Foundry for Data Science

10 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Buildpacks – How to Customise CF

Use Heroku-like buildpacks to add additional runtimes:

� Community Buildpacks for Clojure, Haskell, .NET, Erlang,…

� R Buildpack –  https://github.com/alexkago/cf-buildpack-r –  Works with RShiny apps

� PyData Buildpack –  https://github.com/ihuston/python-conda-buildpack –  Uses conda for hard to install dependencies

Page 11: Cloud Foundry for Data Science

11 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Comparison with Docker

runtime layer

OS image

application layer

Docker

system brings fixed host OS Kernel

* Devs may bring a custom buildpack

runtime layer*

OS image

application layer

CF Buildpack

system brings fixed host OS Kernel

App container

System Provides

Dev Provides

Page 12: Cloud Foundry for Data Science

12 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Services now available These are available on Pivotal CF (Pivotal’s packaged Cloud Foundry offering).

See http://run.pivotal.io for the services available on Pivotal Web Services.

Page 13: Cloud Foundry for Data Science

13 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Data Services

� Easy to bind and scale services –  Databases, NoSQL, message queues etc. $  cf  create-­‐service  rediscloud  PLAN_NAME  INSTANCE_NAME  $  cf  bind-­‐service  APP_NAME  INSTANCE_NAME  

� Details provided in VCAP_APP_SERVICES env variable: {  "rediscloud":  [          {  "name":  "rediscloud-­‐42",  "label":  "rediscloud",  "plan":  "20mb",              "credentials":  {  "port":  "6379",    

 "hostname":  "pub-­‐redis-­‐6379.us-­‐east-­‐1-­‐2.3.ec2.redislabs.com",                  "password":  "your_redis_password”    }    }  ]  }  

Page 14: Cloud Foundry for Data Science

14 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

What can you build with CF?

Some ideas:

�  Interactive data exploration apps

� APIs for a machine learning model

�  Internet of Things data collection and modelling tools

� Microservices architecture for data analysis

�  Interface to a natural language system like IBM’s Watson

Page 15: Cloud Foundry for Data Science

15 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Examples

� Simple HTML + JS app

� Uses data in S3

� Scale out when heavy load expected with auto load balancing

http://ds-demo-transport.cfapps.io

Page 16: Cloud Foundry for Data Science

16 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Examples

� RShiny app

�  Interactive controls with very little additional coding.

� Runs RandomForest model in background

https://ak-insurance-demo.cfapps.io:4443/

Page 17: Cloud Foundry for Data Science

17 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Send me your examples!

Show off your data science related Cloud Foundry apps:

Twitter: @dsoncf http://dsoncf.com

http://DataScienceOnCloudFoundry.com

Page 18: Cloud Foundry for Data Science

18 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

Try Cloud Foundry today!

Pivotal Web Services: http://run.pivotal.io IBM Bluemix: http://bluemix.net Anynines: http://anynines.com

HP Helion: http://horizon.hpcloud.com

Page 19: Cloud Foundry for Data Science

19 © Copyright 2015 Pivotal. All rights reserved.

@ianhuston

More info and resources

� Material for this talk: http://tinyurl.com/cf4ds-talk

� Cloud Foundry: http://cloudfoundry.org

� Python Flask Mega Meta Tutorial for DS: http://tinyurl.com/flaskDS

� Twelve Factor Apps: http://12factor.net/

� Meetups: http://cloud-foundry.meetup.com/