tour anaconda enterprise without leaving your desk
Post on 10-Feb-2017
342 Views
Preview:
TRANSCRIPT
© 2016 Continuum Analytics - Confidential & Proprietary© 2016 Continuum Analytics - Confidential & Proprietary
Tour Anaconda Enterprise Without Leaving Your DeskAccelerate. Connect. Empower.
Ian Stokes-ReesComputational Scientist
December 15, 2016
© 2016 Continuum Analytics - Confidential & Proprietary 2
Join us for the inaugural AnacondaCONDiscover What #OpenDataScienceMeans
http://anacondacon17.io
Speakers from industry, government, academia
Demos, BoFs, Panels, Exhibits, Partner Showcase
© 2016 Continuum Analytics - Confidential & Proprietary 3
• Ph.D. at the University of Oxford, working on the CERN LHCb particle physics experiment
• Harvard University, working on computational techniques for protein structure determination
• Joined Continuum Analytics in 2013• Greatest interest: enabling communication,
collaboration and discovery using high performance computing infrastructure
Ian Stokes-Rees @ijstokesComputational Scientist, Continuum Analytics
© 2016 Continuum Analytics - Confidential & Proprietary 44
• Intro to Anaconda Enterprise Notebooks for effortless collaboration
• Publish analytics using Anaconda Enterprise
• Discover deep insights with interactive visualizations
• Communicate work to all levels through interactive visualizations
• Deploy data science with Anaconda and engage your team with intuitive and
relevant data science narratives
• Q&A
Agenda
5
Anaconda Distribution
© 2016 Continuum Analytics - Confidential & Proprietary 6
ANACONDA Accelerates Adoption of Open Data Science for Enterprises
• Easy to install
• Agile data exploration
• Powerful data analysis
• Simple to collaborate
• Accessible to everyone
PYTHON & R OPEN SOURCE ANALYTICSNumPy SciPy Pandas Scikit-learn Jupyter/IPython
Numba Matplotlib Spyder Numexpr Cython Theano
Scikit-image NLTK NetworkX IRKernel dplyr shiny
ggplot2 tidyr caret PySpark & 720+ packages
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Full Featured Analytics Platform
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Hundreds of Analytics Tools - Integrated
© 2016 Continuum Analytics - Confidential & Proprietary 9
Anaconda Distribution Promise• Individuals• Government• Commercial• Students
• Educational• Research• Application embedding• Commercial
• No time limits• No trials• No license files• No expiry
Free for everyone
Free forany useFreeforever
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Anaconda Solves Many Analytics Problems• Deployment: Windows, Mac, Linux• Reproducibility• Extensibility and Flexibility
• Over 100,000 Conda packages available today• Multi-language: Python, R, Scala, Julia and more• Widely used:
• Hundreds of companies• Millions of users• Millions of annual downloads
• Analytics sandboxes without VMs or containers
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Anaconda DistributionWho is it for?
• Single user• Single system• Unrestricted access to public Internet• Access to Anaconda Cloud
When do you need Anaconda Enterprise?• Multiple users• Collaboration• Compute clusters• Hadoop• On-premesis package mirror• Private package repository
12
Anaconda Enterprise
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Anaconda Platform
© 2016 Continuum Analytics - Confidential & Proprietary 14
Data Lab: shared analytics cluster
Package Control
Internal Anaconda Repository
Authentication
Anaconda Enterprise Notebook Server
Computation
Web Interface
Active Directory/ LDAPOptional
Anaconda Enterprise Architecture
Data Scientist (Mac)
Business Analyst (Win)
DevOps Engineer (Linux)
Publish
Fetch
Productionanalytics cluster
© 2016 Continuum Analytics - Confidential & Proprietary 15
© 2016 Continuum Analytics - Confidential & Proprietary 16
Data lineage
Interactive Visualizations
Advanced notebook extensions
Anaconda Enhanced Jupyter Notebooks
17
Excel + Python + Jupyter
© 2016 Continuum Analytics - Confidential & Proprietary 18
Anaconda Fusion Excel Integration
BRING interactive visualizations, machine learning and ETL to Excel
BRIDGE Excel Data to Python & R through notebooks
ACCESS all the power of Python and Big Data, natively embedded inside Excel
Anaconda Fusion brings Open Data Science to Microsoft Excel
19
Parallel Data Processing
© 2016 Continuum Analytics - Confidential & Proprietary 20
• Parallel and Distributed Pandas and Numpy
• Low latency workflow manager• Graphical tools• Simple APIs• Extensible and generalizable to
other data structures
Dask: Parallel Data Processing
© 2016 Continuum Analytics - Confidential & Proprietary 21
Dask: Parallel Data Processing
Synthetic views of Numpy ndarrays
Synthetic views of Pandas DataFrameswith HDFS support
DAG construction and workflow manager
22
Interactive Data Vizualization Apps
© 2016 Continuum Analytics - Confidential & Proprietary 23
Interactive Data Visualization
• Interactive viz, widgets, and tools• Versatile high level graphics• Streaming, dynamic, large data• Optimized for the browser• No Javascript• With or without a server
© 2016 Continuum Analytics - Confidential & Proprietary 24
Rapid Prototyping Visual Apps
• Python interface• R interface• Smart plotting
25
Geoviews and Datashader
© 2016 Continuum Analytics - Confidential & Proprietary 26
Datashader: Rendering a Billion Points of Data• datashader provides a fast,
configurable visualization pipeline for faithfully revealing even very large datasets
• Each of these visualizations requires just a few lines of code and no magic numbers to adjust by trial and error.
© 2016 Continuum Analytics - Confidential & Proprietary 27
Datashader
28
Anaconda Accelerate
© 2016 Continuum Analytics - Confidential & Proprietary 29
GPU Acceleration in Python
Linear algebra, FFTs, sorting, random number generation
Fast algorithms for nVidia GPUs
Data profiling in Jupyter Notebooks
Works with Numba
Track what size and type of data is beingpassed through your algorithm for better optimization decision-making
Designed to be used in conjunction with the Numba Python compiler for CPUs and GPUs
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Linear Algebra on the GPU with Anaconda Accelerate
Double precision matrix-matrix multiplicationIntel Core i7-4820K 3.70GHz CPU vs. NVIDIA Tesla K20c
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Data Profiling
32
Data Science Workflows
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Laptop to Cluster
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
Analytics artifact repository
© 2016 Continuum Analytics - Confidential & Proprietary
221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440
info@continuum.io
@ContinuumIO
© 2016 Continuum Analytics - Confidential & Proprietary
From Data Lab to Production Analytics
© 2016 Continuum Analytics - Confidential & Proprietary 3636
Questions?
© 2016 Continuum Analytics - Confidential & Proprietary 3737
• Try it out yourselfSign up for an Anaconda Enterprise Test Drive:
know.continuum.io/Anaconda-Enterprise-Test-Drive.html
• Meet up with other thought leaders like youRegister for AnacondaCON – February 7-9, 2017: anacondacon17.io
• Learn more about the Anaconda PlatformCheck out the “Resources” tab for webinars, whitepapers and more: continuum.io/
Next Steps
© 2016 Continuum Analytics - Confidential & Proprietary© 2016 Continuum Analytics - Confidential & Proprietary
Continuum AnalyticsWe empower data science teamsto make the world a better placeWe Empower Data Science Teams to Make the World Better221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5400
top related