data management and analytics update - sas groups/sunz...rdbms ingestion •data prep...
TRANSCRIPT
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Data Management and Analytics updateWessel de Meyer
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Extracting Business Value from the data
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Data Management
Copyright © SAS Inst itute Inc. A l l r ights reserved.
The challenge to Timely business insight
BUSINESS PROBLEM
BUSINESS DECISION
20%80%Preparing to
solve the problemSolving
the problem
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS Data Management – A Unified Data Platform
MQ
XML
Cloud
CONSUMERS
SOURCES
DATA INTEGRATION
DATA ACCESS
EVENT STREAM PROCESSING
DATA VIRTUALIZATION
DATA QUALITY
MASTER DATA MGMT
SAS® DATA MANAGEMENT
DATA GOVERNANCE
RDBMS
Ingestion • Data Prep • Self-Service • Hadoop • Right-Time DI • New Data • Integrated Platform
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Hadoop Ecosystem
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS Data Loader for Hadoop
Simple UI designed for Self-Service Data
Preparation
Market-Leading Data Integration and Data
Quality
Purpose-Built to run in parallel on Hadoop
Load Data In-Memory for Visualisation and
Advanced Analytics
Copyright © SAS Inst itute Inc. A l l r ights reserved.
20%80%Preparing to
solve the problem
Solving the
problem
BUSINESS
PROBLEM
BUSINESS
DECISIONPreparing to
solve the problem
Solving the
problemInnovate
30%20% 50%
Changing the Equation
Copyright © SAS Inst itute Inc. A l l r ights reserved.
A New Data Management Paradigm
ActivatedData
Automated
Cross-OrganizationAlways-On
Ad Hoc Data
Data Scientist or Analyst
IndividualSelf-Service
in-SAS/CAS · in-Hadoop · in-Memory · in-Database · in-Cloud · in-Stream
Code Generation, Optimization, Process Distribution, & Monitoring
Governance
Governed Data
DataStewards
Cross-DepartmentAdministered Use
Monitored Data
IT Staff
Cross-TeamMeasured Use
Managed Data
ETL Developeror Coder
Team/DepartmentRepeated Use
in-SAS/CAS · in-Hadoop · in-Memory · in-Database · in-Cloud · in-Stream
Code Generation, Optimization, Process Distribution, & Monitoring
Governance
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Planned offerings – SAS Viya
Proposed Target Dates (SAS Viya Schedule Dependent) 2017 2018 2018 2018+
SAS Viya Offering“Data Preparation” “Data Discovery” “Data Integration” “Data Management”
Capabilities
Foundation (data prep, data access/load/merge, projects)
Data Quality (DQ, profile, enrichment, more)
Intelligence (suggestions, data discovery, more)
Stewardship (workflow, remediation, biz terms, more)
Integration (process flow design, transforms, more)
Enterprise (MDM, reporting, policies, more)
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Analytics
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS® Viya Design Principles for Analytics Product Suite
IntegratedDiscovery &
AnalyticsModernOpen & Scalable
Architecture
InteractiveVisual &
Collaborative ConnectedOutput predictions
& Operationalize
Models
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS® Viya™ Based Product offerings
Interfaces
SAS Studio
SAS Visual Analytics
Visualapproach
Programmaticapproach
SAS Visual Analytics
SAS Visual Statistics
SAS Visual Data Mining and
Machine Learning
CAS actions, PROCS related to VA capabilities
Visual Analytics (VA) Interface
CAS actions, PROCS related to VS capabilities
Visual Statistics (VS) add-on to VA Interface
CAS actions, PROCS related to VDMML
capabilities
Visual Data Mining and Machine Learning
(VDMML) add-on to VA Interface
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS Visual Analytics
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS® Visual Analytics Roadmap
SAS 9 SAS VIYA
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Visual Analytics 8.1 Key Themes
Usability
Single HTML5 Interface
Change Visualization Type
Undo/Redo with History
Auto Controls
Self-Service Data Prep
Data Profiling
Column Transformations
Color Coded Visual Joining
CAS Action Oriented
Interface Diversity
Coding via SAS Studio, Python, Lua
or Java
Browser Support: Chrome, Firefox,
Safari, IE
Custom Applications via
REST API
Mobile Access (iOS, Android &
Windows)
Enterprise Readiness
Self Healing
New Data Types
Effective Memory Management
Enhanced Cloud Support
Copyright © SAS Inst itute Inc. A l l r ights reserved.
VA 8.1 Demo – 2-3 minutes
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS Visual Statistics
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS® Visual Statistics 8.1Key Features
• Modeling Techniques • Linear Regression
• Logistic Regression
• GLM Regression
• Clustering (k-means)
• Decision Tree
• Common Features• Training-validation partitioning
• Variable Importance / Profile
• Model Assessment
• Score Code
• Model comparison
• Derivation of predictive outputs
• Ability to export model statistics into Excel
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS Visual Data Mining and Machine Learning
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS® Visual Data Mining and Machine Learning 8.1New Machine Learning Visualizations in Visual Analytics
• Machine Learning Techniques• Factorization Machine
• Random Forest
• Gradient Boosting
• Neural Network
• Support Vector Machine
• Common Features• Training-validation
• Auto-tuning
• Model Assessment
• Score Code or Analytic Store
• Model comparison
• Ability to export model statistics into Excel
Copyright © SAS Inst itute Inc. A l l r ights reserved.
• Logistic Regression• Linear Regression• Generalized Linear Models • Nonlinear Regression • Ordinary Least Squares Regression • Decision Trees• Partial Least Squares Regression• Quantile Regression• K-means and K-modes Clustering• Principal Component Analysis • Random Forest• Gradient Boosting• Neural Networks• Support Vector Machines• Factorization Machines• Network Analytics/Community Detection• Text Mining• Boolean Rules• Auto-tuned Hyper-parameters
Data
DeploymentDiscovery
• Assess Supervised Models• Analytic Item Store
• In-Memory Data Step• Transpose• DS2• SQL• Variable Binning • Variable Cardinality Analysis• Sampling and Partitioning• Missing Value Imputation• Variable Selection
SAS VISUAL DATA MINING AND MACHINE LEARNING
Key:
• Visual Statistics• Visual Data Mining Machine Learning
Copyright © SAS Inst itute Inc. A l l r ights reserved.
SAS® Studio Programmatic Interface
Copyright © SAS Inst itute Inc. A l l r ights reserved.
Future Roadmap - Viya
H1 2017 H2 2017 2018SAS Viya 3.2
• SAS Visual Analytics 8.1• SAS Visual Statistics 8.1• SAS Visual Data Mining & Machine Learning 8.1• SAS Visual Forecasting• SAS Econometrics• SAS Optimisation• SAS Visual Investigator 10.2• SAS Visual Scenario Designer 10.2• SAS Event Stream Processing• In-Database Technologies• SAS Studio
SAS Viya 3.3
• SAS Data Preparation(new)• SAS Data Quality• SAS Visual Analytics 8.2• SAS Visual Statistics 8.2• SAS Visual Data Mining & Machine Learning 8.2• SAS Office Analytics• SAS Visual Forecasting• SAS Econometrics• SAS Optimisation• SAS Visual Text Analytics (new)• SAS Model Manager• SAS Visual Investigator 10.3• SAS Visual Scenario Designer 10.3• SAS Event Stream Processing• In-Database Technologies• SAS Studio
SAS Viya 3.4
• SAS Data Preparation• SAS Data Quality• SAS Data Discovery(new)• SAS Data Integration• SAS Visual Analytics 8.3• SAS Visual Statistics 8.3• SAS Visual Data Mining & Machine Learning 8.3• SAS Office Analytics• SAS Visual Forecasting• SAS Econometrics• SAS Optimisation• SAS Visual Text Analytics• SAS Model Manager• SAS Visual Investigator 10.4• SAS Visual Scenario Designer 10.4• SAS Event Stream Processing• In-Database Technologies• SAS Studio
Copyright © SAS Inst itute Inc. A l l r ights reserved.