hadoop architecture options for existing enterprise datawarehouse
DESCRIPTION
Hadoop Architecture Options for Existing Enterprise DataWarehouseTRANSCRIPT
1
Hadoop Integration Architecture Options
2
Various Potions for Hadoop Integration for existing EDW
o Teradata Unified Data Architecture o Existing EDW with new Hadoop cluster using Apache o Existing EDW with new Hadoop cluster using Cloudera o Existing EDW with new Hadoop cluster using
HortonWorks o IBM PureData o Oracle Bigdata Appliance o EMC GreenPlum o Vertica o SAP HANA & SAP Suite
3
Option 1: Teradata Unified Data Architecture
Audio, Video, Images
Text Web & Social Application Input
Machine Logs ERP CRM
Capture | Store | Refine
INTEGRATED DATA
WAREHOUSE
DISCOVERY PLATFORM
Geospatial Analytics Predictive & Real time
Analytics DATA MINING
BUSINESS INTELLIGENCE
APPLICATIONS
Data Scientists
Engineers
Business Analysts
Customers / Partners
Marketing
Executives
Frontline Workers
Operational Systems
Big data Analytics
Big data Management
Transactional Data
4
Predictive Analytics
Reports / Dashboards
Data Sources Data Hub Presentation Layer
INTEGRATED DATA
WAREHOUSE
Geospatial Analytics
DISCOVERY PLATFORM
Flat files
RDBMS
Un/Semi Structured Data
Structured Data
Reporting/Application Layer
Option 1: Teradata Unified Data Architecture (conn..)
5
Predictive Analytics
Reports / Dashboards
Data Sources Data Hub Presentation Layer
INTEGRATED DATA
WAREHOUSE
Geospatial Analytics
Flat files
RDBMS
Un/Semi Structured Data
Structured Data
Reporting/Application Layer
Option 2: Existing EDW with new Hadoop Clusters (Apache)
Existing EDW
Analytics Apache Hadoop Cluster
6
Predictive Analytics
Reports / Dashboards
Data Sources Data Hub Presentation Layer
INTEGRATED DATA
WAREHOUSE
Geospatial Analytics
Flat files
RDBMS
Un/Semi Structured Data
Structured Data
Reporting/Application Layer
Option 3: Existing EDW with new Hadoop Clusters (Cloudera)
Existing EDW
Analytics
7
Predictive Analytics
Reports / Dashboards
Data Sources Data Hub Presentation Layer
INTEGRATED DATA
WAREHOUSE
Geospatial Analytics
Flat files
RDBMS
Un/Semi Structured Data
Structured Data
Reporting/Application Layer
Option 4: Existing EDW with new Hadoop Clusters (Hortonworks)
Existing EDW
Analytics
8
Option 5: IBM PureData
9
Option 6: Oracle Big Data Appliance
10
Option 6: Oracle Big Data Appliance (Conn)
11
Option 7: SAP Suite for Hadoop Integration
12
Predictive Analytics
Reports / Dashboards
Data Sources Data Hub Presentation Layer
INTEGRATED DATA
WAREHOUSE
Geospatial Analytics
Flat files
RDBMS
Un/Semi Structured Data
Structured Data
Reporting/Application Layer
All data to Haddop and from Hadoop to EDW
Existing EDW
Analytics
13
Thank You
Asis Mohanty, CBIP, CDMP [email protected]
** Note: Few images are taken from Oracle, IBM & SAP