dmdw 7. student presentation - pentaho data integration (kettle)

10
ETL Himanshu Joshi Greta Alvarez Sandhya Narayan

Upload: johannes-hoppe

Post on 07-Dec-2014

2.678 views

Category:

Technology


3 download

DESCRIPTION

7. ETL Project by Himanshu Joshi, Sandhya Narayan and Greta Alvarez

TRANSCRIPT

Page 1: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

ETL

Himanshu JoshiGreta Alvarez

Sandhya Narayan

Page 2: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

What is ETL?

Extracting data from outside sourcesTransforming it to fit operational needsLoading it into the end target (DB)

Page 3: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

Extracting the Excel

Errors: Nulll entry

Spelling errorsSpace at the end of entriesStart at the end of entriesDuplicity

Page 4: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

Normalize/Transform??

GOAL:

Decompose relations with anomalies in order to produce smaller, well-structured relations. Involves dividing large tables into smaller (and less redundant) tables and defining relationships between them.

Page 5: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

Tools tried & USED

MS SQL SERVER MY SQL

Clover ETL DesignerAdvanced ETL Processor

Pentaho DI

Page 6: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

Pentaho Data Integration

Power Extraction, Transformation and Loading (ETL) capabilities using an innovative, metadata-driven approach. With an intuitive, graphical, drag and drop design environment, and a proven, scalable, standards-based architecture.

http://kettle.pentaho.com/

Page 7: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

Why Pentaho Data Integration?

Open SourceETL supportedUser FriendlyEasy to use

Page 8: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)
Page 9: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

DEMO

Page 10: DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

Don‘t try to reinvent the wheel!!

Just Use it ;)