big data refinery: distilling value for user-driven analytics
TRANSCRIPT
Twitter Tag: #briefr The Briefing Room
Reveal the essential characteristics of enterprise software, good and bad
Provide a forum for detailed analysis of today’s innovative technologies
Give vendors a chance to explain their product to savvy analysts
Allow audience members to pose serious questions... and get answers!
Mission
Twitter Tag: #briefr The Briefing Room
Refinery is the Perfect Term
Ø Data Quality is a byproduct Ø Master Data Management is an enabler Ø Data Integration is changing
Twitter Tag: #briefr The Briefing Room
Analyst: John Myers
John Myers is Managing Research Director of
Business Intelligence at Enterprise Management
Associates
Twitter Tag: #briefr The Briefing Room
Pentaho
Pentaho offers a variety of business intelligence and analytics products
Pentaho’s platform includes the Streamlined Data Refinery, which provides access to any data source, and includes data integration, governance, discovery, analysis and visualization
Pentaho’s solution is designed to be user-driven for ease of access and self-service
Twitter Tag: #briefr The Briefing Room
Guest: Chuck Yarbrough
Chuck is the Director of Big Data Product Marketing at Pentaho, a leading big data analytics company that helps organizations engineer big data connections, blend data and report and visualize all of their data. Much of Chuck's focus at Pentaho is in educating organizations on how big data can help win, serve and retain customers, lower costs and grow revenue through the proper use of big data. A life-long participant in the data game, Chuck has held leadership roles at Deloitte Consulting, SAP Business Objects, Hyperion and National Semiconductor.
Customizable Applications: Best of Both Worlds
Slide 13 © 2015 Enterprise Management Associates, Inc.
Discussion Questions
• Why not just use the data integration tools that exist within the Hadoop “stack”? For example, sqoop and flume are both provided by Hadoop
• There are differences between sandbox environments and “operationalizating” data integration for on-going operations. How can Pentaho’s blueprints make those tasks easier?
© 2015 Enterprise Management Associates, Inc. Slide 17
Discussion Questions
• “Data Refinery” brings up images of a one-way process from a crude state to a finished product – much a like crude oil being “cracked” into various products like heating oil, motor oil, gasoline and jet fuel. Does Pentaho view the “refining” of data as a one way proposition? Or a more bi/multi-directional approach?
Slide 18 © 2015 Enterprise Management Associates, Inc.
Discussion Questions
• Data governance is a wide ranging practice in the work of data management. How does Pentaho position itself within the breadth of the concept of data governance?
• Some have described “data wrangling” via the Stanford wrangle project as self-service data integration. How does Pentaho compare/contrast with wrangle’s approach?
Slide 19 © 2015 Enterprise Management Associates, Inc.
Twitter Tag: #briefr The Briefing Room
Upcoming Topics
www.insideanalysis.com
April: BIG DATA
May: CLOUD
June: INNOVATORS
Twitter Tag: #briefr The Briefing Room
THANK YOU for your
ATTENTION! Some images provided courtesy of Wikimedia Commons and "Anacortes Refinery
31911" by Walter Siegmund (talk) - Own work. Licensed under CC BY 2.5 via Wikimedia Commons - http://commons.wikimedia.org/wiki/
File:Anacortes_Refinery_31911.JPG#/media/File:Anacortes_Refinery_31911.JPG