learn how to build a real- time analytics solution · learn how to build a real-time analytics...

24
Learn how to build a real- time analytics solution | @ZeMiguelMendes

Upload: others

Post on 22-May-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Learn how to build a real-time analytics solution

| @ZeMiguelMendes

Page 2: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaLearn how to build a real-time analytics solution

Senior Data Analytics Consultant

blogs.adatis.co.uk/josemendes/

@ZeMiguelMendes

Page 3: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

FRIDAY NIGHT, SQLBITS

Learn how to build a real-time analytics solution

PROHIBITION PARTY

Page 4: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

FEEDBACK FORMS

Learn how to build a real-time analytics solution

PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE

SESSION

Page 5: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Agenda

Lambda

Architecture

Demo Databricks Delta

Architecture

Real-time

analytics solutions

Agenda

Demo

Page 6: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Agenda

Lambda

Architecture

Demo Databricks Delta

Architecture

Real-time

analytics solutions

Agenda

Demo

Page 7: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Over 2.5 quintillion bytes of data are created every single day, and it’s only going

to grow from there. By 2020, it’s estimated that 1.7MB of data will be created

every second for every person on earth

Real-time analytics solutions

Page 8: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

A growing demand for faster access to data for analysis

Retail giants making real-time price changes

Manufacturing plants using anomaly detection to determine potential problems on

the assembly line before they happen

Mine and gas companies using hi-tech drill sensor readings to know precisely what

is happening hundreds of feet below the earth's surface as it's happening

Real-time analytics solutions

Page 9: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Agenda

Lambda

Architecture

Demo Databricks Delta

Architecture

Real-time

analytics solutions

Agenda

Demo

Page 10: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaLambda Architecture

A big data processing architecture that combines both

batch and real time processing methods

Page 11: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Speed

Batch

Serving

Lambda Architecture

Query

Page 12: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaReal Time Analytics Architecture

*credit Microsoft

Page 13: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaApache Kafka for HDInsight (Ingest)

Apache Kafka is an open source, distributed, scalable, high-performance,

publish-subscribe message broker, originally built at LinkedIn and now part of

the Apache Software Foundation

Azure HDInsight is a Hadoop service that enables clusters of managed Hadoop

instances

Apache Kafka for HDInsight is a managed service that provides a simplified

configuration process

Page 14: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaKafka Architecture - Components

Page 15: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaAzure Data Lake Storage Gen2 (Store)

*credit Microsoft

Page 16: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaAzure Databricks (Prep & Train)

Azure Databricks is an Apache Spark-based analytics platform optimized for the

Microsoft Azure cloud services platform

*credit Microsoft

Page 17: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaAzure SQL Data Warehouse (Model & Serve)

A cloud-based Enterprise Data Warehouse (EDW) that leverages Massively

Parallel Processing (MPP) to quickly run complex queries across petabytes of

data

Uses PolyBase to query the big data stores

Once data is stored in SQL Data Warehouse, it can run analytics at massive

scale

Page 18: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaPower BI

Power BI is a collection of software services, apps, and connectors that work

together to turn your unrelated sources of data into coherent, visually

immersive, and interactive insights

*credit Microsoft

Page 19: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Agenda

Lambda

Architecture

Demo Databricks Delta

Architecture

Real-time

analytics solutions

Agenda

Demo

Page 20: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Agenda

Lambda

Architecture

Demo Databricks Delta

Architecture

Real-time

analytics solutions

Agenda

Demo

Page 21: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

AgendaDatabricks Delta Architecture

Databricks Delta is a transactional storage layer designed specifically to harness

the power of Apache Spark and Databricks DBFS

ACID transactions

Deletes/Updates/Upserts

Automatic file management

Statistics and data skipping

Page 22: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

*credit Databricks

Databricks Delta Architecture

Page 23: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION

Agenda

Lambda

Architecture

Demo Databricks Delta

Architecture

Real-time

analytics solutions

Agenda

Demo

Page 24: Learn how to build a real- time analytics solution · Learn how to build a real-time analytics solution PLEASE FILL OUT AND PASS TO YOUR ROOM HELPER BEFORE YOU LEAVE THE SESSION