big data overwiew, Татьяна Матвиенко/Александр Павленко, senior...

18
Big Data Overview

Upload: alina-vilk

Post on 24-Jan-2017

122 views

Category:

Education


3 download

TRANSCRIPT

Page 1: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Big Data Overview

Page 2: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

○ What is Big Data?○ Big Data in a real life.○ Big Data problems & challenges?○ Ways to solve these problems.○ Big Data tools and technologies.

Crucial Questions

Page 3: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

What is Big Data?○ Too big to fit in

memory○ Too fast data

acquisition requirements

○ Too fast data processing

○ Too complex for traditional processing

Page 4: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Big Data: numbers & facts701,389 Facebook logins

38,194 posts to instagram

2.4 Million search queries

2.78 Million video views

Page 5: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Example: car fleet management○ 1M car profiles

○ Daily reports○ Track position

by request○ Keep history in

database

Page 6: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Real-time car fleet management○ 1K cars connected in real time

○ Gather data via OBD2 scanners in real-time○ Gather data from cars’ GPS sensors in real-

time○ Store the data for future processing○ Real-time calculation to predict traffic,

engine problems, accidents

Page 7: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

What can we do with (Big)Data?○ Data ingestion & acquisition

○ Data storage (search, transfer, sharing)○ Data processing & analysis○ Data visualization

Page 8: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Data Ingestion & Acquisition○ Extract:

RDBMS, file systems, messaging systems, sensors, log files○ Transform:

Filter, encode/decode, aggregate, validate○ Load:

Data warehouse, messaging system

Page 9: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Data StorageBig Data storage challenges:○ Size (keep and search

huge amount of data)○ Speed (data acquisition,

data search)○ Availability (fault

tolerance, partition tolerance)

Page 10: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

○ Consistency: all nodes see the same data at the same time

○ Availability: every request gets a response (success or failure)

○ Partition tolerance: system works despite of network failures

CAP Theorem

Page 11: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Streaming vs Batch processing

Batch Batch

Stream

Data

Page 12: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Data processing: Lambda Architecture

Page 13: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Data processing: Kappa Architecture

Page 14: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Data processing: MapReduce

Page 15: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Data Visualization

Page 16: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Everything as a Service

Page 17: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Example: Amazon Web Services

Page 18: Big data overwiew, Татьяна Матвиенко/Александр Павленко, Senior Java/BigData Developer, DataArt

Q & A ?○ What is Big Data?○ Big Data in a real life.○ Big Data problems & challenges?○ Ways to solve these problems.○ Big Data tools and technologies.