what is big data and what is hadoop ?

Post on 21-Jul-2015

116 Views

Category:

Education

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Big Data And Hadoop

By easydata

easydata - Online Training

Big Data And Hadoop

• Big Data is an asset, often a complex and ambiguous one.

• Hadoop is a program that accomplishes a set of goals and objectives for dealing with that asset.

• Big data is large sets of data that businesses and other parties put together for specific goals and operations.

• Businesses / Companies collect these data over a period of time.

easydata - Online Training

• These data may include customer identifiers like name , Social Security number , age group , location or anything.

• On product information in the form of model numbers, sales numbers , inventory numbers, complain numbers.

• Customer feedback, angry customers, happy customers etc.

• All of this can be called big data.

• But they all are raw and unsorted data.

• Hadoop is one of the tools designed to handle this raw and unsorted big data.

easydata - Online Training

• Hadoop works to interpret or parse the results of big data searches.

• Hadoop uses some algorithms and methods to understand it.

• Hadoop is an open-source program under the Apache license that is maintained by a global community of users.

• To understand Hadoop, you have to understand two fundamental things.

• They are: How Hadoop stores files, and how it processes data.

easydata - Online Training

• Hadoop includes various main components like MapReduce , HDFS.

• HDFS : Stores raw and unsorted data.

• MapReduce : Its ability to process that data, or provide a framework for processing that data.

• HDFS == Storage

• MapReduce == Processing

easydata - Online Training

Thank You

easydata - Online Training

top related