Download - What is Hadoop & its Use cases-PromtpCloud
HADOOP THE SIGNIFICANT BIG DATA SOFTWARE
The Apache™ Hadoop® project
• Open-source software for reliable, scalable, distributed computing
• Allows distributed processing of large data sets across clusters of computers
Designed for:
• Scale up from single servers to thousands of machines, each offering local computation and storage
• To detect and handle failures at the application layer
• Delivering a highly-available service on top of a cluster of computers
• Hadoop Common (utilities) that support Hadoop models
• Hadoop Distributed File System (HDFS) for high-throughput access to application data
• Hadoop YARN for job scheduling and cluster resource management
• Hadoop MapReduce for parallel processing of large data-sets
Modules designed assuming hardware failures
should be handled by framework
The Apache™ Hadoop® project
RELATED PROJECTS
RELATED PROJECTS
• Ambari™
A tool for provisioning, managing, and monitoring Apache Hadoop clusters
• Avro™
A data serialization system
• Cassandra™
A scalable multi-master database with no single points of failure
• Chukwa™
A data collection system for managing large distributed systems
• HBase™A scalable, distributed database supporting structured data storage for large tables
• Hive™A data warehouse infrastructure for data summarization and ad hoc querying
• Mahout™A scalable machine learning and data mining library
• Pig™A high-level data-flow language and execution framework for parallel computation
RELATED PROJECTS
• Spark™A fast and general compute engine for Hadoop data
• Tez™A generalized data-flow programming framework providing a powerful and flexible engine to execute data processing for both batch and interactive use-cases
• ZooKeeper™A high-performance coordination service for distributed applications
RELATED PROJECTS
Hadoop is useful because…
BIG DATA STORAGE
FAST PROCESSING
BETTER RESULTS & INSIGHTS
Hadoop is Big Data software that…
best meets industry needs
allows movement of large volumes of complex and relational data into a single repository
is affordable storage and retrieval for analytic applications
makes raw data always available
simultaneously processes Big Data divided into multiple parts
Hadoop Uses
PUBLIC HEALTH PRODUCT
DEVELOPMENT
R&D STOCK & COMMODITIES TRADING
SALES & MARKETING
The Hadoop Advantage…
Insights from everywhere, any where
• Hadoop can handle all types of data :
structured | unstructured | log files | pictures |audio files |communications records |email
• No prior need for a schema
• Lets you decide query later
• Makes all data useable, not just database
The Hadoop Advantage…
Economics of everything online
• Legacy systems are far too expensive for general use with large data-sets
• Hadoop relies on internally redundant data. Storing data not previously viable is possible
• Keep data for real-time interactive querying, business intelligence, analysis and visualization
The Hadoop Advantage…
Streamline Data Usage
• Unstructured data accounts for 90% of the data
• Data storage, management and analytics must be re-looked at
• Legacy systems will complement Hadoop-optimized data management
• Hadoop is cost-effective, scalable, and provided streamlined architecture
USE CASES
FOR BUSINESS ENTERPRISES
Hadoop helps
DATA PROCESSING
• extract, transform, and load (ETL) data from source systems
• to transfer data stored in Hadoop to and from a database management
• batch process large quantities of unstructured and semi-structured data
NETWORK MANAGEMENT
• capture, analyze, and display data collected from servers, storage devices, and other IT hardware
• monitor network activity and diagnose bottleneck and other issues
RETAIL FRAUD
• monitor, model, and analyze high volumes of data from transactions
• extract features and patterns, retailers can help prevent credit card account fraud
RECOMMENDATION TOOL
• match and recommend users to one another
• compare products and services based on analysis of user profiles and behavioral data
Hadoop helps
SENTIMENT ANALYSIS
• advanced text analytics tools analyze unstructured text of social media
• tweets and Facebook posts determine user sentiment related to particular companies, brands, or products
FINANCIAL RISK MODELING
• analysis of large volumes of transactional data to determine risk and exposure of financial assets,
• prepare for potential "what-if" scenarios based on simulated market behavior
• due diligence tasks
• rate potential clients for risk
Hadoop helps
MARKETING CAMPAIGN ANALYSIS
• monitor and determine the effectiveness of marketing campaigns
• increase the accuracy of analysis by incorporating higher volumes of detailed data
CUSTOMER INFLUENCER ANALYSIS
• Mine social networking data for mapping customer influence over others
• help enterprises determine customers most important and influential for focused marketing
Hadoop helps
CUSTOMER EXPERIENCE ANALYSIS
• integrate data from previously siloed customer interaction channels
• understand impact of customer interaction to optimize customer lifecycle experience
RESEARCH & DEVELOPMENT
• comb through volumes of text-based research and historical data to support development of new products
Hadoop helps
Hadoop provides a solid foundation on which to build critical big data solutions.
As a tool, using it the right way from the very beginning can help ensure success.
Visit our blog for more interesting articles on Big Data, Crawling &
Extraction, and Analytics