big data drupal
TRANSCRIPT
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 1/25
Big Data DrupalDEMOCRATIZING BIG DATA PROCESSES
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 2/25
Elements
Bonita
Cloudera
NutchSolr
Drupal
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 3/25
BonitaJAVA/ECLIPSE-BASED COMMERCIAL OPEN-SOURCE BUSINESPROCESS AUTOMATION & MODELLING
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 4/25
Bonita StudioDesign business process models
Human or Service Tasks
Human Tasks have Forms
Service Tasks have Connectors
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 5/25
BonitaExperienceWeb-based admin & workflow
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 6/25
Bonita Forms
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 7/25
Shell Script Task
sudo -u hdfs hadoop jar/opt/nutch/basil-apache-nutc1.6/build/apache-nutch-1.6.jo
org.apache.nutch.crawl.Crawl/user/nutch/demo-crawl/urls${dir} -depth ${depth} -topN 1threads 50
Runs Nutch job for Hadoop
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 8/25
ClouderaBIG DATA COMMERCIAL OPEN S OURCE
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 9/25
ClouderaCloudera Manager 4 (Free Edition)
Hbase
HDFS
Hive
Hue
Impala
Mapreduce
Oozie
Zookeeper
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 10/25
Nutch JobHadoop job started by Bonita Shellconnector
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 11/25
ApacheFoundation
Nutch
Solr
Hbase
HDFS
Hive
Impala
Mapreduce
Home to many of these projects
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 12/25
NutchIndustrial strength general purposeweb-crawler
http://blog.csdn.net/hadoopstudy/article/details/15
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 13/25
Nutch
http://blog.csdn.net/hadoopstudy/article/detai
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 14/25
SolrSearch & indexing
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 15/25
DrupalPHP WEB APPLICATION FRAMEWORK
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 16/25
Aegir BOA
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 17/25
DrupalNutch & Solr modules
Integrate with search & views
Created at IAS
Sponsored by Acquia
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 18/25
Apache SolrModule
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 19/25
Apache Solr
ExamplesModule
http://drupal.org/project/apachesolr_examples
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 20/25
Nutch Mulisite
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 21/25
Drupal SearchNutch crawl
Solr indexed
Drupal search & views
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 22/25
Nutch SolrSandbox
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 23/25
Big Data DrupalDEMOCRATIZING BIG DATA PROCESSES
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 24/25
Big Data DrupalAuthor
7/30/2019 Big Data Drupal
http://slidepdf.com/reader/full/big-data-drupal 25/25
Big Data Drupal
Web www.BigDataDrupal.com
Email [email protected]
Contact Nicholas Roberts