big data workshop
TRANSCRIPT
© 2009 IBM Corporation
Big Data Workshop
20th November 2011
© 2011 IBM Corporation
© 2011 IBM Corporation
Agenda Theory sessions
Timings : 9.00 AM to 11.00 AM
1What is Big Data? – Arun Balasubramanyan
2Apache Hadoop Platform. – Anbumunee Ponniah
3HDFS Architecture & Commands. – Suresh Annavarapu
4 MapReduce Framework. – Arvind Rengarajan
5 Big Data Toolset. – Vibhaw P Rajan
TII
Day 1 – 2nd February, 2012
© 2011 IBM Corporation
Timings : 11.10 AM TO 12.30 PM
1Big Data and RDBMS – Vibhaw P Rajan
2Big Data Problems - Amit K Raja
3Case Studies - Arvind Rengarajan
4 Impact on everyday problems - Dinesh P Pandy
5 Big Data Visualization – Arun Balasubramanyan
Agenda Brainstorming
Day 1 – 2nd February, 2012
© 2011 IBM Corporation
Timings : 1.30 PM to 4.30 PM
1HDFS Tour and Commands – Dinesh P Pandy
2Search index using mapreduce – Vibhaw P Rajan
3Analyze twitter data – Dinesh P Pandy
4 Analyze a Weather dataset. – Arun Balasubramanyan
Agenda Lab Sessions
Day 1 – 2nd February, 2012
© 2011 IBM Corporation
Timings : 09.00 AM to 11.00 AM
AgendaGuest Address
Day 2 – 3rd February, 2012
© 2011 IBM Corporation
Timings : 12.30 PM TO 5.30 PM
AgendaContest
Day 2 – 3rd February, 2012
© 2011 IBM Corporation
Analyze Twitter data
Mine data from tweets– Extract data from twitter – Prepare Dataset – Process the data. – Get insights.
© 2011 IBM Corporation
Analyze Weather dataset
Large Weather Dataset.– Prepare dataset for analysis. – Use Hadoop Streaming.– Use Pig/Hive for analysis.