big data
TRANSCRIPT
Big DataYasin Zamani
PhD Candidate of Artificial Intelligence
Image Processing Lab (Prof. S.Kasaei)
Sharif University of Technology
"The number of transistors incorporated in a chip will approximately double every 24 months.“
--Gordon Moore, Intel co-founder4
Excel’s Limit
Feature Maximum limit
Open
workbooks
Limited by available
memory and system
resources
Worksheet size 65,536 rows by 256 columns
Column width 255 characters
Row height 409 pointsFeature Maximum limit
Open
workbooks
Limited by available memory
and system resources
Worksheet size 1,048,576 rows by 16,384 columns
Column width 255 characters
Row height 409 points
Excel 2003
Excel 2007
5
9
• 6,000 tweets per second
• 500 million tweets per day
• 200 billion tweets per year
Amazon
18
IMDB
20
Predictive Marketing
Predicts major life events
Looks at consumer behavior
Uses demographic info
Can purchase more data
23
27
Sources
Cell phones connecting to towers
Satellite radio and GPS connecting
RFID readings
Readings from medical devices
44
Possible Errors
Incomplete or corrupted data
Duplicate records
Typographical errors
Data that is missing context
52
Humans as Visual Animals
Computers excel at predictive models.
Computers excel at data mining.
Humans perceive and interpret better.
Human vision still plays an important role.
57
What Humans Do Well
Identifying visual patterns
Identifying anomalies
Seeing patterns across groups
Interpreting content of images
58
60