the expanding digital universe
TRANSCRIPT
Copyright 2007 IDC. Reproduction is forbidden unless authorized. All rights reserved.
The Expanding Digital Universe:Can we Contain it?The Expanding Digital Universe:Can we Contain it?
July 16, 2007
Dave ReinselVice President, Storage and Semiconductor Research
Oct-07© 2007 IDC
AgendaAgenda
The Expanding Digital UniverseThe StudyThe ResultsThe Implications
2
Oct-07© 2007 IDC
Research ObjectiveResearch Objective
Measure the total amount of digital information created in the worldForecast the the digital universe by region – NA, WE, AP, ROW – to 2010Analyze the character of the information in the digital universe based on information source and typeMeasure and forecast the available storage capacity through 2010Derive implications for society, enterprises, and governmentsDraw conclusions for enterprise IT organizations and vendors
3
Oct-07© 2007 IDC
Key FindingsKey Findings
The digital universe will grow six-fold, from 161 exabytes in 2006 to 988 exabytes in 2010
Digital^
4
Oct-07© 2007 IDC
Key FindingsKey Findings
The digital universe will grow six-fold, from 161 exabytes in 2006 to 988 exabytes in 2010
Digital^
Roughly 3.6 TB/s
About 25 GB/person in the world
If we put all of this information in typical books, we could have 12 separate stacks that each could reach the sun
In 2010?
31 TB/s
A stack of books that could extend from the sun to pluto….and back
When do we surpass a Yottabyte? 5
Oct-07© 2007 IDC
Regional InformationRegional Information
Information Creation, Capture, & Replication
NA41%
WE32%
AP22%
ROW5%
2006161 Exabytes
30%
40%
50%
60%
70%
NA WE AP ROW
Regional Share Regional Growth Per Year, 2005-2010
Oct-07© 2007 IDC
Behind The Explosion: The InternetBehind The Explosion: The Internet
0
200
400
600
800
1,000
1,200
1,400
1,600
1,800
1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
Total Users (M)
Users with Broadband (M)
7
Oct-07© 2007 IDC
Behind The Explosion: eMailBehind The Explosion: eMail
0
5
10
15
20
25
30
1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
Person-to-Person eMails Spam and Alerts
Trillions WW per Year
8
Oct-07© 2007 IDC
Behind The Explosion: ImagesBehind The Explosion: Images
Images Captured by Digital Cameras and Camera Phones WW
0 B
100 B
200 B
300 B
400 B
500 B
600 B
2002 2003 2004 2005 2006 2007 2008 2009 2010
Digital Still Cameras
Camera Phones
9
Oct-07© 2007 IDC
Behind the Explosion: DTVBehind the Explosion: DTV
And it’s not ‘just’ digital TV – it’s also HIGH DEFAnd it’s not ‘just’ digital TV – it’s also HIGH DEFHigh Definition:
• Increases storage requirements
• 1 hour of HD video is roughly 10GBs of storage (depends on encoding rates)
• Requires more bandwidth and special equipment
• But is it worth it….?
High Definition:
• Increases storage requirements
• 1 hour of HD video is roughly 10GBs of storage (depends on encoding rates)
• Requires more bandwidth and special equipment
• But is it worth it….?10
Oct-07© 2007 IDC
Behind the Explosion: DTVBehind the Explosion: DTV
HIGH DEFHIGH DEF
0 M
50 M
100 M
150 M
200 M
250 M
300 M
350 M
2005 2006 2007 2008 2009 2010
Installed base of LCDs
Std. DEFStd. DEF
11
Oct-07© 2007 IDC
Key FindingsKey Findings
The digital universe will grow six-fold, from 161 exabytes in 2006 to 988 exabytes in 2010> 90% of the information in the digital universe is unstructured
Digital^
12
Oct-07© 2007 IDC
What Are We Keeping:Changing Enterprise Data ProfileWhat Are We Keeping:Changing Enterprise Data Profile
Structured, transaction-oriented data accounts for declining portion of storage needs
Explosion in file-based, unstructured data is changing storage needs in the data center
0.0
2.0
4.0
6.0
8.0
10.0
12.0
14.0
2006 2007 2008 2009 2010 2011Unstructured data
Traditional Structured data
Consumption of EnterpriseDisk Capacity by Type CAGR
32.3%
63.7%
(EB)
Secondary storage is the new primary storage
Compliance and security drive new round of file server consolidation
Long term archiving becomes a critical issue
13
Oct-07© 2007 IDC
The Danger of Data IrrelevanceThe Danger of Data Irrelevance
A Caveat to Metcalfe’s Law?In general, the value of a network increases as the number of active nodes increaseIDC research consistently reveals that datacenter storage requirements are increasing 50–60% per year for the next several yearsGoogle has leveraged this network quite successfully!
Can too much of a good thing turn sour?Telephone and telemarketers (caller ID)Email and spam (filters and blockers)Data growth and data irrelevance
Goal: Find the RIGHT data in the Quickest amount of time.
©2007 IDC 14
14
Oct-07© 2007 IDC
Key FindingsKey Findings
The digital universe will grow six-fold, from 161 exabytes in 2006 to 988 exabytes in 2010> 90% of the information in the digital universe is unstructuredBy 2010 ~ 70% of the digital universe will be created by individuals
Digital^
15
Oct-07© 2007 IDC
Implications for the ConsumerImplications for the Consumer
Consumers often have the same
needs as corporate, but have
no expertise to manage their many
digital devices
16
Oct-07© 2007 IDC
Can We Read the Data? Interpreting the 1s and 0sCan We Read the Data? Interpreting the 1s and 0s
dc1995.jpgdc3295.jpgdc2509.jpgdc2500.jpgdc5113.jpgdc9113.jpgdc0013.jpgdc1033.jpgdc3928.jpgdc8977.jpgdc5689.jpgdc2555.jpgdc8833.jpgdc4733.jpgdc3728.jpgdc0903.jpgdc9113.jpgdc1669.jpgdc3416.jpg
17
Oct-07© 2007 IDC
The Content Disruption 2.0The Content Disruption 2.0
Content and the advertising surrounding it are the major if not only sources of revenue
Reliable content access and protection are mission critical
Manage large, fast growing pools of content
1PB to >40PB and growing at >100% a year
©2007 IDC 18
The success of these online companies are
dependent on fast, reliable, always available,
secure storage!
18
Oct-07© 2007 IDC
What About Online Storage Services?What About Online Storage Services?
Is there an opportunity for storage service providers?
Storage
52%
48%
Worldwide
(n=1263)
19
Oct-07© 2007 IDC
Key FindingsKey Findings
The digital universe will grow six-fold, from 161 exabytes in 2006 to 988 exabytes in 2010> 90% of the information in the digital universe is unstructuredBy 2010 ~ 70% of the digital universe will be created by individualsOrganizations will have responsibility or liability for 85% of the Digital Universe
Digital^
20
Oct-07© 2007 IDC
Organizational Touch** Content
859 Exabytes WW
2010988 Exabytes
User* Generated
Content
692 Exabytes
WW* Consumers and Workers
Creating, Capturing, or Replicating Personal
Information
** Transported, Hosted,
Managed, or Secured
User Creation; Enterprise WorriesUser Creation; Enterprise Worries
21
Oct-07© 2007 IDC 22
Content Creation VS DuplicationContent Creation VS Duplication
75% of thedigital universe
75% of thedigital universe
25% of digital universe is created or generated
25% of digital universe is created or generated
We are ‘Pacmen’ and ‘Pacwomen’We are ‘Pacmen’ and ‘Pacwomen’• “might need it some
day” mentality
• Inexpensive storageremoves concernsabout storing
• “Digital” enables multiple copies
• Digital images, audio, video, and higher definition all increase needs
• Can’t possibly store everything, can we?
• “might need it some day” mentality
• Inexpensive storageremoves concernsabout storing
• “Digital” enables multiple copies
• Digital images, audio, video, and higher definition all increase needs
• Can’t possibly store everything, can we?
is a copyis a copya replicatea replicate
a duplicatea duplicate
a mirror imagea mirror imagea backupa backup
is redundantis redundant
Oct-07© 2007 IDC
Key FindingsKey Findings
Digital^
The digital universe will grow six-fold, from 161 exabytes in 2006 to 988 exabytes in 2010> 90% of the information in the digital universe is unstructuredBy 2010 ~ 70% of the digital universe will be created by individualsOrganizations will have responsibility or liability for 85% of the Digital UniverseInformation created and replicated (the DU) will exceed the available storage capacity this year
Can we Containerize
it….ALL?23
Oct-07© 2007 IDC
Information And StorageInformation And Storage
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
900,000
1,000,000
2005 2006 2007 2008 2009 2010
Pet
abyt
es
InformationAvailable Storage
Transient information, unused streams, temporary data
24
Oct-07© 2007 IDC
“Available” Storage Worldwide“Available” Storage Worldwide
Hard Disk Drives
Optical
Tape
Flash/Memory
0
100
200
300
400
500
600
2005 2006 2007 2008 2009 2010
Exabytes
Inform
ation
Crea
tion
Bottom Line, we will store what we want to store (probably more)…but we won’t save every BIT
The key is to ensure that the data that is saved remains
valuable?
25
Oct-07© 2007 IDC
WW HDD Shipments in Terms of CapacityWW HDD Shipments in Terms of Capacity
0EB
50EB
100EB
150EB
200EB
250EB
2006 2007 2008 2009 2010 2011
Personal StorageConsumer ElectronicsPCsData Center
436MWW Unit Ship 483M 529M 579M 623M 674M ~9% 2006-2011 CAGR
26
Oct-07© 2007 IDC
Key Takeaways for the Digital UniverseKey Takeaways for the Digital Universe
The Digital Universe is Growing Around 55% AnnuallyWe will save as much as we want to save, but won’t save everythingVast majority is unstructured and duplicate informationMajority of content creation driven by consumersResults in Creators and Custodians
The Creators will be increasingly enabled and mobile
The Custodians will be increasingly challengedWhat to keep and How to keep it (media life, efficiency, power)How to keep data relevant (metadata, response times, availability, the ‘right’ data)How to keep data safe from datajacking (security, encryption)
27