big data: wall street style - o'reilly data_ wall street style... · 2 permission to reprint or...

Download Big Data: Wall Street Style - O'Reilly Data_ Wall Street Style... · 2 Permission to reprint or distribute…

Post on 12-Jun-2018

212 views

Category:

Documents

0 download

Embed Size (px)

TRANSCRIPT

  • Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Copyright 2012 Standard & Poors Financial Services LLC, a subsidiary of The McGraw-Hill Companies, Inc. All rights reserved.

    Big Data: Wall Street Style

    Jeff Sternberg Jen Zeralli S&P Capital IQ February 29, 2012

  • 2 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Boring Financial Chart

  • 3 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Boring Financial Chart: less boring with labels

    As of 2/24/2012.

  • 4 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Boring Financial Chart = kind of interesting, actually

    More than $2.35 trillion dollars

    invested in Information Technology

    over the last 10 years.

    Source: S&P Capital IQ Transaction Screening As of 2/24/2012.

  • 5 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    How Does That Compare?

    Total Investment over the last 10 years:

    Industrials = $3.49 trillion

    Energy = $2.61 trillion

    Healthcare = $2.47 trillion

    Information Technology = $2.35 trillion

    Telecom = $2.13 trillion

    Source: S&P Capital IQ Transaction Screening. As of 2/24/2012.

  • 6 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    So Is Big Data

    Big Money?

  • 7 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Big Money?

    Total Investment over the last three years:

    Information Technology = $774.4 billion

    Source: S&P Capital IQ Transaction Screening. As of 2/24/2012.

  • 8 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Big Money?

    Total Investment over the last three years:

    Information Technology = $774.4 billion

    Big Data = $32.4 billion

    Source: S&P Capital IQ Transaction Screening. As of 2/24/2012.

  • 9 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Big Money?

    Total Investment over the last three years:

    Information Technology = $774.4 billion

    Big Data = $32.4 billion

    So, 4.2%

    Source: S&P Capital IQ Transaction Screening. As of 2/24/2012.

  • 10 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Big Money?

    Total Investment over the last three years:

    Information Technology = $774.4 billion

    Big Data = $32.4 billion

    So, 4.2%

    Hey, at least were not just the 1%

    Source: S&P Capital IQ Transaction Screening. As of 2/24/2012.

  • 11 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    But What We Really Wanted To Talk About

    Strata: Making Data Work

    February 29, 2012

  • 12 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    But What We Really Wanted To Talk About

    S&P Capital IQ: Data Is Our Product

    About Data Collection

    Standardization

    Linking: The Curious, Special Case of Entities

    Suggesting Data

    Projections

  • 13 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    S&P Capital IQ: Data Is Our Product

    Strata: Making Data Work

    February 29, 2012

  • 14 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Data Is Our Product

  • 15 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Data Is Our Product

    Capital IQ started as an investment bank in 1999*

    Data = competitive advantage over other banks

    Built a database of financial investments,

    relationships and transactions

    *Acquired by Standard and Poors in 2004, now part of S&P Capital IQ.

  • 16 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Hey, Lets Sell That!

    For illustrative purposes only. Source: S&P Capital IQ as of 2/2012.

  • 17 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Data Is Our Product: What We Offer

    Datasets

    Financials and

    Valuation

    Qualitative Data

    Global Market Data

    Sell-Side Research

    Earnings Estimates

    News and Events

    Fixed Income

    Alpha and Risk Models

    Research Companies

    Generate Ideas

    Build Models

    Monitor Markets

    Analyze Performance

    Quantitative

    Research

    Web Portal

    Real-Time

    Workstation

    ClariFi

    Mobile

    Data Feeds

    Web Services

    Office Plug-Ins

    Use Cases Tools

  • 18 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Data Is Our Product: Who We Help

    Investment Bankers

    Asset Managers

    Private Equity Firms

    Venture Capital Firms

    Credit/Equity Analysts

    Corporations

    Consultants and Advisors

    Academia & Government

  • 19 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Data Is Our Product: Some Stats

    Company and Person Profiles

    Companies with full quantitative data 100,000

    Private company profiles 2.7 million

    Professionals and board members 4.2 million

    Quantitative data points per company 5,000

    Qualitative data points per company 1,500

    Transactions

    M&A Transactions 425,000

    Private Placements 190,000

    Public Offerings 138,000

    News and Key Developments

    Daily News articles across 184 countries 16,000

    Key Developments (curated news) 9.7 million

    As of 2/2/2012.

  • 20 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    Data Is Our Product

    DEMO

  • 21 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    About Data Collection

    Strata: Making Data Work

    February 29, 2012

  • 22 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    About Data Collection

    To Have A Data Product, One Must First Acquire Data.

  • 23 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    About Data Collection

    Data Collection Goals

    Coverage

    Quality

    Timeliness

    Auditability

  • 24 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    About Data Collection

    It starts with documents 67,000 per day

    Sources

    Company filings (SEC)

    News feeds (press releases)

    Web crawling

    We store these in our document repository

  • 25 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    About Data Collection

    Document repository

    SQL for metadata

    Regular file storage for docs

    Solr/Lucene indexing for fast search

    99.3 million documents

    240.3 million versions (files)

    As of 2/24/2012.

  • 26 Permission to reprint or distribute any content from this presentation requires the prior written approval of S&P Capital IQ. Not for distribution to the public.

    About Data Collection

    D