next generation analytics & big data (a reference model for big data) jangwon gim sungjoon lim...

20
Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013, Gyeongju Korea 32N2386

Upload: eustacia-melton

Post on 23-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Next Generation Analyt-ics & Big Data

(A Reference Model for Big Data)

Jangwon GimSungjoon LimHanmin Jung

ISO/IEC JTC1 SC32 Ad-hoc meetingMay 29, 2013, Gyeongju Korea

32N2386

Page 2: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Contents

Background Brief history of discussions Case study Procedure for developing standardizations for Big Data Reference model for Big Data Conclusions

2

Page 3: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Discussion of Big Data

Data analytics Data analysis Baba: Vocabulary, Use-case, and so on

Stabilize ArchitectureDefine InterfacesStandardization opportunities

Jim: The aspect of Big Data is “There is many different forms” Krishna: Refers to Wikipedia definition Keith Gorden: Volume, Complex, Velocity Keith W. Hare: Open Big Data Volume, Variety, Velocity, Value, Veracity

Any combination is OK.

3

Page 4: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Background

Emerging Technologies For Big DataIn 2012, The hype cycle of Gartner

Diverse definitions of technologies and services, having different views of data

4

Page 5: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Background

Big Data on hype cycle

A general and common reference model for Big Data is needed

5

Page 6: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Brief history of discussions

6

Issue Date Summary

16 November 2011. [SC32N2181] ISO/IEC JTC 1/SC 32 N2181, “Resolutions and topics from the recent JTC 1 meeting of particular interest to SC 32 participants”, SC32 Chair – Jim Melton

12 January 2012.

[SC32N2198] ISO/IEC JTC 1/SC 32 N 2198, “Analysis of 2012 Gartner Technology Trends”, JTC1 SWG-P - Mario Wendt – Convener SC 6 Telecommunications and information exchange between systems SC 32 Data management and interchange SC 39 Sustainability for and by Information Technology

19 March 2012. [SC32N2199]ISO/IEC JTC 1/SC 32 N 2199, “Discussion: SC 32 Response to 2011 JTC 1 Resolution 33”, SC32 Chair – Jim Melton

6 June 2012. [SC32N2241] Ad-hoc on “Next gen analytics” - Keith Hair - Chair

Page 7: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

The view of Next-Generation Analytics of SC32

Referencing from [SC32N2241]

Need a reference model for Big Data to enhance interoperability

7

Next-Generation AnalyticsSocial Analytics

From Baba

Architectural

Mechanisms

Metadata

Raw Storage

Page 8: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (1)

Korea Institute of Science and Technology (KISTI)Dept. of Computer Intelligence Research

8

Page 9: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (2)

Architecture of InSciTe Adaptive Service

9

Page 10: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (3)

Semantic AnalysisText Data to Ontology

10

Page 11: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (4)

Semantic AnalysisOntology Schema

11

Page 12: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (5)

Semantic AnalysisExample of Semantic Analysis

12

Page 13: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (6)

InSciTe Service Functions – (Hybrid Vehicle)

13

Technology Navigation

TechnologyTrend

Core ElementTechnology

Convergence Technology

Agent Level Agent Partner Integrated Roadmap

Report

Page 14: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (7)

In 2013, About 10 Billion triples from diverse sites will be extracted

14

Sites The number of Count

Freebase 1,015,762,951

Yago 224,949,079

DBPedia 449,383,705

DBLP 81,986,947

baseKB 147,549,529

Etc (WhoisWho,NYTimes,LinkedObervedData,…) 2,296,838,760

Total 4,216,470,971

Page 15: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Case Study (8)

In 2013, System Architecture of InSciTe Adaptive Service

15

Page 16: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Procedure for developing a reference model for Big Data

4. Deriving use-cases for applying the Big Data

3. Defining a concept model / a reference model / a framework for Big Data

2. Establishing visions and strategies for achieving the goal of Big Data

1. Eliciting requirements and analyzing the environment of Big Data

16

We are here

Page 17: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

A lifecycle of Big Data

17

1. 2.

3. 4.

• Collection/Identification• Repository/Registry• Semantic

Intellectualization• Integration

• Data Curation• Data Scientist• Data Engineer

Data Insight

Action Decision

• Workflow• Data Quality

Big Data

• Analytics / Prediction

• Visualization

Page 18: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Reference Model for Big Data

A Reference Model for Big Data

18

Data Layer

Platform LayerData Semantic Intellectualization

Data Integration

Data Quality Management

Big DataManagement

Data Curation

Service LayerAnalysis & Prediction

Security

Data Visualization

Service Support Layer

Workflow Management

Interface

Data Collection

Data Identification (Data Mining & Metadata Extraction)

Data Registry Data Repository

Interface

Interface

Page 19: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Reference Model for Big Data

A Reference Model for Big Data

19

Data Layer

Platform LayerData Semantic Intellectualization

Data Integration

Data Quality Management

Big DataManagement

Data Curation

Service LayerAnalysis & Prediction

Security

Data Visualization

Service Support Layer

Workflow Management

Interface

Data Collection

Data Identification (Data Mining & Metadata Extraction)

Data Registry Data Repository

Interface

Interface

9075

13249

11179

19763

???

Page 20: Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013,

Conclusions

SummaryAnalyzing the circumstance of Big DataBuilding a framework for Big DataDefine detail procedure to create the Big Data

DiscussionPossible suggestions

• New Working Group for the reference model of Big Data New Work Items could be derived from the model

• New Study Group

Future workDiscussion of the concept of NWI

• 2013. 11. Interim meetingsPropose extended the reference model of Big Data (NWI)

• 2014. 5 Plenary meeting

20