pourquoi le big data open source ?

19

Le GTLL face au déﬁ du déluge des données Stefane Fermigier, Data Tuesday, fév. 2013

Upload: stefane-fermigier

Post on 05-Dec-2014

709 views

Category:

Technology

1 download

Report

Download

Embed Size (px):

DESCRIPTION

TRANSCRIPT

Page 1: Pourquoi le big data open source ?

Le GTLL face au défidu déluge des données

Stefane Fermigier, Data Tuesday, fév. 2013

Page 2: Pourquoi le big data open source ?

There is a tsunami of data that is crashing onto the beaches of the civilized world. This is a tidal wave of unrelated, growing data formed in bits

and bytes, coming in an unorganized, uncontrolled, incoherent cacophony of foam. It's filled with flotsam and jetsam. It's filled with the sticks and bones and shells of inanimate and

animate life. None of it is easily related, none of it comes with any organizational methodology.

Richard Saul Wurman, in “Information Architects” (1996)

Page 3: Pourquoi le big data open source ?

Pourquoi le big dataopen source ?

Page 4: Pourquoi le big data open source ?

Facteurs économiques

Source: Michael Driscoll

Page 5: Pourquoi le big data open source ?

Facteur technique

Page 6: Pourquoi le big data open source ?

Page 7: Pourquoi le big data open source ?

Pourquoi le big dataopen source ?

• Expertise historique en scalabilité horizontale (cf. Beowulf, Google, etc.)

• Majors de l’internet (cf. Google, Yahoo!, Facebook, Twitter) imprégnés de culture open source, et dont le business model tourne autour de l’accumulation des données

• Efficience de l’open source comme modèle d’innovation ouverte, de développement et de diffusion de l’innovation

Page 8: Pourquoi le big data open source ?

Page 9: Pourquoi le big data open source ?

Page 10: Pourquoi le big data open source ?

Mission du GT

“Développer l’écosystème du Libreen Ile-de-France”

Page 11: Pourquoi le big data open source ?

65 PME/ETI

17 Grands Groupes

28 Etablissementsde Recherche et Formation

Page 12: Pourquoi le big data open source ?

Distributed / Cloud Embedded

Roadmap technologique

Page 13: Pourquoi le big data open source ?

Distributed / Cloud Embedded

Dev. Tools Middleware Big / Open Data

Roadmap technologique

Page 14: Pourquoi le big data open source ?

Web 2.0 / 3.0 Enterprise Apps

Distributed / Cloud Embedded

Dev. Tools Middleware Big / Open Data

Roadmap technologique

Page 15: Pourquoi le big data open source ?

Projets: 33Effort: 140 M€Aide: 52 M€

R&D collaborative depuis 5 ans

Page 16: Pourquoi le big data open source ?

3 “grands défis”

• Qualité logicielle

• “After PC”

• Déluge des données

Page 17: Pourquoi le big data open source ?

Focus sur le Big Data

Stockage (NoSQL, NewSQL)

Traitement (MapReduce, etc.)

Indexation

Collecte & injection

Infra & sys. management

Data Viz

Page 18: Pourquoi le big data open source ?

Page 19: Pourquoi le big data open source ?

Plus d’infos

Livre blanc disponiblesur www.fermigier.com

Site Web:www.gt-logiciel-libre.org

http://www.fermigier.com

http://www.fermigier.com

http://www.gt-logiciel-libre.org

http://www.gt-logiciel-libre.org

Big Data Open Source Software Projects Documentation

Big Draw on Tour Source Book

Open Source Data Science Elaborando uma plataforma de Big ...€¦ · Open Source Data Science Elaborando uma plataforma de Big Data & Analytics 100% Open Source com apoio do Pentaho

Pourquoi choisir un CMS Open Source ?

Open Source Security Tools for Big Data

Apache Tajo - An open source big data warehouse

POURQUOI ? POURQUOI ?

Scaling to Infinity - Open Source meets Big Data

Atelier I19 Pourquoi le big data et le monitoring vont changer nos vies de touriste connecté

Big Data : SQL, NoSQL ? Pourquoi faire un choix ?

Big Geo Data: Open Source and Open Standards

Créer une communauté open source: pourquoi ? comment ?

Qu’est ce qu’un lien externe et pourquoi c’est important ... · 2 - Pourquoi le lien est important aux yeux de Google Le lien externe est… source d’une meilleure position

Open-Source Big Data Analytics in · PDF fileOpen-Source Big Data Analytics in Healthcare ... PostgresQL; Vocabulary tables with loading ... scalable to ‘big data’

Open Source Platforms for Big Data Analytics

HPCC Systems - Open source, Big Data Processing & Analytics

Big data as a source for official statistics

Bacterial Source Tracking - Big Cypress Creekbcc.tamu.edu/media/6134/bigcypress_stakeholder_26aug10_final.pdf · Bacterial Source Tracking. Big Cypress Creek. Bacteria Assessment

Hack Harvard 2012: Open Source is Big Business

24 June 2015: Open-source big data insight

Un approccio Innovativo e Open Source ai Big Dataforges.forumpa.it › assets › Speeches › 20602 › co_17_tripodi_massi… · Un approccio Innovativo e Open Source ai Big Data

Pourquoi Drupal est le Meilleur CMS/WEM Entreprise Open Source

Big Data Open Source com Hadoop

Big Data Open Source Software and Projects Introduction

A Statistical Framework for Analysing Big Data · business case for using a particular Big Data source, if the Big Data source cannot provide valid statistical inferences, and if

Architecture Big Data open source S.M.A.C.K

Big Data Infrastructure for source integration and

Un RTI Open Source, pourquoi et comment? · Un RTI Open Source, pourquoi et comment?. In : Séminaires ISCLP 2009 logiciels . libres et technologies Open source, Toulouse, CEAT, 18-19

Hitech Fasteners, the big source for small fasteners

UNCLASSIFIED//FOR OFFICIAL USE ONLY UNCLASS/FOUO …info.publicintelligence.net/INSCOM-BigData.pdf · AllAll-All-Source Source Analytics for ‘Big Data’Analytics for ‘Big Data

Billrun - open source billing designed for big data

Leveraging open source for big data stack

Solving BIG problems with Open Source: Condor

Big Data & Why Open Source Protection · Big Data & Open Source Protection Why & How. ... • 50% of organizations leveraging Hadoop/HBase 3 Big Data and Open Source Database adoption

Formation Bâtiment Durable - environnement.brussels · Source: formation PMP II. Pourquoi et comment isoler ? Isolation des parois translucides . 51 Source: fiche ENE 06 II. Pourquoi