data science program by code for tomorrow

43
Data Science Program 2013.10.14 台灣生活資料科學團隊養成計畫 Chia-Kai Liu (CK)

Upload: ckliu

Post on 28-Jan-2015

106 views

Category:

Technology


0 download

DESCRIPTION

Introduction of the Data Science Program organized by Code for Tomorrow. (http://datasci.co/)

TRANSCRIPT

Page 1: Data Science Program by Code for Tomorrow

Data Science Program

2013.10.14

台灣生活資料科學團隊養成計畫

Chia-Kai Liu (CK)

Page 2: Data Science Program by Code for Tomorrow

Code for Tomorrow 以促進「開放發展」為宗旨,鼓勵政府、城市和企業把握新興數位機會,以資料為基礎,發展各種攸關民生的資料服務與投資。

http://codefortomorrow.org/

Page 3: Data Science Program by Code for Tomorrow

Some Activities

• Open Data Day

• Data Weekend

• 地球日工作坊• 開放食庫• 好愛米• Project 615

Page 4: Data Science Program by Code for Tomorrow

87%

13%

台灣

其他國家

Demography

Page 5: Data Science Program by Code for Tomorrow

Data Science Program

Page 6: Data Science Program by Code for Tomorrow

What the hell is Data Science?

Page 7: Data Science Program by Code for Tomorrow

The study of the space of problems that can be solved with data.

Data Science

http://youtu.be/XmxHj-bAH0A

Page 11: Data Science Program by Code for Tomorrow

Data Science Process

Page 12: Data Science Program by Code for Tomorrow

Unit #1● Introduction● Data Exploration & Visualization

Unit #2 ● Statistics & Probabilities

Unit #3

Unit #4

● Datamining

(one week break)

● Group Presentation & Discussions

Course Plan

* Each unit is 4 hours long, on a Saturday afternoon.

Page 13: Data Science Program by Code for Tomorrow

How to measure the average commuting time of all employees in your company?

http://store.hubbardresearch.com/How-To-Measure-Anything-p/htma-signed-001.htm

Rule of Five

Page 14: Data Science Program by Code for Tomorrow

Potential Topics

Page 17: Data Science Program by Code for Tomorrow

http://sfgov.maps.arcgis.com/apps/OnePane/basicviewer/index.html?appid=26c723bc512948c6bf9103fb73e83ffe

美國舊金山市行人傷亡地圖

5%街道

55% 重傷亡案件

Page 18: Data Science Program by Code for Tomorrow

http://accidentgis.appspot.com/start

台灣新竹市交通事故地圖

Page 19: Data Science Program by Code for Tomorrow

http://ariofsevit.com/hubway/maps.php

美國波士頓市城市單車資料分析

Page 20: Data Science Program by Code for Tomorrow

http://goo.gl/57QmuA

台灣台北市都市更新劃定地

Page 21: Data Science Program by Code for Tomorrow

國際發展

http://data.worldbank.org/

Page 22: Data Science Program by Code for Tomorrow

Some Examples

Page 23: Data Science Program by Code for Tomorrow

好愛米(Good Rice)

Page 25: Data Science Program by Code for Tomorrow

http://youtu.be/sWB_A9tEd8w

Page 26: Data Science Program by Code for Tomorrow

http://youtu.be/sWB_A9tEd8w

Page 28: Data Science Program by Code for Tomorrow

地球日工作坊2013.04.27

http://goo.gl/jqaaG

Page 29: Data Science Program by Code for Tomorrow

地球日工作坊

RonnyCKDongpo

AllenKNY

Page 30: Data Science Program by Code for Tomorrow

農產履歷

地籍資料

土壤污染資料

其他資料

DatabaseData

Consumers

Page 31: Data Science Program by Code for Tomorrow

農產履歷

http://data.coa.gov.tw/

Page 32: Data Science Program by Code for Tomorrow

地籍資料

http://easymap.land.moi.gov.tw/K02Web/K02Land.jsp

Page 33: Data Science Program by Code for Tomorrow

土壤污染資料

http://gis.epa.gov.tw/LayerListn.aspx

Page 34: Data Science Program by Code for Tomorrow

所得資料

Page 35: Data Science Program by Code for Tomorrow

Key Contributors: - Dongpo Deng - Ronny Wang - CK

《好愛米計畫》運用地理資訊系統,交叉分析包括土壤重金屬污染調查、農產品產銷履歷以及村里所得等各種政府開放資料,讓消費者在選購市售包裝米時,不但可以吃的安心,更能發揮愛心,照顧弱勢地區的經濟。該計畫也建立了⼀一個開放的土壤重金屬污染資料庫,以供日後比對其他農作物或其他用途。

http://goo.gl/f3nZ0

好愛米計畫

Page 36: Data Science Program by Code for Tomorrow

都市設計

Page 37: Data Science Program by Code for Tomorrow

http://horizonroylin.blogspot.tw/2013/09/urban-design-data-1-design-process-and.html

美國紐約市都市更新分析

Page 38: Data Science Program by Code for Tomorrow

http://horizonroylin.blogspot.tw/2013/09/urban-design-data-1-design-process-and.html

美國紐約市都市更新分析 (2)

地鐵旅客進出數量 工業用地分布

土地使用分區 區域鐵路網路

假設觀點:

這個地區是後工業地區,產業活動蕭條,人流稀少,正中央的這⼀一大片空地,造成城市兩邊的阻隔,政府希望重新規劃....

Page 39: Data Science Program by Code for Tomorrow

http://horizonroylin.blogspot.tw/2013/09/urban-design-data-1-design-process-and.html

美國紐約市都市更新分析 (3)

廠房改建倉儲 人口與土地利用

設計概念:

我們希望將龐大的物流需求整合在train yard上方,並透過與高架道路, 鐵路, 地面道路等等的整合,達到更高的物流與儲貨效率,藉此釋放緊鄰曼哈頓的高價值水岸地段,提供更多居住與商業空間

“Mega Storage”

Page 40: Data Science Program by Code for Tomorrow

選舉與市民所得的關係

Page 41: Data Science Program by Code for Tomorrow

0 - 463,000463,000 - 516,000

516,000 - 562,000562,000 - 617,000617,000 - 689,000689,000 - 803,000803,000 - 1,060,000

2010年市長選舉民進黨得票率

2010年納稅所得(元)

http://www.slideshare.net/ckliu/data-for-development

Page 42: Data Science Program by Code for Tomorrow

Lecturers

Mentors

Who We Need

Statistics

Datamining

DataVisual Analysis

High-LevelIntroduction

Volunteers