mediaeval 2015 - geo_ml @ mediaeval placing task 2015

11

Click here to load reader

Upload: multimediaeval

Post on 20-Jan-2017

178 views

Category:

Education


0 download

TRANSCRIPT

Page 1: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015MediaEval 2015, Wurzen, Germany

Nghia Duong-Trung, Martin Wistuba, Lucas Rego Drumond, LarsSchmidt-Thieme

Information Systems and Machine Learning Lab (ISMLL)University of Hildesheim, Germany

September 14-15, 2015

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 1 / 8

Page 2: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015

Outline

Three-step approach

Experimental results

Discussions and outlook

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 2 / 8

Page 3: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Three-step approach

Outline

Three-step approach

Experimental results

Discussions and outlook

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 3 / 8

Page 4: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Three-step approach

Notations

I m, v , n geotagged multimedia items in the training and test data,number of features respectively

I Each item is annotated with a geo-location y ∈ R2, y = (y lat , y lon)where y lat ∈ R is the latitude and y lon ∈ R is the longitude

I X train ∈ Rm×n, and the respective labels Y train ∈ Rm×2

I find a model f : Rn → R2 such that for some test data X test ∈ Rv×n,the error

∑vi=1 d(f (X test

i ),Y testi ) is minimal

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 3 / 8

Page 5: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Three-step approach

Three-step approach

I Find a model f : Rn → R2

I Three steps:

I find a cluster assignment vector c ∈ Cm, where the i-th element cicontains the cluster assigned to the i-th instance based on itsgeo-location yi

I learn a model on X train and c in order to map the test instances tothose clusters

I predicted geo-location ytesti is the geo-location of the nearest neighborin the same cluster

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 4 / 8

Page 6: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Three-step approach

Three-step approach

Algorithm 1 Three-step approach

Require: X train, X test , Y train, cost s, number of clusters k1: {Step 1: k-means clustering}2: c← Kmeans(Y train, k)3: {Step 2: Linear SVM}4: g ← LinearSVM(X train, s, c)5: {Step 3: k-nearest neighbor}6: for i = 1 . . . v do7: ci ← g(X test

i )8: X ,Y ← rows of X train,Y train belonging to cluster ci9: yi ← 1NNRegression(X ,Y )

10: end for11: return yi

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 5 / 8

Page 7: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Experimental results

Outline

Three-step approach

Experimental results

Discussions and outlook

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 6 / 8

Page 8: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Experimental results

Experimental results

I Implementation achieves a median error of 352.47 km to the test set.

distance # items percentage

0.001 km 504 0.05 %

0.01 km 1051 0.11 %

1 km 11849 1.25 %

10 km 287807 30.03 %

100 km 418831 44.09 %

1000 km 566791 59.67 %

10000 km 911364 95.94 %

40000 km 949889 100.00 %

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 6 / 8

Page 9: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Discussions and outlook

Outline

Three-step approach

Experimental results

Discussions and outlook

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 7 / 8

Page 10: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Discussions and outlook

Discussion and outlook

I Conceptual to understand and simple to implement

I Easily replace and integrate new techniques in the workflow withoutredesigning the complete system

I Outlook

I Only apply K-means on Y train without any textual knowledge orlanguage models

I Do not apply feature ranking, dimension reduction techniques. (alreadydid in another following up paper)

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 7 / 8

Page 11: MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015

Geo ML @ MediaEval Placing Task 2015 Discussions and outlook

Thank you for your attention.

Duong-Trung et al., Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany 8 / 8