self-taught learning: transfer learning from … · 2017-04-19 · self-taught learning: transfer...

24
SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen [email protected] April 7, 2017 Article Summary for Knowledge Discovery Course Instructed by Dr. Cherie Ding 1

Upload: others

Post on 23-May-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM

UNLABELED DATA

Richard Wen

[email protected]

April 7, 2017

Article Summary for Knowledge Discovery Course Instructed by Dr. Cherie Ding

1

Page 2: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

OUTLINE

1. Introduction

2. Methods

3. Results

4. Discussion and Conclusion

2

Page 3: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

INTRODUCTION

3

Page 4: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

LEARNING

4

Page 5: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

WHAT WE SEE

5

Page 6: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

WHAT THE COMPUTER SEES

6

Page 7: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

HIGHER LEVEL FEATURES

7

Page 8: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

WHY?

Labeled data is expensive

Abundance of unlabeled data

Less restrictive data requirements

8

Page 9: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

METHODS

9

Page 10: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

TWO STAGES

1. Learn representation with unlabeled data

2. Apply to labeled data for classification

10

Page 11: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

LABELED TRAINING DATA

A set of m training examples with:

x as n-dimension feature vectors

y as corresponding labels {1 .. C}

l indicating a labeled example

Raina et al. (2007)

11

Page 12: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

UNLABELED TRAINING DATA

A set of k training examples with:

x as n-dimension feature vectors

u indicating an unlabeled example

Raina et al. (2007)

12

Page 13: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

LEARNING BASES (BASIC ELEMENTS)

Optimize aj weights and bj bases to:

1. Reconstruct xu as weighted linear combo of bases

2. Encourage aj to be sparse (mostly zero)

Raina et al. (2007)

13

Page 14: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

LEARNING FEATURES

Using bj, compute sparse features from labeled data

as input to supervised algorithms

Raina et al. (2007)

14

Page 15: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

SPARSE FEATURES EXAMPLE

Raina et al. (2007)

15

Page 16: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

RESULTS

16

Page 17: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

CLASSIFICATION EXPERIMENTS

Principal Component Analysis (PCA) vs Raw vs

Sparse Coding (SC) Features

Support Vector Machine (SVM)

Gaussian Discriminant Analysis (GDA)

17

Page 18: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

HANDWRITTEN DIGITS AND ENGLISH CHARACTERS

Improvements when SC used with Raw for

characters

SC did not perform as well alone for characters

SC generally performed better for digits

Improvements ranged from ~1-7%

18

Page 19: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

REUTERS WEBPAGES AND ARTICLES

SC generally performed well for webpages and

articles

Improvements ranged from ~8-21%

19

Page 20: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

KERNELS (SIMILARITY FUNCTIONS)

Compared against linear, polynomials, Radial Basis

Function (RBF)

Outperforms standard kernel choices above

Improvements ranged from ~6-13%

20

Page 21: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

DISCUSSION AND CONCLUSION

21

Page 22: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

DISCUSSION

Can be applied to different domains

Can have more basis vectors bj than n-dimensions

Unlabeled data must still have some structure

Other algorithms can be modified for self-taught

learning

22

Page 23: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

CONCLUSION

Find higher level representations of patterns

Use of inexpensive unlabeled data

Self-taught learning as a machine learning

framework

23

Page 24: SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM … · 2017-04-19 · SELF-TAUGHT LEARNING: TRANSFER LEARNING FROM UNLABELED DATA Richard Wen rwen@ryerson.ca April 7, 2017 Article Summary

REFERENCES

Raina, R., Battle, A., Lee, H., Packer, B., & Ng, A. Y. (2007). Self-taughtlearning. Proceedings of the 24th international conference onMachine learning - ICML '07. doi:10.1145/1273496.1273592

24