using big data and machine learning to protect your online service

62

Upload: truongtu

Post on 14-Feb-2017

218 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Using Big Data and Machine Learning to Protect Your Online Service
Page 3: Using Big Data and Machine Learning to Protect Your Online Service

www.microsoft.com/learning

http://developer.microsoft.com http://microsoft.com/technet

http://channel9.msdn.com/Events/TechEd

Page 5: Using Big Data and Machine Learning to Protect Your Online Service
Page 6: Using Big Data and Machine Learning to Protect Your Online Service
Page 7: Using Big Data and Machine Learning to Protect Your Online Service

Using big data to find

service attack surface

Using machine learning to

detect malicious requests

Page 8: Using Big Data and Machine Learning to Protect Your Online Service
Page 9: Using Big Data and Machine Learning to Protect Your Online Service
Page 10: Using Big Data and Machine Learning to Protect Your Online Service
Page 11: Using Big Data and Machine Learning to Protect Your Online Service
Page 12: Using Big Data and Machine Learning to Protect Your Online Service
Page 13: Using Big Data and Machine Learning to Protect Your Online Service
Page 14: Using Big Data and Machine Learning to Protect Your Online Service

Top-ranked:

www.microsoftkey.ml

Page 15: Using Big Data and Machine Learning to Protect Your Online Service

Can peek at past via web.archive.org

Page 16: Using Big Data and Machine Learning to Protect Your Online Service
Page 17: Using Big Data and Machine Learning to Protect Your Online Service
Page 18: Using Big Data and Machine Learning to Protect Your Online Service
Page 19: Using Big Data and Machine Learning to Protect Your Online Service
Page 20: Using Big Data and Machine Learning to Protect Your Online Service

Username Password guess

[email protected] abcdefg

[email protected] 123456

[email protected] password

[email protected] princess

[email protected] monkey

[email protected] 12345678

[email protected] Password

Username Password

[email protected] abcdefg

[email protected] abcdefg

[email protected] abcdefg

[email protected] abcdefg

[email protected] abcdefg

[email protected] abcdefg

[email protected] abcdefg

Other strategies: replay {username, password} pairs from leaked datasets

Slide by Cormac Herley, Microsoft Research

Page 21: Using Big Data and Machine Learning to Protect Your Online Service
Page 22: Using Big Data and Machine Learning to Protect Your Online Service
Page 23: Using Big Data and Machine Learning to Protect Your Online Service
Page 24: Using Big Data and Machine Learning to Protect Your Online Service
Page 25: Using Big Data and Machine Learning to Protect Your Online Service

Cooling

Chillers · Air handling

Operations

Provisioning · Monitoring · Security

Page 26: Using Big Data and Machine Learning to Protect Your Online Service

2014-09-30 05:25:12 127.0.0.1 GET http://www.bing.com/?q=surface - 80 -127.0.0.1 Mozilla/5.0+(Windows+NT+6.3;+WOW64;+Trident/7.0;+rv:11.0) -200 0 0 1359

Page 27: Using Big Data and Machine Learning to Protect Your Online Service

Get code from:

https://github.com/alissonsol/TechEd.Europe.2014

Page 28: Using Big Data and Machine Learning to Protect Your Online Service

~10-100 billion requests

Page 29: Using Big Data and Machine Learning to Protect Your Online Service
Page 30: Using Big Data and Machine Learning to Protect Your Online Service
Page 31: Using Big Data and Machine Learning to Protect Your Online Service
Page 32: Using Big Data and Machine Learning to Protect Your Online Service
Page 33: Using Big Data and Machine Learning to Protect Your Online Service
Page 34: Using Big Data and Machine Learning to Protect Your Online Service
Page 35: Using Big Data and Machine Learning to Protect Your Online Service
Page 36: Using Big Data and Machine Learning to Protect Your Online Service
Page 37: Using Big Data and Machine Learning to Protect Your Online Service

M

Page 38: Using Big Data and Machine Learning to Protect Your Online Service

M

O

Page 39: Using Big Data and Machine Learning to Protect Your Online Service

M

O

D

Page 40: Using Big Data and Machine Learning to Protect Your Online Service

M

O

D

E

Page 41: Using Big Data and Machine Learning to Protect Your Online Service

M

O

D

E

L

Page 42: Using Big Data and Machine Learning to Protect Your Online Service
Page 43: Using Big Data and Machine Learning to Protect Your Online Service

Source:

Machine learning in action

By: Harrington, Peter.

Manning Publications

2012

Movie Tag # of kicks # of kisses LabelA 3 104 RomanceB 2 100 RomanceC 1 81 RomanceD 101 10 ActionE 99 5 ActionF 98 2 Action? 18 90 Unknown

Page 44: Using Big Data and Machine Learning to Protect Your Online Service

AB

C

DEF

?

0

20

40

60

80

100

120

0 20 40 60 80 100 120

# o

f k

isse

s

# of kicks

Page 45: Using Big Data and Machine Learning to Protect Your Online Service

Tag # of kicks # of kisses Label Distance

? 18 90 Unknown 0.0

A 3 104 Romance 20.5

B 2 100 Romance 18.9

C 1 81 Romance 19.2

D 101 10 Action 115.3

E 99 5 Action 117.4

F 98 2 Action 118.9

Page 46: Using Big Data and Machine Learning to Protect Your Online Service

Tag # of kicks # of kisses Label Distance

? 18 90 Unknown 0.0

B 2 100 Romance 18.9

C 1 81 Romance 19.2

A 3 104 Romance 20.5

D 101 10 Action 115.3

E 99 5 Action 117.4

F 98 2 Action 118.9

if k = 3 then 3 nearest neighbors are: Romance

Page 47: Using Big Data and Machine Learning to Protect Your Online Service

AB

C

DEF

?

0

20

40

60

80

100

120

0 20 40 60 80 100 120

# o

f k

isse

s

# of kicks

𝜇0

𝜇1

Repeat

1. Compute centroids for k sets

2. Reassign point to group with closest centroid

3. Stop criteria: max iterations or no change(no change on centroids or group assignments)

Minimize

𝐽 =

𝑗=0

𝑘−1

𝑛∈𝑆𝑗

[𝑥𝑛 − 𝜇𝑗]2

Page 48: Using Big Data and Machine Learning to Protect Your Online Service

Tag # of kicks # of kisses CentroidA 3 104 𝜇0

(2, 95)B 2 100C 1 81D 101 10 𝜇1

(99.33, 5.67)E 99 5F 98 2? 18 90 ?

𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒2 𝜇0 = (18 − 2)2+(90 − 95)2= 281

𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒2 𝜇1 = (18 − 99.33)2+(90 − 5.67)2= 13727.22

Page 49: Using Big Data and Machine Learning to Protect Your Online Service

AB

C

D

EF

?

0

20

40

60

80

100

120

0 20 40 60 80 100 120

# o

f k

isse

s

# of kicks

Page 50: Using Big Data and Machine Learning to Protect Your Online Service

A

R

R

R

R

R

R

R

R

R

R

R

R

A

A

R

R

R

R

R

R

R

R

R

R

R

A

A

A

R

R

R

R

R

R

R

R

R

R

A

A

A

A

R

R

R

R

R

R

R

R

R

A

A

A

A

A

R

R

R

R

R

R

R

R

A

A

A

A

A

A

R

R

R

R

R

R

R

A

A

A

A

A

A

A

R

R

R

R

R

R

A

A

A

A

A

A

A

A

R

R

R

R

R

A

A

A

A

A

A

A

A

A

R

R

R

R

A

A

A

A

A

A

A

A

A

A

R

R

R

A

A

A

A

A

A

A

A

A

A

A

R

R

A

A

A

A

A

A

A

A

A

A

A

A

R

A

A

A

A

A

A

A

A

A

A

A

A

A

0

20

40

60

80

100

120

0 20 40 60 80 100 120

Page 51: Using Big Data and Machine Learning to Protect Your Online Service
Page 52: Using Big Data and Machine Learning to Protect Your Online Service
Page 53: Using Big Data and Machine Learning to Protect Your Online Service

Param Size Keywords Attacksurface 7 0 N

<script>alert("XSS");</script> 30 3 Y<script src="XXS.js"></script> 30 2 Y

%3Cscript%3Ealert(%22XSS%22)%3B%3C%2Fscript%3E 46 3 ?

𝐻𝑎𝑟𝑑𝑠ℎ𝑖𝑝: 𝑔𝑟𝑜𝑢𝑛𝑑 𝑡𝑟𝑢𝑡ℎ…

Page 54: Using Big Data and Machine Learning to Protect Your Online Service
Page 55: Using Big Data and Machine Learning to Protect Your Online Service

Scoring Action

True Positive

True Negative

False Positive Boosting GT

False Negative Boosting GT

Page 56: Using Big Data and Machine Learning to Protect Your Online Service
Page 57: Using Big Data and Machine Learning to Protect Your Online Service
Page 58: Using Big Data and Machine Learning to Protect Your Online Service
Page 59: Using Big Data and Machine Learning to Protect Your Online Service
Page 60: Using Big Data and Machine Learning to Protect Your Online Service

SDLSDL+ pentest

Big Data + ML

SDL + pentest

+ Big Data

+ ML

Page 61: Using Big Data and Machine Learning to Protect Your Online Service
Page 62: Using Big Data and Machine Learning to Protect Your Online Service