universal adversarial perturbations seyed-mohsen moosavi...

Universal adversarial perturbationsSeyed-Mohsen Moosavi-Dezfooli*, Alhussein Fawzi+, Omar Fawzi†, Pascal Frossard*

*EPFL, +UCLA,†ENS-LyonLTS

Round number0 1 2 3

Fooling percentage

76 80 79

cashmachine

pillow

computerkeyboard

diningtable

envelopemedicinechest

microwave

mosquitonet

pencilbox

platerack

refrigerator

television

wardrobe

windowshade

CaffeNet GoogLeNet

VGG-16 VGG-19

ResNet-152

Random

Adv. pert. (DF)

Adv. pert. (FGS)

Sum of adv.

ImageNet bias

Universal

Fooling percentage of the perturbations with the same norm

Robustness of deep networksFor image classifiers, achieving robustness to perturbations is a crucial re-quirement

The algorithmFinding universal perturbation using the set X

Misclassification rateHigh misclassification rate on unseen test data

What is special in these perturbations?Comparison of universal perturbation to other types of perturbations

Doubly universal perturbationsThe ability to generalize well across different neural networks

Universal adversarial perturbationA single perturbation causing misclassification with high probability

Feedbacking does not help!Augmenting the training data with perturbed images

Explaining universal perturbationsThe decision boundary in the vicinity of natural images is highly correlated!

Perturbed labelsExistence of dominant labels that suck most other labels.

New theoretical analysis:Check our new work: “Analysis of universal adversarial perturbations”

Random noise achieving 90% fooling rate

E.g. adversarial perturbations causing misclassification

High volume classification regions!

... because universal pertur-bations are diverse!

Whale Turtle

BallonJoystick Flag pole

Face powder

Labrador

LabradorChihuahua

Chihuahua1 50’000

Plot of singular values

Random vectors

Normals of the decision boundary

“Universal adversarial perturbations”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, Honolulu, US.

github.com/lts4/universal

Lampshade

PerturbationClassifier Lampshade?k̂

Algorithm Computation of universal perturbations.

1: input: Data points X , classifier k̂, desired �p norm ofthe perturbation ξ, desired accuracy on perturbed sam-ples δ.

2: output: Universal perturbation vector v.3: Initialize v ← 0.4: while Err (Xv) ≤ 1− δ do5: for each datapointxi ∈ X do6: if k̂(xi + v) = k̂(xi) then7: Compute the minimal perturbation that

sendsxi + v to the decision boundary:

∆vi ← argminr

‖r‖2 s.t. k̂(xi + v + r) �= k̂(xi).

8: Update the perturbation:

v ← Pp,ξ(v +∆vi).

9: end if10: end for11: end while

x1,2,3

∆v 1

x1,2,3

R3∆v 1

x1,2,3

v∆v 2

Cross-model fooling percentage for VGG-16

VGG-19

ResNet-152

GoogLeNet

CaffeNet

Round 0 Round 1

Round 2 Round 3

universal adversarial perturbations seyed-mohsen moosavi...

Documents

!+v measurements @ cms -...

university of udine - cineca (seyed... · seyed amir...

seyed mostafa sadeghi - uah

mohsen mohammed taqi mohsen al-saleh statistics-6000 ·...

carnap and quine on analyticity - ruor.uottawa.ca...

seyed mohammad_karimi.pdf

kolivand mohsen

seyed portfolio

cytotaxonomy of some onobrychis (fabaceae) species · pdf...

sparsefool: a few pixels make a big difference · 2019. 6....

seyed mahyad aghigh_msc

journal of complementary and integrative medicine ·...

v. united states, petition no. p-1939-13, u.s. response to...

a non-invasive insight into soft-tissue sarcomas shm ......a...

post-transplant lymphoproliferative disorder after liver...

prof. albert m. k. cheng - university of houstonprof. albert...

curriculum vitae professor seyed abbas shojaosadati ·...

robustness and evasion attacks - stanford university ·...

1 classiﬁcation regions of deep neural networks1...

robustness of classifiers: from adversarial to random noise...