pac learning and the vc dimension

Upload: butest

Post on 01-Jul-2015

610 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: PAC Learning and The VC Dimension

Page 2: PAC Learning and The VC Dimension

Fix a rectangle (unknown to you):

From An Introduction to Computational Learning Theory by Keanrs and Vazirani

Page 3: PAC Learning and The VC Dimension

Draw points from some fixed unknown distribution:

Page 4: PAC Learning and The VC Dimension

You are told the points and whether they are in or out:

Page 5: PAC Learning and The VC Dimension

You propose a hypothesis:

Page 6: PAC Learning and The VC Dimension

Your hypothesis is tested on points drawn from the same distribution:

Page 7: PAC Learning and The VC Dimension

We want an algorithm that:◦ With high probability will choose a hypothesis that is

approximately correct.

Page 8: PAC Learning and The VC Dimension

Choose the minimum area rectangle containing all the positive points:

Page 9: PAC Learning and The VC Dimension

Derive a PAC bound:

For fixed:◦ R : Rectangle

◦ D : Data Distribution

◦ ε : Test Error

◦ δ : Probability of failing

◦ m : Number of Samples

Page 10: PAC Learning and The VC Dimension

We want to show that with high probability the area below measured with respect to D is bounded by ε :

R< ε

Page 11: PAC Learning and The VC Dimension

We want to show that with high probability the area below measured with respect to D is bounded by ε :

< ε/4

Page 12: PAC Learning and The VC Dimension

Define T to be the region that contains exactly ε/4 of the mass in D sweeping down from the top of R.

p(T’) > ε/4 = p(T) IFFT’ contains T

T’ contains T IFFnone of our m samplesare from T

What is the probabilitythat all samples miss T

< ε/4 T’

Page 13: PAC Learning and The VC Dimension

What is the probability that all m samples miss T:

What is the probability thatwe miss any of the rectangles?◦ Union Bound

< ε/4 T’

Page 14: PAC Learning and The VC Dimension

A B

Page 15: PAC Learning and The VC Dimension

What is the probability that all m samples miss T:

What is the probability thatwe miss any of therectangles:◦ Union Bound

= ε/4

Page 16: PAC Learning and The VC Dimension

Probability that any region has weight greater than ε/4 after m samples is at most:

If we fix m such that:

Than with probability 1- δwe achieve an error rate of at most ε

= ε/4

Page 17: PAC Learning and The VC Dimension

Common Inequality:

We can show:

Obtain a lower bound on the samples:

Page 18: PAC Learning and The VC Dimension

Provides a measure of the complexity of a “hypothesis space” or the “power” of “learning machine”

Higher VC dimension implies the ability to represent more complex functions

The VC dimension is the maximum number of points that can be arranged so that f shatters them.

What does it mean to shatter?

Page 19: PAC Learning and The VC Dimension

A classifier f can shatter a set of points if and only if for all truth assignments to those points f gets zero training error

Example: f(x,b) = sign(x.x-b)

Page 20: PAC Learning and The VC Dimension

What is the VC Dimension of the classifier:◦ f(x,b) = sign(x.x-b)

Page 21: PAC Learning and The VC Dimension

Conjecture:

Easy Proof (lower Bound):

Page 22: PAC Learning and The VC Dimension

Harder Proof (Upper Bound):

Page 23: PAC Learning and The VC Dimension

VC Dimension Conjecture:

Page 24: PAC Learning and The VC Dimension

VC Dimension Conjecture: 4

Upper bound (more Difficult):

Page 25: PAC Learning and The VC Dimension

What is the VC Dimension of:◦ f(x,{w,b})=sign( w . x + b )

◦ X in R^d

Proof (lower bound):◦ Pick {x_1, …, x_n} (point) locations:

◦ Adversary gives assignments {y_1, …, y_n} and you choose {w_1, …, w_n} and b:

Page 26: PAC Learning and The VC Dimension

Page 27: PAC Learning and The VC Dimension

Proof (upper bound): VC-Dim = d+1◦ Observe that the last d+1 points can always be

expressed as:

Page 28: PAC Learning and The VC Dimension

Proof (upper bound): VC-Dim = d+1◦ Observe that the last d+1 points

can always be expressed as:

Page 29: PAC Learning and The VC Dimension

Learning Theory and Model Selection - Weinan Zhangwnzhang.net/teaching/cs420/slides/10-model-selection.pdfUpper Bound on Sample Complexity with VC •Using VC dimension as a measure

Automatic Capacity Tuning of Very Large VC-Dimension Classifiers

VC Dimension - cs.cmu.eduguestrin/Class/15781/slides/learningtheory-bns.pdf · VC Dimension Machine Learning ... Anomaly detection ... An earthquake may be announced on the radio

Examples of the VC Dimension - Universiteit Utrecht

PAC-learning, VC Dimensionguestrin/Class/10701/slides/learningtheory-big... · PAC-learning, VC Dimension Machine Learning – 10701/15781 Carlos Guestrin Carnegie Mellon University

Model Selection via the VC Dimension - jmlr.org

Neural Networks with Quadratic VC Dimensionpapers.nips.cc/paper/1051-neural-networks-with-quadratic... · 2014. 4. 14. · Neural Networks with Quadratic VC Dimension 199 to w2•

Computational Learning Theory PAC IID VC Dimension SVM

Yíksíta Creatíons vc-9001 vc-9002 vc-9003 vc-9004 87 vc ...viksitacreations.com/images/pdf/BatternBurg.pdf · Yíksíta Creatíons vc-9001 vc-9002 vc-9003 vc-9004 87 vc-9005 vc-9006

3 Rademacher Complexity and VC- Dimension

ECE595 / STAT598: Machine Learning I Lecture 27 VC Dimension€¦ · Outline Lecture 25 Generalization Lecture 26 Growth Function Lecture 27 VC Dimension Today’s Lecture: From Dichotomy

Machine Learning - Carnegie Mellon School of Computer Scienceepxing/Class/10701/slides/lecture16-VC.pdf · finite VC dimension d A finite VC dimension d not only guarantees a generalization

The Glivenko-Cantelli Theorem and Introduction to VC Dimension Notes/VC... · 2020. 6. 6. · So vc(F) is in nite. 4 Uses of VC Dimension In the rst half of this section, we prove

Introduction to the VC-Dimension - GitHub Pagesweiningchen.github.io/notes/VC_dim.pdf · OUTLINE 1 RECAP 2 MOTIVATION 3 THE VC-DIMENSION Deﬁnitions Examples 4 THE FUNDAMENTAL THEOREM

Recursive Teaching Dimension, VC-Dimension and Sample ... · RECURSIVE TEACHING DIMENSION, VC-DIMENSION AND SAMPLE COMPRESSION but also the information complexity of iterated optimal

VC Dimension - Carnegie Mellon University

VC Dimension, VC Density, and an Application to

PAC Learning VC Dimension (Most slides by courtesy of Prof . Carlos Guestrin )

Decomposing arrangements of hyperplanes: VC-dimension ... · Decomposing arrangements of hyperplanes: VC-dimension, combinatorial dimension, and point location Esther Ezray Sariel

PAC-learning, VC Dimension

The VC dimension - SVCL · 23 VC dimension to formalize this, we denote the 2n point sample by and the cardinality of the set of distinct functions by the maximum cardinality over

VC v. VCG: Inapproximability of Combinatorial Auctions via Generalizations of the VC Dimension Michael Schapira Yale University and UC Berkeley Joint work

Teaching and compressing for low VC-dimension

VC Dimension, VC Density, and the Sauer-Shelah Dichotomy

VC v. VCG: Inapproximability of Combinatorial Auctions via Generalizations of the VC Dimension

DNA −→ RNA −→ Proteinmath.ipm.ac.ir/conferences/2005/biomath2005/slides/markowetz/tal… · Very prominent capacity concept [7, 8]: Vapnik-Chervonenkis (VC) dimension Shattering

Fundamentals of machine learning 1 Types of machine learning In-sample and out-of-sample errors Version space VC dimension

Inapproximability of VC Dimension and Littlestone’s Dimensionproceedings.mlr.press/v65/manurangsi17a/manurangsi17a.pdfRaghavendra,2016;Manurangsi,2017)). Fortunately, having a large

VC Dimension and classiﬁcation - Stanford University · VC Dimension How do we use shatter coecients to give complexity guarantees? Deﬁnition (VC Dimension) Let H be a collection

CSCI B609: “Foundations of Data Science” · VC-dimension •Def. Concept class 𝐻 shatters a set if ∀ ⊆ there is h∈𝐻 labeling positive and A∖ negative •Def. VC-dim(𝐻)

6 Learning and VC-dimension Notes/Chap 6... · 3/29/2010 6. Learning and VC-dimension 2 The problem of learning a half-space (or a linear separator) consists of n labeled examples

VC Dimension – definition and impossibility result Lecturer: Yishay Mansour Eran Nir and Ido Trivizki

Computational Learning Theory PAC IID VC Dimension SVM Kunstmatige Intelligentie / RuG KI2 - 5 Marius Bulacu & prof. dr. Lambert Schomaker