probability theory overview and analysis of randomized algorithms prepared by john reif, ph.d....
TRANSCRIPT
Probability Theory Overview and Analysis of Randomized
Algorithms
Prepared by
John Reif, Ph.D.
Analysis of Algorithms
Probability Theory Topics
a) Random Variables: Binomial and Geometric
b) Useful Probabilistic Bounds and Inequalities
Readings
• Main Reading Selections:– CLR, Chapter 5 and Appendix C
Probability Measures
• A probability measure (Prob) is a mapping from a set of events to the reals such that:1) For any event A
0 Prob(A) 12) Prob (all possible events) = 13) If A, B are mutually exclusive
events, thenProb(A B) = Prob (A) + Prob (B)
Conditional Probability
• Define
Prob(A B)Prob(A | B)
Prob(B)
for Prob(B) > 0
Bayes’ Theorem
• If A1, …, An are mutually exclusive and contain all events then
ii n
jj=1
j j j
PProb(A | B)
P
where P = Prob(B | A ) Prob(A )
Random Variable A (Over Real Numbers)
• Density Function
Af (x) = Prob(A=x)
Random Variable A (cont’d)
• Prob Distribution Function
x
A A
-
F (x) = Prob(A x) = f (x) dx
Random Variable A (cont’d)
• If for Random Variables A,B
• Then “A upper bounds B” and “B lower bounds A”
A Bx F (x) F (x)
A
B
F (x) Prob (A x)
F (x) Prob (B x)
Expectation of Random Variable A
• Ā is also called “average of A” and “mean of A” = A
A
-
E(A) = A = x f (x) dx
Variance of Random Variable A
2 2 2 2A
2 2A
-
σ (A A) A (A)
where 2nd momoment A x f (x) dx
Variance of Random Variable A (cont’d)
Discrete Random Variable A
Discrete Random Variable A (cont’d)
Discrete Random Variable A Over Nonnegative Numbers
• Expectation
Ax=0
E(A) = A = x f (x)
Pair-Wise Independent Random Variables
• A,B independent if
Prob(A B) = Prob(A) X Prob(B)
• Equivalent definition of independence
A B A B
A B A B
A B A B
f (x) = f (x) f (x)
M (s) = M (s) M (s)
G (z) = G (z) G (z)
Bounding Numbers of Permutations
• n! = n x (n-1) x 2 x 1= number of permutations of n objects
• Stirling’s formulan! = f(n) x (1+o(1)), where
n -nf(n) = n e 2πn
Bounding Numbers of Permutations (cont’d)
• Note– Tighter bound
= number of permutations of n objects taken k at a time
1 1(12n+1) 12nf(n) e < n! < f(n) e
n!
(n-k)!
Bounding Numbers of Combinations
= number of (unordered)
combinations of n objects
taken k at a time
• Bounds (due to Erdos & Spencer, p. 18)
n n!
k k! (n-k)!
2 3
2
k kk 2n 6nn n e
~ (1 o(1))k k!
3
4for k = o n
Bernoulli Variable
• Ai is 1 with prob P and 0 with prob 1-P
• Binomial Variable• B is sum of n independent Bernoulli
variables Ai each with some probability p
BINOMIAL with parameters n,p
B 0
i=1 n
probability P B B+1
procedure
begin
for to do
with do
output
end
B is Binomial Variable with Parameters n,p
n pmean
2 np (1-p)variance
B is Binomial Variable with Parameters n,p (cont’d)
x n-xn Prob(B=x) = p (1-p)
xdensity fn
xk n-k
k=0
n Prob(B x) = p (1-p)
kdistribution fn
Poisson Trial
• Ai is 1 with prob Pi and 0 with prob 1-Pi
• Suppose B’ is the
sum of n independent Poisson trials
Ai with probability Pi for i > 1, …, n
Hoeffding’s Theorem
• B’ is upper bounded by a Binomial Variable B
• Parameters n,p where
n
ii=1
Pp=
n
Geometric Variable V• parameter p
xx 0 Prob(V=x) = p(1-p)
GEOMETRIC parameter p
V 0
loop with probability 1-p
exit
procedure
begin
goto
Probabilistic Inequalities
• For Random Variable A
2 2 2
A
A (A)
mean
variance
Markov and Chebychev Probabilistic Inequalities
• Markov Inequality (uses only mean)
• Chebychev Inequality (uses mean and variance)
Prob (A x)x
2
2Prob ( A )
Example of use of Markov and Chebychev Probabilistic Inequalities
• If B is a Binomial with parameters n,p
2
npThen Prob (B x)
xnp (1-p)
Prob ( B np )
Gaussian Density Function
2x-
21
(x) = e2π
Normal Distribution
• Bounds x > 0 (Feller, p. 175)
x
-
(X) = (Y) dY
3
1 1 Ψ(x)(x) 1 (x)
x x x
x [0,1]
x 1 xx (1) (x) - (0) =
22πe 2π
Sums of Independently Distributed Variables
• Let Sn be the sum of n independently distributed variables A1, …, An
• Each with mean and variance
• So Sn has mean and variance 2
n
2
n
Strong Law of Large Numbers: Limiting to Normal Distribution• The probability density function of
to normal distribution (x)
• Hence
Prob
nn
(S - )T = as nlimits
n( S - x) (x) as n
Strong Law of Large Numbers (cont’d)
• So
Prob
(since 1- (x) (x)/x)
n( S - x) 2(1 (x))
2 (x)/x
Moment Generating Functions and Chernoff Bounds
Advanced Material
Moments of Random Variable A (cont’d)
• n’th Moments of Random Variable A
• Moment generating function
n nA
-
A x f (x) dx
sxA A
-
sA
M (s) e f (x) dx
= E(e )
Moments of Random Variable A (cont’d)
• NoteS is a formal parameter
n
0
nAAn
d M (s)
dss
Moments of Discrete Random Variable A
• n’th moment
n nA
x=0
A x f (x)
Probability Generating Function of Discrete Random Variable A
2 " ' ' 2Avariance σ (1) (1) ( (1))A A AG G G
x AA A
x=0
G (z) = z f (x) E(z )
" 2A2nd derivative G (1) = A A
A1st derivative G ' (1) = A
• If A1, …, An independent with same distribution
1
1 1
1 2 n
n
B A
n n
B A B A
Then if B = A A ... A
f (x) = f (x)
M (s) = M (s) , G (z) = G (z)
1 iA Af (x) = f (x) for i = 1, ... n
Moments of AND of Independent Random Variables
Generating Function of Binomial Variable B with Parameters n,p
• Interesting fact
xn k k n-k
k=0
nG(z) = (1-p+pz) = z p (1-p)
k
1Prob(B= ) =
n
Generating Function of Geometric Variable with parameter p
k k
k=0
pG(Z) = Z (p(1-p) ) =
1-(1-p)Z
Chernoff Bound of Random Variable A
• Uses all moments• Uses moment generating function
By setting x = ’ (s) 1st derivative minimizes bounds
-sxA
γ(s) - sxA
γ(s) - s γ'(s)
Prob (A x) e M (s) for s 0
= e where γ(s) = ln (M (s))
e
Chernoff Bound of Discrete Random Variable A
• Choose z = z0 to minimize above bound
• Need Probability Generating function
-xAProb (A x) z G (z) for z 1
x AA A
x 0
G (z) = z f (x) = E(z )
Chernoff Bounds for Binomial B with parameters n,p
• Above mean x
n-x x
x xx-μ 1
-x - μ 2
Prob (B x)
n-
n-x x
1 e since 1 e
x x
e for x μe
Chernoff Bounds for Binomial B with parameters n,p (cont’d)
• Below mean x
n-x x
Prob (B x)
n-
n-x x
Anguin-Valiant’s Bounds for Binomial B with Parameters n,p
• Just above mean
• Just below mean
2 μ-ε
2
= np for 0 < < 1
Prob (B (1+ ) ) e
2 μ-ε
3
for 0 < < 1
Prob (B (1- ) ) e
Anguin-Valiant’s Bounds for Binomial B with Parameters n,p (cont’d)
Tails are bounded by Normal distributions
Binomial Variable B with Parameters p,N and Expectation = pN• By Chernoff
Bound for p < ½
• Raghavan-Spencer bound for any > 0
nN 2
NProb (B ) 2 p
2
μ
(1+ )Prob (B (1+ ) )
(1+ )
e
Probability Theory
Prepared by
John Reif, Ph.D.
Analysis of Algorithms