on the on/off problem - max planck societyaws/mknoetig_onoff_problem.pdf · analysis logic two step...
TRANSCRIPT
On the On/Off Problem summary of: [Knoetig2014]title image: [Abdo et al.2009]
Max L. Ahnen
Bayes Forum Max L. Ahnen 2014-12-19 1
Analysis logic
Figure: [Caldwell2012]
Inspiration:[Caldwell and Kröninger2006]
Here: closed form specialcase.
Bayes Forum Max L. Ahnen 2014-12-19 2
Analysis logic
Two step procedure:
• Find out significance of measurement
• Calculate signal credibility intervals if detection, or UL if not.
Bayes Forum Max L. Ahnen 2014-12-19 3
The On/Off Measurement
Figure: 9gag.com/gag/4792101
Bayes Forum Max L. Ahnen 2014-12-19 4
The On/Off Measurement
Figure: [Berge et al.2007]
Bayes Forum Max L. Ahnen 2014-12-19 5
The On/Off Measurement
Figure: [Berge et al.2007]
Event numbers in bins follow Poissondistribution
PP (N|λ) =e−λλN
N!(1)
Bayes Forum Max L. Ahnen 2014-12-19 6
The On/Off Measurement
Figure: [Berge et al.2007]
Null hypothesis H0 Likelihood (bg only)
P (Non, Noff|λbg, H0) =
PP (Non|αλbg) PP (Noff|λbg) (2)
With a signal in the On region — H1
P (Non, Noff|λs,λbg, H1) =
PP (Non|λs + αλbg) PP (Noff|λbg) (3)
Bayes Forum Max L. Ahnen 2014-12-19 7
Discovery or not?
Figure: [Abdo et al.2009]
Bayes Forum Max L. Ahnen 2014-12-19 8
My problem
High-energy astrophysics: few counts. Available methods:Bayesian Frequentist
• Tail-area probability based(reject hypothesis withoutalternative)[Gillessen and Harney2005]
• Subjective Bayesian,introducing anotherparameter[Gregory2005]
• most not suitable for lowcount numbers[Li and Ma1983]
• trouble at the border ofparameter space[Rolke et al.2005]
Bayes Forum Max L. Ahnen 2014-12-19 9
My problem
None cover the full problem:Significance of a measurement + Signal estimation
Bayes Forum Max L. Ahnen 2014-12-19 10
Objective Bayesian Analysis
proceeds by
• Modeling initial uncertainty using ’non informative’ Priors,usually improper
• Using Bayes theorem to find proper posterior probabilitydistributions, given data
Bayes Forum Max L. Ahnen 2014-12-19 11
Objective Bayesian Analysis
Problems:
• There is basic agreement on objective Bayesian estimation
• Unfortunately no agreement on objective Bayesian hypothesistesting!
Bayes Forum Max L. Ahnen 2014-12-19 12
Bayes Factors
Use Bayes rule for hypothesis testing
P (Hi|Non, Noff) =P (Non, Noff|Hi) P0 (Hi)
P (Non, Noff)(4)
+ law of total probability => the Bayes factor Bij:
P (Hi|Non, Noff)
P (Hj|Non, Noff)=
P (Non, Noff|Hi) P0 (Hi)
P (Non, Noff|Hj) P0 (Hj)
=
∫P(
Non, Noff|~λi , Hi
)P0
(~λi |Hi
)d~λi∫
P(
Non, Noff|~λj , Hj
)P0
(~λj |Hj
)d~λj
· P0 (Hi)
P0 (Hj)
= Bij · Pij (5)
Bayes Forum Max L. Ahnen 2014-12-19 13
Bayes Factors
Further assuming a complete set of N exclusive rival hypothesissuch that P(Hi ∧ Hj) = 0 for i 6= j one can express the posteriormodel probability with Eqn. 5 conveniently as
P (Hi|Non, Noff) =B0i
1 +∑N−1
j=1 Bj0Pj0(6)
..but in this case (On/Off problem, {H0, H1}) the difference betweenmaking decisions with Eqn. 5 and Eqn. 6 is small. One can alsoargue that there is always systematic errors and therefore no“complete set”
Bayes Forum Max L. Ahnen 2014-12-19 14
Jeffreys’s RuleHarold Jeffreys revived the objective Bayesian view with his work,when people turned to Fisher tests, p-values, ... His suggestion wasto use a prior that yielded the same answer, no matter whatparametrization:
P0
(~λi |Hi
)∝
√det
[I(~λi |Hi
)], (7)
Ikl
(~λi |Hi
)= −E
∂2 ln L(
Non, Noff|~λi , Hi
)∂λk∂λl
, (8)
where Ikl denotes the Fisher information matrix, L the likelihoodfunction (either Eqn. 2 or 3), E the expectation value with respect tothe model with index i and ~λi
Bayes Forum Max L. Ahnen 2014-12-19 15
Jeffreys’s Rule — Benefits
• Almost always defined
• Transforms properly to give results indep. of parametrization
• Almost always gives proper posterior
• Often arises from more general treatments as limiting case
Bayes Forum Max L. Ahnen 2014-12-19 16
Jeffreys’s Rule — Weaknesses
• Violates the (controversial) likelihood principle that all evidenceis embedded in the likelihood. -> but all objective Bayesianmethods do
• Fisher information matrix must exist -> it does in this case[Knoetig2014]
• Can fail badly ( posterior does not converge to “true" result ) inhigher-dimensional problems -> has to be checked!
Bayes Forum Max L. Ahnen 2014-12-19 17
objective Bayes Factors?
Remember:
Bij =
∫P(
Non, Noff|~λi , Hi
)P0
(~λi |Hi
)d~λi∫
P(
Non, Noff|~λj , Hj
)P0
(~λj |Hj
)d~λj
(9)
but P0
(~λi |Hi
)from Jeffrey’s rule is only defined up to constant ci!
Bayes Forum Max L. Ahnen 2014-12-19 18
objective Bayes Factors?
Suggestion: Variation of[Spiegelhalter and Smith1981],[Ghosh and Samanta2002]
Imagine dataset with smallest physical sample size —Non = Noff = 0. Then
B01 = 1 + ε; ‖ε‖ = rather small. (10)
(evidence that exists must be weak, because of 0 counts) ->calibrate the undefined ratio of constants ci
cj!
Bayes Forum Max L. Ahnen 2014-12-19 19
Objective Bayesian hypothesis testing —Summary assumptions
• Bayesian hypothesis testing in the On/Off problem with Bayesfactors
• Jeffreys’s rule prior for each model
• Calibrate the undefined constants by”when you see nothing you do not learn (much)“
Bayes Forum Max L. Ahnen 2014-12-19 20
Objective Bayesian hypothesis testing — Results
B01 =c0
c1· γδ
(11)
where
γ := (1 + 2Noff)α12+Non+Noff (12)
·Γ(
12
+ Non + Noff
)δ := 2 (1 + α)Non+Noff Γ (1 + Non + Noff) (13)
·2 F1
(12
+ Noff, 1 + Non + Noff;32
+ Noff;−1α
)c0
c1=
2 arctan(
1√α
)√π
(14)
[Knoetig2014]Bayes Forum Max L. Ahnen 2014-12-19 21
Objective Bayesian hypothesis testing — Results
Claim detection when Bayes factor B01 is low. If the counted eventslead to detection -> infer signal, assuming H1!
Bayes Forum Max L. Ahnen 2014-12-19 22
Objective Bayesian estimation
This is less controversial as the undefined constants cancel out.Bayes rule gives:
P (λs,λbg|Non, Noff, H1) = (15)P (Non, Noff|λs,λbg, H1) P0 (λs,λbg|H1)∫∞
0∫∞
0 P (Non, Noff|λs,λbg, H1) P0 (λs,λbg|H1) dλsdλbg
+ marginalization + Jeffreys’s prior
Bayes Forum Max L. Ahnen 2014-12-19 23
Objective Bayesian estimation — Results
P (λs|Non, Noff, H1) = PP (Non + Noff|λs) (16)
·U[
12 + Noff, 1 + Noff + Non,
(1 + 1
α
)λs
]2F̃1
(12 + Noff, 1 + Noff + Non; 3
2 + Noff;− 1α
)This can be used for quoting credibility intervals or, if the detectionthreshold was not reached, upper limits. [Caldwell and Kröninger2006],[Knoetig2014]
Bayes Forum Max L. Ahnen 2014-12-19 24
First example: GRB080825C
Figure: [Abdo et al.2009]
Non = 15, Noff = 19,α = 33/525 => B01 = 9.66×10−10 Detection!
Bayes Forum Max L. Ahnen 2014-12-19 25
First example: GRB080825C
aL
0 5 10 15 20 25 300.00
0.02
0.04
0.06
0.08
0.10
Λs
PHΛ
sÈN
on,N
off,H
1L
GRB 080825C
Λs=13.28-3.49+4.16
Figure: [Knoetig2014]
published value: λs = 13.7 [Abdo et al.2009]
Bayes Forum Max L. Ahnen 2014-12-19 26
Second example: GRB080330
Non = 0, Noff = 15,α = 0.123 => B01 = 2.29 Upper Limit!
bL
0 5 10 15 20 25 300.0
0.2
0.4
0.6
0.8
1.0
1.2
Λs
PHΛ
sÈN
on,N
off,H
1L
GRB 080330
Λs<4.10 È99%CL
Figure: [Knoetig2014]
published value: λs < 2.40 [Acciari et al.2011]
Bayes Forum Max L. Ahnen 2014-12-19 27
Validation
In order to compare to [Li and Ma1983] frequentist result use”Bayesian“ z-value [Gillessen and Harney2005]
Sb =√
2 erf−1 [1− B01] . (17)
B01 = 5.7 · 10−7 would correspond to ”5 sigma“
Bayes Forum Max L. Ahnen 2014-12-19 28
Validation, hypothesis test
BJeff.
BGreg.
0 5 10 1510
-9
10-7
10-5
0.001
0.1
10
Non
B0
1ÈN
off
=5
Α=0.2
SbHBJeff.L
SbHBGreg.L
SLiMa
0 50 100 1500
2
4
6
8
10
NonS
HB 01
ÈN off
=3
00
L
Α=0.2
Few counts: compared to [Gregory2005], mostly within one count.Many counts: close to frequentist result [Li and Ma1983]
Bayes Forum Max L. Ahnen 2014-12-19 29
Validation, signal estimation
PJeff.
PGreg.
0 5 10 15 200.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Λs
PHΛ
sÈN
on,N
off=
5,H
1L
Non=1
Non=7
Non=13
Α=0.2 aL
PJeff.
PGreg.
Norm.
0 50 100 1500.00
0.02
0.04
0.06
0.08
0.10
Λs
PHΛ
sÈN
on,N
off=
80
,H1L
Non=20
Non=50 Non=100 Non=150
Α=0.2 bL
Figure: [Knoetig2014]
Similar to [Gregory2005], both converge to classical result in the manycounts case.
Bayes Forum Max L. Ahnen 2014-12-19 30
Conclusion
Claiming detections, setting credibility intervals, or setting upperlimits can be unified over the whole On/Off problem parameterrange in one consistent objective Bayesian method.
Sample implementation (Python, Mathematica):https://polybox.ethz.ch/public.php?service=files&t=29958626f8bd78e4e8fbdbc7f7955c49
Bayes Forum Max L. Ahnen 2014-12-19 31
Bibliography I
Abdo, A. et al.: 2009,Astrophys. J. 707(1), 580
Acciari, V. A. et al.: 2011,Astrophys. J. 743, 62
Berge, D., Funk, S., and Hinton, J.: 2007,Astron. Astrophys. 466, 1219
Caldwell, A.: 2012,in Bayes Forum Munich
Caldwell, A. and Kröninger, K.: 2006,Phys. Rev. D 74(9), 092003
Ghosh, J. K. and Samanta, T.: 2002,J. Statist. Plann. Inference 103, 205
Gillessen, S. and Harney, H. L.: 2005,Astron. Astrophys. 430, 355
Gregory, P.: 2005,Bayesian logical data analysis for the physical sciences,Cambridge University Press
Bayes Forum Max L. Ahnen 2014-12-19 32
Bibliography II
Knoetig, M. L.: 2014,Astrophys. J. 790(2), 106
Li, T. P. and Ma, Y. Q.: 1983,Astrophys. J. 272, 317
Rolke, W. A., López, A. M., and Conrad, J.: 2005,Nucl. Instrum. Methods A 551(2), 493
Spiegelhalter, D. J. and Smith, A. F. M.: 1981,J. R. Statist. Soc. B 44(3), 377
Bayes Forum Max L. Ahnen 2014-12-19 33