arxiv:1801.08300v1 [stat.ap] 25 jan 2018 · pdf filearxiv:1801.08300v1 [stat.ap] 25 jan 2018...

27
arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam Bandyopadhyay 1 and Soumita Modak 2 Department of Statistics, Calcutta University, India email: [email protected] 1 , [email protected] 2 Abstract We consider the problem of estimation of a bivariate density func- tion with support ℜ× [0, ), where a classical bivariate kernel estima- tor causes boundary bias due to the non-negative variable. To over- come this problem, we propose four kernel density estimators whose performances are compared in terms of the mean integrated squared error. Simulation study shows that the estimator based on our pro- posed normal-gamma (NG) kernel performs best, whose applicability is demonstrated using two astronomical data sets. keywords: bivariate density estimation, product of classical and gamma ker- nels, NG kernels, astronomical application. 1 Introduction Let X i = X(X i1 ,...,X id ), i =1,...,n be a given sample of n independent realizations of a d-dimensional random variable X(x 1 ,...,x d ) from a popula- tion characterized by a continuous d-variate probability density f (x 1 ,...,x d )= f , which is unknown. In this paper, we consider the problem of estimating f by kernel density estimator, in which f with support d can be estimated by a dvariate classical kernel estimator (see, for example, Silverman, 1986; Wand and Jones, 1995), but the same causes boundary bias in case of bounded or semi-bounded support. To solve this problem in univariate set-up, the as- sociated kernels are proposed (see, for example, Chen, 1999, 2000; Libengu´ e, 2013; Igarashi and Kakizawa, 2014), whereas, in multivariate set-up, the boundary bias can be omitted by using the product of d univariate associ- ated kernels (see, for example, Bouerzmarni and Rombouts, 2010). In the context of multivariate associated kernel, Kokonendjia and Som´ e (2015) pro- pose a bivariate beta kernel with a correlation structure. Now, when the support is a cartesian product of and bounded or semi-bounded sets, then 1

Upload: phamminh

Post on 05-Feb-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

arX

iv:1

801.

0830

0v1

[st

at.A

P] 2

5 Ja

n 20

18

Bivariate density estimation using normal-gamma kernel with applicationto astronomy

Uttam Bandyopadhyay1 and Soumita Modak2

Department of Statistics, Calcutta University, India

email: [email protected], [email protected]

Abstract

We consider the problem of estimation of a bivariate density func-tion with support ℜ× [0,∞), where a classical bivariate kernel estima-tor causes boundary bias due to the non-negative variable. To over-come this problem, we propose four kernel density estimators whoseperformances are compared in terms of the mean integrated squarederror. Simulation study shows that the estimator based on our pro-posed normal-gamma (NG) kernel performs best, whose applicabilityis demonstrated using two astronomical data sets.

keywords: bivariate density estimation, product of classical and gamma ker-nels, NG kernels, astronomical application.

1 Introduction

Let Xi = X(Xi1, . . . , Xid), i = 1, . . . , n be a given sample of n independentrealizations of a d-dimensional random variable X(x1, . . . , xd) from a popula-tion characterized by a continuous d-variate probability density f(x1, . . . , xd) =f , which is unknown. In this paper, we consider the problem of estimating fby kernel density estimator, in which f with support ℜd can be estimated by ad−variate classical kernel estimator (see, for example, Silverman, 1986; Wandand Jones, 1995), but the same causes boundary bias in case of bounded orsemi-bounded support. To solve this problem in univariate set-up, the as-sociated kernels are proposed (see, for example, Chen, 1999, 2000; Libengue,2013; Igarashi and Kakizawa, 2014), whereas, in multivariate set-up, theboundary bias can be omitted by using the product of d univariate associ-ated kernels (see, for example, Bouerzmarni and Rombouts, 2010). In thecontext of multivariate associated kernel, Kokonendjia and Some (2015) pro-pose a bivariate beta kernel with a correlation structure. Now, when thesupport is a cartesian product of ℜ and bounded or semi-bounded sets, then

1

Page 2: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

f can be estimated using the product of univariate classical kernels and uni-variate associated kernels. Here, in particular, we discuss the estimation ofa bivariate density function with support ℜ× [0,∞).

In this regard, we study the properties of the estimators, based on theproduct of a univariate classical kernel and a univariate gamma kernel, inSection 2. Section 3 proposes bivariate density estimators based on NG ker-nels. Section 4 discusses the relative performances of the estimators throughsimulation study. In Section 5, two astronomical data sets have been success-fully modelled using NG kernel. Conclusion and future study are discussedin Section 6.

2 Product of classical and gamma kernels

For the sake of simplicity, we show the computations for d = 2, which caneasily be extended to d > 2. Consider a bivariate continuous density func-tion f(x1, x2) = f(x) satisfying (i) x ∈ ℜ × [0,∞), (ii) f is twice con-tinuously partially differentiable on ℜ × [0,∞), (iii)

{∂f/∂x1}2dx < ∞,∫

{∂f/∂x2}2dx < ∞,∫

{x2∂2f/∂x2

2}2dx < ∞. To estimate f , we considerthe estimator as follows

f1(x) =1

nh

n∑

i=1

K

(

x1 −Xi1

h

)

Kx2/b2+1,b2(Xi2), (1)

whereK is the classical kernel satisfying (a)K(−t) = K(t), (b)∫∞−∞K(t)dt =

1, (c)∫∞−∞ tK(t)dt = 0, (d)

∫∞−∞ t2K(t)dt = k2 6= 0, with bandwidth h(> 0)

satisfying h → 0 and nhb → ∞ as n → ∞. Kx2/b2+1,b2 is the first class ofgamma kernels (Chen, 2000) defined as

Kx2/b2+1,b2(t) =tx2/b2e−t/b2

b2(x2/b2+1)Γ(x2/b2 + 1),

where Γ is the gamma function, with bandwidth b2(> 0) satisfying b → 0 andnhb → ∞ as n → ∞. Bandwidths of the kernels are so chosen as to make theamount of smoothing in the same scale for both the kernels. In general, anyassociated kernel can be used here. However, we choose the gamma kerneldue to its flexible properties (see, for example, Chen, 2000).

2

Page 3: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

Now, using y1 = x1 − ht, we get

E{f1(x)} =

∫ ∞

−∞

∫ ∞

0

[

h−1K

(

x1 − y1h

)]

[Kx2/b2+1,b2(y2)]f(y1, y2)dy1dy2

=

∫ ∞

−∞

∫ ∞

0

K(t)Kx2/b2+1,b2(y2)f(x1 − ht, y2)dtdy2

=

∫ ∞

−∞K(t)Eξx2

[f(x1 − ht, y2)]dt,

(2)

where ξx2 ∼ gamma(x2/b2 + 1, b2).

Again, a Taylor series expansion gives

Eξx2[f(x1 − ht, ξx2)] = f(x1, x2) + Eξx2

(x1 − ht− x1)f1(x)

+ Eξx2(ξx2 − x2)f

2(x) +1

2Eξx2

{(x1 − ht− x1)2}f 11(x)

+1

2Eξx2

{(ξx2 − x2)2}f 22(x) +

1

2Eξx2

{(−ht)(ξx2 − x2)}f 12(x)

+1

2Eξx2

{(−ht)(ξx2 − x2)}f 21(x) + o(h2 + b2)

= f(x)− htf 1(x) + Eξx2(ξx2 − x2 + b2)f 2(x) + b2f 2(x)

+1

2h2t2f 11(x) +

1

2V ar(ξx2)f

22(x) + o(h2 + b2),

where f j = ∂f/∂xj and f jj′ = ∂2f/∂xj′∂xj , j, j′ = 1, 2. Then, substituting

the last expression in (2), we get

E{f1(x)} = f(x) +1

2k2h

2f 11(x) + b2{f 2(x) +1

2x2f

22(x)}+ o(h2 + b2),

which implies

Bias{f1(x)} =1

2k2h

2f 11(x)+b2{f 2(x)+1

2x2f

22(x)}+o(h2+b2) = O(h2+b2).

(3)This shows estimator f1 is free of boundary bias and the integrated squared

3

Page 4: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

bias is given by

{Bias(f1(x))}2dx =1

4k22h

4

{f 11(x)}2dx+ b4∫

{f 2(x) +1

2x2f

22(x)}2dx

+ k2h2b2

f 11(x){f 2(x) +1

2x2f

22(x)}dx+ o(h4 + b4).

(4)

Now,

V ar{f1(x)} = n−1V ar{K(Xi1)Kx2/b2+1,b2(Xi2)}= n−1E{K(Xi1)Kx2/b2+1,b2(Xi2)}2 +O(n−1)

= n−1h−1

∫ ∞

−∞

∫ ∞

0

K2(t)K2x2/b2+1,b2(y2)f(x1 − ht, y2)dtdy2 +O(n−1)

and∫ ∞

0

K2x2/b2+1,b2(y2)f(x1 − ht, y2)dy2 = Bb(x2)Eηx2

{f(x1 − ht, ηx2)},

where ηx2 ∼ gamma(2x2/b2 + 1, b2) and Bb(x2) =

b−2Γ(2x2/b2+1)

22x2/b2+1Γ2(x2/b2+1)

. Using

Lemma 3 of Brown and Chen (1999), we get

Bb(x2) ∼

1

2√πb−1x−1/2 if x2

b2→ ∞,

Γ(2κ+ 1)

21+2κΓ2(κ+ 1)b−2 if x2

b2→ κ (a non-negative constant),

which implies

V ar{f1(x)} ∼

1

2√πn−1h−1b−1k3x

−1/22 f(x) if x2

b2→ ∞,

Γ(2κ+ 1)

21+2κΓ2(κ+ 1)n−1h−1b−2k3f(x) if x2

b2→ κ,

(5)

where k3 =∫∞−∞K2(t)dt. Expressions (3) and (5) imply that for h → 0, b →

0, and nhb → ∞ as n → ∞, the nonparametric density estimator f1(x)

4

Page 5: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

is consistent for the true density function f(x) at each point x. Now, forδ = b2−ǫ with 1 < ǫ < 2,

∫ ∞

−∞

∫ ∞

0

V ar{f1(x)}dx1dx2 =

∫ ∞

−∞

∫ δ

0

V ar{f(x)}dx1dx2 +

∫ ∞

−∞

∫ ∞

δ

V ar{f(x)}dx1dx2

=1

2√πn−1h−1b−1k3

∫ ∞

−∞

∫ ∞

δ

x−1/22 f(x)dx1dx2 +O(n−1h−1b−ǫ)

=1

2√πn−1h−1b−1k3

∫ ∞

−∞

∫ ∞

0

x−1/22 f(x)dx1dx2 + o(n−1h−1b−1),

(6)

provided∫∞−∞

∫∞0

x−1/22 f(x)dx1dx2 is finite.

Combining (4) and (6), the mean integrated squared error (MISE) isobtained as

MISE{f1(x)} =

{Bias(f1(x))}2dx+

V ar{f1(x)}dx

=1

4k22h

4

{f 11(x)}2dx+ b4∫

{f 2(x) +1

2x2f

22(x)}2dx (7)

+ k2h2b2

f 11(x){f 2(x) +1

2x2f

22(x)}dx+1

2√πn−1h−1b−1k3

x−1/22 f(x)dx

+ o(n−1h−1b−1 + h4 + b4)

(8)

and the leading terms in (7) give the expression of the corresponding asymp-totic mean integrated squared error (AMISE). AMISE is optimal for hopt =h0n

−1/6 and bopt = b0n−1/6, where h0 and b0 are constants, i.e. the optimal

bandwidths for kernel density estimator f1 are O(n−1/6) and O(n−1/3) whichgive

AMISE{f1(x)}opt =[

1

4k22h

40

{f 11(x)}2dx+ b40

{f 2(x) +1

2x2f

22(x)}2dx

+ k2h20b

20

f 11(x){f 2(x) +1

2x2f

22(x)}dx

+1

2√πn−1h−1

0 b−10 k3

x−1/22 f(x)dx

]

n−2/3.

5

Page 6: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

For h = b = h0, the optimal h0 is given by

[ 12√πk3

x−1/22 f(x)dx

2∫

{

12k2f 11(x) + f 2(x) + 1

2x2f 22(x)

}2

dx

]1/6

n−1/6,

which gives

AMISEopt =3

22/3

[∫

{

1

2k2f

11(x) + f 2(x) +1

2x2f

22(x)

}2

dx

]1/3[1

2√πk3

x−1/22 f(x)dx

]2/3

n−2/3.

Another estimator of f is considered as

f2(x) =1

nh

n∑

i=1

K(x1 −Xi1

h)Kρb2 (x2),b2(Xi2), (9)

where Kρb2 (x),b2 is the second class of gamma kernels (Chen, 2000) defined as

ρb2(x2) =

x2/b2 if x2 ≥ 2b2,

1

4(x2/b

2)2 + 1 if x2 ∈ [0, 2b2).

(10)

So,

Bias{f2(x)} =

1

2k2h

2f 11(x) +1

2b2x2f

22(x) + o(h2 + b2) if x2 ≥ 2b2,

1

2k2h

2f 11(x) + b2{ρb2(x2)− x2/b2}f 2(x) + o(h2 + b2) if x2 ∈ [0, 2b2),

(11)

which shows the boundary unbiasedness of estimator f2, and

V ar{f2(x)} ∼

1

2√πn−1h−1b−1k3x

−1/22 f(x) if x2

b2→ ∞,

Γ(κ2/2 + 1)

21+κ2/2Γ2(κ2/4 + 1)n−1h−1b−2k3f(x) if x2

b2→ κ (a non-negative constant),

6

Page 7: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

imply

MISE{f2(x)} =1

4k22h

4

{f 11(x)}2dx+1

4b4∫

{x2f22(x)}2dx

+1

2k2h

2b2∫

f 11(x){x2f22(x)}dx+ o(h4 + b4)

+1

2√πn−1h−1b−1k3

x−1/22 f(x)dx+ o(n−1h−1b−1).

For h = b = h0, the optimal h0 is given by

[ 12√πk3

x−1/22 f(x)dx

12

{

k2f 11(x) + x2f 22(x)

}2

dx

]1/6

n−1/6,

which corresponds to

AMISEopt =3

24/3

[∫

{

k2f11(x) + x2f

22(x)

}2

dx

]1/3[1

2√πk3

x−1/22 f(x)dx

]2/3

n−2/3.

Observe that AMISEopt{f1(x)} ≥ AMISEopt{f2(x)} implies f2 is expected

to have a better asymptotic performance than f1.

3 Bivariate density estimation using normal-gamma (NG)kernel

Consider the density function KΘ of a bivariate normal-gamma distributiondefined as (Bernardo and Smith, 2000):

KΘ(t1, t2) = NG(t1, t2|Θ = (µ, λ, α, β)) = N(t1|µ, (λt2)−1)Ga(t2|α, β)

=

λt22π

e−(t1−µ)2λt2

2 × βα

Γ(α)tα−12 e−βt2

(12)

with µ ∈ ℜ, λ > 0, α > 0, β > 0, where N and Ga, respectively, standfor normal and gamma distributions. Using (12), we define the followingestimator of f as

f3(x) =1

n

n∑

i=1

KΘ1(Xi), (13)

7

Page 8: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

where KΘ1 is the NG kernel with Θ = Θ1 = (x1, 1/(|x1|b1 + b21)(x2 +b2), x2/b2 + 2, 1/b2) such that the bandwidths b1, b2 → 0 and nb1b2 → ∞as n → ∞.

Then,

E{f3(x)} =

∫ ∞

−∞

∫ ∞

0

KΘ1(y1, y2)dy1dy2 = E{f(ξx)},

where ξx = (ξx1, ξx2) ∼ NG(ξx1, ξx2|x1, 1/(|x1|b1+b21)(x2+b2), x2/b2+2, 1/b2),which implies E(ξx1) = x1, E(ξx1) = x2 + 2b2, V ar(ξx1) = |x1|b1 + b21 andV ar(ξx2) = x2b2 + 2b22. By Taylor series expansion we get (see, Appendix(A.1)),

E{f(ξx)} = f(x) + E(ξx1 − x1)f1(x) + E(ξx2 − x2)f

2(x)

+1

2E{(ξx1 − x1)(ξx2 − x2)}{f 12(x) + f 21(x)}

+1

2E(ξx1 − x1)

2f 11(x) +1

2E(ξx2 − x2)

2f 22(x) + o(b1 + b2)

= f(x) + b1{1

2|x1|f 11(x)}+ b2{2f 2(x) +

1

2x2f22(x)}+ o(b1 + b2).

Therefore,

Bias{f3(x)} = b1{1

2|x1|f 11(x)}+b2{2f 2(x)+

1

2x2f

22(x)}+o(b1+b2) = O(b1+b2),

(14)which shows estimator f3 is free of boundary bias, and the integrated squaredbias is given by

{Bias(f3(x))}2dx =1

4b21

{x1f11(x)}2dx

+ b22

{2f 2(x) +1

2x2f

22(x)}2dx

+ b1b2

{|x1|f 11(x)}{2f 2(x) +1

2x2f22(x)}dx+ o(b21 + b22).

(15)

8

Page 9: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

The variance of f3(x) is

V ar{f3(x)} ≈

1

4π√

en−1

b−1/21 b

−1/22 |x1|−1/2

x−1/22 f(x) if |x1|/b1 → ∞, x2/b2 → ∞,

Γ(2κ2 + 7/2)√

π22κ2+9/2√

(κ1 + 1)(κ2 + 1)Γ2(κ2 + 2)n−1

b−11 b

−12 f(x) if |x1|/b1 → κ1, x2/b2 → κ2,

Γ(2κ2 + 7/2)√

π22κ2+9/2√

(κ2 + 1)Γ2(κ2 + 2)n−1

b−1/21 b

−12 |x1|−1/2

f(x) if |x1|/b1 → ∞, x2/b2 → κ2,

1

4π√

e√

κ1 + 1n−1

b−11 b

−1/22 x

−1/22 f(x) if |x1|/b1 → κ1, x2/b2 → ∞,

for non-negative constants κ1, κ2 (see, Appendix (A.2)), and∫

V ar{f3(x)}dx =1

4π√en−1b

−1/21 b

−1/22

|x1|−1/2x−1/22 f(x)dx+o(n−1b

−1/21 b

−1/22 ),

(16)

assuming∫

|x1|−1/2x−1/22 f(x)dx < ∞ (see, Appendix (A.3)).

Now, combining (15) and (16), we get the expression of the MISE asfollows

MISE{f3(x)} =1

4b21

{|x1|f 11(x)}2dx+ b22

{2f 2(x) +1

2x2f

22(x)}2dx

+ b1b2

|x1|f 11(x){2f 2(x) +1

2x2f22(x)}dx

+1

4π√en−1b

−1/21 b

−1/22

|x1|−1/2x−1/22 f(x)dx+ o(b21 + b22 + n−1b

−1/21 b

−1/22 ).

For b1 = b2 = b0, the optimal b0 is given by

[ 14π

√e

|x1|−1/2x−1/22 f(x)dx

2∫

{12|x1|f 11(x) + 2f 2(x) + 1

2x2f 22(x)}2dx

]1/3

n−1/3,

for which the optimal AMISE is obtained as

AMISEopt =3

22/3

[∫

{

1

2|x1|f 11(x) + 2f 2(x) +

1

2x2f

22(x)

}2

dx

]1/3

[

1

4π√e

|x1|−1/2x−1/22 f(x)dx

]2/3

n−2/3.

We propose another estimator of f using (12) as follows

f4(x) =1

n

n∑

i=1

KΘ2(Xi), (17)

9

Page 10: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

where KΘ2 is the NG kernel with Θ = Θ2 = (x1, 1/(|x1|b1 + b21)(x2 +b2), αb2(x2), 1/b2) and

αb2(x2) =

{

x2/b2 if x2 ≥ 3b2,19(x2/b2)

2 + 2 if x2 ∈ [0, 3b2),

such that b1, b2 → 0, nb1b2 → ∞ as n → ∞. Then, in the similar fashion asin estimator f3, we obtain

Bias{f4(x)} =

{

12{b1|x1|f 11(x) + b2x2f

22(x)}+ o(b1 + b2) if x2 ≥ 3b2,12b1|x1|f 11(x) + b2{αb2(x2)− x2

b2}f 2(x) + o(b1 + b2) if x2 ∈ [0, 3b2),

establishing its boundary unbiasedness, and

V ar{f4(x)} ≈

14π

en−1b

−1/21 b

−1/22 |x1|−1/2x

−1/22 f(x) if |x1|/b1 → ∞, x2/b2 → ∞,

Γ(2/9κ22+7/2)

π22/9κ2

2+9/2√(κ1+1)(κ2+1)Γ2(1/9κ2

2+2)

n−1b−11 b−1

2 f(x) if |x1|/b1 → κ1, x2/b2 → κ2,

Γ(2/9κ22+7/2)

π22/9κ2

2+9/2√

(κ2+1)Γ2(1/9κ22+2)

n−1b−1/21 b−1

2 |x1|−1/2f(x) if |x1|/b1 → ∞, x2/b2 → κ2,

14π

e√

κ1+1n−1b−1

1 b−1/22 x

−1/22 f(x) if |x1|/b1 → κ1, x2/b2 → ∞,

for non-negative constants κ1, κ2. Hence,

MISE{f4(x)} =1

4b21

{x1f11(x)}2dx+

1

4b22

{x2f22(x)}2dx

+1

2b1b2

|x1|x2f11(x)f 22(x)dx+ o(b21 + b22)

+1

4π√en−1b

−1/21 b

−1/22

|x1|−1/2x−1/22 f(x)dx+ o(n−1b

−1/21 b

−1/22 ).

For b1 = b2 = b0, the optimal b0 is given by

[ 14π

√e

|x1|−1/2x−1/22 f(x)dx

12

{|x1|f 11(x) + x2f 22(x)}2dx

]1/3

n−1/3

and, therefore, the optimal AMISE is obtained as

AMISEopt =3

24/3

[∫

{

|x1|f 11(x) + x2f22(x)

}2

dx

]1/3

[

1

4π√e

|x1|−1/2x−1/22 f(x)dx

]2/3

n−2/3.

10

Page 11: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

4 Simulation study

In simulation study, we consider the sample sizes (n) 100 and 200, with 1, 000replications, for the following target distributions of different shapes, whereC,HN,N,Ga,Exp, LG, TN correspond to Cauchy, Half-Normal, Normal,Gamma, Exponential, Logistic and Truncated Normal (truncated at zero)distributions respectively, with l, s, sh, rt respectively stand for the location,scale, shape and rate of the corresponding distribution.1) Product of C(l = 0, s = 1) and HN(s = 2), say, f1 (Fig. 1).2) Product of 0.2 ×N(l = −3, s = 1) + 0.6× C(l = 0, s = 1) + 0.2 ×N(l =3, s = 1) and Ga(sh = 3, s = 1), say f2 (Fig. 2).3) Product of 0.4 ×N(l = −3, s = 3) + 0.2× C(l = 0, s = 1) + 0.4 ×N(l =3, s = 3) and Exp(rt = 1), say f3 (Fig. 3).4) Product of 0.5 × LG(l = −1, s = 0.5) + 0.5 × LG(l = 1.5, s = 0.7) and0.6× TN(l = 0, s = 0.5) + 0.4× TN(l = 1.3, s = 0.25), say f4 (Fig. 4).

For comparison purpose among different kernel density estimators, thebandwidths are considered within a range such that h = b =

(b1) =√

(b2) = h1 = h2; where (h1, h2) is the bandwidth vector correspondingto the product of two classical Gaussian (0, 1) kernels, which is the 5–thestimator considered and denoted by f5. For each replication with each es-timator, the bandwidths are chosen by minimizing the integrated squarederror; where the range of integration for each target distribution is specifiedsuch that the value of the bivariate density function is ignorable beyond theconsidered range. The arithmetic mean and the standard deviation of theintegrated squared error (ISE) and the bandwidth vector (BW) are reportedin Table 1.

From the table it is observed, as expected, that with increasing samplesize, the mean of ISE and BW of all estimators are decreasing for all targetdistributions considered. As we can see, f4 performs best in all situations andf3 does worst among the first four estimators. f2 performs second best for thefirst three target distributions, whereas with distribution f4, f2 outperformsf1 for n = 100, but is dominated by f1 for n = 200. This is because f1asymptotically performs better than f2 for distribution f4, and the requiredconvergence rate is reached at n = 200. The first four estimators performsignificantly better than f5 reflecting the boundary bias problem of classicalkernels, except for distribution f2, where f1 and f3 have mean ISE very closeto that of f5. In fact, f3 is outperformed by f5, because distribution f2 haslow density close to zero and hence the boundary bias of f5 comes out to be

11

Page 12: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

close to the bias of f1 and f3.

5 Application

Applicability of density estimator f4 is demonstrated using two sets of astro-nomical data. For given data set X1 = (X11, X12),X2 = (X21, X22), ...,Xn =(Xn1, Xn2) of size n, the bandwidths are selected by minimizing

f 2(x)dx− 2

n2

i

f−i(Xi),

where f−i(Xi) is an estimate of the target distribution f at the point Xi,based on the given data set excluding the observation Xi.

Our first data set consists of information on gamma-ray bursts (GRBs),the brightest explosion in the universe to date since the Big Bang. Wecollect data from the fourth BATSE Gamma-Ray Burst Catalog (revised)(Paciesas et al., 1999) for the variable T90, a measure of burst duration, isthe time in second within which 90% of the flux arrive, and the variableP256, peak flux measured in count per square centimeter per second on the256 millisecond time scale, on 1496 long-duration GRBs (i.e. GRBs withT90 > 2 seconds). As usually done in astronomical studies to analyse thedata set with huge variation in the values of the variables, we also considerthe logarithm transformation of the variables, and study the relation betweenlog10(T90) and log10(P256) (see, Fig. 5). Fig. 6 shows their bivariate density,estimated by f4, is a multimodal distribution.

Next data set contains information on 1131 early-type galaxies (ETGs)at redshift (z) ranging from 0.06 to 2.67, where the data is collected fromForster et al., 2009; Saracco et al., 2009; Taylor et al., 2010; Damjanov etal., 2011; Papovich et al., 2012; Chen et al., 2013; McLure et al., 2013, andSzomoru et al., 2013. Fig. 7 shows the plot of log10Re (Re: effective radius inkiloparsec) versus z for the galaxies, and the corresponding bivariate densityestimated by f4 is shown in Fig. 8. Fig. 8 indicates two significantly differentpatterns in the density function, where there is a high density function forthe nearby ETGs, i.e. galaxies with redshift close to zero, and a low densityfunction for the ETGs in higher redshift region.

12

Page 13: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

6 Discussion

We consider the estimation of a bivariate density function, with supportℜ × [0,∞), using the product of classical and associated kernels. We alsosuggest two new nonparametric density estimators based on normal-gammakernels. The proposed estimators are proved to be free from boundary bias,whereas simulation study shows the estimator f4 performs best among itspossible competitors. Practical implementation of estimator f4 includes twoimportant fields of astronomical study. In simulation, the bandwidth vectoris selected by optimization method such that the integrated squared erroris minimized. With increasing dimension, even if for the bivariate case, thisselection procedure can be time consuming. Hence a fast algorithm is neededand bandwidth selection based on the specific kernel estimator is desired.In future study, full bandwidth matrix instead of diagonal matrix can beused, e.g. Kokonendjia and Some (2015) propose a bivariate beta kernelwith a correlation structure. Bivariate analysis can be tried to extend tomultivariate case other than the product kernel estimators like f1 and f2.

13

Page 14: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

Table 1: Simulation study of the kernel density estimators

Estimator ISE (mean) ISE (sd) 1st BW 1st BW 2nd BW 2nd BW

×106 ×106 (mean) (sd) (mean) (sd)

n= 100, distributionf1f1 2, 984 1, 360 0.481 0.072 0.358 0.095

f2 2, 644 1, 435 0.520 0.090 0.576 0.158

f3 3, 058 1, 414 0.302 0.072 0.208 0.060

f4 2, 384 1, 282 0.230 0.070 0.390 0.113

f5 4, 476 1, 620 0.516 0.073 0.485 0.110n= 200, distributionf1

f1 2, 024 893 0.418 0.053 0.283 0.071

f2 1, 720 880 0.447 0.068 0.483 0.142

f3 2, 108 954 0.247 0.050 0.169 0.042

f4 1, 608 822 0.184 0.052 0.341 0.097

f5 3, 278 1, 056 0.457 0.060 0.392 0.084n= 100, distributionf2

f1 1, 700 475 1.105 0.463 0.211 0.072

f2 1, 607 460 1.035 0.425 0.249 0.074

f3 1, 720 566 0.950 0.444 0.116 0.052

f4 1, 480 469 0.719 0.348 0.176 0.044

f5 1, 716 464 1.015 0.432 0.722 0.148n= 200, distributionf2

f1 1, 216 341 0.729 0.272 0.185 0.049

f2 1, 137 334 0.683 0.227 0.224 0.050

f3 1, 272 361 0.613 0.343 0.099 0.039

f4 1, 071 306 0.439 0.232 0.160 0.034

f5 1, 242 343 0.694 0.242 0.663 0.104n= 100, distributionf3

f1 2, 271 990 1.611 0.442 0.151 0.053

f2 1, 985 1, 108 1.964 0.549 0.204 0.056

f3 2, 303 1, 094 1.115 0.428 0.081 0.039

f4 1, 701 1, 037 1.035 0.396 0.126 0.048

f5 4, 160 1, 301 1.987 0.521 0.173 0.069n= 200, distributionf3

f1 1, 667 669 1.293 0.374 0.128 0.040

f2 1, 439 722 1.580 0.487 0.177 0.050

f3 1, 663 731 0.881 0.343 0.067 0.030

f4 1, 207 653 0.846 0.319 0.107 0.037

f5 3, 207 909 1.754 0.445 0.137 0.057n= 100, distributionf4

f1 10, 941 2, 417 0.729 0.191 0.071 0.039

f2 10, 791 2, 347 0.738 0.212 0.151 0.153

f3 11, 875 2, 791 0.429 0.173 0.045 0.025

f4 10, 574 2, 420 0.333 0.150 0.085 0.055

f5 12, 308 3, 115 0.749 0.179 0.147 0.039n= 200, distributionf4

f1 8, 183 1, 863 0.593 0.120 0.048 0.024

f2 8, 339 1, 889 0.619 0.132 0.075 0.081

f3 8, 860 2, 185 0.295 0.101 0.034 0.015

f4 8, 101 1, 867 0.248 0.091 0.056 0.029

f5 9, 322 2, 262 0.607 0.113 0.124 0.024

14

Page 15: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−3−2

−10

12

3

0

1

2

3

4

0.02

0.04

0.06

0.08

0.10

0.12

x1

x2

f1

Figure 1: Surface plot of the distribution f1.

15

Page 16: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−4

−2

0

2

4

0

1

2

3

45

6

0.00

0.01

0.02

0.03

0.04

0.05

x1

x2

f2

Figure 2: Surface plot of the distribution f2.

16

Page 17: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−6−4

−20

24

6

0.0

0.5

1.0

1.5

2.0

2.5

3.0

0.02

0.04

0.06

0.08

0.10

0.12

x1

x2

f3

Figure 3: Surface plot of the distribution f3.

17

Page 18: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−2

−1

0

1

23

0.0

0.5

1.0

1.5

0.05

0.10

0.15

0.20

0.25

x1x2

f4

Figure 4: Surface plot of the distribution f4.

18

Page 19: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−0.5 0.0 0.5 1.0 1.5 2.0

0.5

1.0

1.5

2.0

2.5

log10(P256)

log 1

0(T90

)

Figure 5: Scatter plot of the GRB data set.

19

Page 20: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−0.5 0.0 0.5 1.0 1.5 2.00.5

1.0

1.5

2.0

2.5

0.2

0.4

0.6

0.8

log10(P256)

log10(T90)

f

Figure 6: Surface plot of the density of GRB data set, estimated by f4 withBW= (0.013, 0.016).

20

Page 21: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−0.5 0.0 0.5 1.0

0.0

0.5

1.0

1.5

2.0

2.5

log10(Re)

z

Figure 7: Scatter plot of the ETG data set.

21

Page 22: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

−0.5

0.0

0.5

0.5

1.0

1.5

2.0

2.5

0.5

1.0

1.5

2.0

2.5

3.0

log10(Re)

z

f

Figure 8: Surface plot of the density of ETG data set, estimated by f4 withBW= (0.02, 0.02).

22

Page 23: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

7 Appendix

7.1 (A.1)

{Eξx1 ,ξx2(ξx1 − x1)}f 1(x) = f 1(x){Eξx1 |ξx2 ,ξx2 (ξx1 − x1)}

= f 1(x)[

ξx1

ξx2

(ξx1 − x1)N(ξx1|µ, (λξ2)−1)Ga(ξx2|α, β)dξx1dξx2

]

= f 1(x)[

ξx2

{

ξx1

(ξx1 − x1)N(ξx1|µ, (λξ2)−1)dξx1

}

Ga(ξx2 |α, β)dξx2

]

= f 1(x)[

ξx2

{Eξx1 |ξx2 (ξx1 − Eξx1 |ξx2 (ξx1))}Ga(ξx2|α, β)dξx2

]

= 0, since ξx1 |ξx2 ∼ N(ξx1 |µ, (λξ2)−1).

Similarly,{Eξx1 ,ξx2

(ξx2 − x2)}f 2(x) = 2b2f2(x).

Again,

1

2E{(ξx1 − x1)(ξx2 − x2)}f 12(x) =

1

2f 12(x)

[

ξx2

(ξx2 − x2){

ξx1

(ξx1 − x1)N(ξx1 |µ, (λξ2)−1)dξx1

}

Ga(ξx2|α, β)dξx2

]

= 0.

Similarly,

1

2E{(ξx1 − x1)(ξx2 − x2)}f 21(x) = 0.

Now,

1

2E{(ξx1 − x1)

2}f 11(x) =1

2f 11(x)

[

ξx2

{

ξx1

(ξx1 − x1)2N(ξx1 |µ, (λξ2)−1)dξx1

}

Ga(ξx2|α, β)dξx2

]

=1

2f 11(x)

[

ξx2

{V arξx1 |ξx2 (ξx1)}Ga(ξx2|α, β)dξx2

]

=1

2f 11(x)λ−1

{

ξx2

ξ−1x2Ga(ξx2|α, β)dξx2

}

, since V arξx1 |ξx2 (ξx1) = (λξx2)−1

=1

2f 11(x)λ−1(α− 1)−1β

=1

2|x1|f 11(x).

23

Page 24: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

Similarly,

1

2E{(ξx2 − x2)

2}f 22(x) =1

2f 22(x)

{

V ar(ξx2) + (2b2)2}

=1

2f 22(x)(x2b2 + 6b22), since V ar(ξx2) = x2b2 + 2b22.

7.2 (A.2)

V ar{f(x)} = n−1V ar{KΘ(Xi)}= n−1E{KΘ(Xi)}2 +O(n−1).

Now,

Eξx1 ,ξx2{KΘ(Xi)}2 =

ξx2

[

ξx1

{N(ξx1 |µ, (λξ2)−1}2dξx1

]

{Ga(ξx2|α, β)}2dξx2

=

√λ√ηx2

2√π

N(ηx1 |µ, (2ληx2)−1){Ga(ξx2|α, β)}2dξx2

= Bb1,b2(x1, x2)N(ηx1 |µ, (2ληx2)−1)Ga(ηx2 |2α− 1/2, 2β),

where Bb1,b2(x1, x2) =√λ√

Γ(2α−1/2)√π22α+1/2Γ(α)2

β1/2.

Now writing the gamma function in terms of R(z) =√2πe−zzz+1/2/Γ(z+1)

for z ≥ 0 we obtain,√

Γ(2α− 1/2)

Γ(α)2=

1√πe

22α−3/2{

1 +1

4(α− 1)

}2α−1 R2(α− 1)

R(2α− 3/2),

which implies (Brown and Chen (1999)) Bb1,b2(x1, x2) ≤ 14π

√e

√λβ for α →

∞. Therefore,

Bb1,b2(x1, x2) ≈

14π

√eb−1/21 b

−1/22 |x1|−1/2x

−1/22 if x1/b1 → ∞ and x2/b2 → ∞,

Γ(2κ2+7/2)√π22κ2+9/2

√(κ1+1)(κ2+1)Γ2(κ2+2)

b−11 b−1

2 if x1/b1 → κ1 and x2/b2 → κ2,

Γ(2κ2+7/2)√π22κ2+9/2

√(κ2+1)Γ2(κ2+2)

b−1/21 b−1

2 |x1|−1/2 if x1/b1 → ∞ and x2/b2 → κ2,

14π

√e√κ1+1

b−11 b

−1/22 x

−1/22 if x1/b1 → κ1 and x2/b2 → ∞,

for non-negative constants κ1, κ2.

24

Page 25: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

7.3 (A.3)

Now, for δ1 = b1−ǫ11 with 0 < ǫ1 < 1 and δ2 = b1−ǫ2

2 with 0 < ǫ2 < 1,

∫ ∞

−∞

∫ ∞

0V ar{f3(x)}dx1dx2 = 2

∫ ∞

0

∫ ∞

0V ar{f3(x)}dx1dx2

= 2

[∫ δ1

0

∫ δ2

0V ar{f3(x)}dx1dx2 +

∫ δ1

0

∫ ∞

δ2

V ar{f3(x)}dx1dx2

+

∫ ∞

δ1

∫ δ2

0V ar{f3(x)}dx1dx2 +

∫ ∞

δ1

∫ ∞

δ2

V ar{f3(x)}dx1dx2]

= 2

[

1

4π√e√κ1 + 1

n−1b−11 b

−1/22

∫ δ1

0

∫ ∞

δ2

x−1/22 f(x)dx1dx2

+Γ(2κ2 + 7/2)

√π22κ2+9/2

(κ2 + 1)Γ2(κ2 + 2)n−1b

−1/21 b−1

2

∫ ∞

δ1

∫ δ2

0|x1|−1/2f(x)dx1dx2

+1

4π√en−1b

−1/21 b

−1/22

∫ ∞

δ1

∫ ∞

δ2

|x1|−1/2x−1/22 f(x)dx1dx2

+O(n−1b−ǫ11 b−ǫ2

2 )

]

= 2

[

1

4π√en−1b

−1/21 b

−1/22

∫ ∞

0

∫ ∞

0|x1|−1/2x

−1/22 f(x)dx1dx2

+O(n−1b−1/21 b

−1/22 )

]

=1

4π√en−1b

−1/21 b

−1/22

∫ ∞

−∞

∫ ∞

0|x1|−1/2x

−1/22 f(x)dx1dx2

+O(n−1b−1/21 b

−1/22 )

with∫∞−∞

∫∞0 |x1|−1/2x

−1/22 f(x)dx1dx2 finite.

References

[1] Bernardo, J. M. and Smith, A. F. M. (2000), Bayesian Theory, John Wiley &Sons, Ltd , Chichester.

[2] Bouezmarni, T. and Rombouts, J. V. K. (2010), Nonparametric density estima-tion for multivariate bounded data, Journal of Statistical Planning and Inference,140, 139-152.

25

Page 26: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

[3] Brown, B. M. and Chen, S. X. (1999), Beta-Bernstein smoothing for regressioncurves with compact support, Scandinavian Journal of Statistics, 26, 47–59.

[4] Chen, S. X. (1999), A beta kernel estimation for density functions, Computa-tional Statistics and Data Analysis, 31, 131-145.

[5] Chen, S. X. (2000), Probability density function estimation using gamma ker-nels, Annals of the Institute of Statistical Mathematics, 52, 471-480.

[6] Chen, Z., Shu, C. G., Burgarella, D., Buat, V., Huang, J. -S., Luo, Z. J. (2013),Properties and morphologies of Lyman break galaxies at z ∼ 1 in the ChandraDeep Field South, inferred from spectral energy distributions, Monthly Noticesof the Royal Astronomical Society, 431, 2080–2105.

[7] Damjanov, I., Abraham, R. G., Glazebrook, K., McCarthy, P. J., Caris, E.,Carlberg, R. G., Chen, H. -W., Crampton, D., Green, A. W., Jørgensen, I.,Juneau, S., Le Borgne, D., Marzke, R. O., Mentuch, E., Murowinski, R., Roth,K., Savaglio, S., Yan, H. (2011), Red Nuggets at High Redshift: Structural Evo-lution of Quiescent Galaxies Over 10 Gyr of Cosmic History, The AstrophysicalJournal, 739, L44–L49.

[8] Forster Schreiber, N. M., Genzel, R., Bouche, N., Cresci, G., Davies, R.,Buschkamp, P., Shapiro, K., Tacconi, L. J., Hicks, E. K. S., Genel, S., Shap-ley, A. E., Erb, D. K., Steidel, C. C., Lutz, D., Eisenhauer, F., Gillessen, S.,Sternberg, A., Renzini, A., Cimatti, A., Daddi, E., Kurk, J., Lilly, S., Kong,X., Lehnert, M. D., Nesvadba, N., Verma, A., McCracken, H., Arimoto, N.,Mignoli, M., Onodera, M. (2009), The SINS Survey: SINFONI Integral FieldSpectroscopy of z ∼ 2 Star-forming Galaxies, The Astrophysical Journal, 706,1364–1428.

[9] Igarashi, G. and Kakizawa, Y. (2014), Re-formulation of inverse Gaussian,reciprocal inverse Gaussian, and BirnbaumSaunders kernel estimators, Statistics& Probability Letters, 84, 235–246.

[10] Kokonendji, C. C. and Some, S. M. (2015), On multivariate associated kernelsfor smoothing general density functions, arXiv:1502.01173.

[11] Libengue, F. G. (2013), Methode Non-Parametrique par Noyaux AssociesMixtes et Applications, Ph.D. Thesis Manuscript (in French) to Universite deFranche-Comte, Besancon, France & Universite de Ouagadougou, Burkina Faso,June 2013, LMB no. 14334, Besancon.

[12] McLure, R. J., Pearce, H. J., Dunlop, J. S., Cirasuolo, M., Curtis-Lake, E.,Bruce, V. A., Caputi, K. I., Almaini, O., Bonfield, D. G., Bradshaw, E. J.,

26

Page 27: arXiv:1801.08300v1 [stat.AP] 25 Jan 2018 · PDF filearXiv:1801.08300v1 [stat.AP] 25 Jan 2018 Bivariate density estimation using normal-gamma kernel with application to astronomy Uttam

Buitrago, F., Chuter, R., Foucaud, S., Hartley, W. G., Jarvis, M. J. (2013), Thesizes, masses and specific star formation rates of massive galaxies at 1.3 < z <1.5: strong evidence in favour of evolution via minor mergers, Monthly Noticesof the Royal Astronomical Society, 428, 1088–1106.

[13] Paciesas, W. S., Meegan, C. A., Pendleton, G. N., Briggs, M. S., Kouveliotou,C., Koshut, T. M., Lestrade, J. P., McCollough, M. L., Brainerd, J. J., Hakkila,J., Henze, W., Preece, R. D., Connaughton, V., Kippen, R. M., Mallozzi, R. S.,Fishman, G. J., Richardson, G. A., Sahi, M. (1999), The fourth batse gamma-ray burst catalog (revised), The Astrophysical Journal Supplement Series, 122,465–495.

[14] Papovich C., Bassett, R., Lotz, J. M., van der Wel, A., Tran, K. -V., Finkel-stein, S. L., Bell, E. F., Conselice, C. J., Dekel, A., Dunlop, J. S., Guo, Y.,Faber, S. M., Farrah, D., Ferguson, H. C., Finkelstein, K. D., Haussler, B.,Kocevski, D. D., Koekemoer, A. M., Koo, D. C., McGrath, E. J., McLure, R.J., McIntosh, D. H., Momcheva, I., Newman, J. A., Rudnick, G., Weiner, B.,Willmer, C. N. A., Wuyts, S. (2012), CANDELS Observations of the StructuralProperties of Cluster Galaxies at z = 1.62, The Astrophysical Journal, 750,93–106.

[15] Saracco, P., Longhetti, M. and Andreon, S. (2009), The population of early-type galaxies at 1 < z < 2 - new clues on their formation and evolution, MonthlyNotices of the Royal Astronomical Society, 392, 718–732.

[16] Silverman, B. W. (1986), Density Estimation for Statistics and Data Analysis,Chapman and Hall, London.

[17] Szomoru, D., Franx, M., van Dokkum, P. G., Trenti, M., Illingworth, G. D.,Labbe, I., Oesch, P. (2013), The Stellar Mass Structure of Massive Galaxiesfrom z = 0 to z = 2.5: Surface Density Profiles and Half-mass Radii, TheAstrophysical Journal, 763, 73–83.

[18] Taylor, E. N., Franx M., Glazebrook, K., Brinchmann, J., Van Der Wel A. andVan Dokkum P. G. (2010), On the Dearth of Compact, Massive, Red SequenceGalaxies in the Local Universe, The Astrophysical Journal, 720, 723–741.

[19] Wand, M.P. and Jones, M.C., 1995. Kernel Smoothing, Chapman and Hall,New York.

27