math 450: mathematical statisticsvucdinh.github.io/f18/lecture01.pdf · 2019-11-20 · textbook:...

33
MATH 450: Mathematical statistics Vu Dinh Departments of Mathematical Sciences University of Delaware August 28th, 2018 Vu Dinh MATH 450: Mathematical statistics

Upload: others

Post on 12-Feb-2020

15 views

Category:

Documents


0 download

TRANSCRIPT

MATH 450: Mathematical statistics

Vu Dinh

Departments of Mathematical SciencesUniversity of Delaware

August 28th, 2018

Vu Dinh MATH 450: Mathematical statistics

General information

Classes:Tuesday & Thursday 9:30-10:45 am, Gore Hall 115

Office hours: Tuesday–Wednesday 1-2:30 pm, Ewing Hall 312

Website: http://vucdinh.github.io/m450f18

Textbook: Modern mathematical statistics with applications(Second Edition). Devore and Berk. Springer, 2012.

Vu Dinh MATH 450: Mathematical statistics

Evaluation

Overall scores will be computed as follows:25% homework, 10% quizzes, 25% midterm, 40% final

No letter grades will be given for homework, midterm, or final.Your letter grade for the course will be based on your overallscore.

The lowest homework scores and the lowest quiz score will bedropped.

Here are the letter grades you can achieve according to youroverall score.

≥ 90%: At least A≥ 75%: At least B≥ 60%: At least C≥ 50%: At least D

Vu Dinh MATH 450: Mathematical statistics

Homework

Vu Dinh MATH 450: Mathematical statistics

Homework

Assignments will be posted on the website every otherTuesday (starting from the second week) and will be due onThursday of the following week, at the beginning of lecture.

No late homework will be accepted.

The lowest homework scores will be dropped in the calculationof your overall homework grade.

Vu Dinh MATH 450: Mathematical statistics

Quizzes and exams

At the end of some chapter, there will be a short quiz duringclass.

The quiz dates will be announced at least one class inadvance.

The lowest quiz score will be dropped.

There will be a midterm on 10/25 and a final exam duringexams week.

Vu Dinh MATH 450: Mathematical statistics

Data analysis

Open source statistical system R

http://cran.r-project.org/

Vu Dinh MATH 450: Mathematical statistics

Tentative schedule

Vu Dinh MATH 450: Mathematical statistics

Topics

Textbook: Modern mathematical statistics with applications(Second Edition).Devore and Berk. Springer, 2012.

Week 1 · · · · · ·• Chapter 6: Statistics and SamplingDistributions

Week 4 · · · · · ·• Chapter 7: Point Estimation

Week 7 · · · · · ·• Chapter 8: Confidence Intervals

Week 10 · · · · · ·• Chapter 9: Test of Hypothesis

· · · · · ·• Chapter 10: Two-sample inference

Week 14 · · · · · ·• Regression

Vu Dinh MATH 450: Mathematical statistics

Mathematical statistics

Vu Dinh MATH 450: Mathematical statistics

Mathematical statistics

Statistics is a branch of mathematics that deals with thecollection, organization, analysis, interpretation andpresentation of data

“...analysis, interpretation and presentation of data”→ mathematical statistics

descriptive statistics: the part of statistics that describes datainferential statistics: the part of statistics that drawsconclusions from data

Vu Dinh MATH 450: Mathematical statistics

Modelling uncertainties

— Modern statistics is about making prediction in the presence ofuncertainties

Vu Dinh MATH 450: Mathematical statistics

— It is difficult to make predictions, especially about the future.

Vu Dinh MATH 450: Mathematical statistics

Inferential statistics

population: a well-defined collection of objects of interest

when desired information is available for all objects in thepopulation, we have what is called a census→ very expensive

a sample, a subset of the population, is selected

Vu Dinh MATH 450: Mathematical statistics

Random sample

Definition

The random variables X1,X2, ...,Xn are said to form a (simple)random sample of size n if

1 the Xi ’s are independent random variables

2 every Xi has the same probability distribution

Vu Dinh MATH 450: Mathematical statistics

Review: Probability

Vu Dinh MATH 450: Mathematical statistics

Overview

Axioms of probability

Conditional probability and independence

Random variables

Special distributions

Bivariate and multivariate distributions

Vu Dinh MATH 450: Mathematical statistics

Most important parts

Expectation and variance of random variables (discrete andcontinuous)

Computations with normal distributions

Bivariate and multivariate distributions

Vu Dinh MATH 450: Mathematical statistics

Random variables

random variables are used to model uncertainties

Notations:

random variables are denoted by uppercase letters (e.g., X );the calculated/observed values of the random variables aredenoted by lowercase letters (e.g., x)

Vu Dinh MATH 450: Mathematical statistics

Random variable

Definition

Let S be the sample space of an experiment. A real-valuedfunction X : S → R is called a random variable of the experiment.

Vu Dinh MATH 450: Mathematical statistics

Discrete random variable

Definition

A random variables X is discrete if the set of all possible values ofX

is finite

is countably infinite

Note: A set A is countably infinite if its elements can be put inone-to-one correspondence with the set of natural numbers, i.e, wecan index the element of A as a sequence

A = {x1, x2, . . . , xn, . . .}

Vu Dinh MATH 450: Mathematical statistics

Discrete random variable

A random variable X is described by its probability mass function

Vu Dinh MATH 450: Mathematical statistics

Represent the probability mass function

As a table

As a function:

p(x) =

{12

(23

)xif x = 1, 2, 3, . . . ,

0 elsewhere

Vu Dinh MATH 450: Mathematical statistics

Expectation

The expected value of a random variable X is also called the mean,or the mathematical expectation, or simply the expectation ofX. It is also occasionally denoted by E [X ], E (X ), EX , µX , or µ.

Vu Dinh MATH 450: Mathematical statistics

Exercise

Problem

A random variable X has the following pmf table

X 0 1 2

probability 0.25 0.5 0.25

What is the expected value of X?

Vu Dinh MATH 450: Mathematical statistics

Law of the unconscious statistician (LOTUS)

Vu Dinh MATH 450: Mathematical statistics

Exercise

Problem

A random variable X has the following pmf table

X 0 1 2

probability 0.25 0.5 0.25

What is E [X 2 − X ]?

Compute Var [X ]

Var [X ] = E [(X − E [X ])2] = E [X 2]− (E [X ])2

Vu Dinh MATH 450: Mathematical statistics

Continuous random variable

Definition

Let X be a random variable. Suppose that there exists anonnegative real-valued function f : R→ [0,∞) such that for anysubset of real numbers A, we have

P(X ∈ A) =

∫Af (x)dx

Then X is called absolutely continuous or, for simplicity,continuous. The function f is called the probability densityfunction, or simply the density function of X.

Whenever we say that X is continuous, we mean that it isabsolutely continuous and hence satisfies the equation above.

Vu Dinh MATH 450: Mathematical statistics

Properties

Let X be a continuous r.v. with density function f , then∫∞−∞ f (x) dx = 1

For any fixed constant a, b,

P(a ≤ X ≤ b) =

∫ b

af (x) dx

P(X = a) = 0

P(a < X < b) = P(a ≤ X < b) = P(a < X ≤ b) = P(a ≤ X ≤ b)

Vu Dinh MATH 450: Mathematical statistics

Probability density function

Vu Dinh MATH 450: Mathematical statistics

Expectation

Vu Dinh MATH 450: Mathematical statistics

Lotus

Vu Dinh MATH 450: Mathematical statistics

Example

Problem

Let X be a continuous r.v. with density function

f (x) =

{ce−2x if x ≥ 0

0 otherwise

where c is some unknown constant.

Compute c

Compite P(X ∈ [1, 2])

Compute E [X ] and Var(X ).

Vu Dinh MATH 450: Mathematical statistics