No Title

STAT 350: Lecture 1

Reading: Here is a list of sections in the text which I will be assuming you have some familiarity with: Chapter 1, 2.1-5, 2.7, 2.9-11, 3.1-3, 4.1, 4.4, 5.1-7, 15.1-4, 16.1-16.10, 17.1-17.7, 19.1-19.6, 19.8, 20.1-3, 21.1, 26.1-2, 27.1-3, Appendix A

Subject of this course:

Values $Y_1,\ldots,Y_n$ of a ``response'' or ``dependent'' variable are measured under different ``conditions''.
Goal: understand influence of conditions on response.
Role of statistics: response is subject random fluctuation or error.

Basic Statistical Model

Additive errors:

$\begin{displaymath}Y=\mu+\epsilon \end{displaymath}$

Assume ${\rm E}(\epsilon)=0$ (or define $\mu={\rm E}(Y)$ and deduce that ${\rm E}(\epsilon)=0$ ).

For a sample of size n:

$\begin{displaymath}Y_i = \mu_i + \epsilon_i \quad ; \quad {\rm E}(\epsilon_i)=0 \quad i=1,\ldots,n \end{displaymath}$

Goal now: relate $\mu_i$ to ``conditions'' for measurement i.

``Condition'' summarized by values of ``covariates''

$\begin{displaymath}x_{ij} = \mbox{ value of $j$th covariate for $i$th response} \end{displaymath}$

Linear Models

Often we assume

$\begin{displaymath}\mu_i = x_{i1} \beta_1 + x_{i2} \beta_2 + \cdots + x_{ip} \beta_p \end{displaymath}$

where $\beta_1,\ldots,\beta_p$ are parameters (usually unknown).

Key is:

$\mu$ is a linear function of

$\begin{displaymath}\beta = \left(\begin{array}{c} \beta_1 \\ \vdots \\ \beta_p\end{array} \right) \end{displaymath}$
This makes it a linear model.
The x_ij are known.

A useful alternative description:

$\begin{displaymath}\frac{\partial \mu_i}{\partial \beta_j} \left( = x_{ij} \right) \mbox{ is {\bf known}} \end{displaymath}$

Example: Thermoluminescence Dating

Thermoluminescence (TL) dating can be used to determine how old a piece of pottery is or how old a sand dune is. When a piece of pottery is found by an archaeologist it is ground up and split into small samples. The samples are irradiated with different amounts of gamma radiation and then heated in an oven. At temperatures around 300 Celsius they glow with blue light called thermoluminescence. The amount of light given off, Y depends on the dose D of radiation given by the analyst (and also on the amount of radiation --cosmic rays or radiation from trace isotopes in the ground-- to which the pot or sand was exposed while buried).

Several models are in use:

a straight-line model,

$\begin{displaymath}Y_i=\beta_1+\beta_2D_i+\epsilon_i \end{displaymath}$
a quadratic model,

$\begin{displaymath}Y_i=\beta_1+\beta_2D_i+\beta_3D_i^2+\epsilon_i \end{displaymath}$
a cubic model,

$\begin{displaymath}Y_i=\beta_1+\beta_2D_i+\beta_3D_i^2+\beta_4D_i^3+\epsilon_i \end{displaymath}$
and a saturating exponential model,

$\begin{displaymath}Y_i=\beta_1[1-\exp\{(-(\beta_2D_i+\beta_3)\}]+\epsilon_i \, . \end{displaymath}$

Of these the first three are linear models while the fourth is not. In the first three cases the mean $\mu_i$ can be differentiated with respect to any $\beta_j$ and you get a known (measured) constant. Thus for example in the second model

$\begin{displaymath}(x_{i,1},x_{i,2},x_{i,3}) = (1,D_i,D_i^2) \, . \end{displaymath}$

For the last model, however, the derivatives depend on unknown parameters, such as,

$\begin{displaymath}\frac{\partial \mu_i}{\partial \beta_1} = 1-\exp\{(-(\beta_2D_i+\beta_3)\} \end{displaymath}$

which is not known since it involves $\beta_2$ and $\beta_3$ .

Here is a plot of the data with the least squares line drawn in.

}

Here is the same plot with the least squares fit of the quadratic model.

You will see that the two fits are virtually indistinguishable. In this course we will want to test the hypothesis that the $\beta_3$ term in the quadratic model can be left out. To see the importance of this consider the use to which these models are put. Physicists reckon that the intercept term $\beta_1$ which is the amount of TL if you don't add any radiation is the TL due to the exposure to cosmic rays and so on while buried. The total exposure while buried is equivalent to some dose D_eq of added radiation called the ``equivalent dose'', equivalent in the sense that $\beta_1 = \beta_2D_{eq}$ if a straight line model is appropriate. This dose is measured by finding the value of D which would produce a predicted TL equal to 0, that is, by extending the graph to negative doses until the fit crosses the x axis. In the next plot you can see that the linear and quadratic fits do cross the x axis (or y=0) at different places.

We will be fitting linear models (and near the end of the course non-linear models like the saturating exponential) by least squares. We will examine residual plots to judge whether or not the model assumptions are adequate:

In this case the plot shows clear signs of heteroscedasticity -- unequal variances.

We will look at Q-Q plots of the residuals to judge normality.

The plot is not straight and so the assumption of normally distributed errors would also be in doubt; this problem is probably irrelevant in view of the heteroscedasticity, however.

$next$ $up$

Richard Lockhart
1999-01-04