In linear regression mean response and predicted response are values of the dependent variable calculated from the regression parameters and a given value of the independent variable. The values of these two responses are the same, but their calculated variances are different.
|
Contents
|
In straight line fitting the model is

where
is the response variable,
is the explanatory variable, εi is the random error, and
and
are parameters. The predicted response value for a given explanatory value, xd, is given by

while the actual response would be

Expressions for the values and variances of
and
are given in linear regression.
Mean response is an estimate of the mean of the y population associated with xd, that is
. The variance of the mean response is given by

This expression can be simplified to

To demonstrate this simplification, one can make use of the identity

The predicted response distribution is the predicted distribution of the residuals at the given point xd. So the variance is given by
![\text{Var}\left(y_d - \left[\hat{\alpha} + \hat{\beta}x_d\right]\right) = \text{Var}\left(y_d\right) + \text{Var}\left(\hat{\alpha} + \hat{\beta}x_d\right) .](http://wpcontent.answcdn.com/wikipedia/en/math/5/7/a/57a3ca62186080677a9c875c38452ef9.png)
The second part of this expression was already calculated for the mean response. Since
(a fixed but unknown parameter that can be estimated), the variance of the predicted response is given by
![\text{Var}\left(y_d - \left[\hat{\alpha} + \hat{\beta}x_d\right]\right) = \sigma^2 + \sigma^2\left(\frac{1}{m} + \frac{\left(x_d - \bar{x}\right)^2}{\sum (x_i - \bar{x})^2}\right) = \sigma^2\left(1+\frac{1}{m} + \frac{\left(x_d - \bar{x}\right)^2}{\sum (x_i - \bar{x})^2}\right) .](http://wpcontent.answcdn.com/wikipedia/en/math/d/c/6/dc6794127d42f8abecf969de90c5a7e7.png)
The
confidence intervals are computed as
. Thus, the confidence interval for predicted response is wider than the interval for mean response. This is expected intuitively – the variance of the population of
values does not shrink when one samples from it, because the random variable εi does not decrease, but the variance of the mean of the
does shrink with increased sampling, because the variance in
and
decrease, so the mean response (predicted response value) becomes closer to
.
This is analogous to the difference between the variance of a population and the variance of the sample mean of a population: the variance of a population is a parameter and does not change, but the variance of the sample mean decreases with increased samples.
The general linear model can be written as

Therefore since
the general expression for the variance of the mean response is

where M is the covariance matrix of the parameters, given by
.|
|
This article includes a list of references, related reading or external links, but its sources remain unclear because it lacks inline citations. Please improve this article by introducing more precise citations. (November 2010) |
Draper, N.R., Smith, H. (1998) Applied Regression Analysis. Wiley. ISBN 0-471-17082-8
|
||||||||||||||||||||||||||||||||||||||||||||||
This entry is from Wikipedia, the leading user-contributed encyclopedia. It may not have been reviewed by professional editors (see full disclaimer)