Share on Facebook Share on Twitter Email
Answers.com

Consistent estimator

 
Wikipedia: Consistent estimator
A sequence of estimators {T1, T2, T3, …} for parameter θ (true value of which is 4). This sequence is consistent: the estimators are getting more and more concentrated near the true value of θ. The limiting distribution of the sequence is a degenerate random variable which equals θ with probability 1.

In statistics, a sequence of estimators for parameter θ is said to be consistent (or asymptotically consistent) if this sequence converges in probability to θ. Intuitively, this means that estimators taken far enough in the sequence are more likely to be in the vicinity of the parameter being estimated, and in the limit they will be arbitrarily close to θ with probability one.

In practice one usually constructs a single estimator as a function of an available sample of size n, and then imagines being able to keep collecting data and expanding the sample ad infinitum. In this way one would obtain a sequence of estimators indexed by n and the notion of consistency will be understood as the sample size “tends to infinity”. If this sequence converges in probability to the true value of the parameter being estimated, we call it a consistent estimator; otherwise the estimator is said to be inconsistent.

This interpretation of sample size “growing to infinity” is pervasive in modern statistics, although it is also criticized as being too unrealistic. Indeed, in those rare cases when additional data can be collected, the statistician is usually forced to use more refined models as more data becomes available, meaning that the “true” parameter θ does not remain fixed and in fact its dimension expands.[1]

Consistency with convergence in probability is sometimes referred to as weak consistency. The notion can be extended to other modes of convergence of random variables. In particular, a sequence is said to be strongly consistent if it converges almost surely to the true parameter.

Contents

Definition

Loosely speaking, an estimator Tn of θ is said to be consistent if [2]

T_n\ \xrightarrow{p}\ \theta,\ \text{as}\ n\to\infty,

where →p denotes convergence in probability.

More precise definition takes into account the fact that θ is actually unknown, and thus the convergence above must take place for every possible value of θ: Let \{p_\theta:\theta\in\Theta\} be a family of distributions, and let Xθ={X1, X2, … : Xi ~ pθ} denote a sample from the distribution pθ. Suppose {Tn(Xθ)} is a sequence of estimators for parameter g(θ). Usually Tn will be based on the first n observations of a sample. Then this sequence {Tn} is said to be (weakly) consistent on g(Θ) if [3]

\operatorname{p}\!\!\lim_{n\to\infty} T_n(X^{\theta}) = g(\theta),\ \ \text{for all}\ \theta\in\Theta,

or plugging in the definition of convergence in probability:

\lim_{n\to\infty}\Pr\!\Big[|T_n(X^\theta)-g(\theta)|\geq\varepsilon\Big]=0,\ \ \text{for all}\ \theta\in\Theta,\ \text{and all}\ \varepsilon>0.

We use g(θ) in this definition instead of simply θ, because often one is interested in estimating only a certain subvector of the underlying parameter, and regard the rest of the components of θ as nuisance. In the next example we estimate the location parameter of the model, but not the scale:

Example: sample mean for normal random variables

Suppose one has a sequence of observations {X1, X2, …} from a N(μ,σ2) distribution. To estimate μ based on the first n observations we use the sample mean: Tn = (X1 + … + Xn)/n. This defines a sequence of estimators, indexed by the sample size n.

Now, from the properties of the normal distribution, it is known that Tn will itself be normally distributed, with mean μ and variance σ2/n. Equivalently, \sqrt{n}(T_n-\mu)/\sigma has a standard normal distribution. Then

\Pr\!\Big[|T_n-\mu|\geq\varepsilon\Big] = \Pr\!\left[ \sqrt{n}|T_n-\mu|/\sigma \geq \sqrt{n}\varepsilon/\sigma \right] = 2\big(1-\Phi(\sqrt{n}\varepsilon/\sigma)\big)\ \to\ 0

as n tends to infinity, for any fixed ε > 0. Therefore, the sequence Tn of sample means is consistent for the population mean μ.

Establishing consistency

The notion of asymptotic consistency is very close, almost synonymous to the notion of convergence in probability. As such, any theorem, lemma, or property which establishes convergence in probability may be used to prove the consistency. Many such tools exist:

  • In order to demonstrate consistency directly from the definition one can use inequality [4]
    \Pr\!\big[h(T_n-\theta)\geq\varepsilon\big] \leq \frac{1}{\varepsilon} \operatorname{E}\big[h(T_n-\theta)\big],
    the most common choice for function h being either the absolute value (in which case it is known as Markov inequality), or the quadratic function (respectively Chebychev's inequality).
  • Another useful result is the continuous mapping theorem: if Tn is consistent for θ and g(·) is a real-valued function continuous at point θ, then g(Tn) will be consistent for g(θ):[5]
    T_n\ \xrightarrow{p}\ \theta\ \quad\Rightarrow\quad g(T_n)\ \xrightarrow{p}\ g(\theta)
  • Slutsky's theorem can be used to combine several different estimators, or an estimator with a non-random covergent sequence. If Tn →pα, and Sn →pβ, then [6]
    \begin{align}
  & T_n + S_n \ \xrightarrow{p}\ \alpha+\beta, \\
  & T_n   S_n \ \xrightarrow{p}\ \alpha \beta, \\
  & T_n / S_n \ \xrightarrow{p}\ \alpha/\beta,\ \text{provided that}\ \beta\neq0
  \end{align}
  • If estimator Tn is given by an explicit formula, then most likely the formula will employ sums of random variables, and then the law of large numbers can be invoked: for a sequence {Xn} of random variables and under suitable conditions,
    \frac{1}{n}\sum_{i=1}^n g(X_i) \ \xrightarrow{p}\ \operatorname{E}[\,g(X)\,]
  • If estimator Tn is defined implicitly, for example as a value that maximizes certain objective function, then more complicated argument involving uniform convergence in probability or stochastic equicontinuity has to be used. [7]

See also

  • Fisher consistency — alternative although rarely used concept of consistency for the estimators
  • Consistent test — the notion of consistency in context of hypothesis testing

References

  1. ^ Le Cam, ch.7.1
  2. ^ Amemiya, def.3.4.2
  3. ^ Lehman, p.332
  4. ^ Amemiya, (3.2.5)
  5. ^ Amemiya, Th.3.2.6
  6. ^ Amemiya, Th.3.2.7
  7. ^ Newey, Ch.2

Further reading


Search unanswered questions...
Enter a question here...
Search: All sources Community Q&A Reference topics
 
 

 

Copyrights:

Wikipedia. This article is licensed under the Creative Commons Attribution/Share-Alike License. It uses material from the Wikipedia article "Consistent estimator" Read more