Share on Facebook Share on Twitter Email
Answers.com

Empirical distribution function

 
Wikipedia: Empirical distribution function

In statistics, an empirical distribution function is a cumulative probability distribution function that concentrates probability 1/n at each of the n numbers in a sample.

Let X1, …, Xn be iid real random variables with the cdf F(x). The empirical distribution function n(x) is a step function defined by

\hat F_n(x) = \frac{ \mbox{number of elements in the sample} \leq x}n = 
\frac{1}{n} \sum_{i=1}^n I(X_i \le x),

where I(A) is the indicator of event A.

For fixed x, I(Xi ≤ x) is a Bernoulli random variable with parameter p = F(x), hence nF̂n(x) is a binomial random variable with mean nF(x) and variance nF(x)(1 − F(x)).

Asymptotical properties

\hat F_n(x)\xrightarrow{\mathrm{a.s.}} F(x) for fixed x (a.s. denotes almost sure convergence).
In other words, n(x) is a consistent unbiased estimator of the cumulative distribution function F(x).
\sqrt{n}(\hat F_n(x)-F(x))
converges in distribution to a normal distribution N(0, F(x)(1 − F(x))) for fixed x.
The Berry–Esséen theorem provides the rate of this convergence.
\|\hat F_n(x)-F(x)\|_\infty\to 0\text{ with probability }1.
The Dvoretzky–Kiefer–Wolfowitz inequality provides the rate of this convergence.
\sqrt{n}\|\hat F_n(x)-F(x)\|_\infty
converges in distribution to the Kolmogorov distribution, provided that F(x) is continuous.
The Kolmogorov–Smirnov test for goodness-of-fit is based on this fact.
\sqrt{n}(\hat F_n-F),
as a process indexed by x, converges in law in the Skorokhod space D(-\infty,\infty) to a Gaussian process B(F(x)), where B(t) is the Brownian bridge.

See also


Search unanswered questions...
Enter a question here...
Search: All sources Community Q&A Reference topics
 
 

 

Copyrights:

Wikipedia. This article is licensed under the Creative Commons Attribution/Share-Alike License. It uses material from the Wikipedia article "Empirical distribution function" Read more