outlier

Share on Facebook Share on Twitter Email
(out''ər) pronunciation
n.
  1. One whose domicile lies at an appreciable distance from his or her place of business.
  2. A value far from most others in a set of data: "Outliers make statistical analyses difficult" (Harvey Motulsky).
  3. Geology. A portion of stratified rock separated from a main formation by erosion.

An observation that is very different to other observations in a set of data. Since the most common cause is recording error, it is sensible to search for outliers (by means of summary statistics and plots of the data) before conducting any detailed statistical modelling.




Outlier. A small data set showing an obvious outlier. Whenever feasible, data should be plotted, since including an outlier will usually make nonsense of any calculations.
Various indicators are used to identify outliers. One is that an observation has a value that is more than 2.5 standard deviations from the mean. Another is that an observation has a value that lies more than 1.5I beyond the upper or the lower quartile, where I is the interquartile range (see boxplot).

If there is only a single outlier present, then an effective test is the Dixon test. Denoting the kth largest observation by y(k), the test statistic is either



depending on whether y(n) appears unusually large, or y(1) appears unusually small. Special tables are required in order to determine significance.

For data from a normal distribution, the test statistic of the Grubbs test, suggested by Grubbs in 1969, is G, given by
G=1/s max {y(n), y(1)},
where and s are the sample mean and standard deviation.

The Rosner test for multiple outliers relies on ordering the n observations in terms of their distance from the overall mean, . Let ym be the observation that is the mth closest to and let the mean and standard deviation of the m−1 observations closest to the overall mean be m−1 and sm−1. The decision as to whether ym is an outlier is based on the value of






[Co]

1. An example of something (e.g. a site or object) found a long way detached from the main distribution of similar things.

2. A standing stone outside and some distance apart from a stone circle.


(in statistics) a data point that falls far outside the values of others in a data set. The nature of outliers should be understood prior to further analysis: those arising from measurement errors will distort the statistics and hence interpretation of the data; those that are genuine, however, may point to unexpected and hence highly interesting results.

Previous:outer membrane, ounce, ouabain
Next:ov+, ova, ovalbumin

An extremely high or low value lying beyond the range of the bulk of the data.

Post a question - any question - to the WikiAnswers community:

Copyrights:

Mentioned in

Hkakabo Razi (mountain, Myanmar)
Wienerwald (geographical area, Austria)