Share on Facebook Share on Twitter Email
Answers.com

Cluster sampling

 
Sci-Tech Dictionary: cluster sampling
(′kləs·tər ′sam·pliŋ)

(statistics) A random sampling plan in which the population is subdivided into groups called clusters so that there is small variability within clusters and large variability between clusters.


Search unanswered questions...
Enter a question here...
Search: All sources Community Q&A Reference topics
Accounting Dictionary: Cluster Sampling
Top

Method of selecting groups of units. The first unit of each group is selected with the use of a random number table. This allows selection of more than one item at a time. In cluster sampling, the population is broken into groups of items, and a Random Sample is selected from all the clusters. Each cluster becomes a sampling unit. After determining the adequate number of clusters, the auditor has a choice of either examining all items in a cluster (one-stage) or only a random number of items in the cluster (two-stage). Cluster sampling requires computing the mean for the individual sampling unit and multiplying this by the number of units in the population to determine the population's estimated value. The precision limit on this estimate must also be computed. Cluster sampling lowers sampling cost and the cost to replace the sample. Sample selection is made easier; however, less statistical efficiency exists. Applications of cluster sampling measure variables such as inventory value and the balance in accounts receivable. It can also be used to measure attributes.

Sports Science and Medicine: cluster sampling
Top

A sample from a representative group where it is not practicable to sample the entire population. For example, if samples of students from universities were required, one university could be selected randomly then a random selection of students made from within the one establishment.

Wikipedia: Cluster sampling
Top

Cluster sampling is a sampling technique used when "natural" groupings are evident in a statistical population. It is often used in marketing research. In this technique, the total population is divided into these groups (or clusters) and a sample of the groups is selected. Then the required information is collected from the elements within each selected group. This may be done for every element in these groups or a subsample of elements may be selected within each of these groups. The technique works best when most of the variation in the population is within the groups, not between them.

Contents

Cluster elements

Elements within a cluster should ideally be as heterogeneous as possible, but there should be homogeneity between cluster means. Each cluster should be a small scale representation of the total population. The clusters should be mutually exclusive and collectively exhaustive. A random sampling technique is then used on any relevant clusters to choose which clusters to include in the study. In single-stage cluster sampling, all the elements from each of the selected clusters are used. In two-stage cluster sampling, a random sampling technique is applied to the elements from each of the selected clusters.

The main difference between cluster sampling and stratified sampling is that in cluster sampling the cluster is treated as the sampling unit so analysis is done on a population of clusters (at least in the first stage). In stratified sampling, the analysis is done on elements within strata. In stratified sampling, a random sample is drawn from each of the strata, whereas in cluster sampling only the selected clusters are studied. The main objective of cluster sampling is to reduce costs by increasing sampling efficiency. This contrasts with stratified sampling where the main objective is to increase precision.

Aspects of cluster sampling

One version of cluster sampling is area sampling or geographical cluster sampling. Clusters consist of geographical areas. Because a geographically dispersed population can be expensive to survey, greater economy than simple random sampling can be achieved by treating several respondents within a local area as a cluster. It is usually necessary to increase the total sample size to achieve equivalent precision in the estimators, but cost savings may make that feasible.

In some situations, cluster analysis is only appropriate when the clusters are approximately the same size. This can be achieved by combining clusters. If this is not possible, probability proportionate to size sampling is used. In this method, the probability of selecting any cluster varies with the size of the cluster, giving larger clusters a greater probability of selection and smaller clusters a lower probability. However, if clusters are selected with probability proportionate to size, the same number of interviews should be carried out in each sampled cluster so that each unit sampled has the same probability of selection.

Cluster sampling is used to estimate high mortalities in cases such as wars, famines and natural disasters.[1]

Advantages

  • Can be cheaper than other methods - e.g. fewer travel expenses, administration costs

Disadvantages

  • Higher sampling error, which can expressed in the so-called "design effect", the ratio between the number of subjects in the cluster study and the number of subjects in an equally reliable, randomly sampled unclustered study.[2]

See also

References

  1. ^ Article by David Brown / Washington Post
  2. ^ Kerry and Bland (1998). Statistics notes: The intracluster correlation coefficient in cluster randomisation. British Medical Journal, 316, 1455-1460.

 
 

 

Copyrights:

Sci-Tech Dictionary. McGraw-Hill Dictionary of Scientific and Technical Terms. Copyright © 2003, 1994, 1989, 1984, 1978, 1976, 1974 by McGraw-Hill Companies, Inc. All rights reserved.  Read more
Accounting Dictionary. Dictionary of Accounting Terms. Copyright © 2005 by Barron's Educational Series, Inc. All rights reserved.  Read more
Sports Science and Medicine. The Oxford Dictionary of Sports Science & Medicine. Copyright © Michael Kent 1998, 2006, 2007. All rights reserved.  Read more
Wikipedia. This article is licensed under the Creative Commons Attribution/Share-Alike License. It uses material from the Wikipedia article "Cluster sampling" Read more