Category: 

What Is Cluster Sampling?

Cluster sampling is popular during elections.
Article Details
  • Written By: Tricia Ellis-Christensen
  • Edited By: O. Wallace
  • Last Modified Date: 15 October 2014
  • Copyright Protected:
    2003-2014
    Conjecture Corporation
  • Print this Article
Free Widgets for your Site/Blog
Dolphins have the most teeth of any mammal, sometimes over 260, yet they almost never chew their food.  more...

October 20 ,  1973 :  The "Saturday Night Massacre"  more...

Cluster sampling is a technique that generates statistics about certain populations. It has a specific format required to obtain an appropriate sample, and though this sampling can help accurately gauge some information, it is not thought as accurate as simple random samples, where all groups of the same size have the same exact chance of being selected. Despite lacking the assurance that comes from using random samples, cluster sampling is used frequently in business and other applications.

The basic procedure for creating a cluster sample is to divide the full population into some sort of meaningful groups. For instance, McDonald’s® might want a sense of what the most popular item ordered on their menu is. They might create a cluster/group for each McDonald’s store. They would then pick some of these clusters and obtain a sample from all people in that group. They could keep track of each customer’s order and decide which menu item is most popular or survey customers eating, but the company would only survey or track people in the chosen clusters; they’d also try to get all people at selected clusters.

This type of sampling is very popular on big voting nights. A natural division exists between voter precincts, but by choosing some of the precincts and surveying or using exit polls at the chosen ones, there’s often a good sense what issues or what elected officials appear to be winning. The results are extrapolated to the entire population, and they’re often fairly representative of it.

Ad

When people study statistics, they often find it challenging to remember the features of cluster sampling as opposed to the features of stratified sampling. The two have some similarities and key differences that are worth understanding.

In a stratified sample, a population is also divided into groups, though number of groups tends to be smaller. A population could be divided by gender, age, income, and region in which they live, and comparing the result of each group may be part of the reason the stratified sample is performed. The huge and appreciable difference between stratified and cluster methods is that when the groups are created, some members from each group or strata are selected. With a cluster, when clusters are created, the whole population of some of the clusters are used.

The degree to which this method works tends to depend on what is being evaluated and how diverse of a population clusters represent. Say a statistician decided to break down voting precincts in a predominantly Republican state and create clusters of some of them to look for predictions about a national election. These results would likely be skewed and not representative of the complete population in the US. On the other hand, cluster sampling with exit polling in a Republican or Democrat state could say a lot about the voting trends in the individual state.

Ad

More from Wisegeek

You might also Like

Discuss this Article

SteamLouis
Post 3

I'm doing a survey for my class assignment. I have a question, if we survey everyone in a cluster, we are preventing bias right?

My cluster is a classroom in our school. Do I need to survey each person in the class? My teacher said that I can survey each person, or sample a group within the class. But if I take sample, than the result might be biased right?

I will have to spend more time and it will be harder to survey everyone in the class. But I also want to have the best result. Which should I do?

burcidi
Post 2

I work for a health organization and we do a lot of surveying about healthcare.

Sampling is not as easy at it seems. There are a lot of things to consider. But we always start out with three main issues. First we decide what it is we are trying to measure, who we are measuring it for and how precise the data needs to be.

We provide the data to different organizations and agencies and it can impact health policy at different stages. So we have to be real careful about the sampling. We use cluster sampling if the survey is small and there is not much funding for it. We also do two stage cluster samples to get more precise data.

serenesurface
Post 1

I think that it would be better to use cluster sampling to gather information about the cluster group rather than generalize information onto other groups that were not sampled.

It's easy to reach wrong conclusions if we use cluster sampling and apply it to other populations. Maybe some might want to do this, and I see it happening a lot when people want to prove a point. Especially companies who are trying to market a product might want to do this. But the information might not be correct.

When I read about data in product reports, I always consider what the cluster sample size is. The smaller the size, the more dependable the information is for me. If the sample size is not reported, then there is no way of knowing.

Post your comments

Post Anonymously

Login

username
password
forgot password?

Register

username
password
confirm
email