Name-Based Cluster Sampling

February 7, 2010

Douglas A. Ferguson Name-Based Cluster Sampling  Sociological Methods & Research 2009 37: 590-598.

An innovative method is proposed for generating valid national samples of online e-mail addresses that are proportionate to the population. Although multistage cluster sampling is not new, Read the rest of this entry »


Peter D. Killworth, Christopher McCarty, Eugene C. Johnsen, H. Russell Bernard, and Gene A. Shelley, Investigating the Variation of Personal Network Size Under Unknown Error Conditions, Sociological Methods & Research 2006 35: 84-112.

February 7, 2010
This article estimates the variation in personal network size, using respondent data containing two systematic sources of error. The data are the proportion of respondents who, on average, claim to know zero, one, and two people in various subpopulations, such as “people who are widows under the age of 65” or “people who are diabetics.” The two kinds of error—transmission error (respondents are unaware that someone in their network is in a subpopulation) and barrier error (something causes a respondent to know more or less than would be expected, in a subpopulation)—are hard to quantify. The authors show how to estimate the shape of the probability density function (pdf) of the number of people known to a random individual by assuming that respondents give what they assume to be accurate responses based on incorrect knowledge. It is then possibleto estimate the relative effective sizes of subpopulations and produce an internally consistent theory. These effective sizes permit an evaluation of the shape of the pdf, which, remarkably,agrees with earlier estimates.

Key Words: social networks • errors • probability density function