next up previous
Next: ICPSR PROCESSING INFORMATION Up: No Title Previous: CONTINUOUS MONITORING STUDY DESIGN

SAMPLING INFORMATION FOR CONTINUOUS MONITORING

                SAMPLING INFORMATION FOR CONTINUOUS MONITORING

          The telephone survey design for continuous monitoring involved
     RDD sample design issues of some complexity. The sample objective was
     a uniform number of interviews in each "week" (a 17 day interviewing
     period) of a 46 week study period. We should be able to treat each of
     the weekly samples more or less independently. That is, the user
     should be able to take the weekly samples and aggregate them in
     various ways to represent time periods (months, quarters, pre-primary
     intervals) of interest to their research.

          The constraints on the sample design were also clear: time and
     money. Given the strong relationship between time and cost in
     telephone interviewing operations, the chosen sample design had to be
     one which minimized the amount of time that interviewers must spend
     in reaching respondent households. A proven method for reducing
     contact time is the two-stage RDD design originally suggested by
     Warren Mitofsky and Joe Waksberg.

          Very briefly, this design utilizes the A.T.& T. listing of
     telephone central office codes. Each record on the listing is an area
     code-central office code combination, e.g., 313-764; thus each record
     represents 10,000 distinct telephone numbers (e.g., 313-764-0000
     through 313-764-9999). Another way to put it is that each record
     represents 100 clusters of 100 consecutive telephone numbers
     (0000-0099,0100-0199, etc.).

          Each primary selection is one such group of 100 consecutive
     telephone numbers, designated by randomly generating a single 4-digit
     number. For example if (313) 764-4424 is generated, the cluster
     containing numbers 4400-4499 is tentatively designated for selection.
     These primary numbers are called; if they are not working household
     numbers, the clusters in which they fall are not selected. If they
     are working household numbers, their clusters are selected into the
     sample and a specified number of additional four-digit numbers within
     the same cluster is generated. For example, if the desired sample
     size is nine, eight more 4-digit numbers within the hundred series
     would be selected.

          While the Waksberg-Mitofsky method is cost-effective, it
     sacrifices something in precision because of its clustered nature.
     The NES implementation of this design for Continuous Monitoring
     spreads the use of each primary stage sample one-hundred series over
     the 46 week course of the study -- maximizing the distribution of the
     sample and minimizing the clustering effects for short time interval
     analyses. At the end of the 46-week study period, the complete sample
     will contain roughly 700 primary stage numbers (clusters) of 5
     interviews each.

          In the Waksberg-Mitofsky two-stage selection, the several
     numbers selected from each cluster at the second stage are used
     within the same sample period. In the NES variation, each cluster
     that is selected produces one telephone number per week. (This
     telephone number translates into a label of a sample coversheet).
     When there is an interview or some other kind of final disposition of
     the coversheet, the cluster is not used further in the sample week.
     Clusters of primary numbers are in the sample for two weeks, then
     rotate out for 8 weeks. The assignment of clusters produces a 50%
     overlap from week to week. The intent of this overlap is to introduce
     some correlation among observations for short, adjacent intervals of
     time. If successful, the time 1 to time 2 correlations will yield
     improved precision for estimates of change between the two periods.

          As the study design is implemented, it is important to note that
     certain coversheet dispositions mean that a cluster can be
     "re-dialed" within the sample week. For example, if the telephone
     number on the coversheet is of a business (non-household) then the
     next number in the primary number series can replace it.  Other
     redialing situations are non-working numbers or non-sample residences
     (institutions). Some sample coversheets, even though they do not
     yield an interview, cannot be replaced. These include refusals,
     non-interviews of valid respondents, and households with no eligible
     respondents.

          Once a household is reached, selection of respondents within the
     household proceeds by listing all persons within that household
     (male, oldest to youngest; female, oldest to youngest); determining
     which of the residents are eligible (18 on election day, U. S.
     citizens) and using a Kish selection table to randomly determine the
     respondents.

          The method results in slightly unequal probabilities of
     selection. That is, respondents in households with TWO telephone
     numbers are twice as likely to be selected as those with only one
     telephone number. Also, respondents from households with TWO eligible
     residents are only half as likely to be selected as respondents who
     are the only eligible adult.

          The NES staff has compared weighted and unweighted frequency
     distributions for a number of variables. Results indicate that the
     data may be treated as an equal probability sample, i.e., that
     selection weights are not required. Those interested in a further
     discussion of the point should see working paper #5. (See Appendix C).


Walter Mebane
Mon Dec 3 06:08:49 EST 2001