IX. Pretesting

Webpage last modified: 2008-Oct-04

Introduction

Pretesting involves a series of activities designed to evaluate a survey instrument's capacity to collect the desired data, the capabilities of the selected mode of data collection, and/or the overall adequacy of the field procedures. Throughout this text we refer to a "pretest" as the collection of all those techniques and activities that allow researchers to evaluate survey questions and/or survey procedures before data collection begins. In contrast, we use the term "pilot study" to refer to pretesting procedures that employ all the procedures and materials involved in data collection-- i.e., a dress rehearsal before the actual data collection begins. Table 1 is a summary of the most commonly used pretesting techniques, such as pilot study, cognitive interviews, focus groups, and behavior coding.

Differences across cultures/countries may or may not exist at any stage of the data collection process. When multiple languages are used in the same survey, pretesting the different language versions is an essential part of ensuring measurement equivalence and of testing translations with the target population (see Translation). In addition, it is often not possible to employ the same mode of data collection across different countries participating in a cross-national project. This may be due to several factors; cost and infrastructure constraints and literacy issues are the most common. It is important to test in advance the suitability of the selected mode for the survey topic and population (see Data Collection). Pretesting techniques may have limited application in a given context and culture. Research into how strategies may need to be tailored to suit different populations is only beginning to be undertaken systematically.

Guidelines

Goal: To ensure that the versions of the survey instrument adequately convey the intended research questions, measure the intended attitudes, values, reported facts and/or behaviors, and ascertain that the collections of data are conducted according to specified study protocols in every country and in every language.

  1. Identify what the pretest should achieve and choose a pretest design that best fits the study goals.
    Rationale
    Procedural steps
    Lessons learned
  2. Combine pretesting techniques to create a comprehensive design plan that takes advantage of the strengths and minimizes the weaknesses of each method.
    Rationale
    Procedural steps
    Lessons learned
  3. Train or hire staff members who are able to adequately implement the chosen pretesting technique(s).
    Rationale
    Procedural steps
    Lessons learned
  4. Conduct the pilot study in the same mode of data collection (interviewer-administered or self-administered) as the survey.
    Rationale
    Procedural steps
    Lessons learned
  5. Conduct the pretest with the same target population as the target population for the survey.
    Rationale
    Procedural steps
    Lessons learned
  6. Pretest the survey instrument or part of it in each country and in each language.
    Rationale
    Procedural steps
    Lessons learned
  7. Evaluate the results of the pretest.
    Rationale
    Procedural steps
    Lessons learned
  8. Fully document the pretesting protocol and findings.
    Rationale
    Procedural steps
    Lessons learned

Glossary

Audio computer-assisted self-interview (ACASI)
A mode in which the respondent uses a computer which displays the question on screen and plays audio recordings of the questions to the respondent, who then enters his/her answers
Behavior coding
Systematic coding of the interviewer-respondent interaction in order to identify problems that arise during the question-answer process
Cognitive interviews
A pretesting method designed to uncover problems in survey items by having respondents think out loud while answering the question
(Cognitive) vignettes
Brief stories/scenarios describing hypothetical situations or persons and their behaviors, to which respondents are asked to react in order to allow the researcher to explore contextual influences on respondents' response formation processes.
Context effects
The impact of question context, such as the order or layout of questions, on survey responses
Coordinating center
A research center that facilitates and organizes cross-national research activities
Double-barreled (questions)
Survey questions that inadvertently ask about two objects at once
Field pilot study
Small-scale rehearsals of the data collection conducted before the main survey
Focus group
Small group discussions under the guidance of a moderator, often used in qualitative research that can also be used to test survey questionnaires and survey protocols
Item nonresponse/ item-missing data/ missing data
The lack of information on individual data items for a sample element where other data items were successfully obtained
Loaded questions/words
Questions that are worded in such a manner as to invite respondents to respond in a particular way
Measurement equivalence
Equivalence of the calibration system used in the questionnaire and the translation
Mode
Method of data collection
Pilot study
A pretesting technique that involves all procedures and materials that will be employed in data collection; a dress rehearsal before the actual data collection begins
Pretesting
A collection of techniques and activities that allow researchers to evaluate survey questions and/or survey procedures before data collection begins
Primacy
Context effects in which the placement of the item at the beginning of a list of response options increases the likelihood that it will be selected by the respondent
Recency
Context effects in which the placement of the item at the end of a list of response options increases the likelihood that it will be selected by the respondent
Response distributions
A description of the values and probabilities that a particular response was selected
Sampling frame/sample frame
Lists or materials used to identify all elements (e.g., persons, households, establishments) of a survey population from which the sample will be selected. These lists or materials can include maps of areas in which the elements can be found, lists of members of a professional association and registries of addresses or persons.
Target population
The finite population for which the survey sponsor wants to make inferences using the sample statistics.

References

[1] Belson, W. A. (1981). The design and understanding of survey questions. Aldershot, UK: Gower.

[2] Bickart, B.& Felcher, M. (1996). Expanding and enhancing the use of verbal protocols in survey research. In N. Schwarz & S. Sudman (Eds.), Answering Questions: Methodology for Determining the Cognitive and Communicative Processes in Survey Research. San Francisco: Jossey-Bass, 115-142.

[3] Bloom, L. (1981). The linguistic shaping of thought. Hillsdale, NJ: Lawrence Erlbaum Associates.

[4] Bolton, R., & Bronkhorst, T. (1996). Questionnaire Pretesting: Computer-Assisted Coding of Concurrent Protocols. In N. Schwarz & S. Sudman (Eds.), Answering Questions: Methodology for Determining Cognitive and Communicative Processes in Survey Research. San Francisco: Jossey-Bass.

[5] Davis, D. K., Blair, J., Crawley, E. L., Craig, K. M., Rappaport, M. S. B., Baker, C. A., & Hanson, S. L. (2001). Census 2000 quality survey instrument pilot test. Report submitted by Development Associates, Inc., Arlington, VA. Washington, DC: US Census Bureau, Center for Survey Methods Research.

[6] Davis, W., & DeMaio, T. (1993). Comparing the think-aloud interviewing technique with standard interviewing in the redesign of a dietary recall questionnaire. Alexandria, VA: ASA Section on Survey Research Methods.

[7] Fowler, F.J. Jr. (1995). Improving survey questions. Vol. 38, Applied social research methods series. Thousand Oaks, CA: Sage Publications.

[8] Gfroerer, J., Eyerman, J., & Chromy, J. (Eds.) (2002). Redesigning an ongoing national household survey: Methodological issues. DHHS Publication No. SMA 03-3768. Rockville, MD: Substance Abuse and Mental Health Services Administration, Office of Applied Studies.

[9] Goerman, P., Caspar, R., Sha, M., McAvinchey, G., & Quiroz, R. (2007). Census bilingual questionnaire research, Final round 2 report. Retrieved June 3, 2008, from http://www.census.gov/srd/papers/pdf/ssm2007-27.pdf

[10] Groves, R., Fowler, F. Couper, M. Lepkowski, J., Singer, E. & Tourangeau, R. (2004). Survey methodology. Hoboken, NJ: Wiley & Sons.

[11] Hansen, S.E., and Couper, M. (2004). Usability Testing as a Means of Evaluating Computer Assisted Survey Instruments. In S. Presser et al. (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: John Wiley & Sons.

[12] Hunt, S.D., Sparkman, R. D., & Wilcox, J. B. (1982). The pretest in survey research: Issues and preliminary findings. Journal of Marketing Research 19, 269-273.

[13] Mangione, T., Fowler, F. J., & Oksenberg, L. (1992). Question characteristics and interviewer effects. Journal of Official Statistics 8(3), 293-307.

[14] Martin. E., & Gerber, E. (2004). Results from recent methodological research on the Hispanic origin and race questions. Paper presented to the Population Association of America.

[15] Miller, K., Fitzgerald, R., Caspar, D., M., Gray, M., Nunes, C., Padilla, J.L., et al. (2008). Design and analysis of cognitive interviews for cross-national testing. Paper to be presented at the International Conference on Survey Methods in Multinational, Multiregional and Multicultural Contexts, Berlin, Germany.

[16] Pan, Y. (2004). Cognitive interviews in languages other than English: Methodological and research issues. Proceedings of the American Statistical Association, Section on Survey Research Methods.

[17] Redline, C., R. Smiley, M. L., DeMaio, T., & Dillman, D. (1999). Beyond concurrent interviews: An evaluation of cognitive interviewing techniques for self-administered questionnaires. Proceedings of the American Statistical Association, Section on Survey Research Methods.

[18] Reeve, B., & Mâsse, L. (2004). Item Response Theory (IRT) Modeling for Questionnaire Evaluation. In S. Presser et al (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: Wiley.

[19] Rossi, P., & Anderson, A. (1982). The factorial survey approach: An introduction. In P. Rossi & S. Nock (Eds.), Measuring social judgments: The factorial survey approach. Thousand Oaks, CA: Sage.

[20] Saris, W., van der Veld, W., & Gallhofer. I. (2004). Development and Improvement of Questionnaires Using Predictions of Reliability and Validity. In S Presser et al (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: Wiley.

[21] Snijkers, G. (2002). Cognitive laboratory experiences on pretesting computerized questionnaires and data quality. Heerlen: Statistics Netherlands.

[22] Tarnai, J., & Moore, D. (2004). Methods for Testing and Evaluating Computer-Assisted Questionnaires. In S. Presser et al (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: Wiley.

[23] Willis, G. B., & Lessler, J. T. (1999). Question appraisal system BRFSS-QAS: A guide for systematically evaluating survey question wording. Report prepared for CDC/NCCDPHP/Division of Adult and Community Health Behavioral Surveillance Branch.

Return to top

Previous chapter | Next chapter | Home

© 2008 The authors of the Guidelines hold the copyright. Please contact us if you wish to publish any of this material in any form.