Webpage last modified: 2008-Oct-04
Pretesting involves a series of activities designed to evaluate a survey instrument's capacity to collect the desired data, the capabilities of the selected mode of data collection, and/or the overall adequacy of the field procedures. Throughout this text we refer to a "pretest" as the collection of all those techniques and activities that allow researchers to evaluate survey questions and/or survey procedures before data collection begins. In contrast, we use the term "pilot study" to refer to pretesting procedures that employ all the procedures and materials involved in data collection-- i.e., a dress rehearsal before the actual data collection begins. Table 1 is a summary of the most commonly used pretesting techniques, such as pilot study, cognitive interviews, focus groups, and behavior coding.
Differences across cultures/countries may or may not exist at any stage of the data collection process. When multiple languages are used in the same survey, pretesting the different language versions is an essential part of ensuring measurement equivalence and of testing translations with the target population (see Translation). In addition, it is often not possible to employ the same mode of data collection across different countries participating in a cross-national project. This may be due to several factors; cost and infrastructure constraints and literacy issues are the most common. It is important to test in advance the suitability of the selected mode for the survey topic and population (see Data Collection). Pretesting techniques may have limited application in a given context and culture. Research into how strategies may need to be tailored to suit different populations is only beginning to be undertaken systematically.
Goal: To ensure that the versions of the survey instrument adequately convey the intended research questions, measure the intended attitudes, values, reported facts and/or behaviors, and ascertain that the collections of data are conducted according to specified study protocols in every country and in every language.
Pretesting is largely cost and time driven. To make the best use of the various pretesting techniques, it should be determined in advance what questions have to be addressed — whether the researchers want to test all field procedures, or only the survey instrument (or parts of it), or the equivalence of the survey instrument across languages and modes of data collection. Pretesting should be done in each country participating in the research; however, pretesting techniques may vary across countries, depending on testing traditions and availability of resources and trained staff. Even if some or all of the questions have been used in other studies, pretesting for the local context is necessary in order to assess their performance in the mode and question order of the current study, as well as with the target population.
Table 1 presents the most commonly used pretesting techniques. It provides a brief description, strength and weaknesses, and context in which a particular technique is typically used.
Pretesting techniques often supplement one another, and can logically be combined to maximize the efficiency of the pretest design (see Table 1). It is important to take cultural norms/traditions, as well as certain interviewer characteristics (see Data Collection), into account when choosing pretesting methods. The most appropriate combinations of pretesting techniques may vary across countries involved in the study. This should be taken into account when results from the different pretests are evaluated and compared.
The selected pretesting procedures may require skills other than those that available interviewers possess. For example, cognitive interviewing requires an interviewing style which is different from traditional standardized interviewing. It is possible for traditional interviewers to conduct cognitive interviews, but they require additional training to ensure they do not approach the cognitive interview as they would a standardized interview. Sufficient time should be allowed to train staff members and develop protocols that correspond to the selected pretest design.
Interviewer-administered questionnaires involve listening; self-administered questionnaires involve reading. Interviewer-administered questionnaires involve social interaction between the interviewer and the respondent; self-administered do not. Interviewer-administered questionnaires do not require the respondent to navigate and worry about skip patterns; self-administered do.
Interviewer-administered and self-administered questionnaires produce different context effects (e.g., recency and primacy) and provoke different needs for socially desirable responding (see Data Collection). In order to determine how well proposed procedures will work in the field, pilot studies should be conducted in the same mode as the final survey.
In order to most effectively pretest the survey instrument or field procedures, conduct the pretest with respondents from the intended target population or, as relevant, a sub-group within the target population. Similarly, the population of a pilot study should be an adequate reflection of the survey target population. For example, if the survey design involves oversampling of certain ethnic groups, the pretest sample should also include reasonable representation of these groups. A pretest with sample persons from the target population will most accurately reflect what will happen during real data collection in terms of cooperation, respondent performance, total interview length, questionnaire performance, survey costs, etc.
For all pretesting techniques:
For pilot studies:
Countries/cultures differ by language and/or cultural systems. When possible, the natural flow of the survey instrument should be tested for each culture and language in order to avoid awkward conversational situations, question order with unpredictable culture-dependent context effects, question repetition not intended in the source, or other culture-specific problems.
The goal of the pretest is to identify problems in the questionnaire and study design in each country. The results of the pretest have to be evaluated to determine the best way to fix existing problems without introducing new ones. Changes to the survey instrument and design should be considered in the context of the whole study—changes that fix a problem in one country may introduce a problem in another. The coordinating center should decide whether minor differences that still preserve the measurement equivalence of the survey instrument across countries can be tolerated (see Translation). Any introduced changes should also be pretested to avoid unforeseen errors (also see Survey Instrument Design).
Providing a permanent record of problems encountered during the pretest(s) and any changes made to the questionnaire, respondent materials, and field procedures aids staff and researchers working on similar studies or on later rounds of the same study.
In a manner consistent across countries:
[1] Belson, W. A. (1981). The design and understanding of survey questions. Aldershot, UK: Gower.
[2] Bickart, B.& Felcher, M. (1996). Expanding and enhancing the use of verbal protocols in survey research. In N. Schwarz & S. Sudman (Eds.), Answering Questions: Methodology for Determining the Cognitive and Communicative Processes in Survey Research. San Francisco: Jossey-Bass, 115-142.
[3] Bloom, L. (1981). The linguistic shaping of thought. Hillsdale, NJ: Lawrence Erlbaum Associates.
[4] Bolton, R., & Bronkhorst, T. (1996). Questionnaire Pretesting: Computer-Assisted Coding of Concurrent Protocols. In N. Schwarz & S. Sudman (Eds.), Answering Questions: Methodology for Determining Cognitive and Communicative Processes in Survey Research. San Francisco: Jossey-Bass.
[5] Davis, D. K., Blair, J., Crawley, E. L., Craig, K. M., Rappaport, M. S. B., Baker, C. A., & Hanson, S. L. (2001). Census 2000 quality survey instrument pilot test. Report submitted by Development Associates, Inc., Arlington, VA. Washington, DC: US Census Bureau, Center for Survey Methods Research.
[6] Davis, W., & DeMaio, T. (1993). Comparing the think-aloud interviewing technique with standard interviewing in the redesign of a dietary recall questionnaire. Alexandria, VA: ASA Section on Survey Research Methods.
[7] Fowler, F.J. Jr. (1995). Improving survey questions. Vol. 38, Applied social research methods series. Thousand Oaks, CA: Sage Publications.
[8] Gfroerer, J., Eyerman, J., & Chromy, J. (Eds.) (2002). Redesigning an ongoing national household survey: Methodological issues. DHHS Publication No. SMA 03-3768. Rockville, MD: Substance Abuse and Mental Health Services Administration, Office of Applied Studies.
[9] Goerman, P., Caspar, R., Sha, M., McAvinchey, G., & Quiroz, R. (2007). Census bilingual questionnaire research, Final round 2 report. Retrieved June 3, 2008, from http://www.census.gov/srd/papers/pdf/ssm2007-27.pdf
[10] Groves, R., Fowler, F. Couper, M. Lepkowski, J., Singer, E. & Tourangeau, R. (2004). Survey methodology. Hoboken, NJ: Wiley & Sons.
[11] Hansen, S.E., and Couper, M. (2004). Usability Testing as a Means of Evaluating Computer Assisted Survey Instruments. In S. Presser et al. (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: John Wiley & Sons.
[12] Hunt, S.D., Sparkman, R. D., & Wilcox, J. B. (1982). The pretest in survey research: Issues and preliminary findings. Journal of Marketing Research 19, 269-273.
[13] Mangione, T., Fowler, F. J., & Oksenberg, L. (1992). Question characteristics and interviewer effects. Journal of Official Statistics 8(3), 293-307.
[14] Martin. E., & Gerber, E. (2004). Results from recent methodological research on the Hispanic origin and race questions. Paper presented to the Population Association of America.
[15] Miller, K., Fitzgerald, R., Caspar, D., M., Gray, M., Nunes, C., Padilla, J.L., et al. (2008). Design and analysis of cognitive interviews for cross-national testing. Paper to be presented at the International Conference on Survey Methods in Multinational, Multiregional and Multicultural Contexts, Berlin, Germany.
[16] Pan, Y. (2004). Cognitive interviews in languages other than English: Methodological and research issues. Proceedings of the American Statistical Association, Section on Survey Research Methods.
[17] Redline, C., R. Smiley, M. L., DeMaio, T., & Dillman, D. (1999). Beyond concurrent interviews: An evaluation of cognitive interviewing techniques for self-administered questionnaires. Proceedings of the American Statistical Association, Section on Survey Research Methods.
[18] Reeve, B., & Mâsse, L. (2004). Item Response Theory (IRT) Modeling for Questionnaire Evaluation. In S. Presser et al (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: Wiley.
[19] Rossi, P., & Anderson, A. (1982). The factorial survey approach: An introduction. In P. Rossi & S. Nock (Eds.), Measuring social judgments: The factorial survey approach. Thousand Oaks, CA: Sage.
[20] Saris, W., van der Veld, W., & Gallhofer. I. (2004). Development and Improvement of Questionnaires Using Predictions of Reliability and Validity. In S Presser et al (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: Wiley.
[21] Snijkers, G. (2002). Cognitive laboratory experiences on pretesting computerized questionnaires and data quality. Heerlen: Statistics Netherlands.
[22] Tarnai, J., & Moore, D. (2004). Methods for Testing and Evaluating Computer-Assisted Questionnaires. In S. Presser et al (Eds.), Methods for Testing and Evaluating Survey Questionnaires. New York: Wiley.
[23] Willis, G. B., & Lessler, J. T. (1999). Question appraisal system BRFSS-QAS: A guide for systematically evaluating survey question wording. Report prepared for CDC/NCCDPHP/Division of Adult and Community Health Behavioral Surveillance Branch.
Previous chapter | Next chapter | Home
© 2008 The authors of the Guidelines hold the copyright. Please contact us if you wish to publish any of this material in any form.