Skip to content

Fieldwork paradata

Additional data collected during the interview process (paradata) are available. These consist of call records, timings data and other information collected by the interviewers during the interview.

Call records

Call record files have information on the number of calls made as well as the issue number, interviewer identifier (scrambled), time and date and the outcome of each call. This is available in the data file w_callrec_ip.

Address response form

Information collected in the address response form (ARF) by interviewers while contacting each household and requesting household members to participate in the survey is available in w_hhsamp_ip. This includes data on the area surrounding the address, the type of accommodation and other information that the interviewer can observe for both responding and non-responding households. Reasons for refusal are also available. Interviewers also record some information about the quality of the interview and persons present during the interview process. This is available along with substantive data collected during adult individual interviews (including proxy interviews) in w_indresp_ip. From Wave 7 onwards the ARF was no longer used.

Timings data files

Timings data files (w_ptimings and w_htimings) include data on the time taken to complete each question and module in the individual and household questionnaires. In IP1, the start and end times are given for blocks of questions, where blocks are one or more question modules. The times are given in seconds. From IP2 onwards the times are given in seconds for individual questions. If the variables are asked in a loop or multi-choice format, the variable name is suffixed with the multi-choice item number or loop iteration count. In Waves 5 to 9 the timings data for interviews completed by web are per screen rather than per question, although most screens contain only a single question. Where there are multiple questions per screen this is documented in the questionnaire. Waves 7 onwards are released in CSV format because the variable names are long strings that are truncated when imported into Stata. From Waves 7 onwards the timings files are w_hhgrid_timings, w_hhint_timings, and w_indint_timings.

The IP11 timings data included an error which has been corrected (see the example Stata code for matching files).

Interviewer characteristics

The interviewer ID w_intnum can be linked to the main survey’s cross-wave file xivdata which contains interviewer characteristics. This file is available from the UK Data Service as a separate dataset (SN 8579), under Special Licence agreement.

Keystroke paradata

For IP11 there is an additional paradata file (k_keystroke_paradata), which contains information automatically recorded from CAPI and web respondents, while they answered the questions in the modules “HMRC consent” and “HMRC consent follow-up” (early and late versions). For each question in these modules the strings in the variables k_keystrokes1 and k_keystrokes2 record the question name, the response category selected, and the timestamp when the interviewer or respondent clicked ‘next’. The variable k_keystrokes1 is truncated for some cases and the remainder of the string can be found in k_keystrokes2.


Email newsletter

Sign up to our newsletter