Methodological issues of using administrative data to improve the quality of survey data 1 |
|
Convenor | Dr Emanuela Sala (Dipartimento di Sociologia e Ricerca Sociale, Università di MIlano Bicocca ) |
Coordinator 1 | Dr Jonathan Burton (ISER, University of Essex) |
Coordinator 2 | Dr Gundi Knies (ISER, University of Essex) |
Questions on sensitive topics like income often produce considerable item-nonresponse or measurement error. While several studies examine item-nonresponse little is known about misreporting. Misreporting might be explained by social desirability bias, which may lead to overreporting or underreporting of characteristics.
We use interviewer and respondent characteristics to estimate their impact on the extent of deviation between reported and recorded incomes. To do so we combine data from the German National Educational Panel Study’s adult cohort with administrative data from the Federal Employment Agency. These longitudinal data stem from mandatory notifications of employers to the social security system.
We consider how credit records may be used to gauge and enhance survey data quality. The analysis focuses on a recent survey about debt collection which used de-identified credit records as a sampling frame, including to oversample consumers considered more likely to have debt collection experiences. The extensive information in the credit-bureau data for both respondents and nonrespondents provides a strong basis for investigating nonresponse bias. Comparing self-reported demographic data for respondents with auxiliary demographic information in the credit-bureau data sheds light on the reliability of such auxiliary data. Similarly, we compare self-reported financial data.
This project analyzes recall error in the year of retirement by comparing self-reports of respondents of the Survey of Health, Ageing and Retirement in Europe (SHARE) with administrative records of the same person provided by the German Pension Fund. A comparison of the two data sources show that 36.5 % of the respondents misreported the year they retired. Based on research from cognitive psychology, different determinants to explain the error are identified. The results show that cognitive abilities and characteristics of the event (as its salience or the time passed since the retirement) are correlated with the recall error.
The paper examines recall error in life course data by comparing survey data and administrative data month by month. A multilevel approach is used to determine the relative importance of factors relating to the errors, such as length of the recall period or length of the episode, and characteristics of the respondents, such as gender, education level, age, region and structural complexity of the life course.