Representing the population: Improving European sampling practices 2 |
|
Chair | Dr Annette Scherpenzeel (SHARE – Survey of Health, Ageing and Retirement in Europe ) |
Coordinator 1 | Mrs Johanna Bristle (SHARE – Survey of Health, Ageing and Retirement in Europe) |
Coordinator 2 | Dr Stefan Zins (ESS-GESIS) |
In average, about 1.3 percent of the entire European population lives in retirement and nursing homes, prisons or refugee hostels. The institutional population is usually excluded in most social surveys. Survey researchers justify the non-coverage with practical concerns and the assumed higher costs of data collection. Following Tourangeau’s scheme of hard-to-survey populations, the institutional population can be classified as hard-to-sample, hard-to-contact and hard-to-interview in comparison with the majority of private population. Our paper focuses on the relevance of people living in retirement and nursing homes for surveys. This subgroup adds up to 40 percent of the entire institutional population. Furthermore, within the European population aged 85 years or older 13 percent live in an institution; far more than in every other age cohort.
The paper is motivated by the overarching question whether surveys of the general population need additional efforts to extent the coverage due to the distinctiveness of the institutional population. Residents living in retirement and nursing homes differ significantly from their non-institutionalized counterparts in socio-demographic, socio-economic, and medical characteristics, as the 2011 European censuses and some survey research in European countries and the USA prove. Those differences could limit the ability to do inference on the whole population, if estimates are based on sample data that excludes the institutional population. Taking the deviations in demographic, economic, and geographic variables as a starting point, we conduct a semisynthetic simulation study. The simulation is based on two assumptions. The first assumption is that the population living in retirement and nursing homes deviates significantly from the respective age cohorts living in private households. They differ mainly in the distribution of gender, functional and cognitive impairments, health status, civil status, and social networks. The second assumption is that the institutional population itself is rather homogenous and depicts a lower degree of variance than the general population. On the basis of these two assumptions, we use the Monte-Carlo simulation study to detect a possible bias of sample estimates if the institutional population is systematically underrepresented or excluded from the target population. The results also allow us to assess the possibilities and limitations of using survey weights to compensate for frame imperfection caused by the exclusion of the institutional population.
EU-SILC in Austria is a sample survey on income and living conditions of persons living in private households carried out on a yearly basis and using an integrated rotational design. This means that every year about one fourth of the sample is replaced by a new rotational group. EU-SILC is the main source of indicators on household income and living conditions such as the at-risk-of-poverty rate (AROP) defined as the rate of persons living in private households with an equivalised disposable income below 60% of the median.
Since participation in the EU-SILC survey is voluntary unit nonresponse inevitably happens and is most common in the first year of the survey. If response patterns differ for characteristics that are highly correlated with the main EU-SILC indicators such as AROP unit nonresponse bias will occur.
From 2012 onwards EU-SILC in Austria has been using income register data from administrative sources that are linked to the sample on micro-level as a source for collecting information about most components of the household income. Since this information is also available for the sampling frame it can be used for unit nonresponse analysis and the computation of weights. One of the most important objectives of the weighting procedure of EU-SILC is to counter potential unit nonresponse bias related to the estimators of the main indicators.
The present work compares procedures for nonresponse weighting based on inverse response probabilities or calibration. The probability of taking part in the survey can be estimated by various methods, for instance logistic regressions or pattern recognition techniques such as artificial neural networks. These methods rely heavily on the quality of register data available for respondents and nonrespondents. Calibration methods adjust the weights to external marginal distributions that also rely on administrative sources.
Finally, a Monte Carlo simulation is carried out to assess unit nonresponse bias by comparing estimators based on different weighting procedures.