
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>doi: 10.3233/sji-190518
Data integration is becoming a crucial task in National Statistical Institutes in order to exploit the information provided by already existing data sources. Here the focus is on statistical matching methods; they are designed to integrate data stemming out from traditional sample surveys referred to the same target population. In particular, this work shows how popular statistical learning techniques can be beneficial for matching purposes. Two proposals are presented, having a different final scope: the creation of a “fused” data set or the assessment of the uncertainty due to the typical statistical matching scenario. The characteristics of these procedures are investigated through a series of simulations and in an application to real survey data. The achieved results are encouraging and show that some statistical learning techniques can be very effective in exploiting the information provided by already existing survey data, permitting a reduction of the uncertainty determined by the typical statistical matching setting.
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 6 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
