Unbiased regression estimation under correlated linkage errors
Linkage errors can occur when probability-based methods are used to link records from two or more distinct data sets corresponding to the same target population. Recent research on allowing for these errors when carrying out regression analysis based on linked data assumes that the linkage errors are independent when more than two data sets are used to generate these data. In this paper, we extend these results to accommodate the more realistic scenario of dependent linkage errors. Our simulation results show that an incorrect assumption of independent linkage errors can lead to insufficient linkage error bias correction, while an approach that allows for correlated linkage errors appears to overcome this problem.