Kim, G. and Chambers, Ray, Regression Analysis under Probabilistic Multi-Linkage, Centre for Statistical and Survey Methodology, University of Wollongong, Working Paper 10-11, 2011, 15.
Linkage errors can occur when probability-based methods are used to link records from two distinct data sets corresponding to the same target population. Current approaches to modifying standard methods of regression analysis to allow for these errors only deal with the case of two linked data sets and assume that the linkage process is complete, i.e. all records on the two data sets are linked. This paper extends these ideas to accommodate the situation when more than two data sets are probabilistically linked and the linkage is incomplete.