Centre for Statistical & Survey Methodology Working Paper Series

Publication Date



Linkage errors can occur when probability-based methods are used to link records from two distinct data sets corresponding to the same target population. Current approaches to modifying standard methods of regression analysis to allow for these errors only deal with the case of two linked data sets and assume that the linkage process is complete, i.e. all records on the two data sets are linked. This paper extends these ideas to accommodate the situation when more than two data sets are probabilistically linked and the linkage is incomplete.