
Record linkage is increasingly used, especially in medical studies, to combine data from different databases that refer to the same entities. The linked data can bring analysts novel and valuable knowledge that is impossible to obtain from a single database. However, linkage errors are usually unavoidable, regardless of record linkage methods, and ignoring these errors may lead to biased estimates. While different methods have been developed to deal with the linkage errors in the generalized linear model, there is not much interest on Cox regression model, although this is one of the most important statistical models in clinical and epidemiological research. In this work, we propose an adjusted estimating equation for secondary Cox regression analysis, where linked data have been prepared by a third‐party operator, and no information on matching variables is available to the analyst. Through a Monte Carlo simulation study, the proposed method is shown to lead to substantial bias reductions in the estimation of the parameters of the Cox model caused by false links. An asymptotically unbiased variance estimator for the adjusted estimators of Cox regression coefficients is also proposed. Finally, the proposed method is applied to a linked database from the Brest stroke registry in France.
Models, Statistical, 330, [SDV]Life Sciences [q-bio], Applications of statistics to biology and medical sciences; meta analysis, 519, [SDV] Life Sciences [q-bio], adjusted estimating equation, linkage error, Bias, Data Interpretation, Statistical, Linear Models, Humans, Regression Analysis, Computer Simulation, secondary analysis, variance estimation, Cox regression, Semantic Web
Models, Statistical, 330, [SDV]Life Sciences [q-bio], Applications of statistics to biology and medical sciences; meta analysis, 519, [SDV] Life Sciences [q-bio], adjusted estimating equation, linkage error, Bias, Data Interpretation, Statistical, Linear Models, Humans, Regression Analysis, Computer Simulation, secondary analysis, variance estimation, Cox regression, Semantic Web
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
