The University of Southampton
University of Southampton Institutional Repository

Cox regression with linked data

Cox regression with linked data
Cox regression with linked data
Record linkage is increasingly used, especially in medical studies, to combine data from different databases that refer to the same entities. The linked data can bring analysts novel and valuable knowledge that is impossible to obtain from a single database. However, linkage errors are usually unavoidable, regardless of record linkage methods, and ignoring these errors may lead to biased estimates. While different methods have been developed to deal with the linkage errors in the generalized linear model, there is not much interest on Cox regression model, although this is one of the most important statistical models in clinical and epidemiological research. In this work, we propose an adjusted estimating equation for secondary Cox regression analysis, where linked data have been prepared by a third-party operator, and no information on matching variables is available to the analyst. Through a Monte Carlo simulation study, the proposed method is shown to {lead to substantial bias reductions in the estimation of the parameters of the Cox model} caused by false links. An asymptotically unbiased variance estimator for the adjusted estimators of Cox regression coefficients is also proposed. Finally, the proposed method is applied to a linked database from the Brest stroke registry in France.
adjusted estimating equation, cox regression, linkage error, secondary analysis, variance estimation
0277-6715
Vo, Thanh Huan
554c7447-a8ae-4c2d-9803-7433a2c57d22
Garès, Valérie
73853505-c0d3-4f20-89e7-2fdb1641637b
Zhang, Li-Chun
a5d48518-7f71-4ed9-bdcb-6585c2da3649
Happe, André
ba2540cb-498c-4553-92d4-10251c19726a
Oger, Emmanuel
c64ec62b-e6f4-4769-9afb-f326da43eba5
Paquelet, Stéphane
c1ed0dad-34fa-4c7f-a5db-cb9fb6e0c3d8
Chauvet, Guillaume
6be88507-a2b2-41cb-9779-e82ca2291e0b
Vo, Thanh Huan
554c7447-a8ae-4c2d-9803-7433a2c57d22
Garès, Valérie
73853505-c0d3-4f20-89e7-2fdb1641637b
Zhang, Li-Chun
a5d48518-7f71-4ed9-bdcb-6585c2da3649
Happe, André
ba2540cb-498c-4553-92d4-10251c19726a
Oger, Emmanuel
c64ec62b-e6f4-4769-9afb-f326da43eba5
Paquelet, Stéphane
c1ed0dad-34fa-4c7f-a5db-cb9fb6e0c3d8
Chauvet, Guillaume
6be88507-a2b2-41cb-9779-e82ca2291e0b

Vo, Thanh Huan, Garès, Valérie, Zhang, Li-Chun, Happe, André, Oger, Emmanuel, Paquelet, Stéphane and Chauvet, Guillaume (2023) Cox regression with linked data. Statistics in Medicine. (In Press)

Record type: Article

Abstract

Record linkage is increasingly used, especially in medical studies, to combine data from different databases that refer to the same entities. The linked data can bring analysts novel and valuable knowledge that is impossible to obtain from a single database. However, linkage errors are usually unavoidable, regardless of record linkage methods, and ignoring these errors may lead to biased estimates. While different methods have been developed to deal with the linkage errors in the generalized linear model, there is not much interest on Cox regression model, although this is one of the most important statistical models in clinical and epidemiological research. In this work, we propose an adjusted estimating equation for secondary Cox regression analysis, where linked data have been prepared by a third-party operator, and no information on matching variables is available to the analyst. Through a Monte Carlo simulation study, the proposed method is shown to {lead to substantial bias reductions in the estimation of the parameters of the Cox model} caused by false links. An asymptotically unbiased variance estimator for the adjusted estimators of Cox regression coefficients is also proposed. Finally, the proposed method is applied to a linked database from the Brest stroke registry in France.

Text
LinkageCoxRegression - Accepted Manuscript
Download (591kB)

More information

Accepted/In Press date: 1 November 2023
Keywords: adjusted estimating equation, cox regression, linkage error, secondary analysis, variance estimation

Identifiers

Local EPrints ID: 484072
URI: http://eprints.soton.ac.uk/id/eprint/484072
ISSN: 0277-6715
PURE UUID: ac3f608b-d5fe-4289-ab9c-eee6643282d0
ORCID for Li-Chun Zhang: ORCID iD orcid.org/0000-0002-3944-9484

Catalogue record

Date deposited: 09 Nov 2023 18:09
Last modified: 01 Nov 2024 05:01

Export record

Contributors

Author: Thanh Huan Vo
Author: Valérie Garès
Author: Li-Chun Zhang ORCID iD
Author: André Happe
Author: Emmanuel Oger
Author: Stéphane Paquelet
Author: Guillaume Chauvet

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×