The University of Southampton
University of Southampton Institutional Repository

Validation and utility of a computerized South Asian names and group recognition algorithm in ascertaining South Asian ethnicity in the national renal registry

Nitsch, D., Kadalayil, L., Mangtani, P., Steenkamp, R., Ansell, D., Tomson, C., Dos Santos Silva, I. and Roderick, P. (2009) Validation and utility of a computerized South Asian names and group recognition algorithm in ascertaining South Asian ethnicity in the national renal registry QJM: An International Journal of Medicine, 102, (12), pp. 865-872. (doi:10.1093/qjmed/hcp142).

Record type: Article


Background: the UK Renal Registry (UKRR) reports on equity and quality of renal replacement therapy (RRT). Ethnic origin is a key variable, but it is only recorded for 76% patients overall in the UKRR and there is wide variation in the degree of its completeness between renal centres. Most South Asians have distinctive names.
Aim: to test the relative performance of a computerized name recognition algorithm (SANGRA) in identifying South Asian names using the UKRR database.
Design: cross-sectional study of patients (n = 27 832) starting RRT in 50 renal centres in England and Wales from 1997 to 2005.
Methods: kappa statistics were used to assess the degree of agreement of SANGRA coding with existing ethnicity information in UKRR centres.
Results: in 12 centres outside London (number of patients = 7555) with 11% (n = 747) self-ascribed South Asian ethnicity, the level of agreement between SANGRA and self-ascribed ethnicity was high ({kappa}=0.91, 95% CI 0.90–0.93). In two London centres (n = 779) with 21% (n = 165) self-ascribed South Asian ethnicity, SANGRA's agreement with self-ascribed ethnicity was lower ({kappa}=0.60, 95% CI 0.54–0.67), primarily due to difficulties in distinguishing between South Asian ethnicity and other non-White ethnic minorities. Use of SANGRA increased numbers defined as South Asian from 1650 to 2076 with no overall change in percentage of South Asians. Kappa values showed no obvious association with degree of missing data returns to the UKRR.
Conclusion: SANGRA's use, taking into account its lower validity in London, allows increased power and generalizability for both ethnic specific analyses and for analyses where adjustment for ethnic origin is important

Full text not available from this repository.

More information

Published date: December 2009


Local EPrints ID: 72935
ISSN: 1460-2725
PURE UUID: b1dda863-cae7-4835-8e01-f5a81571a1ae
ORCID for P. Roderick: ORCID iD

Catalogue record

Date deposited: 25 Feb 2010
Last modified: 18 Jul 2017 23:52

Export record



Author: D. Nitsch
Author: L. Kadalayil
Author: P. Mangtani
Author: R. Steenkamp
Author: D. Ansell
Author: C. Tomson
Author: I. Dos Santos Silva
Author: P. Roderick ORCID iD

University divisions

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton:

ePrints Soton supports OAI 2.0 with a base URL of

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.