Simplifying record linkage: Software and strategy

https://doi.org/10.1016/0010-4825(87)90010-2Get rights and content

Abstract

Although the methodology of record linkage is fairly well developed, there is a need for less expensive methods and simpler software to facilitate trying out different tactics to generate good linkages. The present work has built on a fourth generation language SAS (Statistical Analysis System) with accompanying macroprocessor, to develop a user-friendly and flexible system for both exact and probabilistic matching. The major features of the LINKS system are presented and illustrated using 1979–1984 information from the Manitoba Health Services Commission (MHSC) registry file with the Canadian Mortality Data Base. Initial runs with exact, then probabilistic, matching linked approximately 91% of the Vital Statistics records to corresponding MHSC records. Subsequent modification of parameters improved the linkage to 95%.

References (20)

  • H.B. Newcombe et al.

    Reliability of computerized versus manual searches in a study of the health of Eldorado uranium workers

    Comput. Biol. Med.

    (1983)
  • L.L. Roos et al.

    The art and science of record linkage: methods that work with few identifiers

    Comput. Biol. Med.

    (1986)
  • M.G. Arellano et al.

    The California automated mortality linkage system (CAMLIS)

    Am. J. publ. Hlth

    (1984)
  • D.N. Wentworth et al.

    An evaluation of the security administration master beneficiary record file and the national death index in the ascertainment of vital status

    Am. J. publ. Hlth

    (1983)
  • F. Scheuren

    Methodologic issues in linkage of multiple data bases

  • D. Case et al.

    Large system implementation in SAS: the resource management system using SAS, SAS/FSP, SAS/GRAPH and the Merrill database

  • G.W. Grammas et al.

    Software productivity as a strategic variable

    Interfaces

    (1985)
  • M.E. Smith

    Record linkage: present status and methodology

    J. clin. Comput.

    (1984)
  • SAS Institute
There are more references available in the full text version of this article.

Cited by (31)

  • Use of graph theory measures to identify errors in record linkage

    2014, Computer Methods and Programs in Biomedicine
    Citation Excerpt :

    One organisation estimated the false positive error rate of their linkage, after extensive manual review, at 0.3 per cent [10]. There are two standard approaches to improving overall linkage quality [11]. The first is to focus on the parameters and settings used within the linkage process itself.

  • The BOYS algorithm for determining optimum matching rules

    1993, Computational Statistics and Data Analysis
View all citing articles on Scopus
View full text