The development of a data-matching algorithm to define the ‘case patient’Shelley Cox A B , Rohan Martin A , Piyali Somaia A and Karen Smith A
Australian Health Review 37(1) 54-59 https://doi.org/10.1071/AH11161
Submitted: 19 March 2012 Accepted: 2 July 2012 Published: 21 December 2012
Objectives. To describe a model that matches electronic patient care records within a given case to one or more patients within that case.
Method. This retrospective study included data from all metropolitan Ambulance Victoria electronic patient care records (n = 445 576) for the time period 1 January 2009–31 May 2010. Data were captured via VACIS (Ambulance Victoria, Melbourne, Vic., Australia), an in-field electronic data capture system linked to an integrated data warehouse database. The case patient algorithm included ‘Jaro–Winkler’, ‘Soundex’ and ‘weight matching’ conditions.
Results. The case patient matching algorithm has a sensitivity of 99.98%, a specificity of 99.91% and an overall accuracy of 99.98%.
Conclusions. The case patient algorithm provides Ambulance Victoria with a sophisticated, efficient and highly accurate method of matching patient records within a given case. This method has applicability to other emergency services where unique identifiers are case based rather than patient based.
What is known about the topic? Accurate pre-hospital data that can be linked to patient outcomes is widely accepted as critical to support pre-hospital patient care and system performance.
What does this paper add? There is a paucity of literature describing electronic matching of patient care records at the patient level rather than the case level. Ambulance Victoria has developed a complex yet efficient and highly accurate method for electronically matching patient records, in the absence of a patient-specific unique identifier. Linkage of patient information from multiple patient care records to determine if the records are for the same individual defines the ‘case patient’.
What are the implications for practitioners? This paper describes a model of record linkage where patients are matched within a given case at the patient level as opposed to the case level. This methodology is applicable to other emergency services where unique identifiers are case based.
Additional keywords: ambulance, deterministic linkage, electronic patient care record, pre-hospital, probabilistic linkage, record linkage.
References Newgard CD, Zive D, Malveau S, Leopold R, Worrall W, Sahni R. Developing a statewide emergency medical services database linked to hospital outcomes: a feasibility study. Prehosp Emerg Care 2011; 15 303–19.
| Developing a statewide emergency medical services database linked to hospital outcomes: a feasibility study.CrossRef | 21612384PubMed |
 Cox S, Currell A, Harriss L, Barger B, Cameron P, Smith K. Evaluation of the Victorian state adult pre-hospital trauma triage criteria. Injury 2012; 43 573–81.
| Evaluation of the Victorian state adult pre-hospital trauma triage criteria.CrossRef | 21074157PubMed |
 Cox S, Smith K, Currell A, Harriss L, Barger B, Cameron P. Differentiation of confirmed major trauma patients and potential major trauma patients using pre-hospital trauma triage criteria. Injury 2011; 42 889–95.
| Differentiation of confirmed major trauma patients and potential major trauma patients using pre-hospital trauma triage criteria.CrossRef | 20430387PubMed |
 Li B, Quan H, Fong A, Lu M. Assessing record linkage between health care and Vital Statistics databases using deterministic methods. BMC Health Serv Res 2006; 6 48
| Assessing record linkage between health care and Vital Statistics databases using deterministic methods.CrossRef | 16597337PubMed |
 Dean JM, Vernon DD, Cook L, Nechodom P, Reading J, Suruda A. Probabilistic linkage of computerized ambulance and inpatient hospital discharge records: a potential tool for evaluation of emergency medical services. Ann Emerg Med 2001; 37 616–26.
| Probabilistic linkage of computerized ambulance and inpatient hospital discharge records: a potential tool for evaluation of emergency medical services.CrossRef | 11385330PubMed |
 Matthew AJ. Probabilistic linkage of large public health data files. Stat Med 1995; 14 491–8.
 Newman TB, Brown AN. Use of commercial record linkage software and vital statistics to identify patient deaths. J Am Med Inform Assoc 1997; 4 233–7.
| Use of commercial record linkage software and vital statistics to identify patient deaths.CrossRef | 9147342PubMed |
 Tromp M, Meray N, Ravelli ACJ, Reitsma JB, Bonsel GJ. Ignoring dependency between linking variables and its impact on the outcome of probabilistic record linkage studies. J Am Med Inform Assoc 2008; 15 654–60.
| Ignoring dependency between linking variables and its impact on the outcome of probabilistic record linkage studies.CrossRef | 18579842PubMed |
 Edelman LS, Cook L, Saffle JR. Using probabilistic linkage of multiple databases to describe burn injuries in Utah. J Burn Care Res 2009; 30 983–92.
| 19826268PubMed |
 Sauleau EA, Paumier J-P, Buemi A. Medical record linkage in health information systems by approximate string matching and clustering. BMC Med Inform Decis Mak 2005; 5 32
| Medical record linkage in health information systems by approximate string matching and clustering.CrossRef | 16219102PubMed |
 Crilly JL, O’Dwyer JA, O’Dwyer MA, Lind JF, Peters JAL, Tippett VC, et al Linking ambulance, emergency department and hospital admissions data: understanding the emergency journey. Med J Aust 2011; 194 S34–7.
| 21401486PubMed |
 Australian Bureau of Statistics. Year Book Australia, 2005; 2005. Available at http://www.abs.gov.au/Ausstats/abs@.nsf/0/5B10622703A022B0CA256F7200833007?opendocument [verified 28 June 2010].
 Australian Bureau of Statistics. Regional Population Growth, Australia, 2008–2009; 2009. Available at http://www.abs.gov.au/ausstats/abs@.nsf/Products/3218.0~2008-09~Main+Features~Victoria?OpenDocument [verified 24 June 2010].
 Jaro M. Advances in record linking methodology as applied to the 1985 census of Tampa Florida. J Amer Statist Assoc 1989; 84 414–20.
 Winkler W. Overview of record linkage and current research directions. Washington, DC: Bureau of the Census; 2006.
 Jaro M. Probabilistic linkage of large public health data file. Stat Med 1995; 14 491–8.
| Probabilistic linkage of large public health data file.CrossRef | 7792443PubMed |
 Winkler W. The state of record linkage and current research problems. Washington, DC: Bureau of the Census; 1999.
 Oracle. Oracle Warehouse Builder User’s Guide 10g Release 2 (10.2.0.2). Oracle Corporation; 2009. Available at http://download.oracle.com/docs/cd/B31080_01/doc/owb.102/b28223/ref_dataquality.htm#i1187308 [verified 28 June 2010].
 Black P. Soundex in Dictionary of Algorithms and Data Structures. U.S. National Institute of Standards and Technology; 2010. Available at http://xlinux.nist.gov/dads/HTML/soundex.html [verified 29 September 2011].
 VACIS Clinical Data Warehouse. VACIS Clinical Data Warehouse Technical Case Patient Matching. Melbourne: Ambulance Victoria; 2010.