Radiation Protection Dosimetry Advance Access originally published online on October 30, 2006
Radiation Protection Dosimetry 2007 123(3):318-322; doi:10.1093/rpd/ncl160
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Scanning an individual monitoring database for multiple occurrencies using bi-gram analysis
NRG Radiation and Environment, P.O. Box 3094, 6800 ES Arnhem, The Netherlands
* Corresponding author: j.vandijk{at}nrg-nl.com
Received August 21, 2006, amended September 29, 2006, accepted September 29, 2006
| Abstract |
|---|
Maintaining the integrity of the databases is one of the important aspects of quality assurance at individual monitoring services and national dose registers. This paper presents a method for finding and preventing the occurrence of duplicate entries in the databases that can occur, e.g. because of a variable spelling or misspelling of the name. The method is based on bi-gram text analysis techniques. The methods can also be used for retrieving dose data in historical databases in the framework of dose reconstruction efforts of persons of whom the spelling of the name as originally entered, possibly decades ago, is uncertain.