Radiation Protection Dosimetry Advance Access published online on March 6, 2008
Radiation Protection Dosimetry, doi:10.1093/rpd/ncm470
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
INTERNAL DOSIMETRY VERIFICATION AND VALIDATION DATABASE
Los Alamos National Laboratory, Los Alamos, NM 87545, USA
* Corresponding author: guthrie{at}lanl.gov
| ABSTRACT |
|---|
|
|
|---|
Simulated-data internal dosimetry cases for use in intercomparison exercises or as a software verification and validation tool have been published on the internet (www.lanl.gov/bayesian/software Bayesian software package II). A user may validate their internal dosimetry code or method using this simulated bioassay data. Or, the user may choose to try out the Los Alamos National Laboratory codes ID and UF, which are also supplied. A Poisson–lognormal model of data uncertainty is assumed. A collection of different possible models for each nuclide (e.g. solubility types and particle sizes) are used. For example, for 238Pu, 14 different biokinetic models or types (8 inhalation, 4 wound and 2 ingestion) are assumed. Simulated data are generated for all the assumed biokinetic models, both for incidents, where the time of intake is known, and for non-incidents, where it is not. For the dose calculations, the route of intake, but not the biokinetic model, is considered to be known. The object is to correctly calculate the known true dose from simulated data covering a period of time. A correct result has been defined in two ways: (1) that the credible limits of the calculated dose include the correct dose and (2) that the calculated dose is within a factor of 2 of the correct dose.
| INTRODUCTION |
|---|
|
|
|---|
The IDEAS 2005(1) project and predecessor intercomparison exercises had as their goal the harmonisation of internal dosimetry methods, through cross-checking and discussion by participants of results obtained for a given set of test cases. To this end also we have web-published the verification and validation (VV) test cases used for the internal dosimetry programme at Los Alamos National Laboratory (LANL)(2) for the nuclides 3H, 234U and 238Pu. Etherinton et al.(3) discuss something similar in principle to what is done here (the HPA method), where simulated data are used to test a particular internal dosimetry algorithm. The advantages of these types of tests are that all aspects of the correct answer are known, which allows detailed checking of the mathematical calculation. Of course, scientific questions remain about possible biokinetic behaviours outside of the imagined scope of possibilities.
A Poisson–lognormal model of data uncertainty is assumed(4,5) in order to simulate as accurately as possible the actual measurement process. As discussed in Doerfel et al.(6), two sources of uncertainty can be identified: type-A or measurement uncertainty, which is related to the Poisson uncertainty of the counting measurement, and type-B or normalisation uncertainty, which is related to uncertainty or variability of the normalisation factor applied to the measured net counts to obtain the quantity of interest (e.g. excretion rate). There is experimental evidence, e.g. Moss et al.(7), that normalisation factors are described by lognormal distributions, and this is a standard assumption (e.g. it is the basis of the lognormal model discussed in Doerfel et al.(6)).
| METHOD |
|---|
|
|
|---|
A collection of different possible models for each nuclide (e.g. solubility types and particle sizes) are used. In Bayesian terminology, the collection of biokinetic models used to generate the data constitutes a biokinetic prior. The situation described here is favourable for a Bayesian interpretation of the data, i.e. the same biokinetic prior is used to analyse the data as was used to generate the data, although this need not be the case.
For example, for 238Pu, 14 different biokinetic models are assumed: 8 inhalation, 4 wound and 2 ingestion. The two ingestion models correspond to choosing the gut absorption factor f1 as the recommended value for ICRP66-type S or M(8,9). These models are denoted by the three-character symbols GIS and GIM. Six of the inhalation models are ICRP66-type M and S, with particle sizes (AMAD) of 1, 5 and 10 µm. These are denoted by three-character symbols such as IS0, for inhalation, type S, 10 µm AMAD, etc. The other two inhalation models and the wound models are defined in Table 1, using ICRP66(9) terminology.
|
Simulated data are generated for all the assumed biokinetic types, both for incidents where the time of intake is known and for non-incidents where it is not. For the dose calculations, the route of intake, but not the biokinetic model, is considered to be known. The object is to correctly calculate the known true dose from simulated data covering a period of time. A correct result has been defined in two ways (other definitions could be used): (1) that the credible limits of the calculated dose include the correct dose, and (2) that the calculated dose is within a factor of 2 of the correct dose. Credible limits are the Bayesian nomenclature for the interval of the quantity of interest such that the probability is some specified small amount (usually 5%) that the quantity is less than the lower limit or greater than the upper limit. For a general discussion of Bayesian terminology, see Miller et al.(10).
There are two types of errors that need to be checked, false negatives and false positives(10). The false-negative database, which is discussed here, contains simulated data from intakes producing the smallest E(50) doses of regulatory interest in the USA. For plutonium and uranium, these are assumed to be 1 mSv for incident-related intakes and 5 mSv for non-incident-related intakes. A simple per cent-correct score can be given (% correct out of total number of cases). A false-positive database would contain a very large number of cases generated from zero true intake, a small fraction of which might be falsely interpreted as intakes. This database is not discussed further here.
As an example, consider a non-incident-related case for 238Pu where the biokinetics are chosen to be IS5. The bioassay data to be generated consist of routine urine samples taken every 6 months over a total work history of 10 years (20 samples in all). In the interval between the first and second data points, a time of intake is generated from a uniform distribution. The intake amount corresponding to a 5 mSv E(50) dose is calculated from the biokinetic model. Then using this intake amount and the time of intake, the urine excretion (the signal) is calculated for all data points. As explained in Miller(5), the measurement situation under consideration is a counting measurement that detects N gross counts coupled with a sample-blank background measurement that detects NB background counts (in R counting periods). There is assumed to be an additional lognormally distributed background b in actual samples that does not show up in the sample blanks, e.g. the uranium environmental background often present in uranium urine bioassay samples. This background is assumed to be known from other measurements. The counts are assumed to have Poisson distributions
|
| (1) |
(the signal, e.g. 24-h urine excretion rate) is assumed to be related to µ by a lognormally distributed normalisation factor F, |
|
Uncertainty or variability of F is assumed to arise from two sources: (1) fn, the basic normalisation uncertainty associated with the measurement type, assumed to have logarithmic standard deviation Sn (e.g. biological variability, sample collection variability) and (2) fm, the normalisation uncertainty associated with the measurement itself, assumed to have logarithmic standard deviation Sm (e.g. uncorrected variability of radiochemical yield).
In this way, the counts and background counts are generated for each data point using the value of
calculated from the intake, time of intake and biokinetic model. Sometimes, e.g. for 238Pu and certain times after intake and certain biokinetic models, the gross counts are approximately equal to the background counts. Since in this study the true dose is always positive, if the calculated final determination of dose comes out to be zero, this would be an example of a false-negative error.
| ORGANISATION OF THE SIMULATED DATA |
|---|
|
|
|---|
The software contains the VV test cases for 238Pu, 234U and 3H for the LANL internal dosimetry programme. In the spirit of harmonisation of worldwide internal dosimetry, these cases are made available for others to use with codes and methods of their choice. The user can make detailed comparisons of results using their method with both the known correct result and with those obtained using the LANL ID(11) and UF(12) codes, which are also supplied.
Each test case is in its individual subfolder. For example, the file/BayesII/calcdata/id/pu238/wrk/PIS0R3/bioassay.in contains the bioassay data for the case denoted by PIS0R3. The structure of the BIOASSAY.IN file is explained in the Windows help file ID.CHM.
The case labelling system is as follows, using the case PIS0R3 as an example.
- The first character P denotes plutonium.
- The characters IS0 (inhalation, type S, 10 µm AMAD) denote a biokinetic model or type as explained in the help file /BayesII/progs/id/ID.CHM. The biokinetic types correspond to biokinetic interpolation table files in the folder/BayesII/dfs/pu238/icrp60.
- R denotes a non-incident-related intake detected through routine bioassay, where the date of intake is unknown, rather than an intake associated with an incident (I), where the date of intake as well as some information about the magnitude of intake is known.
- 3 is a sequence number that indicates the random number seed and the data treatment (1–3 exact Poisson, 4–6 Gaussian approximation, 7–9 censored data, 1 4 7 have same seeds, etc).
= 1.9 y–1 and E(50) from Emin = 0.5 to Emax = 50 µSv. The number of intakes has a Poisson distribution with mean number =
ln(Emax/Emin) = 8.7 y–1. The label H13R03 means there are thirteen (13) non-incident-related intakes, and 03 is a sequence number labelling the random number seed. Simulated and actual LANL 3H data are shown in Figure 1.
|
For 234U, the label UIM508 implies a single non-incident-related intake with biokinetic type IM5 (inhalation, type M, 5 µm AMAD), and 08 is a sequence number labelling the random number seed. There is a lognormally distributed environmental urine background, determined empirically from LANL data to have whole population median of 3 mBq/d and lognormal standard deviation of S = 1.3 (
g = 3.67). Simulated 234U data are shown in Figure 2.
|
| SUMMARY OF RESULTS USING LOS ALAMOS CODES ID AND UF |
|---|
|
|
|---|
A summary of results using the Los Alamos Codes ID and UF is shown in Table 2. The quantity E(50) is the 50-year committed effective dose.
|
Table 2 illustrates some aspects of such a VV exercise using simulated data.
- Different data treatments are possible, and they yield different results. At one end of the spectrum considered is the use of actual measured count quantities that are used to calculate the exact Poisson–lognormal likelihood functions describing the data. Such a data treatment in practice depends on count quantities being provided by the analytical laboratory. Sometimes, only the result and measurement uncertainty standard deviation are provided. The Gaussian approximation of the likelihood function might be used in these cases, incorporating both measurement (type A) and estimated normalisation (type B) uncertainties into a single uncertainty quantity as described by Miller(5). Alternatively, a lognormal model might be used as advocated by Doerfel et al.(6); however, such a model will not accommodate small or negative data. With a lognormal model, data less than a few standard deviations positive are sometimes censored, meaning that such a measurement result is designated less than limit of detection and no further information reported. The data treatment labelled censored is essentially this approach.
- Whether or not an incident has occurred is important, and an incident-related internal dosimetry case is distinct from a non-incident case. When an incident has occurred, the time of intake is known and additional bioassay samples are collected. Since more information is available and there is only a single intake, intakes are generally easier to detect.
- The dose quantity needs to be defined. Total E(50) for all years generally has smaller uncertainty than the E(50) for any given year.
- The criterion defining correct needs to be defined. Two reasonable but arbitrary choices are illustrated.
- Different algorithms, such as ID and UF, produce different results. The reason for having more than one algorithm is that one method (ID) is meant to be definitive, a straightforward evaluation of Bayes theorem using Markov Chain Monte Carlo without approximations, whereas the other method (UF) is a faster approximation using 1-D numerical integration and summation. For single-intake situations, the two methods are mathematically equivalent, even though numerically they are quite different. So, the comparisons serve as a check for the mathematical approximations and the numerical techniques.
- And, since these are complex computer algorithms, the use of different computation platforms needs to be checked. The desktop platform is a Windows workstation, whereas the supercomputer platform(14) is a Linux cluster.
|
| MISSED CASES |
|---|
|
|
|---|
The cases that are outside of Bayesian credible limits (missed, i.e. 10 of 93 cases) using most effective method (ID code, exact Poisson likelihood) are shown in Table 3. Two of the missed cases are also shown in Figures 2 and 3.
|
In Table 3, the number of chain iterations per potential intake is the number of Markov Chain Monte Carlo iterations divided by the number of intakes, as discussed in Miller et al.(11). Each chain iteration involves a forward biokinetics calculation (table lookup), and the number of chain iterations is a rough indicator of the computation time required. For incident-related cases, the number of potential intakes is 1. For non-incident-related cases, the number of potential intakes is 19 (1 intake in each interval between bioassay measurements), so the number of chain iterations was 19 times larger than the quantity given in the table for non-incident cases. The quantity
2/Ndata uses
2 calculated from the Poisson–lognormal model, as discussed in Miller (5). If this quantity were much larger than 1, it would indicate a statistical inconsistency, where the Bayesian expectation values of the bioassay data (the fit) are not, on average, within a standard deviation from the data. The probability of intake is defined by Miller et al.(15). When the data are indistinguishable from zero, this probability is about 0.5. As it becomes more and more certain that the data are not consistent with 0, this probability approaches 1. The tritium case is missed because the rather narrow credible limits marginally exclude the true value. There are no missed cases using the factor of 2 definition of correct.
Type M uranium is missed for a similar reason. However, type S 10 µm AMAD uranium does not provide enough signal to be detected.
For plutonium, two 5 mSv non-incident cases are missed. One is a LANL-specific biokinetic model (ICD) describing an inhalation intake intermediate between types M and S. The intake is detected with high probability, but the dose is missed because of misidentification of biokinetic type for two out of three cases. The other case has been discussed and is shown in Figure 3. The 1 mSv plutonium incident cases that are missed involve two LANL-specific biokinetic models: IEE, delayed onset biokinetics for 238Pu and WTA, a wound model having two long-time retention compartments. In addition, type S 1 µm AMAD biokinetics is also missed. These cases simply do not provide enough signal to be distinguishable from background.
| DISCUSSION |
|---|
|
|
|---|
The effectiveness of exact likelihood calculations, particularly in relation to data censoring at 2 standard deviations (87.5 versus 33% correct for 238Pu case), shows the importance of the analytical laboratory reporting count quantities and use of the exact likelihood calculation for a situation such as plutonium. Data censoring here means that the likelihood function used is that corresponding to a less than limit of detection measurement. All the data are used. Another approach, widely used but not considered here, is to simply ignore all data below the limit of detection.
The missed case for plutonium shown in Figure 3 is a good illustration of the problem of data censoring. For this case, all of the data are less than 2 standard deviations positive (measurement divided by measurement uncertainty standard deviation <2 for all data points). The censored data are therefore equivalent to data from zero true result. Even though outside of the Bayesian credible limits, the ID code result for these same data detects an intake with probability 0.77 and has calculated credible limits of (0, 4.7) mSv, a little off the correct result of 5 mSv.
| REFERENCES |
|---|
|
|
|---|
- Hurtgen C., et al. IDEAS/IAEA Intercomparison Exercise on Internal Dose Assessment. Available on http://www.sckcen.be/sckcen_en/publications/other_reports/blgreports/BLG1018.pdf.
- Bertelli L., Miller G., Little T., Guilmette R., Glasser S. Internal dose assessment data management system for a large population of Pu workers. Radiat. Prot. Dosim (2007) 9 October (Epub ahead of print).
- Etherington G., Birchall A., Puncher A. M., Molokanov A., Blanchardon E. Uncertainties in doses from intakes of radionuclides assessed from monitoring measurements. Radiat. Prot. Dosim (2006) 121:40–51.
[Abstract/Free Full Text] - Miller G., Martz H. F., Little T., Guilmette R. Using exact Poisson likelihood functions in Bayesian interpretation of counting measurements. Health Phys (2002) 83:512–518.[CrossRef][ISI][Medline]
- Miller G. Statistical modelling of Poisson/log-normal data. Radiat. Prot. Dosim. advance access published on 12 January 2007; doi:10.1093/rpd/ncl544.
- Doerfel H., et al. General guidelines for the estimation of committed effective dose from incorporation of monitoring data. (2006) 6. Project IDEAS Report FZKA 7243.
- Moss W. D., Campbell E. E., Schulte H. F., Tietjen G. L. A study of the variations found in plutonium urinary data. Health Phys (1969) 17,:571–578.[ISI][Medline]
- International Commission on Radiological Protection (ICRP). Individual monitoring for internal exposure of workers (1997) Oxford: Pergamon Press. ICRP Publication 78.
- International Commission on Radiological Protection (ICRP). Human respiratory tract model for radiological protection. (1994) Oxford: Pergamon Press. ICRP Publication 66.
- Miller G., Inkret W. C., Schillaci M. E., Martz H. F., Little T. Analyzing bioassay data using Bayesian methods—a primer. Health Phys (2000) 78(6):598–613.[CrossRef][ISI][Medline]
- Miller G., Martz H. F., Little T., Guilmette R. Bayesian internal dosimetry calculations using Markov chain Monte Carlo. Radiat. Prot. Dosim (2002) 98:191–198.[Abstract]
- Miller G., Inkret W. C., Martz H. F. Internal dosimetry intake estimation using Bayesian methods. Radiat. Prot. Dosimetry (1999) 82:5–17.[Abstract]
- Miller G., Inkret W. C., Little T. T., Martz H. F., Schillaci M. E. Bayesian prior probability distributions for internal dosimetry. Radiat. Prot. Dosim (2001) 94(4):347–352.[Abstract]
- Miller G., Bertelli L., Little T., Guilmette R. Markov chain Monte Carlo for internal dosimetry on a supercomputer cluster. In: Applied Modeling and Computations in Nuclear Science—Semkow Pomme, Jeroome, eds. (2006) American Chemical Society. Chapter 7.
- Miller G., Martz H., Little T., Bertelli L. Bayesian hypothesis testing—use in interpretation of measurements. Health Phys. (2008).
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


