Validation of algorithmic CT image quality metrics with preferences of radiologists

Standard

Validation of algorithmic CT image quality metrics with preferences of radiologists. / Cheng, Yuan; Abadi, Ehsan; Smith, Taylor Brunton; Ria, Francesco; Meyer, Mathias; Marin, Daniele; Samei, Ehsan.

In: MED PHYS, Vol. 46, No. 11, 11.2019, p. 4837-4846.

Research output: SCORING: Contribution to journalSCORING: Journal articleResearchpeer-review

Harvard

Cheng, Y, Abadi, E, Smith, TB, Ria, F, Meyer, M, Marin, D & Samei, E 2019, 'Validation of algorithmic CT image quality metrics with preferences of radiologists', MED PHYS, vol. 46, no. 11, pp. 4837-4846. https://doi.org/10.1002/mp.13795

APA

Cheng, Y., Abadi, E., Smith, T. B., Ria, F., Meyer, M., Marin, D., & Samei, E. (2019). Validation of algorithmic CT image quality metrics with preferences of radiologists. MED PHYS, 46(11), 4837-4846. https://doi.org/10.1002/mp.13795

Vancouver

Bibtex

@article{a7215445f9dd4ef989a22054962444d3,
title = "Validation of algorithmic CT image quality metrics with preferences of radiologists",
abstract = "PURPOSE: Automated assessment of perceptual image quality on clinical Computed Tomography (CT) data by computer algorithms has the potential to greatly facilitate data-driven monitoring and optimization of CT image acquisition protocols. The application of these techniques in clinical operation requires the knowledge of how the output of the computer algorithms corresponds to clinical expectations. This study addressed the need to validate algorithmic image quality measurements on clinical CT images with preferences of radiologists and determine the clinically acceptable range of algorithmic measurements for abdominal CT examinations.MATERIALS AND METHODS: Algorithmic measurements of image quality metrics (organ HU, noise magnitude, and clarity) were performed on a clinical CT image dataset with supplemental measures of noise power spectrum from phantom images using techniques developed previously. The algorithmic measurements were compared to clinical expectations of image quality in an observer study with seven radiologists. Sets of CT liver images were selected from the dataset where images in the same set varied in terms of one metric at a time. These sets of images were shown via a web interface to one observer at a time. First, the observer rank ordered the CT images in a set according to his/her preference for the varying metric. The observer then selected his/her preferred acceptable range of the metric within the ranked images. The agreement between algorithmic and observer rankings of image quality were investigated and the clinically acceptable image quality in terms of algorithmic measurements were determined.RESULTS: The overall rank-order agreements between algorithmic and observer assessments were 0.90, 0.98, and 1.00 for noise magnitude, liver parenchyma HU, and clarity, respectively. The results indicate a strong agreement between the algorithmic and observer assessments of image quality. Clinically acceptable thresholds (median) of algorithmic metric values were (17.8, 32.6) HU for noise magnitude, (92.1, 131.9) for liver parenchyma HU, and (0.47, 0.52) for clarity.CONCLUSIONS: The observer study results indicated that these algorithms can robustly assess the perceptual quality of clinical CT images in an automated fashion. Clinically acceptable ranges of algorithmic measurements were determined. The correspondence of these image quality assessment algorithms to clinical expectations paves the way toward establishing diagnostic reference levels in terms of clinically acceptable perceptual image quality and data-driven optimization of CT image acquisition protocols.",
author = "Yuan Cheng and Ehsan Abadi and Smith, {Taylor Brunton} and Francesco Ria and Mathias Meyer and Daniele Marin and Ehsan Samei",
note = "{\textcopyright} 2019 American Association of Physicists in Medicine.",
year = "2019",
month = nov,
doi = "10.1002/mp.13795",
language = "English",
volume = "46",
pages = "4837--4846",
journal = "MED PHYS",
issn = "0094-2405",
publisher = "AAPM - American Association of Physicists in Medicine",
number = "11",

}

RIS

TY - JOUR

T1 - Validation of algorithmic CT image quality metrics with preferences of radiologists

AU - Cheng, Yuan

AU - Abadi, Ehsan

AU - Smith, Taylor Brunton

AU - Ria, Francesco

AU - Meyer, Mathias

AU - Marin, Daniele

AU - Samei, Ehsan

N1 - © 2019 American Association of Physicists in Medicine.

PY - 2019/11

Y1 - 2019/11

N2 - PURPOSE: Automated assessment of perceptual image quality on clinical Computed Tomography (CT) data by computer algorithms has the potential to greatly facilitate data-driven monitoring and optimization of CT image acquisition protocols. The application of these techniques in clinical operation requires the knowledge of how the output of the computer algorithms corresponds to clinical expectations. This study addressed the need to validate algorithmic image quality measurements on clinical CT images with preferences of radiologists and determine the clinically acceptable range of algorithmic measurements for abdominal CT examinations.MATERIALS AND METHODS: Algorithmic measurements of image quality metrics (organ HU, noise magnitude, and clarity) were performed on a clinical CT image dataset with supplemental measures of noise power spectrum from phantom images using techniques developed previously. The algorithmic measurements were compared to clinical expectations of image quality in an observer study with seven radiologists. Sets of CT liver images were selected from the dataset where images in the same set varied in terms of one metric at a time. These sets of images were shown via a web interface to one observer at a time. First, the observer rank ordered the CT images in a set according to his/her preference for the varying metric. The observer then selected his/her preferred acceptable range of the metric within the ranked images. The agreement between algorithmic and observer rankings of image quality were investigated and the clinically acceptable image quality in terms of algorithmic measurements were determined.RESULTS: The overall rank-order agreements between algorithmic and observer assessments were 0.90, 0.98, and 1.00 for noise magnitude, liver parenchyma HU, and clarity, respectively. The results indicate a strong agreement between the algorithmic and observer assessments of image quality. Clinically acceptable thresholds (median) of algorithmic metric values were (17.8, 32.6) HU for noise magnitude, (92.1, 131.9) for liver parenchyma HU, and (0.47, 0.52) for clarity.CONCLUSIONS: The observer study results indicated that these algorithms can robustly assess the perceptual quality of clinical CT images in an automated fashion. Clinically acceptable ranges of algorithmic measurements were determined. The correspondence of these image quality assessment algorithms to clinical expectations paves the way toward establishing diagnostic reference levels in terms of clinically acceptable perceptual image quality and data-driven optimization of CT image acquisition protocols.

AB - PURPOSE: Automated assessment of perceptual image quality on clinical Computed Tomography (CT) data by computer algorithms has the potential to greatly facilitate data-driven monitoring and optimization of CT image acquisition protocols. The application of these techniques in clinical operation requires the knowledge of how the output of the computer algorithms corresponds to clinical expectations. This study addressed the need to validate algorithmic image quality measurements on clinical CT images with preferences of radiologists and determine the clinically acceptable range of algorithmic measurements for abdominal CT examinations.MATERIALS AND METHODS: Algorithmic measurements of image quality metrics (organ HU, noise magnitude, and clarity) were performed on a clinical CT image dataset with supplemental measures of noise power spectrum from phantom images using techniques developed previously. The algorithmic measurements were compared to clinical expectations of image quality in an observer study with seven radiologists. Sets of CT liver images were selected from the dataset where images in the same set varied in terms of one metric at a time. These sets of images were shown via a web interface to one observer at a time. First, the observer rank ordered the CT images in a set according to his/her preference for the varying metric. The observer then selected his/her preferred acceptable range of the metric within the ranked images. The agreement between algorithmic and observer rankings of image quality were investigated and the clinically acceptable image quality in terms of algorithmic measurements were determined.RESULTS: The overall rank-order agreements between algorithmic and observer assessments were 0.90, 0.98, and 1.00 for noise magnitude, liver parenchyma HU, and clarity, respectively. The results indicate a strong agreement between the algorithmic and observer assessments of image quality. Clinically acceptable thresholds (median) of algorithmic metric values were (17.8, 32.6) HU for noise magnitude, (92.1, 131.9) for liver parenchyma HU, and (0.47, 0.52) for clarity.CONCLUSIONS: The observer study results indicated that these algorithms can robustly assess the perceptual quality of clinical CT images in an automated fashion. Clinically acceptable ranges of algorithmic measurements were determined. The correspondence of these image quality assessment algorithms to clinical expectations paves the way toward establishing diagnostic reference levels in terms of clinically acceptable perceptual image quality and data-driven optimization of CT image acquisition protocols.

U2 - 10.1002/mp.13795

DO - 10.1002/mp.13795

M3 - SCORING: Journal article

C2 - 31465538

VL - 46

SP - 4837

EP - 4846

JO - MED PHYS

JF - MED PHYS

SN - 0094-2405

IS - 11

ER -