Noninferiority of Artificial Intelligence–Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics

Standard

Noninferiority of Artificial Intelligence–Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics. / Abele, Niklas ; Tiemann, Katharina; Krech, Till; Wellmann, Axel; Schaaf, Christian P; Länger, Florian; Peters, Anja ; Donner, Andreas ; Keil, Felix ; Daifalla, Khalid ; Mackens, Marina ; Mamilos, Andreas ; Minin, Evgeny ; Krümmelbein, Michel ; Krause, Linda; Stark, Maria; Zapf, Antonia; Päpper, Marc; Hartmann, Arndt; Lang, Tobias.

In: MODERN PATHOL, Vol. 36, No. 3, 100033, 03.2023, p. 100033.

Research output: SCORING: Contribution to journalSCORING: Journal articleResearchpeer-review

Harvard

Abele, N, Tiemann, K, Krech, T, Wellmann, A, Schaaf, CP, Länger, F, Peters, A, Donner, A, Keil, F, Daifalla, K, Mackens, M, Mamilos, A, Minin, E, Krümmelbein, M, Krause, L, Stark, M, Zapf, A, Päpper, M, Hartmann, A & Lang, T 2023, 'Noninferiority of Artificial Intelligence–Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics', MODERN PATHOL, vol. 36, no. 3, 100033, pp. 100033. https://doi.org/10.1016/j.modpat.2022.100033

APA

Abele, N., Tiemann, K., Krech, T., Wellmann, A., Schaaf, C. P., Länger, F., Peters, A., Donner, A., Keil, F., Daifalla, K., Mackens, M., Mamilos, A., Minin, E., Krümmelbein, M., Krause, L., Stark, M., Zapf, A., Päpper, M., Hartmann, A., & Lang, T. (2023). Noninferiority of Artificial Intelligence–Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics. MODERN PATHOL, 36(3), 100033. [100033]. https://doi.org/10.1016/j.modpat.2022.100033

Vancouver

Bibtex

@article{1f8e26a7d79d42cbb44bd05f57750364,
title = "Noninferiority of Artificial Intelligence–Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics",
abstract = "Image analysis assistance with artificial intelligence (AI) has become one of the great promises over recent years in pathology, with many scientific studies being published each year. Nonetheless, and perhaps surprisingly, only few image AI systems are already in routine clinical use. A major reason for this is the missing validation of the robustness of many AI systems: beyond a narrow context, the large variability in digital images due to differences in preanalytical laboratory procedures, staining procedures, and scanners can be challenging for the subsequent image analysis. Resulting faulty AI analysis may bias the pathologist and contribute to incorrect diagnoses and, therefore, may lead to inappropriate therapy or prognosis. In this study, a pretrained AI assistance tool for the quantification of Ki-67, estrogen receptor (ER), and progesterone receptor (PR) in breast cancer was evaluated within a realistic study set representative of clinical routine on a total of 204 slides (72 Ki-67, 66 ER, and 66 PR slides). This represents the cohort with the largest image variance for AI tool evaluation to date, including 3 staining systems, 5 whole-slide scanners, and 1 microscope camera. These routine cases were collected without manual preselection and analyzed by 10 participant pathologists from 8 sites. Agreement rates for individual pathologists were found to be 87.6% for Ki-67 and 89.4% for ER/PR, respectively, between scoring with and without the assistance of the AI tool regarding clinical categories. Individual AI analysis results were confirmed by the majority of pathologists in 95.8% of Ki-67 cases and 93.2% of ER/PR cases. The statistical analysis provides evidence for high interobserver variance between pathologists (Krippendorff's α, 0.69) in conventional immunohistochemical quantification. Pathologist agreement increased slightly when using AI support (Krippendorff α, 0.72). Agreement rates of pathologist scores with and without AI assistance provide evidence for the reliability of immunohistochemical scoring with the support of the investigated AI tool under a large number of environmental variables that influence the quality of the diagnosed tissue images.",
author = "Niklas Abele and Katharina Tiemann and Till Krech and Axel Wellmann and Schaaf, {Christian P} and Florian L{\"a}nger and Anja Peters and Andreas Donner and Felix Keil and Khalid Daifalla and Marina Mackens and Andreas Mamilos and Evgeny Minin and Michel Kr{\"u}mmelbein and Linda Krause and Maria Stark and Antonia Zapf and Marc P{\"a}pper and Arndt Hartmann and Tobias Lang",
year = "2023",
month = mar,
doi = "https://doi.org/10.1016/j.modpat.2022.100033",
language = "English",
volume = "36",
pages = "100033",
journal = "MODERN PATHOL",
issn = "0893-3952",
publisher = "NATURE PUBLISHING GROUP",
number = "3",

}

RIS

TY - JOUR

T1 - Noninferiority of Artificial Intelligence–Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics

AU - Abele, Niklas

AU - Tiemann, Katharina

AU - Krech, Till

AU - Wellmann, Axel

AU - Schaaf, Christian P

AU - Länger, Florian

AU - Peters, Anja

AU - Donner, Andreas

AU - Keil, Felix

AU - Daifalla, Khalid

AU - Mackens, Marina

AU - Mamilos, Andreas

AU - Minin, Evgeny

AU - Krümmelbein, Michel

AU - Krause, Linda

AU - Stark, Maria

AU - Zapf, Antonia

AU - Päpper, Marc

AU - Hartmann, Arndt

AU - Lang, Tobias

PY - 2023/3

Y1 - 2023/3

N2 - Image analysis assistance with artificial intelligence (AI) has become one of the great promises over recent years in pathology, with many scientific studies being published each year. Nonetheless, and perhaps surprisingly, only few image AI systems are already in routine clinical use. A major reason for this is the missing validation of the robustness of many AI systems: beyond a narrow context, the large variability in digital images due to differences in preanalytical laboratory procedures, staining procedures, and scanners can be challenging for the subsequent image analysis. Resulting faulty AI analysis may bias the pathologist and contribute to incorrect diagnoses and, therefore, may lead to inappropriate therapy or prognosis. In this study, a pretrained AI assistance tool for the quantification of Ki-67, estrogen receptor (ER), and progesterone receptor (PR) in breast cancer was evaluated within a realistic study set representative of clinical routine on a total of 204 slides (72 Ki-67, 66 ER, and 66 PR slides). This represents the cohort with the largest image variance for AI tool evaluation to date, including 3 staining systems, 5 whole-slide scanners, and 1 microscope camera. These routine cases were collected without manual preselection and analyzed by 10 participant pathologists from 8 sites. Agreement rates for individual pathologists were found to be 87.6% for Ki-67 and 89.4% for ER/PR, respectively, between scoring with and without the assistance of the AI tool regarding clinical categories. Individual AI analysis results were confirmed by the majority of pathologists in 95.8% of Ki-67 cases and 93.2% of ER/PR cases. The statistical analysis provides evidence for high interobserver variance between pathologists (Krippendorff's α, 0.69) in conventional immunohistochemical quantification. Pathologist agreement increased slightly when using AI support (Krippendorff α, 0.72). Agreement rates of pathologist scores with and without AI assistance provide evidence for the reliability of immunohistochemical scoring with the support of the investigated AI tool under a large number of environmental variables that influence the quality of the diagnosed tissue images.

AB - Image analysis assistance with artificial intelligence (AI) has become one of the great promises over recent years in pathology, with many scientific studies being published each year. Nonetheless, and perhaps surprisingly, only few image AI systems are already in routine clinical use. A major reason for this is the missing validation of the robustness of many AI systems: beyond a narrow context, the large variability in digital images due to differences in preanalytical laboratory procedures, staining procedures, and scanners can be challenging for the subsequent image analysis. Resulting faulty AI analysis may bias the pathologist and contribute to incorrect diagnoses and, therefore, may lead to inappropriate therapy or prognosis. In this study, a pretrained AI assistance tool for the quantification of Ki-67, estrogen receptor (ER), and progesterone receptor (PR) in breast cancer was evaluated within a realistic study set representative of clinical routine on a total of 204 slides (72 Ki-67, 66 ER, and 66 PR slides). This represents the cohort with the largest image variance for AI tool evaluation to date, including 3 staining systems, 5 whole-slide scanners, and 1 microscope camera. These routine cases were collected without manual preselection and analyzed by 10 participant pathologists from 8 sites. Agreement rates for individual pathologists were found to be 87.6% for Ki-67 and 89.4% for ER/PR, respectively, between scoring with and without the assistance of the AI tool regarding clinical categories. Individual AI analysis results were confirmed by the majority of pathologists in 95.8% of Ki-67 cases and 93.2% of ER/PR cases. The statistical analysis provides evidence for high interobserver variance between pathologists (Krippendorff's α, 0.69) in conventional immunohistochemical quantification. Pathologist agreement increased slightly when using AI support (Krippendorff α, 0.72). Agreement rates of pathologist scores with and without AI assistance provide evidence for the reliability of immunohistochemical scoring with the support of the investigated AI tool under a large number of environmental variables that influence the quality of the diagnosed tissue images.

U2 - https://doi.org/10.1016/j.modpat.2022.100033

DO - https://doi.org/10.1016/j.modpat.2022.100033

M3 - SCORING: Journal article

C2 - 36931740

VL - 36

SP - 100033

JO - MODERN PATHOL

JF - MODERN PATHOL

SN - 0893-3952

IS - 3

M1 - 100033

ER -