New Method of Paired Comparison for Improved Observer Shortage Using Deep Learning Models

  • Tabata Nariaki
    Department of Health Sciences, Graduate School of Medical Sciences, Kyushu University Department of Radiology, Fukuoka University Chikushi Hospital
  • Ijichi Tetsuya
    Department of Radiology, Fukuoka University Chikushi Hospital
  • Itai Hirotaka
    Department of Health Sciences, Graduate School of Medical Sciences, Kyushu University Department of Radiology, National Hospital Organization Kyushu Medical Center
  • Tateishi Masaru
    Department of Health Sciences, Graduate School of Medical Sciences, Kyushu University
  • Kita Kento
    Department of Health Sciences, Graduate School of Medical Sciences, Kyushu University
  • Obata Asami
    Department of Radiology, Fukuoka University Chikushi Hospital
  • Kawahara Yuna
    Department of Radiology, Fukuoka University Chikushi Hospital
  • Sonoda Lisa
    Department of Radiology, Fukuoka University Chikushi Hospital
  • Katou Shinichi
    Department of Radiology, Fukuoka University Chikushi Hospital
  • Inoue Toshirou
    Department of Radiology, Fukuoka University Chikushi Hospital
  • Ideguchi Tadamitsu
    Department of Health Sciences, Faculty of Medical Sciences, Kyushu University

Bibliographic Information

Other Title
  • 一対比較法における深層学習を用いた観察者不足改善手法の提案
  • イチ ツイヒカクホウ ニ オケル シンソウ ガクシュウ オ モチイタ カンサツシャ ブソク カイゼン シュホウ ノ テイアン

Search this article

Description

<p> Purpose: The aim of this study was to validate the potential of substituting an observer in a paired comparison with a deep-learning observer.Methods: Phantom images were obtained using computed tomography. Imaging conditions included a standard setting of 120 kVp and 200 mA, with tube current variations ranging from 160 mA, 120 mA, 80 mA, 40 mA, and 20 mA, resulting in six different imaging conditions. Fourteen radiologic technologists with >10 years of experience conducted pairwise comparisons using Ura’s method. After training, VGG16 and VGG19 models were combined to form deep learning models, which were then evaluated for accuracy, recall, precision, specificity, and F1value. The validation results were used as the standard, and the results of the average degree of preference and significance tests between images were compared to the standard if the results of deep learning were incorporated.Results: The average accuracy of the deep learning model was 82%, with a maximum difference of 0.13 from the standard regarding the average degree of preference, a minimum difference of 0, and an average difference of 0.05. Significant differences were observed in the test results when replacing human observers with AI counterparts for image pairs with tube currents of 160 mA vs. 120 mA and 200 mA vs. 160 mA.Conclusion: In paired comparisons with a limited phantom (7-point noise evaluation), the potential use of deep learning was suggested as one of the observers.</p>

Journal

References(17)*help

See more

Details 詳細情報について

Report a problem

Back to top