Evaluation of GPT-4’s Chest X-Ray Impression Generation: A Reader Study on Performance and Perception
Exploring the generative capabilities of the multimodal GPT-4, our study uncovered significant differences between radiological assessments and automatic evaluation metrics for chest x-ray impression generation and revealed radiological bias.
Main Authors: | , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
JMIR Publications
2023-12-01
|
Series: | Journal of Medical Internet Research |
Online Access: | https://www.jmir.org/2023/1/e50865 |