Evaluation of GPT-4’s Chest X-Ray Impression Generation: A Reader Study on Performance and Perception

Exploring the generative capabilities of the multimodal GPT-4, our study uncovered significant differences between radiological assessments and automatic evaluation metrics for chest x-ray impression generation and revealed radiological bias.

Bibliographic Details
Main Authors: Sebastian Ziegelmayer, Alexander W Marka, Nicolas Lenhart, Nadja Nehls, Stefan Reischl, Felix Harder, Andreas Sauter, Marcus Makowski, Markus Graf, Joshua Gawlitza
Format: Article
Language:English
Published: JMIR Publications 2023-12-01
Series:Journal of Medical Internet Research
Online Access:https://www.jmir.org/2023/1/e50865