Addressing Data Bias Problems for Chest X-ray Image Report Generation

Philipp Harzig (University of Augsburg), Yan-Ying Chen (FX Pal), Francine Chen (FX Palo Alto Laboratory), Rainer Lienhart (Universitat Augsburg)

Abstract
Automatic medical report generation from chest X-ray images is one possibility for assisting doctors to reduce their workload. However, the different patterns and data distribution of normal and abnormal cases can bias machine learning models. Previous attempts did not focus on isolating the generation of the abnormal and normal sentences in order to increase the variability of generated paragraphs. To address this, we propose to separate abnormal and normal sentence generation by using two different word LSTMs in a hierarchical LSTM model. We conduct an analysis on the distinctiveness of generated sentences compared to the BLEU score, which increases when less distinct reports are generated. We hope our findings will help to encourage the development of new metrics to better verify methods of automatic medical report generation.

DOI
10.5244/C.33.199
https://dx.doi.org/10.5244/C.33.199

Files
Paper (PDF)

BibTeX
@inproceedings{BMVC2019,
title={Addressing Data Bias Problems for Chest X-ray Image Report Generation},
author={Philipp Harzig and Yan-Ying Chen and Francine Chen and Rainer Lienhart},
year={2019},
month={September},
pages={199.1--199.11},
articleno={199},
numpages={11},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Kirill Sidorov and Yulia Hicks},
doi={10.5244/C.33.199},
url={https://dx.doi.org/10.5244/C.33.199}
}