Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning

Amnon Bleich

Antje Linnemann

Bjoern H. Diem

Tim OF Conrad

December 05, 2024

Recent advances in deep learning and natural language generation have significantly improved image captioning, enabling automated, human-like descriptions for visual content. In this work, we apply these captioning techniques to generate clinician-like interpretations of ECG data. This study leverages existing ECG datasets accompanied by free-text reports authored by healthcare professionals (HCPs) as training data. These reports, while often inconsistent, provide a valuable foundation for automated learning. We introduce an encoder-decoder-based method that uses these reports to train models to generate detailed descriptions of ECG episodes. This represents a significant advancement in ECG analysis automation, with potential applications in zero-shot classification and automated clinical decision support.
The model is tested on various datasets, including both 1- and 12-lead ECGs. It significantly outperforms the state-of-the-art reference model by Qiu et al., achieving a METEOR score of 55.53% compared to 24.51% achieved by the reference model. Furthermore, several key design choices are discussed, providing a comprehensive overview of current challenges and innovations in this domain.
The source codes for this research are publicly available in our Git repository this https URL

https://doi.org/10.48550/arXiv.2412.04067

BIFOLD AUTHORS

Prof. Dr. Tim Conrad