TY - CHAP
T1 - LLM-Driven Medical Document Analysis
T2 - Enhancing Trustworthy Pathology and Differential Diagnosis
AU - Kang, Lei
AU - Fu, Xuanshuo
AU - Terrades, Oriol Ramos
AU - Vazquez-Corral, Javier
AU - Valveny, Ernest
AU - Karatzas, Dimosthenis
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026/9/17
Y1 - 2026/9/17
N2 - Medical document analysis plays a crucial role in extracting essential clinical insights from unstructured healthcare records, supporting critical tasks such as differential diagnosis. Determining the most probable condition among overlapping symptoms requires precise evaluation and deep medical expertise. While recent advancements in large language models (LLMs) have significantly enhanced performance in medical document analysis, privacy concerns related to sensitive patient data limit the use of online LLMs services in clinical settings. To address these challenges, we propose a trustworthy medical document analysis platform that fine-tunes a LLaMA-v3 using low-rank adaptation, specifically optimized for differential diagnosis tasks. Our approach utilizes DDXPlus, the largest benchmark dataset for differential diagnosis, and demonstrates superior performance in pathology prediction and variable-length differential diagnosis compared to existing methods. The developed web-based platform allows users to submit their own unstructured medical documents and receive accurate, explainable diagnostic results. By incorporating advanced explainability techniques, the system ensures transparent and reliable predictions, fostering user trust and confidence. Extensive evaluations confirm that the proposed method surpasses current state-of-the-art models in predictive accuracy while offering practical utility in clinical settings. This work addresses the urgent need for reliable, explainable, and privacy-preserving artificial intelligence solutions, representing a significant advancement in intelligent medical document analysis for real-world healthcare applications.
AB - Medical document analysis plays a crucial role in extracting essential clinical insights from unstructured healthcare records, supporting critical tasks such as differential diagnosis. Determining the most probable condition among overlapping symptoms requires precise evaluation and deep medical expertise. While recent advancements in large language models (LLMs) have significantly enhanced performance in medical document analysis, privacy concerns related to sensitive patient data limit the use of online LLMs services in clinical settings. To address these challenges, we propose a trustworthy medical document analysis platform that fine-tunes a LLaMA-v3 using low-rank adaptation, specifically optimized for differential diagnosis tasks. Our approach utilizes DDXPlus, the largest benchmark dataset for differential diagnosis, and demonstrates superior performance in pathology prediction and variable-length differential diagnosis compared to existing methods. The developed web-based platform allows users to submit their own unstructured medical documents and receive accurate, explainable diagnostic results. By incorporating advanced explainability techniques, the system ensures transparent and reliable predictions, fostering user trust and confidence. Extensive evaluations confirm that the proposed method surpasses current state-of-the-art models in predictive accuracy while offering practical utility in clinical settings. This work addresses the urgent need for reliable, explainable, and privacy-preserving artificial intelligence solutions, representing a significant advancement in intelligent medical document analysis for real-world healthcare applications.
KW - Differential Diagnosis
KW - Explainability
KW - Large Language Models
KW - Low-Rank Adaptation
KW - Medical Document Analysis
KW - Pathology
UR - https://www.scopus.com/pages/publications/105017371419
UR - https://www.mendeley.com/catalogue/f292ab6c-665f-3714-87b7-acb7e76a4cbd/
U2 - 10.1007/978-3-032-04624-6_36
DO - 10.1007/978-3-032-04624-6_36
M3 - Chapter
SN - 9783032046239
VL - 16025
T3 - Lecture Notes in Computer Science
SP - 613
EP - 628
BT - LLM-Driven Medical Document Analysis: Enhancing Trustworthy Pathology and Differential Diagnosis
A2 - Yin, Xu-Cheng
A2 - Karatzas, Dimosthenis
A2 - Lopresti, Daniel
ER -