Evaluating large language models as graders of medical short answer questions: a comparative analysis with expert human graders.
將大型語言模型作為醫學簡答題評分者之評估:與專家人工評分者的比較分析
Med Educ Online 2025-08-24
Large language models underperform in European general surgery board examinations: a comparative study with experts and surgical residents.
大型語言模型在歐洲一般外科專科考試中的表現不佳:與專家及外科住院醫師的比較研究
BMC Med Educ 2025-08-24
The performance of ChatGPT on medical image-based assessments and implications for medical education.
ChatGPT 在醫學影像評估中的表現及其對醫學教育的影響
BMC Med Educ 2025-08-23
Graph retrieval augmented large language models for facial phenotype associated rare genetic disease.
用於臉部表型相關罕見遺傳疾病的圖譜檢索增強大型語言模型
NPJ Digit Med 2025-08-23
Does ChatGPT update itself? Accuracy of ChatGPT in tympanostomy tube guidance: A comparative analysis with current literature.
ChatGPT會自我更新嗎?ChatGPT在鼓膜置管指引中的準確性:與現有文獻的比較分析
Eur Arch Otorhinolaryngol 2025-08-23
CARE-AD: a multi-agent large language model framework for Alzheimer's disease prediction using longitudinal clinical notes.
CARE-AD:結合多代理大型語言模型的阿茲海默症預測框架,應用於縱向臨床紀錄
NPJ Digit Med 2025-08-23
Perplexity and proximity: Large language model perplexity complements semantic distance metrics for the detection of incoherent speech.
困惑度與接近度:大型語言模型的困惑度可補足語意距離指標於偵測語無倫次言語的應用
J Biomed Inform 2025-08-23