Evaluating generative AI models for explainable pathological feature extraction in lung adenocarcinoma grading assessment and prognostic model construction.
用於肺腺癌分級評估與預後模型建構之可解釋性病理特徵擷取的生成式AI模型評估
Int J Surg 2025-05-28
The In-depth Comparative Analysis of Four Large Language AI Models for Risk Assessment and Information Retrieval from Multi-Modality Prostate Cancer Work-up Reports.
四種大型語言AI模型在多模態前列腺癌檢查報告中的風險評估和信息檢索的深入比較分析。
World J Mens Health 2025-01-01
Comparative Performance of Anthropic Claude and OpenAI GPT Models in Basic Radiological Imaging Tasks.
Anthropic Claude 與 OpenAI GPT 模型在基本放射影像任務中的比較表現。
J Med Imaging Radiat Oncol 2025-04-08
Multimodal Generative AI for Anatomic Pathology-A Review of Current Applications to Envisage the Future Direction.
解剖病理學中的多模態生成式人工智慧——現有應用之回顧與未來發展方向展望
Adv Anat Pathol 2025-04-29
Evaluating the reference accuracy of large language models in radiology: a comparative study across subspecialties.
放射科大型語言模型參考文獻準確性的評估:跨次專科的比較研究
Diagn Interv Radiol 2025-05-12
這項研究發現,Claude 3.5 Sonnet 在產生放射科參考文獻時最準確,正確率高達 80.8%,捏造比例僅 3.1%,明顯勝過其他模型。相較之下,ChatGPT 和 Google Gemini 1.5 Pro 的正確率較低,捏造比例甚至高達 60.6%。不同放射科次專科的正確率也有差異。整體來說,Claude 3.5 Sonnet 學術可靠度高,其他模型則有誤導風險,引用功能還需加強。
PubMedDOI
Evaluation of large language models in generating pulmonary nodule follow-up recommendations.
大型語言模型在產生肺結節追蹤建議之評估
Eur J Radiol Open 2025-05-20
Evaluation of generative AI assistance in clinical nephrology: Assessing GPT-4, GPT-4o, Gemini 1.0 Ultra, and PaLM 2 in patient interaction and renal biopsy interpretation.
臨床腎臟科中生成式 AI 協助的評估:評估 GPT-4、GPT-4o、Gemini 1.0 Ultra 與 PaLM 2 在病患互動與腎臟切片判讀的表現
Digit Health 2025-06-05