Can American Board of Surgery in Training Examinations be passed by Large Language Models? Comparative assessment of Gemini, Copilot, and ChatGPT.
大型語言模型能通過美國外科醫學會住院醫師訓練考試嗎?Gemini、Copilot 與 ChatGPT 的比較性評估
Am Surg 2025-05-12
Large Language Models for Intraoperative Decision Support in Plastic Surgery: A Comparison between ChatGPT-4 and Gemini.
大型語言模型在整形外科手術中的術中決策支持:ChatGPT-4和Gemini之間的比較。
Medicina (Kaunas) 2024-06-27
Performance evaluation of ChatGPT-4.0 and Gemini on image-based neurosurgery board practice questions: A comparative analysis.
ChatGPT-4.0 與 Gemini 在影像基礎神經外科考試問題上的表現評估:比較分析。
J Clin Neurosci 2025-02-12
Accuracy and quality of ChatGPT-4o and Google Gemini performance on image-based neurosurgery board questions.
ChatGPT-4o 和 Google Gemini 在影像基礎神經外科考試問題上的準確性和質量。
Neurosurg Rev 2025-03-25
這項研究評估了兩個大型語言模型(LLMs),GPT-4o 和 Google Gemini,在神經外科考試影像問題上的表現。共分析379個問題,結果顯示GPT-4o的正確率為51.45%,明顯優於Gemini的39.58%。GPT-4o在病理學和放射學等領域表現突出,且在複雜推理的問題上也更佳。雖然GPT-4o的回答質量較高,但兩者在影像問題上的表現仍不及傳統考試,顯示機器視覺和醫學影像解釋的挑戰。
PubMedDOI
Comparative Analysis of ChatGPT-4o and Gemini Advanced Performance on Diagnostic Radiology In-Training Exams.
ChatGPT-4o 與 Gemini Advanced 在放射診斷住院醫師訓練考試表現的比較分析
Cureus 2025-04-21
Transforming Neurosurgical Practice with Large Language Models: Comparative Performance of ChatGPT-Omni and Gemini in Complex Case Management.
以大型語言模型革新神經外科實務:ChatGPT-Omni 與 Gemini 在複雜病例管理中的表現比較
World Neurosurg 2025-05-22
Battle of the Bots: Assessing the Ability of Four Large Language Models to Tackle Different Surgery Topics.
機器人大對決:評估四種大型語言模型處理不同外科主題的能力
Am Surg 2025-05-27
Transforming neurosurgical practice with large language models: comparative performance of ChatGPT-omni and Gemini in complex case management.
以大型語言模型革新神經外科實踐:ChatGPT-omni 與 Gemini 在複雜病例管理中的表現比較
J Neurosurg Sci 2025-06-05
Evaluating Large Language Models on American Board of Anesthesiology-style Anesthesiology Questions: Accuracy, Domain Consistency, and Clinical Implications.
以美國麻醉科醫學會(American Board of Anesthesiology)風格麻醉學試題評估大型語言模型:準確性、領域一致性與臨床意涵
J Cardiothorac Vasc Anesth 2025-06-15