Performance of the ChatGPT large language model for decision support in community pharmacy.
ChatGPT 大型語言模型在社區藥局決策支持中的表現。
Br J Clin Pharmacol 2024-08-27
Application of Large Language Models in Medical Training Evaluation-Using ChatGPT as a Standardized Patient: Multimetric Assessment.
大型語言模型在醫學訓練評估中的應用 - 使用 ChatGPT 作為標準化病人:多指標評估。
J Med Internet Res 2025-01-01
Humans Continue to Outperform Large Language Models in Complex Clinical Decision-Making: A Study with Medical Calculators.
人類在複雜臨床決策中持續超越大型語言模型:一項使用醫療計算器的研究。
ArXiv 2025-01-13
Factors Associated With the Accuracy of Large Language Models in Basic Medical Science Examinations: Cross-Sectional Study.
與大型語言模型在基礎醫學科學考試準確性相關的因素:橫斷面研究。
JMIR Med Educ 2025-01-23
Assessment of large language models in medical quizzes for clinical chemistry and laboratory management: implications and applications for healthcare artificial intelligence.
大型語言模型在臨床化學和實驗室管理醫學測驗中的評估:對醫療人工智慧的影響與應用。
Scand J Clin Lab Invest 2025-02-19
Assessing ChatGPT 4.0's Capabilities in the United Kingdom Medical Licensing Examination (UKMLA): A Robust Categorical Analysis.
ChatGPT 4.0 在英國醫學執照考試(UKMLA)中的能力評估:一項嚴謹的類別分析
Sci Rep 2025-04-15
Evaluating the Accuracy and Reliability of Large Language Models (ChatGPT, Claude, DeepSeek, Gemini, Grok, and Le Chat) in Answering Item-Analyzed Multiple-Choice Questions on Blood Physiology.
大型語言模型(ChatGPT、Claude、DeepSeek、Gemini、Grok 及 Le Chat)在回答血液生理學題項分析選擇題時之準確性與可靠性評估
Cureus 2025-05-09
Evaluating and leveraging large language models in clinical pharmacology and therapeutics assessment: From exam takers to exam shapers.
在臨床藥理學與治療學評估中評價與應用大型語言模型:從考生到考題設計者
Br J Clin Pharmacol 2025-06-10
最新研究發現,像 ChatGPT-4 Omni 這類大型語言模型,在 CPT 和歐洲處方考試的表現跟醫學生差不多,甚至更厲害,特別是在知識和開藥技巧上。這些 AI 還能揪出題目寫不清楚的地方,不只適合當教學工具,也有助於改進考題品質。
PubMedDOI