Evaluating the Performance of ChatGPT4.0 Versus ChatGPT3.5 on the Hand Surgery Self-Assessment Exam: A Comparative Analysis of Performance on Image-Based Questions.
評估 ChatGPT4.0 與 ChatGPT3.5 在手外科自我評估考試中的表現:基於影像問題的表現比較分析。
Cureus 2025-02-17
Human versus artificial intelligence: evaluating ChatGPT's performance in conducting published systematic reviews with meta-analysis in chronic pain research.
人類與人工智慧:評估 ChatGPT 在慢性疼痛研究中進行已發表的系統性回顧與統合分析的表現。
Reg Anesth Pain Med 2025-02-16
Evaluating Large Language Model Performance to Support the Diagnosis and Management of Patients with Primary Immune Disorders.
評估大型語言模型在支持原發性免疫疾病患者診斷和管理中的表現。
J Allergy Clin Immunol 2025-02-16
Leveraging AI and customer reviews to evaluate technology used by people with disabilities.
利用人工智慧和顧客評價來評估殘障人士使用的技術。
Disabil Rehabil Assist Technol 2025-02-16
Aligning large language models with radiologists by reinforcement learning from AI feedback for chest CT reports.
透過AI反饋的強化學習,將大型語言模型與放射科醫生對齊,以生成胸部CT報告。
Eur J Radiol 2025-02-15