Large language model uncertainty proxies: discrimination and calibration for medical diagnosis and treatment.
大型語言模型不確定性代理:醫療診斷和治療的區分與校準。
J Am Med Inform Assoc 2024-10-13
Systematic analysis of ChatGPT, Google search and Llama 2 for clinical decision support tasks.
ChatGPT、Google 搜尋和 Llama 2 在臨床決策支援任務中的系統性分析。
Nat Commun 2024-03-09
Quality of Answers of Generative Large Language Models vs Peer Patients for Interpreting Lab Test Results for Lay Patients: Evaluation Study.
生成式大型語言模型與同儕患者對於解釋普通患者的檢驗結果的回答品質:評估研究。
ArXiv 2024-03-30
Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.
生成式大型語言模型與同儕用戶對於解釋普通患者的實驗室檢驗結果的答案品質:評估研究。
J Med Internet Res 2024-04-17
Off-the-shelf Large Language Models (LLM) Are Of Insufficient Quality To Provide Medical Treatment Recommendations, While Customization of LLMs Result In Quality Recommendations.
現成的大型語言模型 (LLM) 在提供醫療治療建議方面的質量不足,而定制化的 LLM 則能產生高質量的建議。
Arthroscopy 2024-10-05
Evaluating the use of large language models to provide clinical recommendations in the Emergency Department.
評估大型語言模型在急診科提供臨床建議的應用。
Nat Commun 2024-10-08