How accurate are ChatGPT-4 responses in chronic urticaria? A critical analysis with information quality metrics.
ChatGPT-4 回應於慢性蕁麻疹的準確性如何?以資訊品質指標進行的批判性分析
World Allergy Organ J 2025-06-30
Battle of the Bots: Solving Clinical Cases in Osteoarticular Infections With Large Language Models.
機器人大對決:運用大型語言模型解決骨關節感染的臨床病例
Mayo Clin Proc Digit Health 2025-06-30
這項研究比較15款大型語言模型在骨關節感染臨床案例的表現,OpenEvidence 和 Microsoft Copilot 答對率最高(94.4%),ChatGPT-4o 和 Gemini 2.5 Pro 也有92.8%。不同感染類型下,各模型表現不一。結果顯示先進 LLMs 有潛力協助臨床決策,但在正式醫療應用前,還需要更多驗證和優化。
相關文章PubMedDOI推理
Does the Quality and Readability of Information Related to Varicocele Obtained from ChatGPT 4.0 Remain Consistent Across Different Models of Inquiry?
ChatGPT 4.0 所提供的 Varicocele 相關資訊,其品質與可讀性在不同提問模式下是否保持一致?
World J Mens Health 2025-06-29
cfMethylPre: deep transfer learning enhances cancer detection based on circulating cell-free DNA methylation profiling.
cfMethylPre:深度遷移學習提升基於循環游離DNA甲基化分析的癌症偵測
Brief Bioinform 2025-06-29
DrugTar Improves Druggability Prediction by Integrating Large Language Models and Gene Ontologies.
DrugTar 結合大型語言模型與基因本體論提升藥物可成藥性預測能力
Bioinformatics 2025-06-29