Precision Grounding: Augmenting Large Language Models with Evidence-Based Databases for Trustworthy Genetic Variant Summarization.
精準基礎:結合實證型資料庫以提升大型語言模型於遺傳變異摘要的可信度
medRxiv 2025-06-30
An evaluation of the reliability and readability of large language models in the dissemination of traumatic brain injury information.
大型語言模型在傳播創傷性腦損傷資訊時之可靠性與可讀性評估
Digit Health 2025-06-30
A comparative analysis of machine learning models and human expertise for nursing intervention classification.
機器學習模型與人類專業在護理介入分類上的比較分析
JAMIA Open 2025-06-30
Cross-institutional dental electronic health record entity extraction via generative artificial intelligence and synthetic notes.
跨機構牙科電子健康紀錄實體擷取:結合生成式人工智慧與合成病歷
JAMIA Open 2025-06-30
Standardized clinical assessments and advanced AI-driven instruments used to evaluate neurofunctional deficits, including within biomarker based framework, in Parkinson's disease - human intelligence made vs. AI models - systematic review.
帕金森氏症中以生物標記為基礎之架構下,評估神經功能缺損所使用的標準化臨床評估與先進AI驅動儀器——人類智慧與AI模型之比較的系統性回顧
Front Med (Lausanne) 2025-06-30
How accurate are ChatGPT-4 responses in chronic urticaria? A critical analysis with information quality metrics.
ChatGPT-4 回應於慢性蕁麻疹的準確性如何?以資訊品質指標進行的批判性分析
World Allergy Organ J 2025-06-30
Battle of the Bots: Solving Clinical Cases in Osteoarticular Infections With Large Language Models.
機器人大對決:運用大型語言模型解決骨關節感染的臨床病例
Mayo Clin Proc Digit Health 2025-06-30
這項研究比較15款大型語言模型在骨關節感染臨床案例的表現,OpenEvidence 和 Microsoft Copilot 答對率最高(94.4%),ChatGPT-4o 和 Gemini 2.5 Pro 也有92.8%。不同感染類型下,各模型表現不一。結果顯示先進 LLMs 有潛力協助臨床決策,但在正式醫療應用前,還需要更多驗證和優化。
相關文章PubMedDOI推理
Does the Quality and Readability of Information Related to Varicocele Obtained from ChatGPT 4.0 Remain Consistent Across Different Models of Inquiry?
ChatGPT 4.0 所提供的 Varicocele 相關資訊,其品質與可讀性在不同提問模式下是否保持一致?
World J Mens Health 2025-06-29