Comparative study of ChatGPT and human evaluators on the assessment of medical literature according to recognised reporting standards.
ChatGPT 和人類評估者在根據公認的報告標準評估醫學文獻的比較研究。
BMJ Health Care Inform 2023-10-23
Automated Paper Screening for Clinical Reviews Using Large Language Models: Data Analysis Study.
使用大型語言模型進行臨床評論的自動篩選:資料分析研究。
J Med Internet Res 2024-01-29
Can large language models replace humans in systematic reviews? Evaluating GPT-4's efficacy in screening and extracting data from peer-reviewed and grey literature in multiple languages.
大型語言模型能否取代人類進行系統性回顧?評估 GPT-4 在篩選和提取來自多種語言的同行評審和灰色文獻中的數據的效力。
Res Synth Methods 2024-03-14
Potential roles of large language models in production of systematic reviews and meta-analyses.
大型語言模型在製作系統性文獻回顧和荟萃分析中的潛在作用。
J Med Internet Res 2024-05-31
Title and abstract screening for literature reviews using large language models: an exploratory study in the biomedical domain.
使用大型語言模型進行文獻回顧的標題和摘要篩選:生物醫學領域的探索性研究。
Syst Rev 2024-06-15
The potential and pitfalls of using a large language model such as ChatGPT, GPT-4, or LLaMA as a clinical assistant.
使用大型語言模型如ChatGPT、GPT-4或LLaMA作為臨床助手的潛力與陷阱。
J Am Med Inform Assoc 2024-07-17
Human-Comparable Sensitivity of Large Language Models in Identifying Eligible Studies Through Title and Abstract Screening: 3-Layer Strategy Using GPT-3.5 and GPT-4 for Systematic Reviews.
大型語言模型在通過標題和摘要篩選識別合格研究中的人類可比敏感性:使用 GPT-3.5 和 GPT-4 進行系統評價的三層策略。
J Med Internet Res 2024-08-16