Methods to evaluate risk of bias in LLM studies, Tools for assessing risk of bias in LLM research, Impact of bias evaluation on the validity of LLM findings
執行時間
7.42963 秒
花費Token
138
Assessing the Risk of Bias in Randomized Clinical Trials With Large Language Models.
使用大型語言模型評估隨機臨床試驗的偏倚風險。
JAMA Netw Open / / 2024-05-22
Streamlining Systematic Reviews: Harnessing Large Language Models for Quality Assessment and Risk-of-Bias Evaluation.
利用大型語言模型優化系統性文獻回顧:品質評估與偏倚風險評估。
Cureus / / 2023-09-08
Integrating large language models in systematic reviews: a framework and case study using ROBINS-I for risk of bias assessment.
將此醫學文章的標題翻譯為繁體中文:「將大型語言模型整合到系統性評論中:以 ROBINS-I 進行偏倚風險評估的框架和案例研究。」
BMJ Evid Based Med / / 2024-02-21
Cost, Usability, Credibility, Fairness, Accountability, Transparency, and Explainability Framework for Safe and Effective Large Language Models in Medical Education: Narrative Review and Qualitative Study.
醫學教育中安全有效大型語言模型的成本、可用性、可信度、公平性、責任制、透明度和可解釋性框架:敘事性回顧與質性研究。
JMIR AI / / 2024-06-14
Harnessing LLMs for multi-dimensional writing assessment: Reliability and alignment with human judgments.
利用大型語言模型進行多維寫作評估:可靠性及與人類評價的一致性。
Heliyon / / 2024-08-08
Fighting reviewer fatigue or amplifying bias? Considerations and recommendations for use of ChatGPT and other Large Language Models in scholarly peer review.
在學術同儕審查中使用 ChatGPT 和其他大型語言模型時,如何避免審稿人疲勞或加劇偏見?考量與建議。
Res Sq / / 2023-07-12
The policies on the use of large language models in radiological journals are lacking: a meta-research study.
放射學期刊中大型語言模型使用政策的不足:一項元研究。
Insights Imaging / / 2024-08-01
Shadows of wisdom: Classifying meta-cognitive and morally grounded narrative content via large language models.
智慧的陰影:透過大型語言模型將元認知和道德基礎敘事內容進行分類。
Behav Res Methods / / 2024-05-29
Title and abstract screening for literature reviews using large language models: an exploratory study in the biomedical domain.
使用大型語言模型進行文獻回顧的標題和摘要篩選:生物醫學領域的探索性研究。
Syst Rev / / 2024-06-15
Fighting reviewer fatigue or amplifying bias? Considerations and recommendations for use of ChatGPT and other large language models in scholarly peer review.
在學術同儕評審中使用 ChatGPT 和其他大型語言模型時,如何避免評審者疲勞或加劇偏見?考量與建議。
Res Integr Peer Rev / / 2023-07-22
Large Language Models in Ophthalmology Scientific Writing: Ethical Considerations Blurred Lines or Not at All?
眼科科學寫作中的大型語言模型:模糊邊界還是完全不存在的道德考量?
Am J Ophthalmol / / 2023-11-11
Large language models show human-like content biases in transmission chain experiments.
大型語言模型在傳播鏈實驗中展現出類似人類的內容偏見。
Proc Natl Acad Sci U S A / / 2023-11-12