Accuracy of Prospective Assessments of 4 Large Language Model Chatbot Responses to Patient Questions About Emergency Care: Experimental Comparative Study.
大型語言模型聊天機器人對患者急救問題的前瞻性評估準確性:實驗比較研究。
J Med Internet Res 2024-11-04
Evaluating human ability to distinguish between ChatGPT-generated and original scientific abstracts.
評估人類區分 ChatGPT 生成與原創科學摘要的能力。
Updates Surg 2025-01-24
Identification of dental related ChatGPT generated abstracts by senior and young academicians versus artificial intelligence detectors and a similarity detector.
年長與年輕學者對於牙科相關 ChatGPT 生成摘要的識別,與人工智慧檢測器及相似性檢測器的比較。
Sci Rep 2025-04-02
Comparison of performance of artificial intelligence tools in answering emergency medicine question pool: ChatGPT 4.0, Google Gemini and Microsoft Copilot.
人工智慧工具於急診醫學題庫作答表現之比較:ChatGPT 4.0、Google Gemini 與 Microsoft Copilot
Pak J Med Sci 2025-04-28
Evaluating Accuracy and Readability of Responses to Midlife Health Questions: A Comparative Analysis of Six Large Language Model Chatbots.
六種大型語言模型聊天機器人對中年健康問題回答之準確性與可讀性評估:比較分析
J Midlife Health 2025-05-07
研究比較六款聊天機器人回答中年健康問題的表現,發現 Meta AI 答案最準確、最有條理,Perplexity 最容易閱讀。整體來說,這些聊天機器人對中年健康教育有幫助,但表現有差異,選擇合適的工具很重要。
PubMedDOI
From Algorithms to Academia: An Endeavor to Benchmark AI-Generated Scientific Papers against Human Standards.
從演算法到學術界:以人類標準評估 AI 生成科學論文的嘗試
Arch Bone Jt Surg 2025-05-07
Chatbots' Role in Generating Single Best Answer Questions for Undergraduate Medical Student Assessment: Comparative Analysis.
Chatbots 在產生醫學生單一最佳答案題目中的角色:比較分析
JMIR Med Educ 2025-05-30
Evaluation of AI-Based Chatbots in Liver Cancer Information Dissemination: A Comparative Analysis of GPT, DeepSeek, Copilot, and Gemini.
AI 聊天機器人在肝癌資訊傳播中的評估:GPT、DeepSeek、Copilot 與 Gemini 之比較分析
Oncology 2025-06-10