Benchmarking four large language models' performance of addressing Chinese patients' inquiries about dry eye disease: A two-phase study.
評估四個大型語言模型在解答中國患者對乾眼症詢問的表現:一項兩階段研究。
Heliyon 2024-08-08
Harnessing LLMs for multi-dimensional writing assessment: Reliability and alignment with human judgments.
利用大型語言模型進行多維寫作評估:可靠性及與人類評價的一致性。
Heliyon 2024-08-08
Scoring story recall for individual differences research: Central details, peripheral details, and automated scoring.
個體差異研究中的故事回憶評分:中心細節、周邊細節與自動評分。
Behav Res Methods 2024-08-07
Assessing the use of the novel tool Claude 3 in comparison to ChatGPT 4.0 as an artificial intelligence tool in the diagnosis and therapy of primary head and neck cancer cases.
評估新工具 Claude 3 與 ChatGPT 4.0 在診斷和治療原發性頭頸癌病例中的人工智慧工具使用情況。
Eur Arch Otorhinolaryngol 2024-08-07
Leveraging a large language model to predict protein phase transition: A physical, multiscale, and interpretable approach.
利用大型語言模型預測蛋白質相變:一種物理、多尺度和可解釋的方法。
Proc Natl Acad Sci U S A 2024-08-07