Performance of ChatGPT in the In-Training Examination for Anesthesiology and Pain Medicine Residents in South Korea: Observational Study.
南韓麻醉學與疼痛醫學住院醫師在訓練考試中 ChatGPT 的表現:觀察性研究。
JMIR Med Educ 2024-09-16
On the development and validation of large language model-based classifiers for identifying social determinants of health.
基於大型語言模型的分類器在識別健康社會決定因素中的開發與驗證。
Proc Natl Acad Sci U S A 2024-09-16
How Aligned are Human Chart Takeaways and LLM Predictions? A Case Study on Bar Charts with Varying Layouts.
人類圖表摘要與大型語言模型預測的對齊程度如何?以不同佈局的條形圖為案例研究。
IEEE Trans Vis Comput Graph 2024-09-16
AdversaFlow: Visual Red Teaming for Large Language Models with Multi-Level Adversarial Flow.
AdversaFlow:針對大型語言模型的多層對抗流可視化紅隊測試。
IEEE Trans Vis Comput Graph 2024-09-16
The future of large language models in social science research: Reply to Berger (2024) and Carrillo et al. (2024).
大型語言模型在社會科學研究中的未來:回覆 Berger (2024) 和 Carrillo et al. (2024)。
Am Psychol 2024-09-16
A workflow for human-centered machine-assisted hypothesis generation: Commentary on Banker et al. (2024).
以人為中心的機器輔助假設生成工作流程:對 Banker et al. (2024) 的評論。
Am Psychol 2024-09-16