CARDBiomedBench: A Benchmark for Evaluating Large Language Model Performance in Biomedical Research.
CARDBiomedBench:評估大型語言模型在生物醫學研究中表現的基準。
bioRxiv 2025-01-27
Unveiling GPT-4V's hidden challenges behind high accuracy on USMLE questions: Observational Study.
揭示 GPT-4V 在 USMLE 問題高準確率背後的隱藏挑戰:觀察性研究。
J Med Internet Res 2025-02-07
Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation.
評估多模態大型語言模型在12導聯心電圖影像解讀中零樣本視覺問答的表現。
Front Cardiovasc Med 2025-02-21
Evaluating the performance of large language & visual-language models in cervical cytology screening.
大型語言與視覺-語言模型於子宮頸細胞學篩檢之表現評估
NPJ Precis Oncol 2025-05-23
MedBLIP: A multimodal method of medical question-answering based on fine-tuning large language model.
MedBLIP:基於微調大型語言模型的多模態醫學問答方法
Comput Med Imaging Graph 2025-06-08