MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering.
MedExpQA:大型語言模型在醫學問答中的多語言基準測試。
Artif Intell Med 2024-08-09
Q-BENCH: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to Pairs.
Q-BENCH:一個針對單幅圖像到成對圖像的低階視覺多模態基礎模型的基準。
IEEE Trans Pattern Anal Mach Intell 2024-08-21
Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning.
透過視覺參考指導調整推進多模態大型語言模型在圖表問題回答中的應用。
IEEE Trans Vis Comput Graph 2024-09-10
Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation.
評估多模態大型語言模型在12導聯心電圖影像解讀中零樣本視覺問答的表現。
Front Cardiovasc Med 2025-02-21