Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning.
透過視覺參考指導調整推進多模態大型語言模型在圖表問題回答中的應用。
IEEE Trans Vis Comput Graph 2024-09-10
Enhancing Data Literacy On-demand: LLMs as Guides for Novices in Chart Interpretation.
提升數據素養即時需求:以LLMs為新手在圖表解讀中的指導。
IEEE Trans Vis Comput Graph 2024-06-12
Q-BENCH: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to Pairs.
Q-BENCH:一個針對單幅圖像到成對圖像的低階視覺多模態基礎模型的基準。
IEEE Trans Pattern Anal Mach Intell 2024-08-21
Fine-Tuned Large Language Model for Visualization System: A Study on Self-Regulated Learning in Education.
針對視覺化系統的微調大型語言模型:教育中自我調節學習的研究。
IEEE Trans Vis Comput Graph 2024-09-10
An Empirical Evaluation of the GPT-4 Multimodal Language Model on Visualization Literacy Tasks.
對GPT-4多模態語言模型在視覺素養任務上的實證評估。
IEEE Trans Vis Comput Graph 2024-09-10
How Aligned are Human Chart Takeaways and LLM Predictions? A Case Study on Bar Charts with Varying Layouts.
人類圖表摘要與大型語言模型預測的對齊程度如何?以不同佈局的條形圖為案例研究。
IEEE Trans Vis Comput Graph 2024-09-16