A Multimodal Large Language Model Framework for Intelligent Perception and Decision-Making in Smart Manufacturing.
智慧製造中用於智能感知與決策的多模態大型語言模型框架
Sensors (Basel) 2025-05-28
MGFusion: a multimodal large language model-guided information perception for infrared and visible image fusion.
MGFusion:一種多模態大型語言模型引導的紅外與可見光影像融合信息感知。
Front Neurorobot 2025-01-07
When language and vision meet road safety: Leveraging multimodal large language models for video-based traffic accident analysis.
當語言與視覺相遇於道路安全:運用多模態大型語言模型進行基於影片的交通事故分析
Accid Anal Prev 2025-06-05
The Synergy Between Data and Multi-Modal Large Language Models: A Survey From Co-Development Perspective.
數據與多模態大型語言模型的協同效應:從共同發展視角的綜述
IEEE Trans Pattern Anal Mach Intell 2025-06-06