MGFusion: a multimodal large language model-guided information perception for infrared and visible image fusion.
MGFusion:一種多模態大型語言模型引導的紅外與可見光影像融合信息感知。
Front Neurorobot 2025-01-07
Cockpit-Llama: Driver Intent Prediction in Intelligent Cockpit via Large Language Model.
Cockpit-Llama:透過大型語言模型在智慧駕駛艙中預測駕駛者意圖。
Sensors (Basel) 2025-01-11
Query by Example: Semantic Traffic Scene Retrieval Using LLM-Based Scene Graph Representation.
以範例查詢:使用基於大型語言模型(LLM)的場景圖表示進行語意交通場景檢索
Sensors (Basel) 2025-04-26
Collision risk prediction and takeover requirements assessment based on radar-video integrated sensors data: A system framework based on LLM.
基於雷達-影像整合感測器數據的碰撞風險預測與接管需求評估:一個基於LLM的系統架構
Accid Anal Prev 2025-05-06
A Multimodal Large Language Model Framework for Intelligent Perception and Decision-Making in Smart Manufacturing.
智慧製造中用於智能感知與決策的多模態大型語言模型框架
Sensors (Basel) 2025-05-28
When language and vision meet road safety: Leveraging multimodal large language models for video-based traffic accident analysis.
當語言與視覺相遇於道路安全:運用多模態大型語言模型進行基於影片的交通事故分析
Accid Anal Prev 2025-06-05
Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models.
Argus:結合多視角影像與大型語言模型以提升3D場景理解
IEEE Trans Neural Netw Learn Syst 2025-06-25