JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models.
JailbreakLens:針對大型語言模型的 Jailbreak 攻擊之視覺化分析
IEEE Trans Vis Comput Graph 2025-06-02
AdversaFlow: Visual Red Teaming for Large Language Models with Multi-Level Adversarial Flow.
AdversaFlow:針對大型語言模型的多層對抗流可視化紅隊測試。
IEEE Trans Vis Comput Graph 2024-09-16
PromptAid: Visual Prompt Exploration, Perturbation, Testing and Iteration for Large Language Models.
PromptAid: 大型語言模型的視覺提示探索、擾動、測試與迭代。
IEEE Trans Vis Comput Graph 2025-03-03
Backdoor Attacks and Countermeasures in Natural Language Processing Models: A Comprehensive Security Review.
自然語言處理模型中的後門攻擊與對策:全面的安全性回顧。
IEEE Trans Neural Netw Learn Syst 2025-03-03
JailbreakHunter: A Visual Analytics Approach for Jailbreak Prompts Discovery From Large-Scale Human-LLM Conversational Datasets.
JailbreakHunter:一種從大規模人類-LLM對話數據集中發現越獄提示的視覺分析方法。
IEEE Trans Vis Comput Graph 2025-04-11
Large Language Model-Powered Protected Interface Evasion: Automated Discovery of Broken Access Control Vulnerabilities in Internet of Things Devices.
大型語言模型驅動的保護介面規避:自動化發現物聯網裝置中的Broken Access Control漏洞
Sensors (Basel) 2025-05-14