Backdoor Attacks and Countermeasures in Natural Language Processing Models: A Comprehensive Security Review.
自然語言處理模型中的後門攻擊與對策:全面的安全性回顧。
IEEE Trans Neural Netw Learn Syst 2025-03-03
Large language models can consistently generate high-quality content for election disinformation operations.
大型語言模型可以持續生成高品質內容,用於選舉虛假資訊操作。
PLoS One 2025-03-17
Large Language Models for Synthetic Dataset Generation of Cybersecurity Indicators of Compromise.
用於生成網路安全威脅指標(Indicators of Compromise, IoC)合成資料集的大型語言模型
Sensors (Basel) 2025-05-14
JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models.
JailbreakLens:針對大型語言模型的 Jailbreak 攻擊之視覺化分析
IEEE Trans Vis Comput Graph 2025-06-02