Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods.
大型語言模型增強強化學習的調查:概念、分類法與方法。
IEEE Trans Neural Netw Learn Syst 2025-03-03
A framework for mitigating malicious RLHF feedback in LLM training using consensus based reward.
一個基於共識獎勵的框架,用於減輕在大型語言模型訓練中惡意 RLHF 反饋的影響。
Sci Rep 2025-03-18