Reinforcement Learning With LLMs Interaction For Distributed Diffusion Model Services.
分散式擴散模型服務中結合 LLMs 互動的強化學習
IEEE Trans Pattern Anal Mach Intell 2025-06-30
LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models.
LLMER:利用大型語言模型生成的 JSON 數據創建互動擴展現實世界。
IEEE Trans Vis Comput Graph 2025-03-10
A framework for mitigating malicious RLHF feedback in LLM training using consensus based reward.
一個基於共識獎勵的框架,用於減輕在大型語言模型訓練中惡意 RLHF 反饋的影響。
Sci Rep 2025-03-18