A framework for mitigating malicious RLHF feedback in LLM training using consensus based reward.
一個基於共識獎勵的框架,用於減輕在大型語言模型訓練中惡意 RLHF 反饋的影響。
Sci Rep 2025-03-18
Robust privacy amidst innovation with large language models through a critical assessment of the risks.
在大型語言模型創新中的穩健隱私:對風險的批判性評估。
J Am Med Inform Assoc 2025-03-20
Adaptive Compressed-based Privacy-preserving Large Language Model for Sensitive Healthcare.
基於自適應壓縮的隱私保護大型語言模型在敏感醫療中的應用。
IEEE J Biomed Health Inform 2025-04-08
Adapting Generative Large Language Models for Information Extraction from Unstructured Electronic Health Records in Residential Aged Care: A Comparative Analysis of Training Approaches.
將生成式大型語言模型應用於長照機構非結構化電子健康紀錄資訊擷取之訓練方法比較分析
J Healthc Inform Res 2025-05-01
Ethical Privacy Framework for Large Language Models in Smart Healthcare: A Comprehensive Evaluation and Protection Approach.
智慧醫療中大型語言模型的倫理隱私框架:全面性評估與保護方法
IEEE J Biomed Health Inform 2025-06-04
DIRI: Adversarial Patient Reidentification with Large Language Models for Evaluating Clinical Text Anonymization.
DIRI:利用大型語言模型進行對抗性病患再識別以評估臨床文本去識別化
AMIA Jt Summits Transl Sci Proc 2025-06-12
Reinforcement Learning With LLMs Interaction For Distributed Diffusion Model Services.
分散式擴散模型服務中結合 LLMs 互動的強化學習
IEEE Trans Pattern Anal Mach Intell 2025-06-30
Harnessing Moderate-Sized Language Models for Reliable Patient Data Deidentification in Emergency Department Records: Algorithm Development, Validation, and Implementation Study.
運用中等規模語言模型於急診部門紀錄中進行可靠的病患資料去識別化:演算法開發、驗證與實作研究
JMIR AI 2025-07-03