Enhancing semantical text understanding with fine-tuned large language models: A case study on Quora Question Pair duplicate identification.
透過微調大型語言模型增強語義文本理解:以 Quora Question Pair 重複識別為案例研究。
PLoS One 2025-01-10
A scientific-article key-insight extraction system based on multi-actor of fine-tuned open-source large language models.
基於多演員的微調開源大型語言模型的科學文章關鍵洞察提取系統。
Sci Rep 2025-01-10
Comparative evaluation and performance of large language models on expert level critical care questions: a benchmark study.
大型語言模型在專家級重症護理問題上的比較評估與表現:基準研究。
Crit Care 2025-02-10
這項研究評估了五個大型語言模型(LLMs)在重症醫學中的表現,針對1181道選擇題進行測試。結果顯示,GPT-4o的準確率最高,達93.3%,其次是Llama 3.1 70B(87.5%)和Mistral Large 2407(87.9%)。所有模型的表現都超過隨機猜測和人類醫師,但GPT-3.5-turbo未顯著優於醫師。儘管準確性高,模型仍有錯誤,需謹慎評估。GPT-4o成本高昂,對能源消耗引發關注。總體而言,LLMs在重症醫學中展現潛力,但需持續評估以確保負責任的使用。
PubMedDOI
Fine-tuning large language models for improved health communication in low-resource languages.
為低資源語言改善健康溝通的大型語言模型微調。
Comput Methods Programs Biomed 2025-02-23
Comparing large Language models and human annotators in latent content analysis of sentiment, political leaning, emotional intensity and sarcasm.
比較大型語言模型與人類標註者在情感、政治傾向、情緒強度和諷刺的潛在內容分析中的表現。
Sci Rep 2025-04-03
Benchmarking large language models for biomedical natural language processing applications and recommendations.
大型語言模型在生物醫學自然語言處理應用中的基準測試與建議。
Nat Commun 2025-04-05
Careful design of Large Language Model pipelines enables expert-level retrieval of evidence-based information from syntheses and databases.
精心設計的大型語言模型(Large Language Model, LLM)流程可實現專家級的循證資訊檢索,來自綜合分析與資料庫。
PLoS One 2025-05-15
Automating and Evaluating Large Language Models for Accurate Text Summarization Under Zero-Shot Conditions.
在零樣本條件下自動化與評估大型語言模型以提升文本摘要的準確性
AMIA Jt Summits Transl Sci Proc 2025-06-12