APBench and benchmarking large language model performance in fundamental astrodynamics problems for space engineering.
APBench 與大型語言模型在太空工程基本天體力學問題中的性能基準測試。
Sci Rep 2025-03-06
Large language models for human-machine collaborative particle accelerator tuning through natural language.
大型語言模型在自然語言下進行人機協作的粒子加速器調整。
Sci Adv 2025-01-01
CARDBiomedBench: A Benchmark for Evaluating Large Language Model Performance in Biomedical Research.
CARDBiomedBench:評估大型語言模型在生物醫學研究中表現的基準。
bioRxiv 2025-01-27
Benchmarking large language models for biomedical natural language processing applications and recommendations.
大型語言模型在生物醫學自然語言處理應用中的基準測試與建議。
Nat Commun 2025-04-05
Achieving GPT-4o level performance in astronomy with a specialized 8B-parameter large language model.
以專門的 8B 參數大型語言模型實現天文學領域 GPT-4o 等級的表現
Sci Rep 2025-04-21
AstroSage-Llama-3.1-8B 是專為天文學打造的 AI 模型,訓練時用上大量天文相關資料。它在天文學測試上表現超越其他同級模型,甚至能跟 GPT-4o 一較高下。現在已免費開放給研究和教育使用。
PubMedDOI
Industrial applications of large language models.
大型語言模型的產業應用
Sci Rep 2025-04-21
3DBench: A scalable benchmark for object and scene-level instruction-tuning of 3D large language models.
3DBench:用於3D大型語言模型物件與場景層級指令微調的可擴展性基準
Neural Netw 2025-05-17