RxnNano：通过分层课程学习训练紧凑型大语言模型以进行化学反应和逆合成预测

出处: RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

发布: 2026年3月4日

📄 中文摘要

化学反应预测在加速药物发现和合成规划中至关重要。尽管数据驱动模型取得了一定进展，但当前方法受到参数和数据集规模过度强调的限制。一些方法结合了评估技术，绕过了反应表示中的基本挑战，未能捕捉到深层的化学直觉，如反应常识和拓扑原子映射逻辑。核心挑战在于如何将这些知识融入模型中。为此，提出了一个统一框架，通过三项关键创新优先考虑化学理解而非规模：1）潜在化学一致性目标，将反应建模为在连续化学空间中的运动；2）分层课程学习策略，逐步引导模型学习复杂的化学知识；3）紧凑型大语言模型设计，以提高计算效率和预测准确性。

🏷️ 相关标签

#化学反应预测 #逆合成 #分层课程学习 #大语言模型 #化学理解

📄 English Summary

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

Chemical reaction prediction is crucial for accelerating drug discovery and synthesis planning. Despite advancements in data-driven models, current approaches are limited by an overemphasis on scaling parameters and datasets. Some methods, coupled with evaluation techniques, bypass fundamental challenges in reaction representation and fail to capture deep chemical intuitions such as reaction common sense and topological atom mapping logic. The core challenge lies in instilling this knowledge into the models. A unified framework is proposed that prioritizes chemical understanding over scale through three key innovations: (1) a Latent Chemical Consistency objective that models reactions as movements in a continuous chemical space; (2) a hierarchical curriculum learning strategy that gradually guides the model to learn complex chemical knowledge; and (3) a compact large language model design to enhance computational efficiency and prediction accuracy.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误