RxnNano:通过分层课程学习训练紧凑型大语言模型以进行化学反应和逆合成预测

📄 中文摘要

化学反应预测在加速药物发现和合成规划中至关重要。尽管数据驱动模型取得了一定进展,但当前方法受到参数和数据集规模过度强调的限制。一些方法结合了评估技术,绕过了反应表示中的基本挑战,未能捕捉到深层的化学直觉,如反应常识和拓扑原子映射逻辑。核心挑战在于如何将这些知识融入模型中。为此,提出了一个统一框架,通过三项关键创新优先考虑化学理解而非规模:1)潜在化学一致性目标,将反应建模为在连续化学空间中的运动;2)分层课程学习策略,逐步引导模型学习复杂的化学知识;3)紧凑型大语言模型设计,以提高计算效率和预测准确性。

📄 English Summary

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

Chemical reaction prediction is crucial for accelerating drug discovery and synthesis planning. Despite advancements in data-driven models, current approaches are limited by an overemphasis on scaling parameters and datasets. Some methods, coupled with evaluation techniques, bypass fundamental challenges in reaction representation and fail to capture deep chemical intuitions such as reaction common sense and topological atom mapping logic. The core challenge lies in instilling this knowledge into the models. A unified framework is proposed that prioritizes chemical understanding over scale through three key innovations: (1) a Latent Chemical Consistency objective that models reactions as movements in a continuous chemical space; (2) a hierarchical curriculum learning strategy that gradually guides the model to learn complex chemical knowledge; and (3) a compact large language model design to enhance computational efficiency and prediction accuracy.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等