BioBridge:连接蛋白质与语言以增强生物推理能力的框架
📄 中文摘要
现有的蛋白质语言模型(PLMs)在多任务适应性方面常常受到限制,并且在多样的生物学背景下表现出较差的泛化能力。相比之下,通用的大型语言模型(LLMs)缺乏对蛋白质序列的解释能力,并且在领域特定知识方面存在不足,限制了其进行有效生物语义推理的能力。为此,提出了BioBridge,一个用于蛋白质理解的领域自适应持续预训练框架。该框架采用领域增量持续预训练(DICP)方法,将蛋白质领域知识与通用推理语料库同时注入LLM,有效缓解了灾难性遗忘问题。跨模态对齐机制进一步增强了模型的表现。
📄 English Summary
BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs
Existing Protein Language Models (PLMs) often exhibit limited adaptability to multiple tasks and poor generalization across diverse biological contexts. In contrast, general-purpose Large Language Models (LLMs) lack the ability to interpret protein sequences and fall short in domain-specific knowledge, which limits their capacity for effective biosemantic reasoning. To address these issues, BioBridge is proposed as a domain-adaptive continual pretraining framework for protein understanding. This framework employs Domain-Incremental Continual Pre-training (DICP) to simultaneously infuse protein domain knowledge and general reasoning corpus into an LLM, effectively mitigating catastrophic forgetting. Additionally, a cross-modal alignment mechanism enhances the model's performance.
Powered by Cloudflare Workers + Payload CMS + Claude 3.5
数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等