分子Transformer中的电路、特征与启发式机制 (Circuits, Features, and Heuristics in Molecular Transformers) - 专知论文

会员服务 ·

0

分子 · Transformer · 启发式 · 结构 · 表示 ·

Circuits, Features, and Heuristics in Molecular Transformers

翻译：分子Transformer中的电路、特征与启发式机制

Kristof Varadi,Mark Marosi,Peter Antal

Transformers generate valid and diverse chemical structures, but little is known about the mechanisms that enable these models to capture the rules of molecular representation. We present a mechanistic analysis of autoregressive transformers trained on drug-like small molecules to reveal the computational structure underlying their capabilities across multiple levels of abstraction. We identify computational patterns consistent with low-level syntactic parsing and more abstract chemical validity constraints. Using sparse autoencoders (SAEs), we extract feature dictionaries associated with chemically relevant activation patterns. We validate our findings on downstream tasks and find that mechanistic insights can translate to predictive performance in various practical settings.

翻译：Transformer模型能够生成有效且多样化的化学结构，但其捕捉分子表示规则的内部机制尚不明确。本文对基于药物类小分子训练的自回归Transformer进行机制分析，从多个抽象层次揭示其能力背后的计算结构。我们识别出与低层次句法解析及更抽象的化学有效性约束相一致的计算模式。通过稀疏自编码器（SAEs），我们提取了与化学相关激活模式关联的特征词典。我们在下游任务中验证了这些发现，并证明机制性见解可转化为多种实际场景中的预测性能提升。

0

相关内容

【NeurIPS2025】大型语言模型中关系解码线性算子的结构

【NeurIPS2025】大型语言模型中关系解码线性算子的结构

专知会员服务

10+阅读 · 11月2日

【ICML2025】生成模型中潜空间的Hessian几何结构

【ICML2025】生成模型中潜空间的Hessian几何结构

专知会员服务

17+阅读 · 6月15日

图机器学习与分子分析，NUS- Xavier Bresson教授讲解,附视频与Slides

图机器学习与分子分析，NUS- Xavier Bresson教授讲解,附视频与Slides

专知会员服务

15+阅读 · 2023年1月27日

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

AAAI 2022 | ProtGNN：自解释图神经网络

AAAI 2022 | ProtGNN：自解释图神经网络

专知

10+阅读 · 2022年2月28日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

LibRec 每周算法：DeepFM

LibRec 每周算法：DeepFM

LibRec智能推荐

14+阅读 · 2017年11月6日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

AFF1和AFF4形成分子开关调控细胞分化

国家自然科学基金

0+阅读 · 2015年12月31日

半参数空间自回归模型的理论研究及应用

国家自然科学基金

1+阅读 · 2015年12月31日

拟南芥非编码RNA HID1参与红光介导的光形态建成调控的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

Solvable Tuple Patterns and Their Applications to Program Verification

Arxiv

0+阅读 · 12月13日

Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models

Arxiv

0+阅读 · 12月4日

Structural and Spectral Properties of Strictly Interval Graphs

Arxiv

0+阅读 · 12月1日

Bayesian ICA with super-Gaussian Source Priors

Arxiv

0+阅读 · 11月14日

Cluster Catch Digraphs with the Nearest Neighbor Distance

Arxiv

0+阅读 · 11月11日

VIP会员

文章信息

相关主题

相关VIP内容

【NeurIPS2025】大型语言模型中关系解码线性算子的结构

【NeurIPS2025】大型语言模型中关系解码线性算子的结构

专知会员服务

10+阅读 · 11月2日

【ICML2025】生成模型中潜空间的Hessian几何结构

【ICML2025】生成模型中潜空间的Hessian几何结构

专知会员服务

17+阅读 · 6月15日

图机器学习与分子分析，NUS- Xavier Bresson教授讲解,附视频与Slides

图机器学习与分子分析，NUS- Xavier Bresson教授讲解,附视频与Slides

专知会员服务

15+阅读 · 2023年1月27日

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】加速科学发现的因果建模实践算法

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

《美国国家科学院：美空军部数字化转型报告》2025最新150页

大模型推理时代的知识编辑

相关资讯

AAAI 2022 | ProtGNN：自解释图神经网络

AAAI 2022 | ProtGNN：自解释图神经网络

专知

10+阅读 · 2022年2月28日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

LibRec 每周算法：DeepFM

LibRec 每周算法：DeepFM

LibRec智能推荐

14+阅读 · 2017年11月6日

相关论文

Solvable Tuple Patterns and Their Applications to Program Verification

Arxiv

0+阅读 · 12月13日

Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models

Arxiv

0+阅读 · 12月4日

Structural and Spectral Properties of Strictly Interval Graphs

Arxiv

0+阅读 · 12月1日

Bayesian ICA with super-Gaussian Source Priors

Arxiv

0+阅读 · 11月14日

Cluster Catch Digraphs with the Nearest Neighbor Distance

Arxiv

0+阅读 · 11月11日

相关基金

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

AFF1和AFF4形成分子开关调控细胞分化

国家自然科学基金

0+阅读 · 2015年12月31日

半参数空间自回归模型的理论研究及应用

国家自然科学基金

1+阅读 · 2015年12月31日

拟南芥非编码RNA HID1参与红光介导的光形态建成调控的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

微信扫码咨询专知VIP会员