通过因果关系调解分析,测试培训前语言模型对分配的理解 (Testing Pre-trained Language Models' Understanding of Distributivity via Causal Mediation Analysis) - 专知论文

会员服务 ·

0

Analysis · 可理解性 · MoDELS · 知识 (knowledge) · 语言模型化 ·

2022 年 9 月 11 日

Testing Pre-trained Language Models' Understanding of Distributivity via Causal Mediation Analysis

翻译：通过因果关系调解分析,测试培训前语言模型对分配的理解

Pangbo Ban,Yifan Jiang,Tianran Liu,Shane Steinert-Threlkeld

To what extent do pre-trained language models grasp semantic knowledge regarding the phenomenon of distributivity? In this paper, we introduce DistNLI, a new diagnostic dataset for natural language inference that targets the semantic difference arising from distributivity, and employ the causal mediation analysis framework to quantify the model behavior and explore the underlying mechanism in this semantically-related task. We find that the extent of models' understanding is associated with model size and vocabulary size. We also provide insights into how models encode such high-level semantic knowledge.

翻译：培训前语言模型在多大程度上掌握了有关分配现象的语义学知识? 在本文中,我们引入了DetNLI,这是针对分配性产生的语义差异的自然语言推断的新诊断数据集,并使用因果调解分析框架来量化模式行为和探索这一与语义相关任务的基本机制。我们发现,模型的理解程度与模型大小和词汇大小相关。我们还提供了如何将这种高级语义学知识编码的模型的洞察力。

0

相关内容

Analysis

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

气固两相在循环流化床传质过程中的计算传质学方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

纳米晶TiO2中可控缺陷对光电化学性能影响及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

鼓泡流化床中局部不均匀结构与"三传一反"关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

CRMP2对MCAO大鼠的神经保护作用

国家自然科学基金

0+阅读 · 2012年12月31日

靶向信号BTLA抑制TH1细胞极化治疗类风湿关节炎的研究

国家自然科学基金

0+阅读 · 2009年12月31日

BayesGmed: An R-package for Bayesian Causal Mediation Analysis

BayesGmed: An R-package for Bayesian Causal Mediation Analysis

Arxiv

0+阅读 · 2022年10月21日

A Linguistic Investigation of Machine Learning based Contradiction Detection Models: An Empirical Analysis and Future Perspectives

Arxiv

0+阅读 · 2022年10月19日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

VIP会员

文章信息

相关主题

知识 (knowledge)

语言模型化

相关VIP内容

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

《无人军用移动机器人中密码学与导航系统的集成：当前趋势与前景综述》

【MIT博士论文】弱监督学习：理论、方法与应用

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

BayesGmed: An R-package for Bayesian Causal Mediation Analysis

BayesGmed: An R-package for Bayesian Causal Mediation Analysis

Arxiv

0+阅读 · 2022年10月21日

A Linguistic Investigation of Machine Learning based Contradiction Detection Models: An Empirical Analysis and Future Perspectives

Arxiv

0+阅读 · 2022年10月19日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

相关基金

气固两相在循环流化床传质过程中的计算传质学方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

纳米晶TiO2中可控缺陷对光电化学性能影响及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

鼓泡流化床中局部不均匀结构与"三传一反"关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

CRMP2对MCAO大鼠的神经保护作用

国家自然科学基金

0+阅读 · 2012年12月31日

靶向信号BTLA抑制TH1细胞极化治疗类风湿关节炎的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员