SMART: 判决作为文本评价的基本单位 (SMART: Sentences as Basic Units for Text Evaluation) - 专知论文

会员服务 ·

0

BASIC · 泛函 · Extensibility · MoDELS · 相关系数 ·

2022 年 8 月 1 日

SMART: Sentences as Basic Units for Text Evaluation

翻译：SMART: 判决作为文本评价的基本单位

Reinald Kim Amplayo,Peter J. Liu,Yao Zhao,Shashi Narayan

from arxiv, code coming soon

Widely used evaluation metrics for text generation either do not work well with longer texts or fail to evaluate all aspects of text quality. In this paper, we introduce a new metric called SMART to mitigate such limitations. Specifically, We treat sentences as basic units of matching instead of tokens, and use a sentence matching function to soft-match candidate and reference sentences. Candidate sentences are also compared to sentences in the source documents to allow grounding (e.g., factuality) evaluation. Our results show that system-level correlations of our proposed metric with a model-based matching function outperforms all competing metrics on the SummEval summarization meta-evaluation dataset, while the same metric with a string-based matching function is competitive with current model-based metrics. The latter does not use any neural model, which is useful during model development phases where resources can be limited and fast evaluation is required. Finally, we also conducted extensive analyses showing that our proposed metrics work well with longer summaries and are less biased towards specific models.

翻译：广泛使用的文本生成评价指标要么与较长的文本不起作用,要么无法评估文本质量的所有方面。在本文中,我们引入了称为SMART的新指标,以缓解这些限制。具体地说,我们把判决作为匹配的基本单位而不是象征性,并使用对软匹配候选人和参考判决的匹配功能。候选判决也与源文件中的判决进行比较,以便能够进行依据(例如事实质量)评估。我们的结果显示,我们拟议的指标与基于模型的匹配功能的系统层面相关性超过了SummEval总配对元评价数据集中所有相互竞争的指标,而同一基于字符串的匹配功能与目前基于模型的指数具有竞争力。后者不使用任何神经模型,这种模型在模型开发阶段有用,因为那里的资源有限,需要快速评估。最后,我们还进行了广泛的分析,表明我们拟议的指标与较长的概要很有效,而且不太偏向特定模型。

0

相关内容

BASIC

Beginner's All－purpose Symbolic Instruction Code（初学者通用的符号指令代码），刚开始被作者写做 BASIC，后来被微软广泛地叫做 Basic 。

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

大岩桐叶序调控相关基因的克隆及功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

上调转录因子Nrf2用于造血干细胞抗氧化保护的研究

国家自然科学基金

0+阅读 · 2012年12月31日

钾离子门控通道基因TREK-1在前列腺癌发病中的作用及其表达调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于关联分析的野生毛花猕猴桃AsA富集相关基因发掘及功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

油菜BnICE1基因与MAP激酶信号途径在调控植物耐寒性中的相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

A Contrastive Framework for Neural Text Generation

Arxiv

0+阅读 · 2022年9月26日

News Summarization and Evaluation in the Era of GPT-3

Arxiv

0+阅读 · 2022年9月26日

Implementing contact angle boundary conditions for second-order Phase-Field models of wall-bounded multiphase flows

Arxiv

0+阅读 · 2022年9月24日

XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages

Arxiv

0+阅读 · 2022年9月22日

Attention Bottlenecks for Multimodal Fusion

Arxiv

31+阅读 · 2021年6月30日

VIP会员

文章信息

相关主题

相关VIP内容

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

相关论文

A Contrastive Framework for Neural Text Generation

Arxiv

0+阅读 · 2022年9月26日

News Summarization and Evaluation in the Era of GPT-3

Arxiv

0+阅读 · 2022年9月26日

Implementing contact angle boundary conditions for second-order Phase-Field models of wall-bounded multiphase flows

Arxiv

0+阅读 · 2022年9月24日

XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages

Arxiv

0+阅读 · 2022年9月22日

Attention Bottlenecks for Multimodal Fusion

Arxiv

31+阅读 · 2021年6月30日

相关基金

大岩桐叶序调控相关基因的克隆及功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

上调转录因子Nrf2用于造血干细胞抗氧化保护的研究

国家自然科学基金

0+阅读 · 2012年12月31日

钾离子门控通道基因TREK-1在前列腺癌发病中的作用及其表达调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于关联分析的野生毛花猕猴桃AsA富集相关基因发掘及功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

油菜BnICE1基因与MAP激酶信号途径在调控植物耐寒性中的相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员