Semantic matching is of central significance to the answer selection task which aims to select correct answers for a given question from a candidate answer pool. A useful method is to employ neural networks with attention to generate sentences representations in a way that information from pair sentences can mutually influence the computation of representations. In this work, an effective architecture,multi-size neural network with attention mechanism (AM-MSNN),is introduced into the answer selection task. This architecture captures more levels of language granularities in parallel, because of the various sizes of filters comparing with single-layer CNN and multi-layer CNNs. Meanwhile it extends the sentence representations by attention mechanism, thus containing more information for different types of questions. The empirical study on three various benchmark tasks of answer selection demonstrates the efficacy of the proposed model in all the benchmarks and its superiority over competitors. The experimental results show that (1) multi-size neural network (MSNN) is a more useful method to capture abstract features on different levels of granularities than single/multi-layer CNNs; (2) the attention mechanism (AM) is a better strategy to derive more informative representations; (3) AM-MSNN is a better architecture for the answer selection task for the moment.
翻译:语义匹配对于从候选答题库中选择对特定问题正确答案的答案的答案选择任务具有核心意义。 一种有用的方法是使用神经网络,注意生成句子表达方式,使来自对等句的信息能够对表达方式的计算产生相互影响。 在这项工作中,在选择答案的任务中引入了一个有效的结构,即具有关注机制的多尺寸神经网络(AM-MSNNN),这个结构平行地捕捉了比单层CNN和多层CNN更多的语言微粒。同时,它通过关注机制扩展了句子表达方式,从而包含了不同类型问题的更多信息。关于选择答案的三种基准任务的经验性研究表明了所有基准中拟议模型的功效及其优于竞争者。实验结果显示:(1) 多尺寸神经网络(MSNNN)是比单层/多层CNNNNP更有用的一种方法,用来捕捉不同微粒度不同层次的抽象特征;(2) 注意机制(AM)是一个更好的战略,以获得更多信息性陈述;(3) AM-MSNNNNNM是选择答案时一个更好的结构。