Summary: | In this work, we present a new unsupervised and language-independent methodology to detect the relations of textual generality. For this, we introduce a particular case of Textual Entailment (TE), namely Textual Entailment by Generality (TEG). TE aims to capture primary semantic inference needs across applications in Natural Language Processing (NLP). Since 2005, in the TE Recognition (RTE) task, systems have been asked to automatically judge whether the meaning of a portion of the text, the Text (<i>T</i>), entails the meaning of another text, the Hypothesis (<i>H</i>). Several novel approaches and improvements in TE technologies demonstrated in RTE Challenges are signaling renewed interest towards a more in-depth and better understanding of the core phenomena involved in TE. In line with this direction, in this work, we focus on a particular case of entailment, entailment by generality, to detect the relations of textual generality. In text, there are different kinds of entailments, yielded from different types of implicative reasoning (lexical, syntactical, common sense based), but here, we focus just on TEG, which can be defined as an entailment from a specific statement towards a relatively more general one. Therefore, we have <inline-formula><math display="inline"><semantics><mrow><mi>T</mi><mover><mo>→</mo><mi>G</mi></mover><mi>H</mi></mrow></semantics></math></inline-formula> whenever the premise <i>T</i> entails the hypothesis <i>H</i>, this also being more general than the premise. We propose an unsupervised and language-independent method to recognize TEGs, from a pair <inline-formula><math display="inline"><semantics><mrow><mo>〈</mo><mi>T</mi><mo>,</mo><mi>H</mi><mo>〉</mo></mrow></semantics></math></inline-formula> having an entailment relation. To this end, we introduce an Informative Asymmetric Measure (IAM) called Simplified Asymmetric InfoSimba (AISs), which we combine with different Asymmetric Association Measures (AAM). In this work, we hypothesize about the existence of a particular mode of TE, namely TEG. Thus, the main contribution of our study is highlighting the importance of this inference mechanism. Consequently, the new annotation data seem to be a valuable resource for the community.
|