SANTM: efficient self-attention-driven network for text matching

Prayag Tiwari
,
Amit Kumar Jaiswal
,
Sahil Garg
,
Ilsun You

University of Padua
,
Aalto University
,
University of Leeds
,
École de technologie supérieure
,
Soonchunhyang University

Research Output: Contribution to journal Article Peer-review

Abstract

Self-attention mechanisms have recently been embraced for a broad range of text-matching applications. Self-attention model takes only one sentence as an input with no extra information, i.e., one can utilize the final hidden state or pooling. However, text-matching problems can be interpreted either in symmetrical or asymmetrical scopes. For instance, paraphrase detection is an asymmetrical task, while textual entailment classification and question-answer matching are considered asymmetrical tasks. In this article, we leverage attractive properties of self-attention mechanism and proposes an attention-based network that incorporates three key components for inter-sequence attention: global pointwise features, preceding attentive features, and contextual features while updating the rest of the components. Our model follows evaluation on two benchmark datasets cover tasks of textual entailment and question-answer matching. The proposed efficient Self-attention-driven Network for Text Matching outperforms the state of the art on the Stanford Natural Language Inference and WikiQA datasets with much fewer parameters.

Publication Information

Output type

Research Output: Contribution to journal Article Peer-review

Original language

English

Article number

Journal (Volume, Issue Number)

ACM Transactions on Internet Technology (Volume 22, Issue 3)

Publication milestones

Accepted/In press - 01/09/2020
Published - 29/11/2021

Publication status

Published - 29/11/2021

ISSN

1533-5399

External Publication IDs

Scopus: 85137624463

Access to documents

10.1145/3426971

Link

Link to publication in Scopus, opens in new tab

Publication metrics

Metrics

Scopus