Skip to search boxSkip to navigationSkip to main content

Understanding negative sampling in knowledge graph embedding

  • Jing Qian
    ,
  • Gangmin Li
    ,
  • Katie Atkinson
    ,
  • Yong Yue
Research Output: Contribution to journal Article Peer-review

Open access

Abstract

Knowledge graph embedding (KGE) is to project entities and relations of a knowledge graph (KG) into a low-dimensional vector space, which has made steady progress in recent years. Conventional KGE methods, especially translational distance-based models, are trained through discriminating positive samples from negative ones. Most KGs store only positive samples for space efficiency. Negative sampling thus plays a crucial role in encoding triples of a KG. The quality of generated negative samples has a direct impact on the performance of learnt knowledge representation in a myriad of downstream tasks, such as recommendation, link prediction and node classification. We summarize current negative sampling approaches in KGE into three categories, static distribution-based, dynamic distribution-based and custom cluster-based respectively. Based on this categorization we discuss the most prevalent existing approaches and their characteristics. It is a hope that this review can provide some guidelines for new thoughts about negative sampling in KGE.

Publication Information

Output type

Research Output: Contribution to journal Article Peer-review

Original language

English

Journal (Volume, Issue Number)

International Journal of Artificial Intelligence and Applications (Volume 12, Issue 1)

Publication milestones

  • Published - 31/01/2021

Publication status

Published - 31/01/2021

ISSN

0976-2191

External Publication IDs

  • handle.net: 10547/626424