Measuring Semantic Similarity by Latent Relational Analysis

From National Research Council Canada

Download	View accepted manuscript: Measuring Semantic Similarity by Latent Relational Analysis (PDF, 253 KiB)
Author	Search for: Turney, Peter¹
Affiliation	National Research Council of Canada. NRC Institute for Information Technology
Format	Text, Article
Conference	The Nineteenth International Joint Conference on Artificial Intelligence (IJCAI-05), July 30 - August 5, 2005, Edinburgh, Scotland
Abstract	This paper introduces Latent Relational Analysis (LRA), a method for measuring semantic similarity. LRA measures similarity in the semantic relations between two pairs of words. When two pairs have a high degree of relational similarity, they are analogous. For example, the pair cat:meow is analogous to the pair dog:bark. There is evidence from cognitive science that relational similarity is fundamental to many cognitive and linguistic tasks (e.g., analogical reasoning). In the Vector Space Model (VSM) approach to measuring relational similarity, the similarity between two pairs is calculated by the cosine of the angle between the vectors that represent the two pairs. The elements in the vectors are based on the frequencies of manually constructed patterns in a large corpus. LRA extends the VSM approach in three ways: (1) patterns are derived automatically from the corpus, (2) Singular Value Decomposition is used to smooth the frequency data, and (3) synonyms are used to reformulate word pairs. This paper describes the LRA algorithm and experimentally compares LRA to VSM on two tasks, answering college-level multiple-choice word analogy questions and classifying semantic relations in noun-modifier expressions. LRA achieves state-of-the-art results, reaching human-level performance on the analogy questions and significantly exceeding VSM performance on both tasks.
Publication date	2005
In	Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence (IJCAI-05).
Language	English
NRC number	NRCC 48255
NPARC number	5764414
Export citation	Export as RIS
Report a correction	Report a correction (opens in a new tab)
Record identifier	e98f097a-24d2-4420-93f3-812be915dcec
Record created	2009-03-29
Record modified	2020-10-09

Date modified:: 2025-04-04