Using latent semantic indexing as a measure of conceptual association for noun compound disambiguation

Alan M. Buckeridge, Richard F.E. Sutcliffe

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Noun compounds are a frequently occurring yet highly ambiguous construction in natural language; their interpretation relies on extra-syntactic information. Several statistical methods for compound disambiguation have been reported in the literature; however, a striking feature of all these approaches is that disambiguation relies on statistics derived from unambiguous compounds in training, meaning they are prone to the problem of sparse data. Other researchers have overcome this difficulty somewhat by using manually crafted knowledge resources to collect statistics on "concepts" rather than noun tokens, but have sacrificed domain-independence by doing so. We report here on work investigating the application of Latent Semantic Indexing [4], an Information Retrieval technique, to the task of noun compound disambiguation. We achieved an accuracy of 84%, indicating the potential of applying vector-based distributional information measures to syntactic disambiguation.

Original languageEnglish
Title of host publicationArtificial Intelligence and Cognitive Science - 13th Irish Conference, AICS 2002, Proceedings
EditorsMichael O’Neill, Richard F. E. Sutcliffe, Conor Ryan, Malachy Eaton, Niall J. L. Griffith
PublisherSpringer Verlag
Pages12-19
Number of pages8
ISBN (Electronic)3540441840, 9783540441847
DOIs
Publication statusPublished - 2002
Event13th Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2002 - Limerick, Ireland
Duration: 12 Sep 200213 Sep 2002

Publication series

NameLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
Volume2464
ISSN (Print)0302-9743

Conference

Conference13th Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2002
Country/TerritoryIreland
CityLimerick
Period12/09/0213/09/02

Fingerprint

Dive into the research topics of 'Using latent semantic indexing as a measure of conceptual association for noun compound disambiguation'. Together they form a unique fingerprint.

Cite this