Document Type

Conference Paper

Rights

This item is available under a Creative Commons License for non-commercial use only

Disciplines

Computer Sciences

Publication Details

Salton, G., Ross, R. & Kelleher, J. (2016). Idiom token classification using sentential distributed semantics.

Abstract

Idiom token classification is the task of deciding for a set of potentially idiomatic phrases whether each occurrence of a phrase is a literal or idiomatic usage of the phrase. In this work we explore the use of Skip-Thought Vectors to create distributed representations that encode features that are predictive with respect to idiom token classification. We show that classifiers using these representations have competitive performance compared with the state of the art in idiom token classification. Importantly, however, our models use only the sentence containing the tar- get phrase as input and are thus less dependent on a potentially inaccurate or in- complete model of discourse context. We further demonstrate the feasibility of using these representations to train a competitive general idiom token classifier.

Share

COinS