HLT — Metaphor Identification (overview)
A lecture-level summary of the metaphor identification course, from MIPVU to neural classifiers, sketched as a lexical note for the wiki.
This is a lexical note — a longer, lecture-style write-up that contains several distinct ideas. The atomic zettels mipvu-procedure, conceptual-metaphor-theory, and metaphor-as-classification are extracted from it.
What is metaphor identification, computationally?
Two traditions sit underneath the field:
-
MIPVU (“Metaphor Identification Procedure VU”) — the linguistic annotation procedure developed at the VU. It walks each lexical unit and asks whether its contextual meaning differs from a more basic meaning, in which case it is marked as metaphor-related (see mipvu-procedure).
-
Conceptual Metaphor Theory — the cognitive-linguistic frame from Lakoff & Johnson, in which metaphors map source domains onto target domains. ARGUMENT IS WAR. LIFE IS A JOURNEY. The cognitive level, not the lexical one (see conceptual-metaphor-theory).
These are different operationalisations of the same phenomenon, and a lot of recent NLP work is essentially a tug-of-war between them.
Computational systems
In NLP, metaphor identification has been framed as token-level binary classification: each token is or is not metaphor-related. Early systems used hand-crafted features (concreteness, imageability) on top of WordNet relations; modern systems use contextual embeddings. See metaphor-as-classification.
Open problems
Annotation cost is high. Disagreement between annotators is high. Cross-domain generalisation is poor. The relationship between MIPVU and Conceptual Metaphor Theory is unresolved at the dataset level.