Michael Wiegand, Marc Schulder, & Josef Ruppenhofer. (n.d.). Opinion Holder and Target Extraction for Verb-based Opinion Predicates -- The Problem is Not Solved. In Proceedings of the 6th Workshop in Computational Approaches to Subjectivity and Sentiment Analysis. Lisbon, Portugal.
Problem description
This article focuses on the extraction of opinion roles for verb-based opinion predicates, noting that automatic discrimination of these roles is a challenging task, since the position of opinion holder (OH) and opinion target (OT) depend on the verb and even may vary depending on the point of view, as demonstrated in the examples below:
- Peter/OH dislikes Marry/OT.
- Peter/OT disappoints Marry/OH.
- Peter/OH consoles Marry/OT.
- Peter/OT consoles Marry/OH.
- These people/OT are gossiping a lot.
The authors conclude, that lexical resources would need to include information on the argument positions of a verb's opinion roles - a requirement which neither FrameNet nor WordNet meet at this stage. Another problem is, that the
review domain which is often used for sentiment analysis tasks does
not seem to be
suitable for opinion holder extraction, since
- product reviews typically focus on the author's view on a particular product, and
- adjectives are much more frequently used than verbs in this domain.
The authors conclude that other domains (such as News media) might be much more relevant to the problem of opinion holder and target extraction.
Automatic extraction of opinion roles
The paper also includes approaches to automatically extract opinion roles that are not very successful at this stage:
- Similarity based: Opinion roles do not correspond to the similarity of opinion verbs, as experiments performed with the WordNet::Similarity metric demonstrate (for instance: rage, temper, fate and fear are similar to outrage, but have a reverse selectional preference).
- Machine learning: experiments which learn selectional preferences from MPQA using (a) the sequence labeler MultiRel and (b) convolution kernels show that the obtained F-measure amounts to ~72% in-domain and only ~45% across-domains.