Mining comparative opinions from customer reviews for Competitive Intelligence
Key Concepts
- companies apply CI methods that
- monitor the reception of their own products and services
- analyze information on their competitors activities
- comparative opinions are valuable information sources for identifying the relative weaknesses and strengths of products.
- the authors distinguish between rule-based relation extraction methods with automatically and manually defined rules. The first group is further separated into (i) feature-based methods, that rely on features such as POS tags and entity types for relation extraction, and (ii) kernel-based methods that represent examples and define kernels to compute similarities in a high-dimensional space.
- comparative relation extraction obtains directed relation with the following four arguments: $$Relation Type and Direction(Product_1, Product_1, Attribute, Sentimental Phrase)$$
- Examples relations:
- >(Nokia N95, iPhone, camera, better)
- ~(Pearl, Curve, camera, high resolution)
- >(iPhone, Curve, screen, bigger
Method
The authors use conditional random fields to model (a) the relation between relations and entities, and (b) between relations and words.Linguistic Features
- Capitalization information (identify product names), Word type, POS tags
- Prefixes and suffixes: recognize sentiment phrases (bett-er, quick-er, fast-est, ...)
- Indicator phrases for comparisons (in contrast to, unlike, compare with, ...)
- Syntactic paths (identify comparisons)
- Grammatical roles (products are often subjects, attributes entities are objects, ...)