Towards extracting coherent user concerns and their hierarchical organization from user reviews

Ligaj Pradhan, Chengcui Zhang, Steven Bethard

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

Mining user reviews to discover what the user likes and dislikes is vital to understanding user behaviors. Topic modeling techniques have been extensively used to discover meaningful topics for user reviews and to discover user behaviors. Extracted topics may be a mixture of different concepts and hence very likely to be less coherent and unclear, especially when extracting a relatively small number of topics. As such, we propose a method that extracts a relatively large number of topics using a topic modeling technique and relies on hierarchical clustering to exploit semantic distances between topics, to generate a small number of highly coherent and clear topics. We also compare this set of topics representing hidden user concerns extracted by our approach with those derived using LDA (Latent Dirichlet Allocation) and a hierarchical variant called Pachinko Allocation Model (PAM) and show that our method generates more coherent user concerns. Further, we also demonstrate how a hierarchical model of user concerns can be automatically generated by exploiting our approach. Such a hierarchy may help capture the conceptual distances between various user concerns and inherent similarities between users having those concerns.

Original languageEnglish (US)
Title of host publicationProceedings - 2016 IEEE 17th International Conference on Information Reuse and Integration, IRI 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages582-590
Number of pages9
ISBN (Electronic)9781509032075
DOIs
StatePublished - 2016
Externally publishedYes
Event17th IEEE International Conference on Information Reuse and Integration, IRI 2016 - Pittsburgh, United States
Duration: Jul 28 2016Jul 30 2016

Publication series

NameProceedings - 2016 IEEE 17th International Conference on Information Reuse and Integration, IRI 2016

Conference

Conference17th IEEE International Conference on Information Reuse and Integration, IRI 2016
Country/TerritoryUnited States
CityPittsburgh
Period7/28/167/30/16

Keywords

  • Coherent user concerns
  • Hierarchical clustering
  • Recommendation systems
  • Topic modeling
  • User reviews

ASJC Scopus subject areas

  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Towards extracting coherent user concerns and their hierarchical organization from user reviews'. Together they form a unique fingerprint.

Cite this