SEARCH
ISTC-CC NEWSLETTER
RESEARCH HIGHLIGHTS
Ling Liu's SC13 paper "Large Graph Processing Without the Overhead" featured by HPCwire.
ISTC-CC provides a listing of useful benchmarks for cloud computing.
Another list highlighting Open Source Software Releases.
Second GraphLab workshop should be even bigger than the first! GraphLab is a new programming framework for graph-style data analytics.
ISTC-CC Abstract
Jointly Modeling Aspects, Ratings and Sentiments for Movie Recommendations
Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’14), August 2014.
Qiming Diao, Minghui Qiu, Chao-Yuan Wu*, Alexander J. Smola*^, Jing Jiang,
Chong Wang*
Singapore Mgt. University
* Carnegie Mellon University
^ Google
Recommendation and review sites offer a wealth of infor- mation beyond ratings. For instance, on IMDb users leave reviews, commenting on different aspects of a movie (e.g. actors, plot, visual effects), and expressing their sentiments (positive or negative) on these aspects in their reviews. This suggests that uncovering aspects and sentiments will allow us to gain a better understanding of users, movies, and the process involved in generating ratings.
The ability to answer questions such as “Does this user care more about the plot or about the special effects?” or ”What is the quality of the movie in terms of acting?” helps us to understand why certain ratings are generated. This can be used to provide more meaningful recommendations.
In this work we propose a probabilistic model based on collaborative filtering and topic modeling. It allows us to capture the interest distribution of users and the content distribution for movies; it provides a link between inter- est and relevance on a per-aspect basis and it allows us to differentiate between positive and negative sentiments on a per-aspect basis. Unlike prior work our approach is entirely unsupervised and does not require knowledge of the aspect specific ratings or genres for inference.
We evaluate our model on a live copy crawled from IMDb. Our model offers superior performance by joint modeling. Moreover, we are able to address the cold start problem — by utilizing the information inherent in reviews our model demonstrates improvement for new users and movies.
FULL PAPER: pdf