Topic Tracking in News Streams using Latent Factor Models


The increasing amount of published news articles and messages in social media make it hard for users to find the relevant information and to track interesting topics. Relevant news is hidden in a haystack of irrelevant data. Text-mining techniques have been developed to extract implicit, hidden information. These techniques analyze big datasets and compute "latent" features based on implicit correlations between documents and events. In this paper we develop a system that applies the latent factor models on data streams. Our method allows us detecting the dominant topics and tracking the changes in the relevant topics. In addition, we explain how the extracted knowledge is used for computing recommendations based on trending topics and terms. We evaluate our system on a stream of news messages published on the micro-blogging service Twitter. The evaluation shows that our system efficiently extracts topics and provides valuable insights into the continuously changing news stream helping users quickly identifying the most relevant information as well as current trends.

author = {Jens Meiner and Andreas Lommatzsch},
title = {Topic Tracking in News Streams using Latent Factor Models},
booktitle = {Proc. of the 16th International Conference on Innovations for Community Services (I4CS)},
year = {2016},
series = {I4CS '16, CCIS 648},
pages = {173-191},
isbn = {10.007/978-3-319-49466-1_12},
location = {Vienna, Austria},
publisher = {Springer International Publishing},
address = {Heidelberg, Germany},
keywords = {latent factor models, data streams, topic tracking}
Jens Meiners, Andreas Lommatzsch
Conference Paper
16th International Conference on Innovations for Community Services, Vienna, Austria