Investigating the Applicability of current Machine-Learning based Subjectivity Detection Algorithms on German Texts

Abstract

In the field of subjectivity detection, algorithms automatically classify pieces of text into fact or opinion. Many different approaches have been successfully evaluated on English or Chinese texts. Nevertheless the assumption that these algorithms equally perform on all other languages cannot be verified yet. It is our intention to encourage more research in other languages, making a start with German. Therefore, this work introduces a German corpus for subjectivity detection on German news articles. We carry out this study in which we choose a number of state of the art subjectivity detection approaches and implement them. Finally we show and compare these algorithms' performances and give advice on how to use and extend the introduced dataset.

@inproceedings{atalla_2011,
 author = {Malik Atalla and Christian Scheel and Ernesto William De Luca and Sahin Albayrak},
 title = {Investigating the Applicability of current Machine-Learning based Subjectivity Detection Algorithms on German Texts},
 booktitle = {Proceedings of the Workshop on Robust Unsupervised and Semisupervised Methods in Natural Language Processing (ROBUS2011, in conjunction with RANLP 2011)},
 year = {2011},
 isbn =  {978-954-452-017-5},
 location = {Hissar, Bulgaria},
 publisher = {INCOMA Ltd.},
 address = {Shoumen, Bulgaria},
}
Autoren:
Malik Atalla, Christian Scheel, Ernesto William De Luca, Sahin Albayrak
Kategorie:
Tagungsbeitrag
Jahr:
2011
Ort:
Proceedings of the Workshop on Robust Unsupervised and Semisupervised Methods in Natural Language Processing (in conjunction with RANLP 2011)
Link: