Exploiting Conversational Features to Detect High-Quality Blog Comments.

Abstract

In this work, we present a method for classifying the quality of blog comments using Linear-Chain Conditional Random Fields (CRFs). This approach is found to yield high accuracy on binary classification of high-quality comments, with conversational features contributing strongly to the accuracy. We also present a new corpus of blog data in conversational form, complete with user-generated quality moderation labels from the science and technology news blog Slashdot.

Publication
In Proceedings of the Canadian Conference on Artificial Intelligence (CAI) 2011. St. Johns, Newfoundland. (short paper)
Date
Links