Exploring a Rich Design Space for Text Summarization

Abstract

Text summarization is a critical task to deal with the ever increasing amount of information available online. There is abundant research in this area, however, because inputs and outputs of the summarization process can vary substantially, it is often difficult to compare and integrate the existing methods. To tackle this challenge, in this talk I will introduce a set of dimensions to characterize a rich design space for summarization systems. I will then present two summarization frameworks we have been working on, one for evaluative text (e.g. customer reviews) and one for conversations (e.g., discussion forums), and discuss how they fit into the proposed design space. It will be clear that our two frameworks represent points which are quite far apart in the design space. One summarizer generates abstractive summaries of evaluative text (e.g. customer reviews) by applying a probabilistic discourse parser and by aggregating the extracted information. In contrast, the summarizer of conversations generates extractive multimedia summaries by first performing topic modeling and by then visualizing the source text annotated with the resulting topics. A comparison between these two very different frameworks will hopefully shed some light on the pros and cons of different approaches to summarization and on how to move forward in this important area of NLP.

Date
Event
Academic talk
Location
SCSE Meeting Room (N4-02A-35), NTU
Links

Speaker: Prof. Giuseppe Carenini

Bio: Giuseppe Carenini is a Professor in Computer Science at UBC (Vancouver, Canada). Giuseppe has broad interdisciplinary interests. His work on natural language processing and information visualization to support decision making has been published in over 100 peer-reviewed papers (including best paper at UMAP-14 and ACM-TiiS-14). Dr. Carenini was the area chair for “Sentiment Analysis, Opinion Mining, and Text Classification” of ACL 2009, the area chair for “Summarization and Generation” of NAACL 2012, the Program Co-Chair for IUI 2015, and the Program Co-Chair for SigDial 2016. He has also co-edited an ACM-TIST Special Issue on “Intelligent Visual Interfaces for Text Analysis”. In 2011, he published a co-authored book on “Methods for Mining and Summarizing Text Conversations”. In his work, Dr. Carenini has also extensively collaborated with industrial partners, including Microsoft and IBM. Giuseppe was awarded a Google Research Award, an IBM CASCON Best Exhibit Award, and a Yahoo Faculty Research Award in 2007, 2010 and 2016 respectively.