Investigating sentence weighting components for automatic summarisation

Liang, SF, Devlin, Siobhan and Tait, John (2006) Investigating sentence weighting components for automatic summarisation. Information Processing & Management, 146-153. (doi:10.1016/j.ipm.2006.05.010).

Record type: Article

Abstract

The work described here initially formed part of a triangulation exercise to establish the effectiveness of the Query Term Order algorithm. The methodology produced subsequently proved to be a reliable indicator of quality for summarising English web documents. We utilised the human summaries from the Document Understanding Conference data, and generated queries automatically for testing the QTO algorithm. Six sentence weighting schemes that made use of Query Term Frequency and QTO were constructed to produce system summaries, and this paper explains the process of combining and balancing the weighting components. We also examined the five automatically generated query terms in their different permutations to check if the automatic generation of query terms resulting bias. The summaries produced were evaluated by the ROUGE-1 metric, and the results showed that using QTO in a weighting combination resulted in the best performance. We also found that using a combination of more weighting components always produced improved performance compared to any single weighting component.

Text

IP&M_revised-clean.pdf - Other

Download (172kB)