Abstract
The use of text summaries in information-seeking research has focused on query-based summaries. Extracting content that resembles the query alone, however, ignores the greater context of the document. Such context may be central to the purpose and meaning of the document. We developed a generic, a query-based, and a hybrid summarizer, each with differing amounts of document context. The generic summarizer used a blend of discourse information and information obtained through traditional surface-level analysis. The query-based summarizer used only query-term information, and the hybrid summarizer used some discourse information along with query-term information. The validity of the generic summarizer was shown through an intrinsic evaluation using a well-established corpus of human-generated summaries. All three summarizers were then compared in an information-seeking experiment involving 297 subjects. Results from the information-seeking experiment showed that the generic summaries outperformed all others in the browse tasks, while the query-based and hybrid summaries outperformed the generic summary in the search tasks. Thus, the document context of generic summaries helped users browse, while such context was not helpful in search tasks. Such results are interesting given that generic summaries have not been studied in search tasks and the that majority of Internet search engines rely solely on query-based summaries.
Original language | English (US) |
---|---|
Pages (from-to) | 111-141 |
Number of pages | 31 |
Journal | ACM Transactions on Information Systems |
Volume | 24 |
Issue number | 1 |
DOIs | |
State | Published - 2006 |
Keywords
- Browse
- Generic summaries
- Indicative summaries
- Information seeking
- Natural language processing
- Search
- Summarization
- Text processing
ASJC Scopus subject areas
- Information Systems
- General Business, Management and Accounting
- Computer Science Applications