Abstract
This paper describes the participation of the Technical University of Catalonia in the CLEF 2007 Question Answering on Speech Transcripts track. For the processing of manual transcripts we have deployed a robust factual Question Answering that uses minimal syntactic information. For the handling of automatic transcripts we combine the QA system with a novel Passage Retrieval and Answer Extraction engine, which is based on a sequence alignment algorithm that searches for "sounds like" sequences in the document collection. We have also enriched the NERC with phonetic features to facilitate the recognition of named entities even when they are incorrectly transcribed.
| Original language | English (US) |
|---|---|
| Journal | CEUR Workshop Proceedings |
| Volume | 1173 |
| State | Published - 2007 |
| Externally published | Yes |
| Event | 2007 Cross Language Evaluation Forum Workshop, CLEF 2007, co-located with the 11th European Conference on Digital Libraries, ECDL 2007 - Budapest, Hungary Duration: Sep 19 2007 → Sep 21 2007 |
Keywords
- Phonetic distance
- Question answering
- Spoken document retrieval
ASJC Scopus subject areas
- General Computer Science