Abstract
Information extraction (IE) is the process of scanning text for information relevant to some interest, including extracting entities, relations, and, most challenging, events-or who did what to whom, when, and where. It requires deeper analysis than keyword searches, but its aims fall short of the very hard and long-termproblemof text understanding, wherewe seek to capture all the information in a text, alongwith the speaker’s or writer’s intention. IE represents a midpoint on this spectrum, where the aim is to capture structured informationwithout sacrificing feasibility. IE typically focuses on surface linguistic phenomena that do not require deep inference, and it focuses on the phenomena that are most frequent in texts.
| Original language | English (US) |
|---|---|
| Title of host publication | Handbook of Natural Language Processing, Second Edition |
| Publisher | CRC Press |
| Pages | 511-532 |
| Number of pages | 22 |
| ISBN (Electronic) | 9781420085938 |
| ISBN (Print) | 9781420085921 |
| State | Published - Jan 1 2010 |
| Externally published | Yes |
ASJC Scopus subject areas
- General Computer Science
- General Economics, Econometrics and Finance
- General Business, Management and Accounting
Fingerprint
Dive into the research topics of 'Information extraction'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS