Skip to main navigation Skip to search Skip to main content

Information extraction

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Information extraction (IE) is the process of scanning text for information relevant to some interest, including extracting entities, relations, and, most challenging, events-or who did what to whom, when, and where. It requires deeper analysis than keyword searches, but its aims fall short of the very hard and long-termproblemof text understanding, wherewe seek to capture all the information in a text, alongwith the speaker’s or writer’s intention. IE represents a midpoint on this spectrum, where the aim is to capture structured informationwithout sacrificing feasibility. IE typically focuses on surface linguistic phenomena that do not require deep inference, and it focuses on the phenomena that are most frequent in texts.

Original languageEnglish (US)
Title of host publicationHandbook of Natural Language Processing, Second Edition
PublisherCRC Press
Pages511-532
Number of pages22
ISBN (Electronic)9781420085938
ISBN (Print)9781420085921
StatePublished - Jan 1 2010
Externally publishedYes

ASJC Scopus subject areas

  • General Computer Science
  • General Economics, Econometrics and Finance
  • General Business, Management and Accounting

Fingerprint

Dive into the research topics of 'Information extraction'. Together they form a unique fingerprint.

Cite this