Classify First, and Then Extract: Prompt Chaining Technique for Information Extraction

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This work presents a new task-aware prompt design and example retrieval approach for information extraction (IE) using a prompt chaining technique. Our approach divides IE tasks into two steps: (1) text classification to understand what information (e.g., entity or event types) is contained in the underlying text and (2) information extraction for the identified types. Initially, we use a large language model (LLM) in a few-shot setting to classify the contained information. The classification output is used to select the relevant prompt and retrieve the examples relevant to the input text. Finally, we ask a LLM to do the information extraction with the generated prompt. By evaluating our approach on legal IE tasks with two different LLMs, we demonstrate that the prompt chaining technique improves the LLM's overall performance in a few-shot setting when compared to the baseline in which examples from all possible classes are included in the prompt. Our approach can be used in a low-resource setting as it does not require a large amount of training data. Also, it can be easily adapted to many different IE tasks by simply adjusting the prompts. Lastly, it provides a cost benefit by reducing the number of tokens in the prompt.

Original languageEnglish (US)
Title of host publicationNLLP 2024 - Natural Legal Language Processing Workshop 2024, Proceedings of the Workshop
EditorsNikolaos Aletras, Ilias Chalkidis, Leslie Barrett, Catalina Goanta, Daniel Preotiuc-Pietro, Gerasimos Spanakis
PublisherAssociation for Computational Linguistics (ACL)
Pages303-317
Number of pages15
ISBN (Electronic)9798891761834
StatePublished - 2024
Event6th Natural Legal Language Processing Workshop 2024, NLLP 2024, co-located with the 2024 Conference on Empirical Methods in Natural Language Processing - Miami, United States
Duration: Nov 16 2024 → …

Publication series

NameNLLP 2024 - Natural Legal Language Processing Workshop 2024, Proceedings of the Workshop

Conference

Conference6th Natural Legal Language Processing Workshop 2024, NLLP 2024, co-located with the 2024 Conference on Empirical Methods in Natural Language Processing
Country/TerritoryUnited States
CityMiami
Period11/16/24 → …

ASJC Scopus subject areas

  • Language and Linguistics
  • Computational Theory and Mathematics
  • Software
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Classify First, and Then Extract: Prompt Chaining Technique for Information Extraction'. Together they form a unique fingerprint.

Cite this