3M-Transformers for Event Coding on Organized Crime Domain

Erick Skorupa Parolin, Latifur Khan, Javier Osorio, Patrick T. Brandt, Vito D’Orazio, Jennifer Holmes

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Scopus citations

Abstract

Political scientists and security agencies increasingly rely on computerized event data generation to track conflict processes and violence around the world. However, most of these approaches rely on pattern-matching techniques constrained by large dictionaries that are too costly to develop, update, or expand to emerging domains or additional languages. In this paper, we provide an effective solution to those challenges. Here we develop the 3M-Transformers (Multilingual, Multi-label, Multitask) approach for Event Coding from domain specific multilingual corpora, dispensing external large repositories for such task, and expanding the substantive focus of analysis to organized crime, an emerging concern for security research. Our results indicate that our 3M-Transformers configurations outperform state-of-the-art usual Transformers models (BERT and XLM-RoBERTa) for coding events on actors, actions and locations in English, Spanish, and Portuguese languages.

Original languageEnglish (US)
Title of host publication2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665420990
DOIs
StatePublished - 2021
Event8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021 - Virtual, Online, Portugal
Duration: Oct 6 2021Oct 9 2021

Publication series

Name2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

Conference

Conference8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021
Country/TerritoryPortugal
CityVirtual, Online
Period10/6/2110/9/21

Keywords

  • Deep neural networks
  • event coding
  • Multi-task learning
  • Natural language processing
  • Organized crime
  • Transfer learning

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Signal Processing
  • Information Systems and Management
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of '3M-Transformers for Event Coding on Organized Crime Domain'. Together they form a unique fingerprint.

Cite this