3M-Transformers for Event Coding on Organized Crime Domain

Parolin, Erick Skorupa; Khan, Latifur; Osorio, Javier; Brandt, Patrick T.; D'Orazio, Vito; Holmes, Jennifer

doi:10.1109/DSAA53316.2021.9564232

Citation Details

3M-Transformers for Event Coding on Organized Crime Domain

Political scientists and security agencies increasingly rely on computerized event data generation to track conflict processes and violence around the world. However, most of these approaches rely on pattern-matching techniques constrained by large dictionaries that are too costly to develop, update, or expand to emerging domains or additional languages. In this paper, we provide an effective solution to those challenges. Here we develop the 3M-Transformers (Multilingual, Multi-label, Multitask) approach for Event Coding from domain specific multilingual corpora, dispensing external large repositories for such task, and expanding the substantive focus of analysis to organized crime, an emerging concern for security research. Our results indicate that our 3M-Transformers configurations outperform state-of-the-art usual Transformers models (BERT and XLM-RoBERTa) for coding events on actors, actions and locations in English, Spanish, and Portuguese languages. more »

Award ID(s):: 1931541

NSF-PAR ID:: 10470307

Author(s) / Creator(s):: Parolin, Erick Skorupa; Khan, Latifur; Osorio, Javier; Brandt, Patrick T.; D'Orazio, Vito; Holmes, Jennifer

Publisher / Repository:: IEEE

Date Published:: 2021-10-06

Page Range / eLocation ID:: 1 to 10

Format(s):: Medium: X

Location:: Porto, Portugal

Sponsoring Org:: National Science Foundation

Conference Paper:
https://doi.org/10.1109/DSAA53316.2021.9564232

More Like this