Transformer-based cynical expression detection in a corpus of Spanish YouTube reviews

Samuel González-López, Steven Bethard

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Consumers of services and products exhibit a wide range of behaviors on social networks when they are dissatisfied. In this paper, we consider three types of cynical expressions – negative feelings, specific reasons, and attitude of being right – and annotate a corpus of 3189 comments in Spanish on car analysis channels from YouTube. We evaluate both token classification and text classification settings for this problem, and compare performance of different pre-trained models including BETO, SpanBERTa, Multilingual Bert, and RoBERTuito. The results show that models achieve performance above 0.8 F1 for all types of cynical expressions in the text classification setting, but achieve lower performance (around 0.6-0.7 F1) for the harder token classification setting.

Original languageEnglish (US)
Title of host publicationWASSA 2023 - 13th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Proceedings of the Workshop
EditorsJeremy Barnes, Orphee De Clercq, Roman Klinger
PublisherAssociation for Computational Linguistics (ACL)
Pages194-201
Number of pages8
ISBN (Electronic)9781959429876
DOIs
StatePublished - 2023
Event13th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, WASSA 2023 - Toronto, Canada
Duration: Jul 14 2023 → …

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference13th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, WASSA 2023
Country/TerritoryCanada
CityToronto
Period7/14/23 → …

ASJC Scopus subject areas

  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'Transformer-based cynical expression detection in a corpus of Spanish YouTube reviews'. Together they form a unique fingerprint.

Cite this