TY - GEN
T1 - Deep Learning for Hate Speech Detection
T2 - 33rd ACM Web Conference, WWW 2024
AU - Lee, Kyuhan
AU - Ram, Sudha
N1 - Publisher Copyright:
© 2024 Copyright is held by the owner/author(s).
PY - 2024/5/13
Y1 - 2024/5/13
N2 - A crucial element in the combat against hate speech is the development of efficient algorithms for automatically detecting hate speech. Previous research, however, has primarily neglected important insights from the field of psychology literature, particularly the relationship between personality and hate, resulting in suboptimal performance in hate speech detection. To this end, we propose a novel framework for detecting hate speech focusing on people’s personality factors reflected in their writing. Our framework has two components: (i) a knowledge distillation model for fully automating the process of personality inference from text and (ii) a personality-based deep learning model for hate speech detection. Our approach is unique in that it incorporates low-level personality factors, which have been largely neglected in prior literature, into automated hate speech detection and proposes novel deep learning components for fully exploiting the intricate relationship between personality and hate (i.e., intermediate personality factors). The evaluation shows that our model significantly outperforms state-of-the-art baselines. Our study paves the way for future research by incorporating personality aspects into the design of automated hate speech detection. In addition, it offers substantial assistance to online social platforms and governmental authorities facing challenges in effectively moderating hate speech.
AB - A crucial element in the combat against hate speech is the development of efficient algorithms for automatically detecting hate speech. Previous research, however, has primarily neglected important insights from the field of psychology literature, particularly the relationship between personality and hate, resulting in suboptimal performance in hate speech detection. To this end, we propose a novel framework for detecting hate speech focusing on people’s personality factors reflected in their writing. Our framework has two components: (i) a knowledge distillation model for fully automating the process of personality inference from text and (ii) a personality-based deep learning model for hate speech detection. Our approach is unique in that it incorporates low-level personality factors, which have been largely neglected in prior literature, into automated hate speech detection and proposes novel deep learning components for fully exploiting the intricate relationship between personality and hate (i.e., intermediate personality factors). The evaluation shows that our model significantly outperforms state-of-the-art baselines. Our study paves the way for future research by incorporating personality aspects into the design of automated hate speech detection. In addition, it offers substantial assistance to online social platforms and governmental authorities facing challenges in effectively moderating hate speech.
KW - deep learning
KW - hate speech detection
KW - personality
UR - http://www.scopus.com/inward/record.url?scp=85194492717&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194492717&partnerID=8YFLogxK
U2 - 10.1145/3589335.3652502
DO - 10.1145/3589335.3652502
M3 - Conference contribution
AN - SCOPUS:85194492717
T3 - WWW 2024 Companion - Companion Proceedings of the ACM Web Conference
SP - 1667
EP - 1671
BT - WWW 2024 Companion - Companion Proceedings of the ACM Web Conference
PB - Association for Computing Machinery, Inc
Y2 - 13 May 2024 through 17 May 2024
ER -