Aspect Based Multilabel Text Classification for Identifying Dangerous Speech Twitter Text

Yulian, Findawati and Kresna Adhi, Pramana and Agus Budi, Raharjo and Totok, Wahyu Abadi and Diana, Purwitasari (2022) Aspect Based Multilabel Text Classification for Identifying Dangerous Speech Twitter Text. International Conference of Information and Communication Technology (ICoICT).

[img] Text
Aspect Based Multilabel Text Classification for Identifying Dangerous Speech Twitter Text-1.pdf

Download (1MB)

Abstract

As part of hate speech, dangerous speech is any expression that can increase the risk of committing violence against other people. So far, hate speech research only explains whether some sentences is categorized as hate speech. It does not explain aspects of the sentences that make them called dangerous speech. Aspects of dangerous speech are social context, historical context, dehumanization, the accusation in the mirror, women and children attack, loyalty to the group, and group threat. This study uses the multi-label text classification method to determine dangerous speeches on Twitter texts based on seven aspects. Then, we assign a weighted score from those aspects to differentiate dangerous and hate speech. Based on the test results show the best performance is the Naive Bayes method with label-based subset accuracy (±36%), instance-based (average) accuracy (±86%) and classification accuracy (±77%). However, even though Naive Bayes has the best performance in terms of instance based (average) accuracy, the average difference between all methods with Naive Bayes is only ± 0.014, this indicates that other methods also produce quite good performance.

Item Type: Article
Subjects: H Social Sciences > H Social Sciences (General)
Divisions: Faculty of Social and Political Science > School of Communication Science
Depositing User: Totok Wahyu Abadi
Date Deposited: 15 Jun 2023 10:01
Last Modified: 15 Jun 2023 10:01
URI: http://eprints.umsida.ac.id/id/eprint/12004

Actions (login required)

View Item View Item