Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers

Shah, Seyed Muzaffar Ahmad; Singh, Satwinder

Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers

dc.contributor.author	Shah, Seyed Muzaffar Ahmad
dc.contributor.author	Singh, Satwinder
dc.date.accessioned	2024-01-21T10:48:41Z
dc.date.accessioned	2024-08-14T05:05:35Z
dc.date.available	2024-01-21T10:48:41Z
dc.date.available	2024-08-14T05:05:35Z
dc.date.issued	2023-05-03T00:00:00
dc.description.abstract	Social media is rapidly growing in popularity and has its advantages and disadvantages. Users posting their daily updates and opinions on social media may inadvertently hurt the feelings of others. Detecting hate speech and harmful information on social media is critical these days, lest it led to calamity. In this research, machine learning classifiers such as Na�ve Bayes, support vector machines, logistic regression, and pre-trained models BERT and RoBERTa, developed by Google and Facebook, respectively, are used to detect hate speech and offensive content from Twitter data on a newly created dataset that included tweets and articles/blogs. The sentiments were obtained using the VADER sentiment analyzer. The results depicted that the pre-trained classifiers outperformed the machine learning classifiers utilized in this study. An accuracy score of 96% and 93% was scored by BERT and RoBERTa, respectively, on the tweet dataset, whereas on a dataset of articles/blogs, accuracy of 97% and 98%, respectively, was achieved by both the classifiers outperforming other classifiers used in this work. Further, it can also be depicted that neutral content is shared more in articles/blogs, hate content is mostly shared equally in both the tweets and article/blogs, whereas offensive content is shared higher in tweets than articles/blogs. � 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.	en_US
dc.identifier.doi	10.1007/978-981-19-7455-7_17
dc.identifier.isbn	9789811974540
dc.identifier.issn	23673370
dc.identifier.uri	https://kr.cup.edu.in/handle/32116/3921
dc.identifier.url	https://link.springer.com/10.1007/978-981-19-7455-7_17
dc.language.iso	en_US	en_US
dc.publisher	Springer Science and Business Media Deutschland GmbH	en_US
dc.subject	BERT	en_US
dc.subject	Hate speech	en_US
dc.subject	Offensive language	en_US
dc.subject	RoBERTa	en_US
dc.subject	Tweets	en_US
dc.subject	VADER	en_US
dc.title	Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers	en_US
dc.title.journal	Lecture Notes in Networks and Systems	en_US
dc.type	Conference paper	en_US
dc.type.accesstype	Closed Access	en_US

Collections

Computer Science And Technology - Research Publications

Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers

Files

Collections