Department Of Computer Science And Technology

Permanent URI for this communityhttps://kr.cup.edu.in/handle/32116/79

Browse

Search Results

Now showing 1 - 2 of 2
  • Item
    Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers
    (Springer Science and Business Media Deutschland GmbH, 2023-05-03T00:00:00) Shah, Seyed Muzaffar Ahmad; Singh, Satwinder
    Social media is rapidly growing in popularity and has its advantages and disadvantages. Users posting their daily updates and opinions on social media may inadvertently hurt the feelings of others. Detecting hate speech and harmful information on social media is critical these days, lest it led to calamity. In this research, machine learning classifiers such as Na�ve Bayes, support vector machines, logistic regression, and pre-trained models BERT and RoBERTa, developed by Google and Facebook, respectively, are used to detect hate speech and offensive content from Twitter data on a newly created dataset that included tweets and articles/blogs. The sentiments were obtained using the VADER sentiment analyzer. The results depicted that the pre-trained classifiers outperformed the machine learning classifiers utilized in this study. An accuracy score of 96% and 93% was scored by BERT and RoBERTa, respectively, on the tweet dataset, whereas on a dataset of articles/blogs, accuracy of 97% and 98%, respectively, was achieved by both the classifiers outperforming other classifiers used in this work. Further, it can also be depicted that neutral content is shared more in articles/blogs, hate content is mostly shared equally in both the tweets and article/blogs, whereas offensive content is shared higher in tweets than articles/blogs. � 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
  • Item
    Comparison of Public and Critics Opinion About the Taliban Government Over Afghanistan Through Sentiment Analysis
    (Springer Science and Business Media Deutschland GmbH, 2023-05-03T00:00:00) Reza, Md Majid; Singh, Satwinder; Kundra, Harish; Reza, Md Rashid
    The usage of social media has increased exponentially these days. People worldwide are sharing their opinions on different platforms such as Twitter, personal blogs, Facebook, and other similar platforms. Twitter has grown in popularity as a platform for people to express their thoughts and opinions on many different topics. The data from Twitter about the Taliban has been examined in this research work, and various machine learning algorithms have been applied including SVM, LR, and random forest. Text sentiments have been captured via TextBlob. Among the machine learning models applied, SVM outperformed all other models and achieved an accuracy score of around 94% on the tweet dataset and logistic regression outperformed other models with an accuracy score of 83% on the news article dataset. � 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.