Department Of Computer Science And Technology

Permanent URI for this communityhttps://kr.cup.edu.in/handle/32116/79

Browse

Search Results

Now showing 1 - 2 of 2
  • Item
    Semi-supervised labeling: a proposed methodology for labeling the twitter datasets
    (Springer, 2022-01-28T00:00:00) Jan, Tabassum Gull; Khurana, Surinder Singh; Kumar, Munish
    Twitter has nowadays become a trending microblogging and social media platform for news and discussions. Since the dramatic increase in its platform has additionally set off a dramatic increase in spam utilization in this platform. For Supervised machine learning, one always finds a need to have a labeled dataset of Twitter. It is desirable to design a semi-supervised labeling technique for labeling newly prepared recent datasets. To prepare the labeled dataset lot of human affords are required. This issue has motivated us to propose an efficient approach for preparing labeled datasets so that time can be saved and human errors can be avoided. Our proposed approach relies on readily available features in real-time for better performance and wider applicability. This work aims at collecting the most recent tweets of a user using Twitter streaming and prepare a recent dataset of Twitter. Finally, a semi-supervised machine learning algorithm based on the self-training technique was designed for labeling the tweets. Semi-supervised support vector machine and semi-supervised decision tree classifiers were used as base classifiers in the self-training technique. Further, the authors have applied K means clustering algorithm to the tweets based on the tweet content. The principled novel approach is an ensemble of semi-supervised and unsupervised learning wherein it was found that semi-supervised algorithms are more accurate in prediction than unsupervised ones. To effectively assign the labels to the tweets, authors have implemented the concept of voting in this novel approach and the label pre-directed by the majority voting classifier is the actual label assigned to the tweet dataset. Maximum accuracy of 99.0% has been reported in this paper using a majority voting classifier for spam labeling. � 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
  • Item
    Deep Ensemble Approach for COVID-19 Fake News Detection from Social Media
    (Institute of Electrical and Electronics Engineers Inc., 2021-10-20T00:00:00) Priya, Anu; Kumar, Abhinav
    Social media networks such as Facebook and Twitter are overwhelmed with COVID-19-related posts during the outbreak. People have also posted several fake news among the massive COVID-19-related social media posts. Fake news has the potential to create public fear, weaken government credibility, and pose a serious threat to social order. This paper provides a deep ensemble-based method for detecting COVID-19 fake news. An ensemble classifier is made up of three different classifiers: Support Vector Machine, Dense Neural Network, and Convolutional Neural Network. The extensive experiments with the proposed ensemble model and eight different conventional machine learning classifiers are carried out using the character and word n-gram TF-IDF features. The results of the experiments show that character n-gram features outperform word n-gram features. The proposed deep ensemble classifier performed better, with a weighted Fl -score of 0.97 in contrast to numerous conventional machine learning classifiers and deep learning classifiers. � 2021 IEEE