Identifying Multiple Categories for News Headlines using Text Analysis

Main Article Content

Shilpa Patil
V Lokesha
Anuradha S G

Abstract

As entire world is witnessing this digital transformation giving rise to change in every aspect of human life. At Present, news classification is one of the valuable tools to improve the accessibility, and efficiency of news articles, due to the availability of huge digital data demanding accurate classification for users easy. The focus of our article is to build a context classifier for the news categories and evaluate the sentiments of text-based headlines. The Experimental study is carried out on 15 News categories available in the ‘News_Category_Dataset_v2’ dataset from Kaggle. The results of our study reveal the sentiment of the text associated with these news categories using a polarity index, with values ranging from positive and negative.

Article Details

How to Cite
1.
Patil S, V Lokesha, Anuradha S G. Identifying Multiple Categories for News Headlines using Text Analysis. J. Int. Acad. Phys. Sci. [Internet]. 2024 Mar. 15 [cited 2024 May 3];28(1):71-83. Available from: https://www.iaps.org.in/journal/index.php/journaliaps/article/view/972
Section
Articles

References

S. Taj, B.B. Shaikh and A.F. Meghji; Sentiment analysis of news articles: a lexicon-based approach. In 2019 2nd international conference on computing, mathematics and engineering technologies (iCoMET), IEEE (2019) 1-5.

N. Isnaini, M.S. Mubarok and M.Y.A. Bakar; A multi-label classification on topics of Indonesian news using K-Nearest Neighbor. In Journal of Physics: Conference Series, IOP Publishing, 1192-1 (2019), 012027.

O. Fuks; Classification of News Dataset, Standford University, 2018.

F. Fanny, Y. Muliono and F. Tanzil; A comparison of text classification methods k-NN, Naïve Bayes, and support vector machine for news classification, Jurnal Informatika: Jurnal Pengembangan IT, 3-2 (2018), 157-160.

R Katari and M B Myneni; A survey on news classification techniques. In 2020 International Conference on Computer Science, Engineering and Applications (ICCSEA) IEEE, 2020.

J.T. Pintas, L.A.F. Fernandes and A.C.B. Garcia; Feature selection methods for text classification: a systematic literature review, Artif Intell Rev, 54 (2021), 6149–6200.

D.Singh and S Malhotra; Intra News Category Classification using N-gram TF-IDF Features and Decision Tree Classifier, IJSART, 4 (2018), 508-514.

F.A. Pozzi, E. Fersini, E. Messina and B. Liu; Challenges of sentiment analysis in social networks: an overview, Sentiment analysis in social networks, (2017) 1-11.

P. Harjule, A. Gurjar, H. Seth and P. Thakur; Text classification on Twitter data. In 2020 3rd International Conference on Emerging Technologies in Computer Engineering: Machine Learning and Internet of Things (ICETCE) IEEE (2020) 160-164.

T. Sabri, O. El Beggar and M. Kissi; Comparative study of Arabic text classification using feature vectorization methods, Procedia Computer Science, 198 (2022), 269-275.

Y. HaCohen-Kerner, D. Miller and Y. Yigal; The influence of preprocessing on text classification using a bag-of-words representation, PloS one, 15-5 (2020), e0232525.

Y. Wang and L. Zhu; Research on improved text classification method based on combined weighted model, Concurrency and Computation: Practice and Experience, 32-6 (2020), e5140.