Decoding the Language of Deception: A Textual Analysis of Fraud Trends in News Media with Advanced NLP Technique

Authors

  • Nugroho Petronas Carigali Ketapang II Ltd (PCK2L)

DOI:

https://doi.org/10.21532/apfjournal.v9i2.333

Keywords:

Natural Language Processing (NLP), Fraud Detection, Sentiment Analysis, News Media Analysis, Topic Modelling

Abstract

This research aims to unveil the extent and nature of deception in news media outlets, employing advanced natural language processing (NLP) techniques. NLP has reached an exciting point in its development, which has made it a key tool for analyzing large scale textual data across industries. As a result, fraud detection has become one of the most important use cases of NLP because it can reveal hidden patterns and linguistic indicators. The traditional analysis of fraud is usually based on financial and transactional data, but the analysis of text provides a new view to the way language affects the perception of trustworthiness and deceit. Through integrating various methodologies such as sentiment analysis, topic modeling, and named entity recognition, the investigation meticulously analyzes news articles to identify subtle linguistic indicators of fraudulent behavior. Utilizing specialized NLP software packages like VADER, spaCy, Gensim, and NLTK, the study effectively detects intricate patterns in the representation of fraud within media narratives and monitors shifts in public attitudes towards deceitful actions. The results reveal distinct linguistic patterns and trends in the portrayal of fraud, offering novel insights into the media’s role in shaping public perception of deception. These findings provide significant contributions to the theoretical framework of AI-driven news analysis and have practical implications for journalism and policymaking. This research not only sets new benchmarks in media monitoring and analysis by merging computational linguistics with fraud detection techniques but also plays a crucial role in enhancing the integrity of information dissemination and fostering a more accurately informed public sphere.

References

Fisher, I. E., Garnsey, M. R., and Hughes, M. E. (2016) Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research. Intelligent Systems in Accounting, Finance and Management, 23(3), 157-214. https://doi.org/10.1002/isaf.1386

García-Méndez, S., de Arriba-Pérez, F., Barros-Vila, A. González-Castaño, F. J., Costa-Montenegro, E. (2023). Automatic Detection of Relevant Information, Predictions and Forecasts in Financial News Through Topic Modelling with Latent Dirichlet Allocation. Applied Intelligence: The International Journal of Research on Intelligent Systems for Real Life Complex Problems, 53, 19610–19628. https://doi.org/10.1007/s10489-023-04452-4.

Goel, S., Gangolly, J., Faerman, S R., & Uzuner, Ö. (2010, January 1). Can Linguistic Predictors Detect Fraudulent Financial Filings?. Journal of Emerging Technologies in Accounting, 7(1), 25-46. https://doi.org/10.2308/jeta.2010.7.1.25.

Hajek, P., Henriques, R. (2017) Mining Corporate Annual Reports for Intelligent Detection of Financial Statement Fraud–A Comparative Study of Machine Learning Methods. Knowledge-Based Systems, 128, 139–152. https://doi.org/10.1016/j.knosys.2017.05.001.

Jurgovsky, J., Granitzer, M., Ziegler, K., Calabretto, S., Portier, P. E., He-Guelton, L., Caelen, O. (2018) Sequence Classification For Credit-Card Fraud Detection. Expert Systems with Applications, 100, 234-245.

Khurana, D., Koli, A., Khatter, K., Singh, S. (2022). Natural Language Processing: State of The Art, Current Trends and Challenges. Multimedia Tools and Applications, 82(6), 1-25.

Mayopu, R. G., Wang, Y. -Y., & Chen, L. -S. (2023). Analyzing Online Fake News Using Latent Semantic Analysis: Case of USA Election Campaign. Big Data and Cognitive Computing, 7(2), 1-19. https://doi.org/10.3390/bdcc7020081.

Minhas, S., and Hussain, A. (2016). From spin to swindle: identifying falsification in financial text. Cognitive Computation, 8(4), 729-745. https://doi.org/10.1007/s12559-016-9413-9.

Nomoto, T. (2023). Keyword Extraction: A Modern Perspective. SN Computer Science, 4, 1-19. https://doi.org/10.1007/s42979-022-01481-7.

Turing, A. M. (1950). Computing Machinery and Intelligence. Mind, 59(236), 433–460, https://doi.org/10.1093/mind/LIX.236.433.

Downloads

Published

2024-12-12

How to Cite

Nugroho. (2024). Decoding the Language of Deception: A Textual Analysis of Fraud Trends in News Media with Advanced NLP Technique. Asia Pacific Fraud Journal, 9(2), 225–239. https://doi.org/10.21532/apfjournal.v9i2.333