Combatting Falsehoods and Discriminatory Speech with NLP and ML Techniques

Dr. Pawan Whig; Veeramani Ganesan; Srinivas Venkata

Combatting Falsehoods and Discriminatory Speech with NLP and ML Techniques

Authors

Dr. Pawan Whig Dean Research VIPS-TC
Veeramani Ganesan Software engineer, Edison, New Jersey, USA
Srinivas Venkata STAFF DATA ENGINEER Informatica IICS-Integration Technology at Teradata

Keywords:

fake news, hate specch

Abstract

The rise of fake news and hate speech in the digital era has been a major challenge to the integrity of information and the well-being of society. This paper explores the use of machine learning and natural language processing (NLP) to detect and prevent the spread of false information and discriminatory speech. We review recent advances in NLP and ML algorithms for detecting fake news and hate speech, including text classification, sentiment analysis, and representation learning. Additionally, we present case studies and real-world applications of these techniques, highlighting their strengths and limitations. The goal of this paper is to provide a comprehensive overview of the current state-of-the-art in NLP and ML for combating fake news and hate speech, and to inspire further research in this important area.

References

T. Fritz and A. Klingler, “The d-Separation Criterion in Categorical Probability,” 2023. [Online]. Available: http://jmlr.org/papers/v24/22-0916.html.

P. Whig, A. Velu, and R. R. Naddikatu, “The Economic Impact of AI-Enabled Blockchain in 6G-Based Industry,” in AI and Blockchain Technology in 6G Wireless Network, Springer, Singapore, 2022, pp. 205–224.

Y. Alkali, I. Routray, and P. Whig, “Strategy for Reliable, Efficient and Secure IoT Using Artificial Intelligence.,” IUP Journal of Computer Sciences, vol. 16, no. 2, 2022.

P. Whig, A. Velu, and P. Sharma, “Demystifying Federated Learning for Blockchain: A Case Study,” in Demystifying Federated Learning for Blockchain and Industrial Internet of Things, IGI Global, 2022, pp. 143–165.

P. Whig, S. Kouser, A. Velu, and R. R. Nadikattu, “Fog-IoT-Assisted-Based Smart Agriculture Application,” in Demystifying Federated Learning for Blockchain and Industrial Internet of Things, IGI Global, 2022, pp. 74–93.

P. Whig, A. Velu, and R. Ready, “Demystifying Federated Learning in Artificial Intelligence With Human-Computer Interaction,” in Demystifying Federated Learning for Blockchain and Industrial Internet of Things, IGI Global, 2022, pp. 94–122.

P. Whig, A. Velu, and A. B. Bhatia, “Protect Nature and Reduce the Carbon Footprint With an Application of Blockchain for IIoT,” in Demystifying Federated Learning for Blockchain and Industrial Internet of Things, IGI Global, 2022, pp. 123–142.

P. Whig, A. Velu, and R. R. Nadikattu, “Blockchain Platform to Resolve Security Issues in IoT and Smart Networks,” in AI-Enabled Agile Internet of Things for Sustainable FinTech Ecosystems, IGI Global, 2022, pp. 46–65.

H. Jupalle, S. Kouser, A. B. Bhatia, N. Alam, R. R. Nadikattu, and P. Whig, “Automation of human behaviors and its prediction using machine learning,” Microsystem Technologies, pp. 1–9, 2022.

M. Anand, A. Velu, and P. Whig, “Prediction of Loan Behaviour with Machine Learning Models for Secure Banking,” Journal of Computer Science and Engineering (JCSE), vol. 3, no. 1, pp. 1–13, 2022.

G. Chopra and P. WHIG, “Using machine learning algorithms classified depressed patients and normal people,” International Journal of Machine Learning for Sustainable Development, vol. 4, no. 1, pp. 31–40, 2022.

A. Velu and P. Whig, “Studying the impact of the COVID vaccination on the world using data analytics,” Vivekananda J Res, vol. 10, no. 1, pp. 147–160, 2022.

Y. Khera, P. Whig, and A. Velu, “efficient effective and secured electronic billing system using AI,” Vivekananda Journal of Research, vol. 10, pp. 53–60, 2021.

Alqahtani, S., & Mahmood, T. (2019). A survey on fake news: history, detection approaches, and opportunities. Journal of Information Science, 45(3), 306-328.

Badjatiya, P., Gupta, D., Gupta, P., & Varma, V. (2017). Deep learning for hate speech detection in tweets. Proceedings of the 26th International Conference on World Wide Web Companion, 759-760.

Buntain, C., & Golbeck, J. (2017). Automatically identifying fake news in popular Twitter threads. Proceedings of the Tenth International Conference on Web and Social Media, 2-11.

Burnap, P., Williams, M. L., Sloan, L., Rana, O., & Housley, W. (2015). Tweeting the terror: modelling the social media reaction to the Woolwich terrorist attack. Social Network Analysis and Mining, 5(1), 10.

Cui, W., Chen, Z., Liu, T., & Wang, S. (2019). Exploiting lexicon based approach for fake news detection in social media networks. Journal of Parallel and Distributed Computing, 128, 14-24.

De Sarkar, S., Pal, A., & Bandyopadhyay, S. (2019). A survey of natural language processing techniques for hate speech detection. Journal of Information Science, 45(3), 357-373.

Gaur, M., & Das, A. (2018). Deep learning based techniques for fake news detection. Proceedings of the 4th International Conference on Computing Communication and Automation, 1-4.

Goyal, P., Ferrara, E., & De Choudhury, M. (2018). Online extreme hate speech detection using hatebase and neural network classifiers. Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 444-451.

Gupta, N., Kumaraguru, P., & Castillo, C. (2018). TweetCred: a real-time credibility assessment of content on Twitter. Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 617-620.

Imran, M., Castillo, C., Diaz, F., & Vieweg, S. (2016). Processing social media messages in mass emergency: a survey. ACM Computing Surveys, 47(4), 67.

Kshirsagar, A., & Saini, S. (2019). Fake news detection using machine learning and natural language processing. International Journal of Computer Sciences and Engineering, 7(9), 447-451.

Lakkaraju, H., Saha, K., & Veturi, Y. (2017). Interpretable decision sets for accurate and intuitive detection of diverse subgroups. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 977-986.

Mirza, F., & Zahoor, S. (2018). Detection of fake news using machine learning techniques. Proceedings of the 2018 International Conference on Frontiers of Information Technology, 86-91.

Perez-Rosas, V., Kleinberg, B., Lefevre, A., & Mihalcea, R. (2018). Automatic detection of fake news. Proceedings of the 27th International Conference on Computational Linguistics, 3391-3401.

Saleh, M., Potthast, M., & Hagen, M. (2019). Hierarchical attention networks for fake news detection. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 3428-3438.

Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2017). Fake news detection on social media: a data mining perspective. ACM SIGKDD Explorations Newsletter, 19(1), 22-36.

Thelwall, M., Stuart, E., & Wilkinson, D. (2019). Information quality discussions in politics on Twitter. Journal of Information Science, 45(3), 329-340.

Wang, W., Chen, Y., & Cao, J. (2020). A novel fake news detection model based on bi-directional LSTM and tree-structured convolutional neural network. Journal of Ambient Intelligence and Humanized Computing, 11(3), 1063-1073.

Waseem, Z., & Hovy, D. (2016). Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. Proceedings of the NAACL-HLT 2016, 88-93.

A. Kumar, S. Gupta, N. Yathiraju, S. Bose Chakraborty, K. Dodda and D. Verma, "The Novel E-Way of Identifying the Face Mask and To Ware the System in the Crowd Management System," 2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2023, pp. 999-1002, doi: 10.1109/ICACITE57410.2023.10182427.

R. Pandey, S. Saha, N. Yathiraju, I. S. Abdulrahman, R. Nittala and V. Tripathi, "Integration of RFID and Image Processing for Surveillance ABased Security System," 2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2023, pp. 380-384, doi: 10.1109/ICACITE57410.2023.10182987.

M. Nagaraju Naik, A. Kaur, N. Yathiraju, S. Das and K. Pant, "Improved and Accurate Face Mask Detection Using Machine Learning in the Crowded Places," 2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2023, pp. 572-576, doi: 10.1109/ICACITE57410.2023.10182567.

N. Yathiraju, P. Raman, R. Madala, P. Surgonda Patil, A. Kumar and S. Ashwin, "Research and Innovation to Market Development: Artificial Intelligence in Business," 2023 Eighth International Conference on Science Technology Engineering and Mathematics (ICONSTEM), Chennai, India, 2023, pp. 1-6, doi: 10.1109/ICONSTEM56934.2023.10142715.

N. Yathiraju, A. Sankar, S. Sandhiya, S. k. R, S. K and R. S, "Cardiac Disease Prediction for Heart Monitoring using Data Mining Techniques," 2022 International Interdisciplinary Humanitarian Conference for Sustainability (IIHC), Bengaluru, India, 2022, pp. 1282-1287, doi: 10.1109/IIHC55949.2022.10060047.

Yathiraju, N., & Mohapatra, A. (2023). "The Implications of IoT in the Modern Healthcare Industry post COVID-19," International Journal of Smart Sensor and Adhoc Network: Vol. 3: Iss. 4, Article 3.

DOI: 10.47893/IJSSAN.2023.1226

Yathiraju, N., & Dash, B. (2023). Gamification Of E-Wallets With The Use Of Defi Technology-A Revisit To Digitization In Fintech. International Journal of Engineering, Science, 3(1).

Yathiraju, N., & Dash, B. (2023). BIG DATA AND METAVERSE REVOLUTIONIZING THE FUTURISTICFINTECH INDUSTRY,” International Journal of Computer Science & Information Technology(IJCSIT) Vol 15, No.1, 2023. DOI: 10.5121/ijcsit.2023.15101

Ahammad, D. S. H. ., & Yathiraju, D. . (2021). Maternity Risk Prediction Using IOTModule with Wearable Sensor and Deep Learning Based Feature Extraction andClassification Technique. Research Journal of Computer Systems and Engineering,2(1), 40:45

Citation Indices	All	Since 2018
Citation	50854	30996
h-index	28	23
i10-index	119	72

Year	Rate
2024	12.6%
2023	18.3%

Combatting Falsehoods and Discriminatory Speech with NLP and ML Techniques