Abstract:
Religion is one’s relation to what he or she regards as holy, sacred, spiritual, or worthy of especial reverence. Religious extremism is the advocacy of extreme measures over a religion whereas religious extremists are even willing to murder as they provide sanctions for violence in the service of God. Sri Lanka has a tragic history of religious and ethnic extremism and the Easter Sunday attack coordinated by a radical Islamic group that killed over 300 and injured another several hundred can be identified as the recent climax of these events. In this modern information age, it is evident that these radical extremist groups utilize social media for spreading their extreme ideologies due to its free and unregulated nature. If there were a mechanism to even slightly identify the possibility of tragic incidents like Easter Sunday bombing, the 300 souls who had to sacrifice their lives for an unreasonable cause would be still alive happily. In this research, we propose a predictive methodology for identifying any upcoming religious extremism-based threats in Sri Lanka using social media intelligence. We aim to specifically address Sri Lanka’s multi-lingual culture by analyzing all the bilingual social media posts in Sinhala and Tamil languages. A hybrid sentiment analysis methodology consisting of a Machine Learning model and a sentiment lexicon was trained on carefully chosen labelled social media text data and each text was classified as either religious-extreme or not, using Naïve Bayes, SVM, and Random Forest algorithms. When comparing their results, we were able to achieve the best results with the Naïve Bayes algorithm resulting in an accuracy of 81% for Sinhala tweets while Random Forest algorithm resulted in an accuracy of 73% for Tamil tweets proving that social media intelligence can be used to predict religious extremism-based threats.