OpinAIS: An Artificial Immune System-based Framework for Opinion Mining
DOI:
https://doi.org/10.9781/ijimai.2015.333Keywords:
Classification, Artificial Intelligence, Analysis, Evolutionary Algorithm, Machine LearningAbstract
This paper proposes the design of an evolutionary algorithm for building classifiers specifically aimed towards performing classification and sentiment analysis over texts. Moreover, it has properties taken from Artificial Immune Systems, as it tries to resemble biological systems since they are able to discriminate harmful from innocuous bodies (in this case, the analogy could be established with negative and positive texts respectively). A framework, namely OpinAIS, is developed around the evolutionary algorithm, which makes it possible to distribute it as an open-source tool, which enables the scientific community both to extend it and improve it. The framework is evaluated with two different public datasets, the first involving voting records for the US Congress and the second consisting in a Twitter corpus with tweets about different technology brands, which can be polarized either towards positive or negative feelings; comparing the results with alternative machine learning techniques and concluding with encouraging results. Additionally, as the framework is publicly available for download, researchers can replicate the experiments from this paper or propose new ones.Downloads
References
[1] A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. Passonneau. Sentiment Analysis of Twitter Data. In Proc. LSM, pages 30-38, 2011.
[2] E. Cambria, T. Mazzocco, and A. Hussain. Application of MultiDimensional Scaling and Artificial Neural Networks for Biologically Inspired Opinion Mining. BICA, 4:41-53, 2013.
[3] E. Cambria, B. Schuller, Y. Xia, and C. Havasi. New Avenues in Opinion Mining and Sentiment Analysis. IEEE Intelligent Systems, 28(2):15-21, 2013.
[4] J. H. Carter. The Immune System as a Model for Classification and Pattern Recognition. Genetic Programming and Evolvable Machines, 7:28-41, 2000.
[5] H. Chen and D. Zimbra. AI and Opinion Mining. IEEE Intelligent Systems, 25(3):74-76, 2010.
[6] H. Cordobés, A. Fernández-Anta, L. F. Chiroque, F. Pérez, T. Redondo, and A. Santos. Graph-based Techniques for Topic Classification of Tweets in Spanish. IJIMAI, 2:31-37, 2014.
[7] S. Dasgupta and V. Ng. Mine the Easy, Classify the Hard: a SemiSupervised Approach to Automatic Sentiment Classification. In Proc. ACL-IJCNLP, pages 701-709, 2009.
[8] L. N. de Castro and J. Timmis. Artificial Immune Systems: A New Computational Intelligence Approach. Springer, 2002.
[9] L. N. de Castro and F. J. Von Zuben. The Clonal Selection Algorithm with Engineering Applications. In Proc. GECCO, pages 36-37, 2000.
[10] L. N. de Castro and F. J. Von Zuben. Learning and Optimization Using the Clonal Selection Principle. IEEE Transactions on Evolutionary Computation, 6:239-251, 2002.
[11] L. Dey and S. M. Haque. Opinion Mining from Noisy Text Data. IJDAR, 12:205-226, 2009.
[12] K. D. Elgert. Immunology - Understanding the Immune System. John Wiley & Sons, Inc., 1996.
[13] T. Fawcett. An Introduction to ROC Analysis. Pattern Recognition Letters, 27:861-874, 2006.
[14] M. Govindarajan. Sentiment Analysis of Movie Reviews using Hybrid Method of Naive Bayes and Genetic Algorithm. IJACR, 3(4):139-145, 2013.
[15] M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The WEKA Data Mining Software: An Update. SIGKDD Explorations, 11(1):10-18, 2009.
[16] R. R. S. Jandail, P. Sharma, and C. Agrawal. A Survey on Sentiment Analysis and Opinion Mining: A Need for an Organization and Requirement of a Customer. IJETAE, 4(3):17-24, 2014.
[17] A. N. Jebaseeli and E. Kirubakaran. Genetic Optimized Neural Network Algorithm to Improve Classification Accuracy for Opinion Mining of M-Learning Reviews. IJETTCS, 2(3):345-349, 2013.
[18] J. Khairnar and M. Kinikar. Machine Learning Algorithms for Opinion Mining and Sentiment Classification. IJSRP, 3(6), 2013.
[19] E. Kouloumpis, T. Wilson, and J. Moore. Twitter sentiment analysis: The good the bad and the omg! In Proc. ICWSM, 2011.
[20] S. Leon-Rojas, U. Kirschenmann, and M. Wolpers. We Have No Feelings, We Have Emoticons ;-). In Proc. ICALT, pages 642-646, 2012.
[21] S. Li, Z. Wang, G. Zhou, and S. Y. M. Lee. Semi-Supervised Learning for Imbalanced Sentiment Classification. In Proc. IJCAI, pages 1826-1831, 2011.
[22] B. Liu. Sentiment analysis: A multifaceted problem. IEEE Intelligent Systems, 25(3):76-80, 2010.
[23] B. Liu and L. Zhang. A Survey of Opinion Mining and Sentiment Analysis. In Mining Text Data, pages 415-463. Springer, 2012.
[24] D. Maynard, K. Bontcheva, and D. Rout. Challenges in developing opinion mining tools for social media. In Proc. LREC Workshop, pages 15-22, 2012.
[25] N. Mishra and C. K. Jha. Classification of Opinion Mining Techniques. IJCA, 56(13):1-6, 2012.
[26] A. Pak and P. Paroubek. Twitter as a corpus for sentiment analysis and opinion mining. In Proc. LREC, pages 1320-1326, 2010.
[27] B. Pang and L. Lee. Opinion Mining and Sentiment Analysis. FTIR, 2(1-2):1-135, 2008.
[28] B. Pang, L. Lee, and S. Vaithyanathan. Thumbs Up? Sentiment Classification using Machine Learning Techniques. In Proc. EMNLP, pages 79-86, 2002.
[29] M. F. Porter. An Algorithm for Suffix Stripping. Program, 14(3):130- 137, 1980.
[30] M. A. Potter and K. A. De Jong. The Coevolution of Antibodies for Concept Learning. In Proc. PPSN, pages 530-539, 1998.
[31] M. Puteh, N. Isa, S. Puteh, and N. A. Redzuan. Sentiment Mining of Malay Newspaper (SAMNews) Using Artificial Immune System. In Proc. WCE, 2013.
[32] A. Puurula and S.-H. Myaeng. Integrated Instanceand Class-based Generative Modeling for Text Classification. In Proc. ADCS, pages 66-73, 2013.
[33] Y. Saez, C. Navarro, A. Mochón, and P. Isasi. A System for Personality and Happiness Detection. IJIMAI, 2:7-15, 2014.
[34] N. Samsudin, M. Puteh, A. R. Hamdan, and M. Z. A. Nazri. Is Artificial Immune System Suitable for Opinion Mining. In Proc. DMO, pages 131-136, 2012.
[35] N. Samsudin, M. Puteh, A. R. Hamdan, and M. Z. A. Nazri. Immune Based Feature Selection for Opinion Mining. In Proc. WCE, 2013. [36] J. C. Schlimmer. Concept Acquisition through Representational Adjustment. PhD thesis, University of California, Irvine, 1987.
[37] B. Schuller and T. Knaup. Learning and Knowledge-Based Sentiment Analysis in Movie Review Key Excerpts. In Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces. Theoretical and Practical Issues, volume 6456 of LNCS, pages 448-472. Springer, 2011.
[38] N. R. Sharma and V. D. Chitre. Opinion Mining, Analysis and its Challenges. IJIACS, 3(1):59-65, 2014.
[39] G. Stylios, C. D. Katsis, and D. Christodoulakis. Using Bio-inspired intelligence for Web opinion Mining. IJCA, 87(5):36-43, 2014.
[40] S. Tan, X. Cheng, Y. Wang, and H. Xu. Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis. In Advances in Information Retrieval, volume 5478 of LNCS, pages 337-349. Springer, 2009.
[41] J. Twycross and S. Cayzer. An Immune-based Approach to Document Classification. Technical report, HP Laboratories, Filton Road, Stoke Gifford, Bristol U.K., 2002.
[42] G. Vinodhini and R. M. Chandrasekaran. Sentiment Analysis and Opinion Mining: A Survey. IJARCSSE, 2(6), 2012.
[43] A. Watkins, J. Timmis, and L. Boggess. Artificial Immune Recognition System (AIRS): An Immune-Inspired Supervised Learning Algorithm. Genetic Programming and Evolvable Machines, 5:291-317, 2004.
[44] J. Wilbur and K. Sirotkin. The Automatic Identification of Stop Words. JIS, 18:45-55, 1992.
Downloads
Published
-
Abstract32
-
PDF22






