Document Type : Original/Review Paper

Authors

1 Department of Electrical and Computer Engineering, University of Hormozgan, Bandar Abbas, Iran and Deep Learning Research Group, University of Hormozgan, Bandar Abbas, Iran

2 Department of Electrical and Computer Engineering, University of Hormozgan, Bandar Abbas, Iran.

3 Markaz-e Elmi Karbordi Bandar Abbas 1, University of Applied Science and Technology, Farahani Boulevard, Bandar Abbas, 79199_33153, Iran.

Abstract

A Question Answering System (QAS) is a special form of information retrieval which consists of three parts: question processing, information retrieval, and answer selection. Determining the type of question is the most important part of QAS as it affects other following parts. This study uses effective features and ensemble classification to improve the QAS performance by increasing the accuracy of question type identification. We use the gravitational search algorithm to select the features and perform ensemble classification. The proposed system is extensively tested on different datasets using four types of experiments: (1) neither feature selection nor ensemble classification, (2) feature selection without ensemble classification, (3) ensemble classification without feature selection, and (4) feature selection with ensemble classification. These four kinds of experiments are carried out under the differential evolution algorithm and gravitational search algorithm. The experimental results show that the proposed method outperforms compared to state-of-the-art methods in previous researches.

Keywords

[1] M. Ramprasath and S. Hariharan, "A survey on question answering system," International Journal of Research and Reviews in Information Sciences, vol. 2, no. 1, pp. 171-179, 2012.
[2] P. Shafiei and M. Parsa, "Designing question answering system using genetic algorithm," in The 4th Data Mining Conference of Sharif university of Tehran, Tehran, 2010 (in Persian).
[3] M. R. Kangavari, S. Ghandchi, and M. Golpour, "A new model for question answering systems," World Academy of Science, Engineering and Technology, vol. 42, pp. 506-513, 2008.
[4] U. Hermjakob, "Parsing and question classification for question answering," in Proceedings of the ACL 2001 workshop on open-domain question answering, 2001, pp. 1-6.
[5] D. Zhang and W. S. Lee, "Question classification using support vector machines," in Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, 2003, pp. 26-32.
[6] A. Tahri and O. Tibermacine, "DBPedia based factoid question answering system," International Journal of Web and Semantic Technology, vol. 4, no. 3, pp. 23, 2013.
[7] M. Shamsfard and M. A. Yarmohammadi, "A semantic approach to extract the final answer in SBUQA question answering system," International Journal of Digital Content Technology and its Applications, vol. 4, no. 7, pp. 165-176, 2010.
[8] O. Kolomiyets and M.F.  Moens, "A survey on question answering technology from an information retrieval perspective," Information Sciences, vol. 181, no. 24, pp. 5412-5434, 2011.
[9] A. Mollaei, S. Rahati-Quchani, and A.  Estaji, "Persian Question Classification Accuracy Improvement by Using Question Informer based on Conditional Random Fields Model," in the 2nd International eConference on Computer and Knowledge Engineering (ICCKE), 2012, pp. 295-300.
[10] B. Liu, Z. Hao, X. Yang, and X. Lin, "Chinese question classification with support vector machine," IJCSNS International Journal of Computer Science and Network Security, vol. 6, pp. 231-240, 2006.
[11] W. Zhang, J. Chen, and Y. Niu, "Research on Chinese question classification based on hownet and dependency parsing," in 2011 3rd International Workshop on Intelligent Systems and Applications, 2011, pp. 1-4.
[12] A. A. Aburomman and M. B. I. Reaz, "A novel SVM-kNN-PSO ensemble method for intrusion detection system," Applied Soft Computing, vol. 38, pp. 360-372, 2016.
[13] I. Syarif, E. Zaluska, A. Prugel-Bennett, and G. Wills, "Application of bagging, boosting and stacking to intrusion detection," in International Workshop on Machine Learning and Data Mining in Pattern Recognition, 2012, pp. 593-602.
[14] E. Bahri, N. Harbi, and H. N. Huu, "Approach based ensemble methods for better and faster intrusion detection," in Computational Intelligence in Security for Information Systems, 2011, pp. 17-24.
[15] A. Ekbal and S. Saha, "Weighted vote based classifier ensemble selection using genetic algorithm for named entity recognition," in International Conference on Application of Natural Language to Information Systems, 2010, pp. 256-267.
[16] A. Ekbal and S. Saha, "Weighted vote-based classifier ensemble for named entity recognition: A genetic algorithm-based approach," ACM Transactions on Asian Language Information Processing (TALIP), vol. 10, no. 2, pp 1-37, 2011.
[17] Z. Shojaee, S. A. Shahzadeh Fazeli, E. Abbasi, and F. Adibnia. "Feature Selection based on Particle Swarm Optimization and Mutual Information." Journal of AI and Data Mining, vol. 9, no. 1, pp.39-44, 2021.
[18] K. H. Nabavi and A. Kabir, "Ensemble classification: creating diversity and combination rules," Scientific Journal of Iran Computer Association, vol. 3, no. 3, 2005 (in Persian).
[19] E. Rashedi, H. Nezamabadi-Pour, and S. Saryazdi, "GSA: a gravitational search algorithm," Information sciences, vol. 179, no. 13, pp. 2232-2248, 2009.
[20] K. Hacioglu and W. Ward, "Question classification with support vector machines and error correcting codes," in Companion Volume of the Proceedings of HLT-NAACL 2003-Short Papers, 2003, pp. 28-30.
[21] X. Li and D. Roth, "Learning question classifiers: the role of semantic information," Natural Language Engineering, vol. 12, no. 3, pp. 229-249, 2006.
[22] A. A. Yahya and A. Osman, "Automatic classification of questions into Bloom's cognitive levels using support vector machines," in The International Arab Conference on Information Technology, Naif Arab University for Security Science (NAUSS), 2011, pp. 1-6.
[23] L. Wang, H. Zhang, D. Wang, and J. Huang, "Chinese question classification based on semantic gram and SVM," in 2009 International Forum on Computer Science-Technology and Applications, 2009,
vol. 1, pp. 432-435.
[24] P. Blunsom, K. Koicik, J. R. Curran, "Question classification with log-linear models," in Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, 2006, pp. 615-6616.
[25] S. K. Ray, S. Singh, and B. P. Joshi, "A semantic approach for question classification using WorldNet and Wikipedia," Pattern Recognition letters, vol. 31, no. 13, pp. 1935-1943, 2010.
[26] X. Li and D. Roth, "Learning question classifiers," in COLING 2002: The 19th International Conference on Computational Linguistics, 2002.
[27] Z. Huang, M. Thint, and Z. Qin, "Question classification using head words and their hypernyms," in Proceedings of the 2008 Conference on empirical methods in natural language processing, 2008, pp. 927-936.
[28] M. Mohd and R. Hashmy, "Question classification using a knowledge-based semantic kernel," in Soft Computing: Theories and Applications, 2018: springer, pp. 599–606.
[29] A. Anjomshoaa, H. Nezamabadi-Pour, and H. Sanatnama, "Filtering of spam emails from valid emails using two methods of feature selection," in The First Iranian Conference on Pattern Recognition and Image Analysis, Birjand, 2012 (in Persian).
[30] M. Ganji, A. Yousef, N. Charkari, and M. Saniee-Abadeh, "Predicting protein interaction based on colonial competition algorithm," in 7th International Conference on Information Technology and Knowledge, no. 10, 2015 (in Persian).
[31] Y. Ghanbari, A. Harifi, and S. Ghanbari, "Recognition of Persian Manuscript Numbers Using Gabor Filter, Principal Component Analysis and Ensemble Classification," in 2nd International Conference on Electrical Engineering, no. 10, 2015 (in Persian).
[32] U. K. Sikdar, A. Ekbal, and S. Saha, "MODE: Multiobjective Differential Evolution for feature selection and classifier ensemble," Soft Computing, vol. 19, no. 12, pp. 3529-3549, 2015.
[33] B. Loni, "Enhanced Question Classification with Optimal Combination of Features", M.S. thesis, Dept. Media and Knowledge Eng., Delft Univ., Delft, NL, 2011.
[34] Y. Ghanbari, Sh. Golzari, and S. Doraisamy, "Feature Selection for questions classification using cuckoo evolutionary algorithm," in the Proceedings of Second International Conference on Technology, Communication and Knowledge, ICTCK2015, 2015 (in Persian).
[35] S. Bakhtiar-Ghale-Taki, M. H. Dezfoulian, and M. Mansourizadeh, "Provide a combined method for scoring answers in question answering systems, " in 7th International Conference of Information Technology and Knowledge, Hamadan, 2013, pp. 5-30. (in Persian).
[36] Z. Huang, M. Thint, and Z.  Qin, "Question classification using head words and their hypernyms," in Proceedings of the 2008 Conference on empirical methods in natural language processing, 2008, pp. 927-936.