Document Type : Original/Review Paper

Authors

1 Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.

2 Content & E-Services Research Group, IT Research Faculty, ICT Research Institute, Tehran, Iran.

3 Department of Computer Engineering, Amirkabir University of Technology, Tehran, Iran.

4 Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran

Abstract

Recently, learning classifier systems are used to control physical robots, sensory robots, and intelligent rescue systems. The most important challenge in these systems, which are models of real environments, is its non-markov quality. Therefore, it is necessary to use memory to store system states in order to make decisions based on a chain of previous states. In this research, a memory-based XCS is proposed to help use more effective rules in classifier by identifying efficient rules. The proposed model was implemented on five important maze maps and led to a reduction in the number of steps to reach the goal and also an increase in the number of successes in reaching the goal in these maps.

Keywords

[1] Lanzi, Pier L. "Learning classifier systems: from foundations to applications.", No. 1813. Springer Science & Business Media, 2000.
 
[2] J. Holland, L. Booker, M. Colombetti, M. Dorigo, D. Goldberg, S. Forrest et al., "What Is a Learning Classifier System?," In Learning Classifier Systems. vol. 1813, Springer Berlin Heidelberg, pp. 3-32, 2000.
[3]           S. W. Wilson, "Classifier fitness based on accuracy," Evol. Comput., vol. 3, pp. 149-175, 1995.
[4] Bernadó-Mansilla, Ester, and Josep M. Garrell-Guiu. "Accuracy-based learning classifier systems: models, analysis and applications to classification tasks." Evolutionary computation, vol .11, no. 3 pp. 209-238, 2003.
[5] J. H. Holmes, P. L. Lanzi, W. Stolzmann, and S. W. Wilson, "Learning classifier systems: New models, successful applications," Information Processing Letters, vol. 82, pp. 23-30, 2002.
[6] M. Shariat Panahi, A. Karkhaneh Yousefi, and M. Khorshidi, "Combining accuracy and success-rate to improve the performance of eXtended Classifier System (XCS) for data-mining and control applications," Engineering Applications of Artificial Intelligence, vol. 26, pp. 1930-1935, 2013.
[7] Irfan, Muhammad et al. "Enhancing learning classifier systems through convolutional autoencoder to classify underwater images." Soft Computing , vol . 25, no . 15, pp. 10423-10440, 2021.
[8] Irfan, Muhammad et al. "Knowledge extraction and retention based continual learning by using convolutional autoencoder-based learning classifier system." Information Sciences 591, pp. 287-305, 2022
[9] Kato, Jefferson Satoshi, and Adriana Sbicca. "Bounded Rationality, Group Formation and the Emergence of Trust: An Agent-Based Economic Model. " Computational Economics, pp. 1-29, 2021.
[10] Liu, Yi. Learning Classifier Systems for Understanding Patterns in Data. Diss. Open Access Te Herenga Waka-Victoria University of Wellington, 2021.
[11]  Hansmeier, Tim, and Marco Platzner. "An experimental comparison of explore/exploit strategies for the learning classifier system XCS." Proceedings of the Genetic and Evolutionary Computation Conference Companion. 2021.
[12] Guendouzi, Wassila, and Abdelmadjid Boukra. "A new differential evolution algorithm for cooperative fuzzy rule mining: application to anomaly detection. " Evolutionary Intelligence, pp. 1-12, 2021.
[13] Hochberger, Christian, Lars Bauer, and Thilo Pionteck. Architecture  of Computing Systems. Springer International Publishing, 2021.
[14] Büttner, Johannes, and Sebastian Von Mammen. "Training a Reinforcement Learning Agent based on XCS in a Competitive Snake Environment." 2021 IEEE Conference on Games (CoG). IEEE, 2021.
[15]        B. Bartin, "Use of learning classifier systems in microscopic toll plaza simulation models," IET Intelligent Transport Systems, vol. 13, pp. 860-869, 2019.
[16]        M. R. Karlsen and S. Moschoyiannis, "Evolution of control with learning classifier systems," Applied network science, vol. 3, pp. 1-36, 2018.
[17]         M. Butz and D. Goldberg, "Generalized State Values in an Anticipatory Learning Classifier System," in Anticipatory Behavior in Adaptive Learning Systems. vol. 2684, M. Butz, O. Sigaud, and P. Gérard, Eds., ed: Springer Berlin Heidelberg, pp. 282-301, 2003.
[18]        M. V. Butz, T. Kovacs, P. L. Lanzi, and S. W. Wilson, "Toward a theory of generalization and learning in XCS," Evolutionary Computation, IEEE Transactions on, vol. 8, pp. 28-46, 2004.
[19]        P. Gérard and O. Sigaud, "YACS: Combining Dynamic Programming with Generalization in Classifier Systems," in Advances in Learning Classifier Systems. vol. 1996, P. Luca Lanzi, W. Stolzmann, and S. Wilson, Eds., ed: Springer Berlin Heidelberg, pp. 52-69, 2001.
[20]        J. H. Holland, "Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems," in Computation & intelligence, F. L. George, Ed., ed: American Association for Artificial Intelligence, pp. 275-304, 1995.
[21]        P. L. Lanzi, "An analysis of generalization in the xcs classifier system," Evol. Comput., vol. 7, pp. 125-149, 1999.
[22]        P. L. Lanzi, D. Loiacono, S. W. Wilson, and D. E. Goldberg, "Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension," Evol. Comput., vol. 15, pp. 133-168, 2007.
[23]        M. Iqbal, W. Browne, and M. Zhang, "XCSR with Computed Continuous Action," in AI 2012: Advances in Artificial Intelligence. vol. 7691, M. Thielscher and D. Zhang, Eds., ed: Springer Berlin Heidelberg, pp. 350-361, 2012.
[24]         M. Iqbal, W. N. Browne, and Z. Mengjie, "Reusing Building Blocks of Extracted Knowledge to Solve Complex, Large-Scale Boolean Problems," Evolutionary Computation, IEEE Transactions on, vol. 18, pp. 465-480, 2013.
[25]        F. Freschi and M. Repetto, "Multi-objective optimization by a modified artificial immune system algorithm," presented at the Proceedings of the 4th international conference on Artificial Immune Systems, Banff, Alberta, Canada, 2005.
[26] H. Asadul Rehman, M. Iqbal, I. Younas, and M. Bashir, "Learning Regular Expressions using XCS-based Classifier System," International Journal of Pattern Recognition and Artificial Intelligence, vol. 34, no. 10, p. 2051011, 2019.
[27]        J. Khan, A. Alam, J. Hussain, and Y.-K. Lee, "EnSWF: effective features extraction and selection in conjunction with ensemble learning methods for document sentiment classification," Applied Intelligence, vol. 49, pp. 3123-3145, Aug  2019.
[28]        K. Shafi and H. A. Abbass, "A survey of learning classifier systems in games," IEEE Computational Intelligence Magazine, vol. 12, pp. 42-55, 2017.
[29]        I. M. Alvarez, W. N. Browne, and M. Zhang, "Human-inspired scaling in learning classifier systems: Case study on the n-bit multiplexer problem set," in Proceedings of the Genetic and Evolutionary Computation Conference , pp. 429-436, 2016.
[30]        M. Tubishat, M. A. M. Abushariah, N. Idris, and I. Aljarah, "Improved whale optimization algorithm for feature selection in Arabic sentiment analysis," Applied Intelligence, vol. 49, pp. 1688-1707, May  2019.
[31]         Z. Zang, D. Li, and J. Wang, "Learning classifier systems with memory condition to solve non-Markov problems," Soft Computing, vol. 19, pp. 1679-1699, June 2015.
[32] A. L. Thomaz and C. Breazeal, "Teachable robots: Understanding human teaching behavior to build more effective robot learners," Artificial Intelligence, vol. 172, pp. 716-737, 2008.
[33]        L. M. Saksida, S. M. Raymond, and D. S. Touretzky, "Shaping robot behavior using principles from instrumental conditioning," Robotics and Autonomous Systems, vol. 22, pp. 231-249, 1997.
[34]         M. Dorigo and M. Colombetti, "Robot shaping: developing autonomous agents through learning," Artificial Intelligence, vol. 71, pp. 321-370, 1994.
[35]        D. Cliff and S. Ross, "Adding temporary memory to ZCS," Adapt. Behav., Vol. 3, pp. 101-150, 1994.
[36]        S. Wilson, "Classifier systems and the animat problem," Machine Learning, vol. 2, pp. 199-228, Nov 1987.
[37]         G. E. P. Box, "Evolutionary operation: a method for increasing industrial productivity," Applied statistics : a journal of the Royal Statistical Society, vol. 6, pp. 81-101, 1957.
[38]        L. B. Booker, "Intelligent behavior as an adaptation to the task environment," University of Michigan, 1982.
[39]         L. B. Booker, "Improving the Performance of Genetic Algorithms in Classifier Systems," presented at the Proceedings of the 1st International Conference on Genetic Algorithms, 1985.
[40]        L. B. Booker, "Classifier systems that learn internal world models,"Mach. Lang., vol. 3, pp. 161-192, 1988.
[41]        L. B. Booker, "Triggered Rule Discovery in Classifier Systems," presented at the Proceedings of the 3rd International Conference on Genetic Algorithms, 1989.
[42]        S. W. Wilson, "Zcs: A zeroth level classifier system," Evol. Comput., vol. 2, pp. 1-18, 1994.
[43]        R. S. Sutton and A. G. Barto, "Toward a modern theory of adaptive networks: expectation and prediction," Psychol Rev, vol. 88, pp. 135-70, 1981.
[44]        S. W. Wilson, "Classifiers that approximate functions," vol. 1, pp. 211-234, 2002.
[45]        L. Bull, "Two Simple Learning Classifier Systems," in Foundations of Learning Classifier Systems. Vol. 183, L. Bull and T. Kovacs, Eds., ed: Springer Berlin Heidelberg, pp. 63-89, 2005.
[46]        L. Bull, "A brief history of learning classifier systems: from CS-1 to XCS and its variants," Evolutionary Intelligence, pp. 1-16,  Jan 2015.
[47]        A. Hamzeh, S. Hashemi, A. Sami, and A. Rahmani, "A Recursive Classifier System for Partially Observable Environments," Fundam. Inform., vol. 97, pp. 15-40, 2009.
[48]  Later            A. Hamzeh and A. Rahmani, "A New Architecture for Learning Classifier Systems to Solve POMDP Problems," Fundam. Inform., vol. 84, pp. 329-351, 2008.
[49]         R. Preen and L. Bull, "Discrete and fuzzy dynamical genetic programming in the XCSF learning classifier system," Soft Computing, vol. 18, pp. 153-167, Jan 2014.
[50]        T. Ke, L. Jing, H. Lv, L. Zhang, and Y. Hu, "Global and local learning from positive and unlabeled examples,"Applied Intelligence, vol. 48, pp. 2373-2392, August 2018.
[51]        L. Lu, Q. Lin, H. Pei, and P. Zhong, "The aLS-SVM based multi-task learning classifiers,"Applied Intelligence, vol. 48, pp. 2393-2407, August  2018.
[52]        I. M. Alvarez, W. N. Browne, and M. Zhang, "Compaction for Code Fragment Based Learning Classifier Systems," Cham,  pp. 41-53, 2016.
[53] M. Moradi and J. Hamidzadeh, "Ensemble-based Top-k Recommender System Considering Incomplete Data," Journal of AI and Data Mining, vol. 7, no. 3, pp. 393-402, 2019.