Optimized Information Integration in Data Mining Using Ensemble Classification

Authors

  • C Priya
  • K Kumuthapriya
  • R Sankar
  • Anantha Raman Rathinam
  • M Venkatesan
  • K Jeevitha
  • Mohd Miskeen Ali
  • Dennis Surendar

Keywords:

Data mining, fusion, classification, rule-based mining, maximum entropy

Abstract

Effectively integrating categorisation rules is crucial in data mining to improve predicted consistency and robustness. Traditional methods, including ensemble techniques and weighted rule aggregation, frequently do not maintain the structural integrity of classifier parameters. This study introduces an optimised information integration framework utilising maximum entropy classifiers, wherein classifier fusion is accomplished via the preservation of probabilistic parameters. The method combines non-parametric wave functions such as Dirichlet and Wishart for handling continuous distributions and uses regression analysis statistics across regulated input dimensions for generated classification. The wave parameters are systematically categorised first-order or higher-order populations, facilitating scalable implementation. Fusion is achieved by the multiplication of hyper-distributions, resulting in streamlined assignment formulae that preserve probabilistic attributes. The proposed strategy guarantees the preservation of critical hyper-distributions during integration, allowing their effective application in future organised training phases. This maximum entropy-based fusion methodology improves classifier interoperability and provides a reliable method for optimised information integration in complex data mining contexts.

References

[1]. M. Zhang, H. Song, S. Lv, Y.Li, X.Yu, and J. Bao, “Research on the Multi-sensors Information Fusion Technique Based on the Neural Networks and its Application,” Int. Workshop on Knowledge Discovery and Data Mining, pp. 93–96, 2009.

[2]. B. Verma, and A. Rahman, “Cluster Oriented Ensemble Classifier: Impact of Multi-cluster Characterisation on Ensemble Classifier Learning,” IEEE Tr. on Knowledge and Data Engineering, vol. 24, no. 4, pp. 605–618, 2011.

[3]. X. Ceamanos, B. Waske, J. A. Benediktsson, J. Chanussot, M. Fau- vel, and J. R. Sveinsson, “A Classifier Ensemble Based on Fusion of Support Vector Machines for Classifying Hyperspectral Data,” International Journal of Image and Data Fusion, vol. 1, no. 4, pp. 293–307, 2010.

[4]. P. Gray, A. Preece, N. Fiddian, W. Gray, T. Bench-Capon, M. Shave,N.Azarmi, I. Wiegand, M. Ashwell, M. Beer, Z. Cui, B. Diaz, S. Embury, K. Hui, A.C. Jones, D.M. Jones, G.J.L. Kemp, E.W. Lawson, K. Lunn, P. Marti, J. Shao, P.R.S. Visser, “KRAFT: Knowledge Fusion from Distributed Databases and Knowledge Bases,” in Proceesdings of the 8th International Workshop on Database and Expert Systems Applications, pp. 682–691, 1997.

[5]. K. Ying Hui, and P. Gray, “Constraint and Data Fusion in a Distributed Information System,” in Advances in Databases, ser. Lecture Notes in Computer Science, S. Embury, N. Fiddian,W.Gray, and A. Jones, Eds. Springer Berlin, Heidelberg, vol. 1405, pp. 181–182, 1998.

[6]. K. Ying Hui, “Knowledge Fusion and Constraint Solving in a Distributed Environment,” Ph.D. dissertation, Department of Computing Science, University of Aberdeen, 2000.

[7]. G. Pavlin, P. De Oude, M. Maris, J. Nunnink, and T. Hood, “A Multi-Agent Systems Approach to Distributed Bayesian Information Fusion,” Information Fusion, vol. 11, no. 3, pp. 267–282, 2010

[8]. E. Santos Jr, J. Wilkinson, and E. Santos, “Bayesian Knowledge Fusion,” in Proc. of the 22nd Int. FLAIRS Conf., 2009, pp. 559–564, 2009.

[9]. Y. Wang, B. Wu, and J. Hu, “A Semantic Knowledge Fusion Method Based on Topic Maps,” in Workshop on Intelligent Information Technology Application, pp. 74–76, 2007.

[10]. H. Lu and B. Feng, “An Intelligent Topic Map-Based Approach to Detecting and Resolving Conflicts for Multi-Resource Knowledge Fusion,” Information Technology Journal, vol. 8, no. 8, pp. 1242–1248, 2009.

[11]. A. Smirnov, M. Pashkin, N. Chilov, and T. Levashova, “KSNET- Approach to Knowledge Fusion from Distributed Sources,” Computing and Informatics, vol. 22, no. 2, pp. 105–142, 2003.

[12]. O. Buchtala, and B. Sick, “Techniques for the Fusion of Symbolic Rules in Distributed Organic Systems,” in IEEE Mountain Workshop on Adaptive and Learning Systems, pp. 85-90, 2006.

[13]. G. Prakash, and A. Khan, “Investigate the Role of Impulsivity in Decision Making During Gambling Task: Case Study,” International Journal of MC Square Scientific Research, vol. 4, no. 1, 2013

[14]. S. Murugan, T.R. Ganesh Babu, and C. Srinivasan, “Underwater Object Recognition Using KNN Classifier,” International Journal of MC Square Scientific Research vol. 9, no. 3, pp. 48-52, 2017.

[15]. A. Budai, R. Bock, A. Maier, J. Hornegger, and G. Michelson, “Robust vessel segmentation in fundus images,” International Journal of Biomedical Imaging, vol. 2013, no. 1, pp. 1-9, 2013.

Downloads

Published

30-08-2025

How to Cite

[1]
C. Priya, “Optimized Information Integration in Data Mining Using Ensemble Classification”, Inno. Intell. Syst. Adv. Eng, vol. 1, no. 2, pp. 23–30, Aug. 2025, Accessed: Mar. 16, 2026. [Online]. Available: https://iisae.org/index.php/IISAE/article/view/10

Share