•  
  •  
 

Chemical Technology, Control and Management

Abstract

An analytical method for determining informative sets of features (INP) is developed, taking into account the resource for criteria based on the use of a measure of dispersion of classified objects. The areas of existence of the solution are defined. The statements and properties for the Fischer-type information criterion are proved, using which the proposed analytical method for determining the INP guarantees optimal results in the sense of maximizing the selected functional. The appropriateness of choosing this type of informative criterion is justified. A method for transforming attributes is proposed. The universality of the method in relation to the type of features is shown. An algorithm for implementing this method is given. In addition, the paper discusses the dynamics of the growth of information volume in the world, problems related to big data, as well as problems and tasks of pre-processing data. The relevance of reducing the dimension of the feature space for performing data processing and visualization without unnecessary difficulties is proved. The disadvantages of existing methods and algorithms for selecting an informative set of features are shown.

First Page

57

Last Page

64

References

  1. https://regnum.ru/news/it/2574265.html (data obrasheniya: 15.01.2020)
  2. Jiawei Han, Micheline Kamber, Jian Pei. Data mining : concepts and techniques// 3rd ed. by Elsevier Inc., USA, 2012.
  3. Zamyatin A.V. Intellektualniy analiz dannix//Tomsk : Izd.dom TomGU, 2016.
  4. Jian Long Zhou, Fang Chen. Human and Machine Learning: Visible, Explainable, Trustworthy and Transparent//Springer, Human-Computer interaction Series, 2018, Switzerland, p.482.
  5. Zagoruyko N.G. Prikladnie metodi analiza dannix i znaniy//Novosibirsk: IM SO RAN, 1999, str. 270.
  6. Nasma M. Sovremennie tendensii metodov intellektualnogo analiza dannix: metod klasterizatsii//Moskovskiy ekonomicheskiy jurnal, №6, Rossiya, 2019.
  7. Nishanov A.X., Akbaraliev B.B., Ruzibaev O.B., Xujaev O.K. Sravnitelniy analiz algoritmov na osnove nechetkogo K-srednix s primeneniem razlichnix metrik // Kimyoviy texnologiya, nazorat va boshqaruv, Xalqaro ilmiy-texnikaviy jurnal, 2014 yil, 6-son, 78-82 b.
  8. Kamilov M.M., Nishanov A.H., Akbaraliev B.B. About one clustering algorithm in intellectual data analysis// Proceedings of ICEIC2008, June 24-27, 2008, Tashkent, pp. 476-478.
  9. Kamilov M.M., Nishanov A.H., Akbaraliev B.B. Methods of forming of optimal sign space for object recognition in the class of logic-heuristic algorithms// Fourth World Conference on Intelligent Systems for Industrial Automation - WCIS 2006, Tashkent.
  10. Akbaraliev B.B. Formirovanie informativnix naborov priznakov v slojnix sistemax raspoznavaniya// TATU xabarlari, №2, 2007, 47-50 b.
  11. Raxmanov A.T., Akbaraliev B.B., Ergashev A.K. Ob odnom metode sokrashenie razmernosti ob’ema viborki v intellektualnom analize dannix//“Informatika va Energetika muammolari” O’zbekiston jurnali, 1-son, 2011, 76-79 b.
  12. Sunita Beniwal, Jitender Arora. Classification and Feature Selection Techniques in Data Mining//International Journal of Engineering Research & Technology (IJERT), ISSN: 2278-0181, Vol. 1 Issue 6, August – 2012.
  13. Huiqing Liu, Jinyan Li, Limsoon Wong. A Comparative Study on Feature Selection and Classification Methods Using Gene Expression Profiles and Proteomic Patterns//Genome Informatics 13: 51-60, 2002.
  14. Krasnyanskiy M.N. i dr. Sravnitelniy analiz metodov mashinnogo obucheniya dlya resheniya zadachi klassifikatsii dokumentov nauchno-obrazovatelnogo uchrejdeniya // Vestnik VGU, seriya: Sistemniy analiz i informatsionnie texnologii, 2018, № 3, str.173-182.
  15. Lbov G. S. Metodi obrabotki raznotipnix eksperimentalnix dannix // Novosibirsk: Nauka, Sib.otd., 1981. - 160 s.
  16. Juravlev Yu.I. Izbrannie nauchnie trudi//M: Izdatelstvo Magistr, 1998. – 420s.
  17. Zhang, L., Luo, M., Liu, J., Li, Z., Zheng, Q. Diverse fuzzy c-means for image clustering //Pattern Recognition LettersVolume 130, February 2020, Pages 275-283.
  18. Santra, D., Basu, S.K., Mandal, J.K., Goswami, S. Rough set based lattice structure for knowledge representation in medical expert systems: Low back pain management case study//Expert Systems with Applications Volume 145, 1 May 2020, 113084
  19. Xiong, Y., Zuo, R. Recognizing multivariate geochemical anomalies for mineral exploration by combining deep learning and one-class support vector machine//Computers and Geosciences Volume 140, July 2020, 104484
  20. Gai, J., Shen, J., Wang, H., Hu, Y. A Parameter-Optimized DBN Using GOA and Its Application in Fault Diagnosis of Gearbox//Shock and Vibration, Volume 2020, 2020, 4294095.
  21. Raja, P.S., Thangavel, K. Missing value imputation using unsupervised machine learning techniques//Soft Computing 24(6), с. 4361-4392,2020.
  22. Wang,D., Tian,F., Yang,S.X., Jiang,D., Cai,B. Improved deep CNN with parameter initialization for data analysis of near-infrared spectroscopy sensors//Sensors (Switzerland) Volume 20, Issue 3, 1 February 2020, 874, 20(3),874, 2020.
  23. Lou, P., Jimeno Yepes, A., Zhang, Z., Li, C., Wren, J. BioNorm: Deep learning-based event normalization for the curation of reaction databases//Bioinformatics Volume 36, Issue 2, 15 January 2020, Pages 611-620.
  24. Fu, S., Liu, X. A new method to solve the problem of facing less learning samples in signal modulation recognition//Eurasip Journal on Wireless Communications and Networking Volume 2020, Issue 1, 1 December 2020, 8.
  25. Wei, D., Chen, T., Li, S., Zhao, Y., Li, T. Adaptive dictionary learning based on local configuration pattern for face recognition//Eurasip Journal on Advances in Signal Processing Volume 2020, Issue 1, 1 December 2020, 20.
  26. Ala’raj, M., Majdalawieh, M., Abbod, M.F. Improving binary classification using filtering based on k-NN proximity graphs//Journal of Big Data, Volume 7, Issue 1, 1 December 2020, 15.
  27. Mishra, G., Vishwakarma, V.P., Aggarwal, A. Constrained L1-optimal sparse representation technique for face recognition/Optics and Laser Technology Volume 129, September 2020, 106232.
  28. Kibbey, T.C.G., Jabrzemski, R., O'Carroll, D.M.Supervised machine learning for source allocation of per- and polyfluoroalkyl substances (PFAS) in environmental samples//Chemosphere Volume 252, August 2020, 126593.
  29. Shen, Z., Man, Z., Cao, Z., Zheng, J. A new intelligent pattern classifier based on structured sparse representation //Computers and Electrical Engineering Volume 84, June 2020, 106641.
  30. Nishanov А.Kh., Djurayev G.P., Kasanova М.Kh. Improved algorithms for calculating evaluations in processing medical data // National Institute of Science Communication and Information Resources (NISCAIR)-India, 2019,-3158-3165.
  31. Kamilov M., Nishanov A., Beglerbekov R. Modified stages of algorithms for computing estimates in the space of informative features // International Journal of Innovative Technology and Exploring Engineering (2019) 8(6).
  32. Nishanov A. Avazov E. Akbaraliyev B. Partial selection method and algorithm for determining graph-based traffic routes in a real-time environment// International Journal of Innovative Technology and Exploring Engineering (2019) 8(6) 696-698 ISSN: 22783075.
  33. Emary E. Zawbaa H. Hassanien A. Binary ant lion approaches for feature selection// Neurocomputing. 2016 vol: 213. DOI 10.1016/j.neucom.2016.03.101. ISSN 18728286.
  34. Yong Z. Dun-wei G. Wan-qiu Z. Feature selection of unreliable data using an improved multi-objective PSO algorithm// Neurocomputing. 2016 vol: 171. DOI 10.1016/j.neucom.2015.07.057. ISSN 18728286.
  35. Zhang Y. Gong D. Sun X. Guo Y. A. PSO-based multi-objective multi-label feature selection method in classification.// Scientific Reports. 2017 vol: 7 (1). DOI 10.1038/s41598-017-00416-0. ISSN 20452322.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.