Review of Automata Processor of KNN Algorithm Applications in Machine Learning

Authors

  • Ismail Rashid Fadulilahi Assistant Professor, Department of Computer Science, Faculty of Information and Communication Technology, SDD University of Business and Integrated Development Studies, Wa, Ghana.
  • Fredrick Kuupille Assistant Professor, Department of Informatics, Faculty of Information and Communication Technology, SDD University of Business and Integrated Development Studies, Wa, Ghana.
  • Yahaya Haleem Assistant Professor, Department of Informatics, Faculty of Information and Communication Technology, SDD University of Business and Integrated Development Studies, Wa, Ghana.

DOI:

https://doi.org/10.69996/jcai.2025025

Keywords:

Automata Processor, KNN, Similarity, NFA, Applications

Abstract

This paper examined the concepts and application of the kNN algorithm relating to machine learning characteristics on automata processes. The algorithm applications focused on network congestion, demand, and supply relating to price and time, and finally, the usage of transaction mode in financial transactions in terms of deposit and withdrawal. The study reviewed machine learning as an aspect of artificial intelligence in computing systems that improves and automatically applies to a variety of
complex mathematical computations to address problems using similarity search on query techniques with a given dataset. This was applied with k-nearest neighbors (kNN) developing an effective design of the automata processor (AP)) related to non-von Neumann architecture that focused on nondeterministic finite automaton (NFA) driven by the execution model; According to the research, KNN performance is affected by a number of factors, including parameter normalization, the value of K, and the distance
between the two points on the graph. The study also revealed some challenges of the kNN algorithm, such as computational efficiency, selecting the optimal value of k, high-dimensional data handling, and finally, noise and outlier sensitivity. The results of the kNN algorithm indicated common values of each MSE for both demand and supply prediction, and the network congestion. The optimal k-value is 2, with a
corresponding 66.67% without congestion. Finally, with regard to the financial transaction, the kNN algorithm computes each customer transaction frequency with the selected bank account number carried
out within the range of the query provided.

References

[1] D. Achlioptas, “Database-friendly random projections,” Proceedings of the Twentieth ACM SIGMOD-SIGACT-SIGART Symposium on Principles, 2001.

[2] U. Anand, M. Shivkumar and J. Roheet, "Comparative study of SVM and KNN Forsignature verification," Journal of Statistics and Management Systems, vol. 23, no. 2, pp. 191198, 2020

[3] A. Andoni and P.Indyk, “Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions,” 47th Annual IEEE Symposium onFoundations of Computer Science (FOCS’06), Berkeley, CA, USA, 2006.

[4] S. Akila and U.S Reddy, “Cost-sensitive Risk Induced Bayesian Inference Bagging (RIBIB) for credit card fraud detection,” J Comput Sci., vol.27, pp.247–54, 2018.

[5] A. S. Arefin, C.Riveros and R. Berretta, “A software tool for fast and scalable kNN computation using gpus,” PublicLibrary of Science, vol. 7, pp. e44000, 2012.

[6] R. Agrawal, C.Faloutsos,andA.N. Swami. “Efficient similarity search in sequence databases,” in Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms, FODO ’93, London, UK, pp. 69–84, 1993.

[7] L. Aronovich, R. Asher, E.Bachmat, H.Bitner, Hirsch and S. Klein, “The design of a similarity-based deduplication system,” in Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference, SYSTOR ’09, New York, NY, USA, pp. 6:1–6:14, 2009.

[8] B.E. Boser, I.MGuyon. andV. N. Vapnik, “A Training Algorithm for Optimal Margin Classifiers,” Proceedings of the Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, pp.144- 152, 1992.

[9] L. Breiman, and R. Ihaka, “Nonlinear Discriminant Analysis via Scaling and ACE,” Department of Statistics, University of California, Berkeley.

[10] K. Chakrabarti and S.Mehrotra, “Local dimensionality reduction: a new approach to indexing high dimensional spaces,” In Proceedings of the 26th International Conference on Very Large Data Bases (VLDB ‘00), Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 89–100, 2000

[11] Y. Chen, T. Luo, S. Liu, S. Zhang, L. He et al., “A machine-learning supercomputer,” in 2014 47th Annual IEEE/ACM International Symposium on Micro architecture (MICRO), pp. 609–622, 2014.

[12] C. Cortes and V. Vapnik. “Support-Vector Networks,” Machine Learning, pp.273-297, 1995.

[13] T.M. Cover and P. D, “Nearest neighbor pattern classification,” IEEE Trans Inf Theory, vol.13, no.1, pp.21–7, 1967.

[14] P. Dlugosch, D. Brown, P. Glendenning, M. Leventhal and H. Noyes, “An efficient and scalablesemiconductor architecture for parallel automata processing,” IEEE Transactions on Parallel and Distributed Systems, vol. 25, no. 12, pp. 3088–3098, 2014

[15] R. Edwin, and Z. Bogdan, “Comparison of Support Vector Machine, Random Forest and NeuralNetwork Classifiers for Tree Species Classification on Airborne Hyperspectral APEX Images,” European Journal of Remote Sensing, vol.50, pp.144-154, 2017.

[16] B. Ghimire, J. Rogan, V. R. Galiano, P. Panday and V. Neeti, “An Evaluation of Bagging, boosting, and Random forests for Land-Cover Classification in Cape Cod, Massachusetts, USA,” GI Science and Remote Sensing, vol.49, pp.623-643, 2012.

[17] P. Hall and B. Samworth, “Choice of neighbor order in nearest-neighbor classification,” The Annals of Statistics, vol. 36, no.5, pp.2135-2152, 2008.

[18] O. Halvani, M., Steinebach and R. Zimmermann, “Authorship Verification via K-Nearest Neighbor Estimation,” Notebook PAN at CLEF, CLEF 2013 Working Notes, Valencia, 2013.

[19] N.A. Hamid and S.N. Jarif, “Handwritten recognition using SVM, KNN and neural Network,” arXiv (Cornell University), 2017.

[20] T. Hastie, R. Tibshirani and J. Friedman, “The Elements of Statistical Learning: Kernel Methods in Machine Learning,” Annals of Statistics, vol.36, no.3, pp. 1171–1220, 2008.

[21] S. Jingwen, D. Weixing and S. Niancai, “A Survey of kNN Algorithm,” Information Engineering and Applied Computing, pp.1 – 10, 2018.

[22] I. Kamel and C. Faloutsos, “Hilbert R-tree: An Improved R-tree using Fractals,” In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB ‘94), 1994.

[23] M. Kusner, Y. Sun, N.Kolkin and K.Q. Weinberger, “From Word Embeddings To Document Distances” in Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pp. 957–966, 2015.

[24] V. Maniraj and G. Varja, “Supervised Learning Algorithm in Classification: A Comparison of SVM and KNN,” International Research Journal of Education and Technology, vol. 7, no.2 , pp 93 – 97, 2025

[25] S.E. Mohsen and A. Scholz, “Human activity recognition using K-Nearest Neighbor Machine Learning Algorithm,” In Smart innovation, systems and technologies, pp. 304–313, 2021.

[26] T. I. Purcell, C. Donner, M. Cammarano, H.W. Jensen and P. Hanrahan, “Photon mapping on programmable graphics hardware,” in Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, pp. 41–50, 2003.

[27] K.H. Rajib, N.U. Mohammed, U.M. Ashraf, A. Sunil and K. Ansam, “Enhancing K-nearest neighbor algorithm: a comprehensive review and performance analysis of modifications,” Journal of Big Data, vol.11, 2024.

[28] J.P.Rigol-Sanchez, M. Chica-Olmo and F. Abarca-Hernandez, “Artificial Neural Networks as a Tool for Mineral Potential Mapping with GIS,” International Journal of Remote Sensing, vol.24, pp.1151- 1156, 2003.

[29] J. Roy, “High Performance Pattern Matching using the Automata Processor,” In Proc. of IPDPS’16, 2016

[30] M. Satyanarayana, “Machine Learning: KNN Algorithm a Review,” International Journal of Software and Hardware Research in Engineering, vol.7, no.6, 2019.

[31] L. Shenshen, W. Cheng, L. Ying Liu and J. Liheng, “Design and evaluation of a parallel k-nearest neighbor algorithm on CUDA-enabled GPU,” 2010 IEEE 2nd Symposium on Web Society (SWS), pp.53-60, 2010.

[32] J. Sivic and A. Zisserman, “Video google: A text retrieval approach to object matching in videos,” in Proceedings of the Ninth IEEE International Conference on Computer Vision, vol.2, pp. 1470– 1481, 2003

[33] Y. Song, H. Jian ,Z. Ding, Z. Hongyuan and G. Lee, “Iknn: Informative k-nearest neighbour pattern classification,” in Knowledge Discovery in Databases, pp. 248– 264, 2007.

[34] A. Statnikov, L. Wang and C.F. Aliferis, “A Comprehensive Comparison of Random Forests and Support Vector Machines for Microarray-Based Cancer Classification,” BMC Bioinformatics, vol.9, no. 319, 2008.

[35] P. Professionals, "Advantages and Disadvantages of KNN Algorithm in Machine Learning," disadvantages-of-knn, 2017.

[36] A. Torralba, R. Fergus and W.T. Freeman, “80 million tiny images: A large data set for nonparametric object and scene recognition,” IEEE Trans Pattern Anal. Mach. Intell., vol. 30, pp. 1958–1970, 2008.

[37] B. Venkateswarlu and G. Rekha, “Exploring the Power and Practical Applications of K-Nearest Neighbours (KNN) in Machine Learning,” Journal of Computer Allied Intelligence, vol.02, no.01, pp.8-15, 2024.

[38] B. Waske and M. Braun, “Classifier Ensembles for Land Cover Mapping Using Multitemporal SAR Imagery,” ISPRS Journal of Photogrammetry and Remote Sensing, vol.64, pp.450-457 2009.

[39] X. Wu, V. Kumar, J. Ross, I. Quinlan, Q. Ghosh et al., “Top 10 algorithms in data mining,” Knowl. Inf. Syst., vol. 14, pp. 1–37, 2007.

[40] C. Yang, X. Yu and L. Yang, “Continuous KNN join processing for real-time recommendation,” 2014 IEEE International Conference on Data Mining, Shenzhen, China, 2014.

[41] C. Zanchettin, B.L.D. Bezerra and W.W. Azevedo, “A KNN-SVM hybrid model for cursive handwriting recognition,” The 2012 International Joint Conference on Neural Networks (IJCNN), Brisbane, QLD, Australia, 2012.

[42] K. Zhou, “Regular expression acceleration on the micron automata processor: Brill tagging as a case study,” Big Data CA, 2015.

Downloads

Published

2025-10-31

How to Cite

Ismail Rashid Fadulilahi, Fredrick Kuupille, & Yahaya Haleem. (2025). Review of Automata Processor of KNN Algorithm Applications in Machine Learning. Journal of Computer Allied Intelligence(JCAI, ISSN: 2584-2676), 3(5), 54-67. https://doi.org/10.69996/jcai.2025025