MACHINE LEARNING APPROACH FOR TRANSFORMER CONDITION ASSESSMENT USING K-MEANS CLUSTERING AND MULTI-CLASSIFIER MODELS

Authors

  • Zulfiana Safitri Majid State Polytechnic of Ujung Pandang
  • Andarini Asri State Polytechnic of Ujung Pandang
  • Musfirah Putri Lukman State Polytechnic of Ujung Pandang
  • Wisna Saputri Alfira WS State Polytechnic of Ujung Pandang
  • Auliya Nabila State Polytechnic of Ujung Pandang
(*) Corresponding Author

DOI:

https://doi.org/10.34288/jri.v8i3.539

Keywords:

Transformer Fault, Machine Learning, Clustering, Classification, DGA

Abstract

Transformers play a critical role in power systems, yet their degradation is often difficult to detect due to complex influencing factors. Conventional diagnostic methods, such as Dissolved Gas Analysis (DGA), are time-consuming and rely heavily on expert interpretation. This study proposes a machine learning approach for transformer condition assessment by combining clustering and classification techniques. K-Means clustering is first applied to identify patterns in transformer condition data without prior labeling, with the optimal number of clusters determined as three using the Elbow Method. The resulting clusters are then used as pseudo-labels to train multiple classification models, including KNN, Decision Tree, SVM, Gradient Boosting, Extra Trees, and Voting Classifier. The results show that all models achieve high performance, with accuracy above 94%. Ensemble methods, particularly Gradient Boosting and Voting Classifier, achieve the best performance with an accuracy of 98.30%. These findings demonstrate that the proposed approach effectively improves transformer condition assessment and supports faster and more reliable maintenance decision-making.

Downloads

Download data is not yet available.

References

Anwar, M. T., Hadikurniawati, W., Winarno, E., & Widiyatmoko, W. (2020). Performance Comparison of Data Mining Techniques for Rain Prediction Models in Indonesia. 2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), 83–88.

Arumugam, S. (2021). Failure diagnosis and root-cause analysis of in-service and defective distribution transformers. Energies, 14(16), 4997.

Balcan, M.-F., & Sharma, D. (2024). Learning accurate and interpretable tree-based models. ArXiv Preprint ArXiv:2405.15911.

Bishop, C. M., & Nasrabadi, N. M. (2006). Pattern recognition and machine learning (Vol. 4, Number 4). Springer.

Chakraborty, J., Pradhan, D. K., & Nandi, S. (2024). A multiple k-means cluster ensemble framework for clustering citation trajectories. Journal of Informetrics, 18(2), 101507.

Dicoding Indonesia. (2024, October 25). Belajar Machine Learning untuk Pemula. Dicoding Academy.

Du, K.-L., Jiang, B., Lu, J., Hua, J., & Swamy, M. N. S. (2024). Exploring kernel machines and support vector machines: Principles, techniques, and future directions. Mathematics, 12(24), 3935.

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning. Springer series in statistics New-York.

Ikotun, A. M., Ezugwu, A. E., Abualigah, L., Abuhaija, B., & Heming, J. (2023). K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data. Information Sciences, 622, 178–210.

Khan, M., Hooda, B. K., Gaur, A., Singh, V., Jindal, Y., Tanwar, H., Sharma, S., Sheoran, S., Vishwakarma, D. K., & Khalid, M. (2024). Ensemble and optimization algorithm in support vector machines for classification of wheat genotypes. Scientific Reports, 14(1), 22728.

Kurniawan, A., Rahmawati, Y., & Putranto, H. (2019). Studi Performa Transformator Daya Menggunakan Metode Health Index di Gardu Induk Waru Sidoarjo. SinarFe7, 2(1), 33–38.

MacQueen, J. (1967). Multivariate observations. Proceedings Ofthe 5th Berkeley Symposium on Mathematical Statisticsand Probability, 1, 281–297.

Majid, Z. S., Bachtiar, M. I., Faraby, M. D., Widyaningsih, D., Nurfadhilah, A., & Ridhwan, M. (2025). Machine Learning-Based Analysis of Transformer Insulation Failure. 2025 8th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), 245–250.

Mazarei, A., Sousa, R., Mendes-Moreira, J., Molchanov, S., & Ferreira, H. M. (2025). Online boxplot derived outlier detection. International Journal of Data Science and Analytics, 19(1), 83–97.

Medina, R. D., Romero, A. A., Mombello, E. E., & Rattá, G. (2017). Assessing degradation of power transformer solid insulation considering thermal stress and moisture variation. Electric Power Systems Research, 151, 1–11.

Park, Y. S., & Lee, Y.-S. (2011). Diagnostic cluster analysis of mathematics skills. IERI Monograph Series: Issues and Methodologies in Large-Scale Assessments, 4, 75–108.

Pensa, R. G., Crombach, A., Peignier, S., & Rigotti, C. (2025). Explaining Random Forest and XGBoost with Shallow Decision Trees by Co-clustering Feature Importance. Machine Learning, 114(12), 287.

Ridho Tri Putra Nanda, M. (2023). Perhitungan Health Index Untuk Menentukan Umur dan Kondisi Transformator Daya Kapasitas 80 MVA.

Sable, N. P., Patil, R. V, Deore, M., Bhimanpallewar, R., & Mahalle, P. N. (2024). Machine Learning Based Agricultural Profitability Recommendation Systems: A Paradigm Shift in Crop Cultivation. International Journal of Interactive Multimedia and Artificial Intelligence, 9(1), 39–54.

Singh, A., Halgamuge, M. N., & Lakshmiganthan, R. (2017). Impact of different data types on classifier performance of random forest, naive bayes, and k-nearest neighbors algorithms. International Journal of Advanced Computer Science and Applications, 8(12).

Sinsomboonthong, S. (2022). Performance comparison of new adjusted min‐max with decimal scaling and statistical column normalization methods for artificial neural network classification. International Journal of Mathematics and Mathematical Sciences, 2022(1), 3584406.

Suyal, M., & Sharma, S. (2024). A review on analysis of k-means clustering machine learning algorithm based on unsupervised learning. Journal of Artificial Intelligence and Systems, 6(1), 85–95.

Zhang, S., Li, X., Zong, M., Zhu, X., & Wang, R. (2017). Efficient kNN classification with different numbers of nearest neighbors. IEEE Transactions on Neural Networks and Learning Systems, 29(5), 1774–1785.

Zhao, S., Rui, C., & Zhang, Y. (2013). MICkNN: multi-instance covering kNN algorithm. Tsinghua Science and Technology, 18(4), 360–368.

Downloads

Published

2026-06-16

How to Cite

Majid, Z. S., Asri, A., Putri Lukman, M., Saputri Alfira WS, W., & Nabila, A. (2026). MACHINE LEARNING APPROACH FOR TRANSFORMER CONDITION ASSESSMENT USING K-MEANS CLUSTERING AND MULTI-CLASSIFIER MODELS. Jurnal Riset Informatika, 8(3), 416–425. https://doi.org/10.34288/jri.v8i3.539

Issue

Section

Articles