A SEASONAL IMPUTATION METHOD FOR ADDRESSING MISSING DATA IN ENVIRONMENTAL IOT SENSOR TIME SERIES

Authors

  • Ardiansyah Ramadhan Telkom University https://orcid.org/0009-0005-5126-1176
  • Surya Micrandi Nasution Telkom University
  • Reza Rendian Septiawan Telkom University
  • I Kadek Nuary Trisnawan Telkom University
  • Angel Metanosa Afinda Telkom University
(*) Corresponding Author

DOI:

https://doi.org/10.34288/jri.v8i2.475

Keywords:

Internet of Things, Seasonal imputation, Incomplete data, Time-series analysis

Abstract

Missing and incomplete observations in Environmental IoT sensor networks reduce data reliability and disrupt analyses, especially for temperature and humidity time series exhibiting strong diurnal seasonality. This study develops and evaluates a seasonal imputation method to address missing data in IoT-based environmental monitoring, using a workflow of anomaly detection, outlier removal, time-of-day-aware imputation, and performance evaluation under varying missing-rate scenarios. Key challenges include sensor noise, connectivity issues, and intermittent hardware failures, which degrade data integrity and affect trend analysis, forecasting, and anomaly detection. To mitigate these, the method uses hourly and minute-level seasonal patterns after filtering out physically unrealistic values. Experimental results show high accuracy and robustness in reconstructing temperature and humidity data: temperature imputation achieves MAE values of approximately 0.86–0.87°C, and humidity yields MAE values of 3.92–4.01%RH, with no performance drop even at 50% data loss. The imputed series preserves natural diurnal dynamics without introducing distortions, effectively restoring continuity and structural consistency in environmental IoT time series for reliable modeling, feature extraction, and decision support.

Downloads

Download data is not yet available.

Author Biographies

Ardiansyah Ramadhan, Telkom University

Computer Engineering Study Program

Surya Micrandi Nasution, Telkom University

Computer Engineering Study Program

Reza Rendian Septiawan, Telkom University

Computer Engineering Study Program

I Kadek Nuary Trisnawan, Telkom University

Computer Engineering Study Program

Angel Metanosa Afinda, Telkom University

Computer Engineering Study Program

References

Bhattacharya, S., Kamper, F., & Beirlant, J. (2023). Outlier detection based on extreme value theory and applications. Scandinavian Journal of Statistics, 50(3), 1466–1502. https://doi.org/10.1111/sjos.12665

Bhuyan, M. H., Bhattacharyya, D. K., & Kalita, J. K. (2014). Network anomaly detection: Methods, systems and tools. IEEE Communications Surveys and Tutorials, 16(1), 303–336. https://doi.org/10.1109/SURV.2013.052213.00046

Boukerche, A., Zheng, L., & Alfandi, O. (2020). Outlier Detection: Methods, Models, and Classification. ACM Computing Surveys, 53(3). https://doi.org/10.1145/3381028

Budiwati, T., Tanti, D. A., Indrawati, A., & Santoso, M. (2022). Analisis Kualitas Udara dan Hubunganya dengan Deposisi Basah di Semarang. In Dinamika dan Teknologi Atmosfer Benua Maritim Indonesia (pp. 185–200).

Chicco, D., Warrens, M. J., & Jurman, G. (2021). The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Computer Science, 7, 1–24. https://doi.org/10.7717/PEERJ-CS.623

Dau, H. A., Silva, D. F., Petitjean, F., Forestier, G., Bagnall, A., Mueen, A., & Keogh, E. (2018). Optimizing dynamic time warping’s window width for time series data mining applications. Data Mining and Knowledge Discovery, 32(4), 1074–1120. https://doi.org/10.1007/s10618-018-0565-y

Eckardt, M., & Kerber, W. (2024). Property rights theory, bundles of rights on IoT data, and the EU Data Act. European Journal of Law and Economics. https://doi.org/10.1007/s10657-023-09791-8

Fatyanosa, T. N., Data, M., Firdausanti, N. A., Prayoga, P. H. N., Mendonça, I., & Aritsugi, M. (2023). ImputAnom: Anomaly Detection Framework Using Imputation Methods for Univariate Time Series. In P. Delir Haghighi, E. Pardede, G. Dobbie, V. Yogarajan, N. A. S. ER, G. Kotsis, & I. Khalil (Eds.), Information Integration and Web Intelligence (pp. 56–61). Springer Nature Switzerland.

Fatyanosa, T. N., Firdausanti, N. A., Prayoga, P. H. N., Kuriu, M., Aritsugi, M., & Mendonça, I. (2024). Meta-learning for vessel time series data imputation method recommendation. Expert Systems with Applications, 251. https://doi.org/10.1016/j.eswa.2024.124016

Hamel, P., Ding, N., Cherqui, F., Zhu, Q., Walcker, N., Bertrand-Krajewski, J. L., Champrasert, P., Fletcher, T. D., McCarthy, D. T., Navratil, O., & Shi, B. (2024). Low-cost monitoring systems for urban water management: Lessons from the field. Water Research X, 22. https://doi.org/10.1016/j.wroa.2024.100212

Hudda, S., & Haribabu, K. (2025). A review on WSN based resource constrained smart IoT systems. Discover Internet of Things, 5(1), 56. https://doi.org/10.1007/s43926-025-00152-2

Indrawati, A., Tanti, D. A., Ambarsari, N., Ma’ruf, I. F., Sumaryati, Setyawati, W., Pusfitasari, E. D., Nugroho, G. A., Cholianawati, N., Sinatra, T., Halimurrahman, Triani, L. R., Nafiisyanti, A., Putra, S. I., Nurlatifah, A., Huo, M., & Awaludin, A. (2024). Spatiotemporal distribution in chemical composition of wet atmospheric deposition in Bandung Indonesia. Environmental Science and Pollution Research. https://doi.org/10.1007/s11356-024-35485-y

Jalasena, A., Rachmawati, L. M., Hadi Syahputra, M. F., Hasmul, N. A., Suwandi, Indah Utami, A. R., Nugroho, B. S., & Chandra, I. (2023). PRELIMINARY STUDY OF URGENCY TO MONITOR INDOOR AIR QUALITY AT TELKOM EDUCATION AREA IN THE POST-PANDEMIC. IOP Conference Series: Earth and Environmental Science, 1194(1). https://doi.org/10.1088/1755-1315/1194/1/012004

Kwok, C.-F., Qian, G., & Kuleshov, Y. (2023). Analyzing Error Bounds for Seasonal-Trend Decomposition of Antarctica Temperature Time Series Involving Missing Data. Atmosphere, 14(2). https://doi.org/10.3390/atmos14020193

Laha, S. R., Pattanayak, B. K., & Pattnaik, S. (2022). Advancement of Environmental Monitoring System Using IoT and Sensor: A Comprehensive Analysis. AIMS Environmental Science, 9(6), 771–800. https://doi.org/10.3934/environsci.2022044

Lee, J. H., & Charles Huber, J. (2021). Evaluation of Multiple Imputation with Large Proportions of Missing Data: How Much Is Too Much? In Iran J Public Health (Vol. 50, Issue 7). https://creativecommons.org/licenses/by-nc/4.0/

Mahajan, M., Kumar, S., Pant, B., & Tiwari, U. K. (2020). Incremental Outlier Detection in Air Quality Data Using Statistical Methods. 2020 International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy (ICDABI), 1–5. https://doi.org/10.1109/ICDABI51230.2020.9325683

Mara Ribeiro, S. I., & Leite de Castro, C. I. (2022). Missing Data in Time Series: A Review of Imputation Methods and Case Study. In Journal of the Brazilian Society on Computational Intelligence (SBIC) (Vol. 20, Issue 1).

Mohammed, M. B., Zulkafli, H. S., Adam, M. B., Ali, N., & Baba, I. A. (2021). Comparison of five imputation methods in handling missing data in a continuous frequency table. AIP Conference Proceedings, 2355(1), 040006. https://doi.org/10.1063/5.0053286

Mois, G., Folea, S., & Sanislav, T. (2017). Analysis of Three IoT-Based Wireless Sensors for Environmental Monitoring. IEEE Transactions on Instrumentation and Measurement, 66(8), 2056–2064. https://doi.org/10.1109/TIM.2017.2677619

Muhr, D., Affenzeller, M., & Küng, J. (2023). A Probabilistic Transformation of Distance-Based Outliers. Machine Learning and Knowledge Extraction, 5(3), 782–802. https://doi.org/10.3390/make5030042

Mustofa, A. H., Chandra, I., Salam, R. A., & Rachmawati, L. M. (2022). Vertical Structure of PM2.5and CO2Concentrations in the Greater Bandung Air Basin. Journal of Physics: Conference Series, 2377(1). https://doi.org/10.1088/1742-6596/2377/1/012052

Nassif, A. B., Talib, M. A., Nasir, Q., & Dakalbab, F. M. (2021). Machine Learning for Anomaly Detection: A Systematic Review. In IEEE Access (Vol. 9, pp. 78658–78700). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ACCESS.2021.3083060

Pham, T. M., Pandis, N., & White, I. R. (2022). Missing data, part 2. Missing data mechanisms: Missing completely at random, missing at random, missing not at random, and why they matter. American Journal of Orthodontics and Dentofacial Orthopedics, 162(1), 138–139. https://doi.org/10.1016/j.ajodo.2022.04.001

Qin, Y., Ren, G., Zhang, P., Wu, L., & Wen, K. (2021). An imputation method for the climatic data with strong seasonality and spatial correlation. Theoretical and Applied Climatology, 144(1–2), 203–213. https://doi.org/10.1007/s00704-021-03537-9

Ramadhan, A., Chandra, I., Indrawati, A., Tanti, D. A., Mendonça, I., & Aritsugi, M. (2025). Decision support system using the expert-fuzzy method in validating real-time acid rain measurements. Journal of Reliable Intelligent Environments, 11(4). https://doi.org/10.1007/s40860-025-00258-y

Ramadhan, A., Indrawati, A., Tanti, D. A., Setyawati, W., & Chandra, I. (2025). AI-Driven Imputation and Correlation Analysis of Environmental Parameters for Understanding pH Variability in Acid Rain. 2025 IEEE International Conference on Networking, Intelligent Systems, and IoT (ICONS-IoT), 75–81. https://doi.org/10.1109/ICONS-IoT65216.2025.11211132

Saleem, S., Aslam, M., & Rukh Shaukat, M. (2021). A REVIEW AND EMPIRICAL COMPARISON OF UNIVARIATE OUTLIER DETECTION METHODS (Vol. 37, Issue 4).

Salvatore, D., Reagle, D., & New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore Sydney Toronto, M.-H. (2002). Theory and Problems of STATISTICS AND ECONOMETRICS SECOND EDITION Schaum’s Outline Series. https://doi.org/10.1036/0071395687

Sarker, I. H. (2021). Data Science and Analytics: An Overview from Data-Driven Smart Computing, Decision-Making and Applications Perspective. In SN Computer Science (Vol. 2, Issue 5). Springer. https://doi.org/10.1007/s42979-021-00765-8

Seo, S. (2006). A Review and Comparison of Methods for Detecting Outliers in Univariate Data Sets. University of Pittsburgh.

Seu, K., Kang, M.-S., & Lee, H. (2022). An Intelligent Missing Data Imputation Techniques: A Review. www.joiv.org/index.php/joiv

Shadbahr, T., Roberts, M., Stanczuk, J., Gilbey, J., Teare, P., Dittmer, S., Thorpe, M., Torné, R. V., Sala, E., Lió, P., Patel, M., Preller, J., Lió, P., Walton, N., Holzer, M., Wassin, M., Escudero Sánchez, L., Babar, J., Prosch, H., … Schönlieb, C. B. (2023). The impact of imputation quality on machine learning classifiers for datasets with missing values. Communications Medicine, 3(1). https://doi.org/10.1038/s43856-023-00356-z

Singh, D., Dahiya, M., Kumar, R., & Nanda, C. (2021). Sensors and systems for air quality assessment monitoring and management: A review. Journal of Environmental Management, 289, 112510. https://doi.org/https://doi.org/10.1016/j.jenvman.2021.112510

Smiti, A. (2020). A critical overview of outlier detection methods. Computer Science Review, 38, 100306. https://doi.org/https://doi.org/10.1016/j.cosrev.2020.100306

Vaicdan, F., Chandra, I., & Suhendi, A. (2019). Pengamatan Konsentrasi Massa PM2.5 di Cekungan Udara Bandung Raya (Observation of PM2.5 mass concentrations in the Greater Bandung air basin).

Vasenin, D., Pasetti, M., Astolfi, D., Savvin, N., Rinaldi, S., & Berizzi, A. (2024). Incorporating Seasonal Features in Data Imputation Methods for Power Demand Time Series. IEEE Access, 12, 103520–103536. https://doi.org/10.1109/ACCESS.2024.3434652

Yu, Y., Zhu, Y., Li, S., & Wan, D. (2014). Time series outlier detection based on sliding window prediction. Mathematical Problems in Engineering, 2014. https://doi.org/10.1155/2014/879736

Zhao, Y., & Udell, M. (2024). gcimpute: A Package for Missing Data Imputation. Journal of Statistical Software, 108(4), 1–27. https://doi.org/10.18637/jss.v108.i04

Downloads

Published

2026-03-15

How to Cite

Ramadhan, A., Nasution, S. M., Septiawan, R. R., Trisnawan, I. K. N., & Afinda, A. M. (2026). A SEASONAL IMPUTATION METHOD FOR ADDRESSING MISSING DATA IN ENVIRONMENTAL IOT SENSOR TIME SERIES. Jurnal Riset Informatika, 8(2), 215–229. https://doi.org/10.34288/jri.v8i2.475

Issue

Section

Articles