ML Modeling for Carbonate Reservoir Characterization
DOI:
https://doi.org/10.29017/scog.v49i1.1969Keywords:
carbonate rock, machine learning, permeability, porosity, reservoir characterizationAbstract
Reservoir characterization is essential for understanding rock and fluid behavior in hydrocarbon field development. In the Baturaja Formation, South Sumatra Basin, this process is challenging due to heterogeneity resulting from depositional and diagenetic variations. Limited core data and the high cost of conventional analysis encourage the use of machine learning (ML). This study aims to predict formation, facies, porosity, and permeability using ML algorithms and to assess the impact of feature augmentation. The dataset includes well log and core data from 13 wells. The workflow consists of preprocessing, feature selection, feature engineering, and supervised learning using Decision Tree, Random Forest, XGBoost, and KNN. Performance was evaluated using the F1-score for classification and MAE/RMSE for regression, followed by blind testing on wells HARLEY and XSR. Random Forest achieved the best formation prediction (F1-score 0.9890; blind test 0.9975) because the well data fall within the range of the training data distribution, although accuracy decreased in XSR due to differences in data distribution. XGBoost was the most accurate for facies prediction, improving from an F1-score of 0.9648 to 0.9741 after feature augmentation. For porosity and permeability, Random Forest produced the lowest errors, although permeability remained challenging in heterogeneous carbonates. Overall, ML provides an efficient and accurate approach, with Random Forest and XGBoost performing best, and feature augmentation consistently enhancing model generalization.
References
Al-Mudhafar, W. J., Hasan, A. A., Abbas, M. A., & Wood, D. A. (2025). Machine learning with hyperparameter optimization applied in facies-supported permeability modeling in carbonate oil reservoirs. Scientific Reports, 15(1), 12939. https://doi.org/10.1038/s41598-025-95490-0
Bruce, P., Bruce, A., & Gedeck, P. (2020). Practical Statistics for Data Scientists (2nd ed.). O’Reilly Media, Inc.
Chowdhury, Md. S., Tanjil, H., & Akter, S. (2019). Production Logging and its Implementation: A Technical Review. International Journal of Petroleum and Petrochemical Engineering, 5, 42–51. https://doi.org/10.20431/2454-7980.0502004
Downey, A. B. (2014). Think Stats Exploratory Data Analysis in Python. Green Tea Press.
Ginger, D., & Fielding, K. (2005). The Petroleum Systems and Future Potential of The South Sumatra Basin. Proceedings Indonesian Petroleum Association, 30th Annual Convention & Exhibition, 67–89.
Hafwandi, B. S., Irawan, D., & Yasutra, A. (2023). Study of Permeability Prediction Using Hydraulic Flow Unit (HFU) and Machine Learning Method in “BSH” Field. PETRO:Jurnal Ilmiah Teknik Perminyakan, 12(2), 98–121. https://doi.org/10.25105/petro.v12i2.15763
He, X., Zhu, W., Kwak, H., Yousef, A., & Hoteit, H. (2024). Deep learning-assisted Bayesian framework for real-time CO2 leakage locating at geologic sequestration sites. Journal of Cleaner Production, 448, 141484. https://doi.org/10.1016/j.jclepro.2024.141484
Lv, M., Li, K., Cai, J., Mao, J., Gao, J., & Xu, H. (2025). Evaluation of landslide susceptibility based on SMOTE-Tomek sampling and machine learning algorithm. PLOS One, 20(5), e0323487. https://doi.org/10.1371/journal.pone.0323487
Ma, Y. Z. (2019). Quantitative Geosciences: Data Analytics, Geostatistics, Reservoir Characterization and Modeling. Springer International Publishing. https://doi.org/10.1007/978-3-030-17860-4
Nugroho, I. D. R., Trisna, M. D., & Saroji, S. (2024). An Implementation of XGBoost and Random Forest Algorithm to Estimate Effective Porosity and Permeability on Well log data at Fajar Field, South Sumatera Basin, Indonesia. INDONESIAN JOURNAL OF APPLIED PHYSICS, 14(2), 271. https://doi.org/10.13057/ijap.v14i2.82901
Sastra, M. M., & Rohmana, R. C. (2024). Perbandingan Metode Klasifikasi Machine Learning: Studi Kasus Prediksi Jenis Litologi Berdasarkan Data Well Log Pada Formasi Sleipner, North Sea. Jurnal Teknik, 13(2), 51–63.
Septiano, J., Yasutra, A., & Rahmawati, S. D. (2022). Build of Machine Learning Proxy Model for Prediction of Wax Deposition Rate in Two Phase Flow Water-Oil. Scientific Contributions Oil and Gas, 45(1), 34–48. https://doi.org/10.29017/SCOG.45.1.922
Seyyedattar, M., Zendehboudi, S., & Butt, S. (2020). Technical and Non-technical Challenges of Development of Offshore Petroleum Reservoirs: Characterization and Production. Natural Resources Research, 29(3), 2147–2189. https://doi.org/10.1007/s11053-019-09549-7
Tukey, J. W. (1977). Exploratory Data Analysis. Addison-Wesley Publishing Company.
Wardhana, R., Yasutra, A., Irawan, D., & Haidar, M. (2022). A Case Study of Primary and Secondary Porosity Effect for Permeability Value in Carbonate Reservoir using Differential Effective Medium and Adaptive Neuro-Fuzzy Inference System Method. Scientific Contributions Oil and Gas, 45, 50–59. https://doi.org/10.29017/SCOG.45.1.923
Wardhana, S. G., Pakpahan, H. J., Simarmata, K., Pranowo, W., & Purba, H. (2021). Algoritma Komputasi Machine Learning untuk Aplikasi Prediksi Nilai Total Organic Carbon (TOC). Lembaran Publikasi Minyak Dan Gas Bumi, 55(2), 75–87. https://doi.org/10.29017/LPMGB.55.2.606
Wulandari, Y. P., & Rosid, M. S. (2023). Identifying Pore Type of Lagoon and Barrier Carbonate to Model Shear Wave Velocity in Salawati Basin by Differential Kuster-ToksÖz. POSITRON, 13(1), 1. https://doi.org/10.26418/positron.v13i1.63761
Published
Issue
Section
License
Copyright (c) 2026 © Copyright by Authors. Published by LEMIGAS

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors are free to Share — copy and redistribute the material in any medium or format for any purpose, even commercially Adapt — remix, transform, and build upon the material for any purpose, even commercially.
The licensor cannot revoke these freedoms as long as you follow the license terms, under the following terms Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.









