ML Modeling for Carbonate Reservoir Characterization

Authors

  • Sayyidah Adilia Mahfudhoh Faculty of Engineering, Universitas Jember
  • Welayaturromadhona Faculty of Engineering, Universitas Jember

DOI:

https://doi.org/10.29017/scog.v49i1.1969

Keywords:

carbonate rock, machine learning, permeability, porosity, reservoir characterization

Abstract

Reservoir characterization is essential for understanding rock and fluid behavior in hydrocarbon field development. In the Baturaja Formation, South Sumatra Basin, this process is challenging due to heterogeneity resulting from depositional and diagenetic variations. Limited core data and the high cost of conventional analysis encourage the use of machine learning (ML). This study aims to predict formation, facies, porosity, and permeability using ML algorithms and to assess the impact of feature augmentation. The dataset includes well log and core data from 13 wells. The workflow consists of preprocessing, feature selection, feature engineering, and supervised learning using Decision Tree, Random Forest, XGBoost, and KNN. Performance was evaluated using the F1-score for classification and MAE/RMSE for regression, followed by blind testing on wells HARLEY and XSR. Random Forest achieved the best formation prediction (F1-score 0.9890; blind test 0.9975) because the well data fall within the range of the training data distribution, although accuracy decreased in XSR due to differences in data distribution. XGBoost was the most accurate for facies prediction, improving from an F1-score of 0.9648 to 0.9741 after feature augmentation. For porosity and permeability, Random Forest produced the lowest errors, although permeability remained challenging in heterogeneous carbonates. Overall, ML provides an efficient and accurate approach, with Random Forest and XGBoost performing best, and feature augmentation consistently enhancing model generalization.

References

Al-Mudhafar, W. J., Hasan, A. A., Abbas, M. A., & Wood, D. A. (2025). Machine learning with hyperparameter optimization applied in facies-supported permeability modeling in carbonate oil reservoirs. Scientific Reports, 15(1), 12939. https://doi.org/10.1038/s41598-025-95490-0

Bruce, P., Bruce, A., & Gedeck, P. (2020). Practical Statistics for Data Scientists (2nd ed.). O’Reilly Media, Inc.

Chowdhury, Md. S., Tanjil, H., & Akter, S. (2019). Production Logging and its Implementation: A Technical Review. International Journal of Petroleum and Petrochemical Engineering, 5, 42–51. https://doi.org/10.20431/2454-7980.0502004

Downey, A. B. (2014). Think Stats Exploratory Data Analysis in Python. Green Tea Press.

Ginger, D., & Fielding, K. (2005). The Petroleum Systems and Future Potential of The South Sumatra Basin. Proceedings Indonesian Petroleum Association, 30th Annual Convention & Exhibition, 67–89.

Hafwandi, B. S., Irawan, D., & Yasutra, A. (2023). Study of Permeability Prediction Using Hydraulic Flow Unit (HFU) and Machine Learning Method in “BSH” Field. PETRO:Jurnal Ilmiah Teknik Perminyakan, 12(2), 98–121. https://doi.org/10.25105/petro.v12i2.15763

He, X., Zhu, W., Kwak, H., Yousef, A., & Hoteit, H. (2024). Deep learning-assisted Bayesian framework for real-time CO2 leakage locating at geologic sequestration sites. Journal of Cleaner Production, 448, 141484. https://doi.org/10.1016/j.jclepro.2024.141484

Lv, M., Li, K., Cai, J., Mao, J., Gao, J., & Xu, H. (2025). Evaluation of landslide susceptibility based on SMOTE-Tomek sampling and machine learning algorithm. PLOS One, 20(5), e0323487. https://doi.org/10.1371/journal.pone.0323487

Ma, Y. Z. (2019). Quantitative Geosciences: Data Analytics, Geostatistics, Reservoir Characterization and Modeling. Springer International Publishing. https://doi.org/10.1007/978-3-030-17860-4

Nugroho, I. D. R., Trisna, M. D., & Saroji, S. (2024). An Implementation of XGBoost and Random Forest Algorithm to Estimate Effective Porosity and Permeability on Well log data at Fajar Field, South Sumatera Basin, Indonesia. INDONESIAN JOURNAL OF APPLIED PHYSICS, 14(2), 271. https://doi.org/10.13057/ijap.v14i2.82901

Sastra, M. M., & Rohmana, R. C. (2024). Perbandingan Metode Klasifikasi Machine Learning: Studi Kasus Prediksi Jenis Litologi Berdasarkan Data Well Log Pada Formasi Sleipner, North Sea. Jurnal Teknik, 13(2), 51–63.

Septiano, J., Yasutra, A., & Rahmawati, S. D. (2022). Build of Machine Learning Proxy Model for Prediction of Wax Deposition Rate in Two Phase Flow Water-Oil. Scientific Contributions Oil and Gas, 45(1), 34–48. https://doi.org/10.29017/SCOG.45.1.922

Seyyedattar, M., Zendehboudi, S., & Butt, S. (2020). Technical and Non-technical Challenges of Development of Offshore Petroleum Reservoirs: Characterization and Production. Natural Resources Research, 29(3), 2147–2189. https://doi.org/10.1007/s11053-019-09549-7

Tukey, J. W. (1977). Exploratory Data Analysis. Addison-Wesley Publishing Company.

Wardhana, R., Yasutra, A., Irawan, D., & Haidar, M. (2022). A Case Study of Primary and Secondary Porosity Effect for Permeability Value in Carbonate Reservoir using Differential Effective Medium and Adaptive Neuro-Fuzzy Inference System Method. Scientific Contributions Oil and Gas, 45, 50–59. https://doi.org/10.29017/SCOG.45.1.923

Wardhana, S. G., Pakpahan, H. J., Simarmata, K., Pranowo, W., & Purba, H. (2021). Algoritma Komputasi Machine Learning untuk Aplikasi Prediksi Nilai Total Organic Carbon (TOC). Lembaran Publikasi Minyak Dan Gas Bumi, 55(2), 75–87. https://doi.org/10.29017/LPMGB.55.2.606

Wulandari, Y. P., & Rosid, M. S. (2023). Identifying Pore Type of Lagoon and Barrier Carbonate to Model Shear Wave Velocity in Salawati Basin by Differential Kuster-ToksÖz. POSITRON, 13(1), 1. https://doi.org/10.26418/positron.v13i1.63761

Published

25-02-2026

Issue

Section

Articles