Performance evaluation of machine learning pipelines for pore pressure prediction

dc.contributor.authorNaweed, MNM
dc.contributor.authorSuheerman, S
dc.contributor.authorDilkushan, SMDKR
dc.contributor.authorThiruchittampalam, S
dc.contributor.authorWickrama, MADMG
dc.date.accessioned2026-01-09T03:49:20Z
dc.date.issued2025
dc.description.abstractAccurate pore pressure prediction is critical for safe drilling operations. Conventional prediction methods, which rely on simplified empirical assumptions, often fail to capture the multivariate and non-linear relationships present in complex geological settings. Machine learning (ML) provides a data-driven approach that can model these complexities directly from well log data without relying on predefined physical equations. However, the practical application of ML is often inconsistent due to a lack of systematic understanding of how data preprocessing choices impact final model performance. This study aims to resolve this uncertainty by identifying the optimal combination of preprocessing strategy and ML algorithm for this task. A comparative analysis was conducted across four scenarios: raw data, outlier-capped data, feature-selected data, and combined preprocessing (outlier capping and feature selection) using six ML algorithms to systematically evaluate the effects of outlier capping and the removal of multicollinear features. The findings identify a tuned XGBoost model as the top performer (R² = 0.9789), achieving this optimal result on the raw, unprocessed dataset. This key finding, when analyzed in the context of the other experimental scenarios, demonstrates that removing linearly correlated features can be detrimental to advanced models and that the necessity of outlier treatment is algorithm dependent. This study concludes that while the data preparation strategy is universal, it is closely tied to algorithm choice, offering a context-aware framework to enhance model reliability and support interpretability in future research.
dc.identifier.conferenceInternational Symposium on Earth Resources Management and Environment - ISERME 2025
dc.identifier.departmentDepartment of Earth Resources Engineering
dc.identifier.doihttps://doi.org/10.31705/ISERME.2025.15
dc.identifier.emailmaheshwari@uom.lk
dc.identifier.facultyEngineering
dc.identifier.issn2961-5372
dc.identifier.pgnospp. 95-102
dc.identifier.placeMoratuwa, Sri Lanka
dc.identifier.proceedingProceedings of the 9th International Symposium on Earth Resources Management & Environment
dc.identifier.urihttps://dl.lib.uom.lk/handle/123/24706
dc.language.isoen
dc.publisherDepartment of Earth Resources Engineering, University of Moratuwa, Sri Lanka
dc.subjectEnsemble methods
dc.subjectFeature selection
dc.subjectGeomechanics
dc.subjectHyperparameter tuning
dc.subjectOutlier capping
dc.subjectXGBoost
dc.titlePerformance evaluation of machine learning pipelines for pore pressure prediction
dc.typeConference-Full-text

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ISERME.2025.15.pdf
Size:
644.42 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections