Concrete Crrotion
Concrete Crrotion
com/scientificreports
                                  The integration of Artificial Intelligence techniques, particularly Artificial Neural Networks (ANNs),
                                  has transformed predictive modeling in structural and durability engineering. This study investigates
                                  the use of ANN-based approaches to predict the corrosion rates of mild steel reinforcement embedded
                                  in cementitious composites subjected to clay-dominated soil environments. Key environmental
                                  parameters, sodium chloride (NaCl) content (0-4%), inhibitor dosage (DOI) (0-5%), and exposure
                                  duration (30-180 days), were selected as input variables. Two ANN architectures, Feedforward
                                  Backpropagation (FFBP) and Cascadeforward Backpropagation (CFBP), were developed and trained
                                  using 72 experimental data points extracted from the literature. The FFBP model outperformed
                                  CFBP in terms of predictive accuracy, achieving a correlation coefficient (R) of 0.998, a mean absolute
                                  percentage error (MAPE) of 30.43%, and a root mean square error (RMSE) of 0.071 during testing.
                                  Sensitivity analysis revealed that inhibitor dosage had the most significant influence on corrosion
                                  behavior, followed by NaCl concentration and exposure duration. The findings confirm that ANN
                                  models can effectively capture the nonlinear interactions governing corrosion progression, even under
                                  complex environmental conditions associated with clayey soils. This research provides a reliable and
                                  practical AI-driven framework for assessing corrosion risk, guiding material design, and enhancing
                                  long-term infrastructure durability in aggressive subsurface conditions. The study underscores the
                                  growing relevance of machine learning in simulating time-dependent deterioration processes in
                                  geotechnical and structural materials.
                                  Keywords Corrosion behavior prediction, Cementitious composite materials, Neural network modeling,
                                  Reinforced steel durability
                                  Abbreviations
                                  AAD	Average Absolute Deviation
                                  ANN	Artificial Neural Network
                                  ANNs	Artificial Neural Networks
                                  CFBP	Cascadeforward Backpropagation
                                  FFBP	Feedforward Backpropagation
                                  MAPE	Mean Absolute Percentage Error
                                  MLP	Multilayer Perceptron
                                  MSE	Mean Sum of Squares
                                  R2	Determination Coefficient
                                  R	Correlation Coefficient
                                  RMSE	Root Mean Square Error
                                  SVM	Support Vector Machine
                                  STDV	Standard Deviation
                                  Concrete has long served as the backbone of modern infrastructure, enabling the construction of buildings,
                                  highways, dams, and underground systems that shape cities and support economies1. Much of this infrastructure
1Geomechanics & Geotechnics, University of Kiel, Kiel, Germany. 2Department of Civil Engineering, Aligarh Muslim
                                 University, Aligarh, India. 3Narmada Hydroelectric Development Corporation Ltd., Bhopal, India. 4Kiewit Inc.,
                                 Denver, USA. email: shahbaz.10ahmad@gmail.com; shahbaz.ahmad@ifg.uni-kiel.de
                                  was built during the industrial and urban expansion of the last half-century2, with design lives of 50 to 100
                                  years. As these structures age, maintenance and rehabilitation become critical, particularly in aggressive
                                  environments that accelerate deterioration. Among such environments, clay-dominated soils present unique
                                  durability challenges due to their high moisture retention, low permeability, high ion exchange capacity, and
                                  often acidic pH. These geochemical characteristics intensify the electrochemical processes that drive steel
                                  corrosion in buried reinforced concrete. Studies have shown that clay-rich media accelerate chloride ingress and
                                  corrosion initiation3–6, with significant implications for underground material performance7–9. Understanding
                                  and predicting corrosion behavior in such settings is vital for extending the service life of reinforced concrete
                                  structures. Concrete durability modeling has evolved through three major scientific paradigms: empiricism,
                                  theory, and computation10. Empirical studies established early understanding through observation, while
                                  theoretical models provided frameworks for predicting complex behaviors. Computational methods, including
                                  finite element modeling (FEM), later introduced more refined simulations. Alkam11 predicted service life for RC
                                  structures in chloride environments, while Lin and Xiang12 developed a model incorporating environmental and
                                  material parameters. Ahmad13 reviewed corrosion monitoring methods and predictive models. Classic models
                                  by Bazant14, Morinaga15, and Wang and Zhao16 described corrosion-induced cracking through mechanical-
                                  expansion models and FEM. These works form the foundation for durability prediction. Finite element-based
                                  numerical models have evolved significantly, incorporating coupled processes such as heat, moisture, and ion
                                  transport to simulate chloride diffusion and corrosion-induced damage17. Although these models provide
                                  mechanistic insight, their reliance on material-specific parameters and complex boundary conditions limits
                                  their scalability.
                                      Moreover, such models are computationally intensive and less adaptable to large datasets arising from
                                  modern field monitoring campaigns, especially in geotechnical contexts involving expansive or problematic soils
                                  common in rural infrastructure18. To address these limitations, researchers are increasingly turning to artificial
                                  intelligence (AI) and machine learning (ML) approaches. Among them, artificial neural networks (ANNs) have
                                  gained prominence due to their capacity to learn nonlinear relationships from data, making them ideal for
                                  problems involving complex environmental interactions. Although ANNs have been in use for several decades,
                                  their continued relevance lies in their adaptability, transparency in architecture, and interpretability. Compared
                                  to more recent deep learning models such as convolutional neural networks (CNNs) and recurrent neural
                                  networks (RNNs), which are better suited for image processing and sequential time-series data respectively,
                                  shallow ANNs like FFBP and CFBP are computationally efficient, easier to train on smaller datasets, and
                                  highly effective for tabular experimental data with limited dimensionsas is the case in this study. Thus, ANN
                                  remains a relevant and practical modeling framework, particularly when explainability and fast convergence
                                  are important. AI has been extensively applied in civil engineering to predict material properties such as
                                  compressive strength19,20, crack propagation21–24, flexural and tensile strength25–28, shear capacity29,30, elastic
                                  modulus31, shrinkage32,33, and chloride diffusion34,35. Despite these advances, the direct application of ANNs to
                                  predict corrosion behavior in aggressive soil environments remains relatively limited36. However, recent studies
                                  have shown promise. Dong et al.37 applied ML to model steel corrosion embedded in soil, while Hosseinzadeh
                                  et al.38 predicted chloride resistance in concrete using AI. Song et al.39 explored interpretable ML for corrosion
                                  depth analysis, and Ji et al.40 applied recurrent neural networks for time-series corrosion forecasting. These
                                  developments reflect the growing relevance of AI-based models for durability assessments. While these
                                  contributions have improved mechanistic understanding, the integration of such physical models with data-
                                  driven approaches remains a significant research challenge41. Artificial intelligence, particularly ANNs, offers
                                  a paradigm shift by providing data-driven alternatives to conventional prediction models. These tools excel in
                                  scenarios with complex parameter interactions and incomplete mechanistic understanding. Furthermore, ANNs
                                  are adaptable, capable of being retrained or fine-tuned as new data becomes available, which makes them ideal
                                  for infrastructure monitoring and risk assessment frameworks. By demonstrating the feasibility and benefits of
                                  ANN-based corrosion modeling in clayey soils, the work contributes to a growing body of literature advocating
                                  for hybridized, intelligent infrastructure systems, (shown in Figure 1). To further explore this potential, the
                                  present study evaluates two ANN architectures, Feedforward Backpropagation (FFBP) and Cascadeforward
                                  Backpropagation (CFBP) for predicting corrosion rates of mild steel reinforcement in cementitious composites
                                  Fig. 1. Conceptual workflow showing corrosion progression of a buried reinforced concrete structure in clayey
                                  soil, key environmental inputs, and the artificial neural network (ANN) framework used for corrosion rate
                                  prediction.
                                  exposed to clay-rich soils. Input parameters include sodium chloride (NaCl) concentration, inhibitor dosage
                                  (DOI), and exposure duration (t). Literature sourced experimental data are used for model training and testing.
                                  Performance metrics such as mean absolute percentage error (MAPE) and coefficient of determination (R )
                                  are employed to evaluate model accuracy. Additionally, sensitivity analysis is conducted to identify the relative
                                  influence of each input parameter. The overarching goal is not only to improve predictive capability but also to
                                  provide actionable insights for material design and durability planning. By identifying the dominant factors that
                                  influence corrosion in clayey environments, this study supports more informed engineering decisions regarding
                                  the selection of inhibitors, exposure thresholds, and material formulations. These insights can directly contribute
                                  to extending the service life of infrastructure and reducing lifecycle maintenance costs. Beyond its technical
                                  scope, this work addresses a broader research need: bridging the gap between classical mechanistic models and
                                  emerging data-driven tools. As the volume of field and lab generated durability data grows, the integration of AI
                                  into predictive modeling will become increasingly essential. In this context, corrosion prediction represents a
                                  frontier where data, materials science, and machine learning converge. This study represents a step toward that
                                  integration. By demonstrating the feasibility and benefits of ANN-based corrosion modeling in clayey soils, the
                                  work contributes to a growing body of literature advocating for hybridized, intelligent infrastructure systems.
                                  As we move toward smarter cities and more resilient construction practices, embedding predictive intelligence
                                  within materials research is not just an advantage, it is a necessity.
                                  Methods
                                  Formulation of neural network model and data
                                  In order to map the relationship related to the rate of corrosion, an input-output scheme was used. From review
                                  of literature, it was concluded that the rate of corrosion depends upon: (i) Salinity level (NaCl), (ii) Dose of
                                  inhibitor (DOI), (iii) Exposure duration (t). The model thus takes the input in the form of causative factors
                                  namely NaCl, DOI and t and yields the output as Corrosion rate (CR).
                                                                            CR = f (NaCl, DOI, t)(1)
                                  The input and output variables involved in the present ANN model were first normalized within the range of 0
                                  to 1 as follows:
                                                                                      X − Xmin
                                                                             XN =                (2)
                                                                                     Xmax − Xmin
                                  where XN is the normalized value of X, and Xmax and Xmin are the maximum and minimum values of each
                                  variable. This normalization allowed the network to be trained more effectively.
                                      The dataset used in this study to train and validate the ANN models was derived from the experimental work
                                  of Akhtar et al.42, which investigated the corrosion behavior of reinforced concrete samples under controlled
                                  laboratory conditions. The corrosion rate data were generated by systematically varying three primary input
                                  parameters: sodium chloride (NaCl) concentration, corrosion inhibitor dosage (DOI), and exposure duration
                                  (t). These variables were selected due to their well-established influence on electrochemical corrosion processes
                                  affecting steel embedded in cementitious environments. The dataset comprises 72 distinct experimental
                                  observations, encompassing a representative range of conditions: NaCl concentrations from 0% to 4%, inhibitor
                                  dosages from 0% to 5%, and exposure durations between 30 and 180 days. The corrosion rate (CR) values,
                                  reported in mils per year (mpy), were derived from gravimetric mass loss testing as described in Akhtar et al.42.
                                  To prepare the data for model training, all input and output variables were normalized to a [0, 1] scale using min-
                                  max normalization to ensure uniform scaling, reduce bias due to magnitude differences, and improve training
                                  stability. Subsequently, the dataset was randomly partitioned into training, validation, and testing subsets using
                                  an 80:10:10 split. The training set (80%) was used to derive the model, while the remaining 20% of the data,
                                  unseen during training was reserved for validation and testing to ensure unbiased performance evaluation.
                                      In the present work, different types of networks were considered and trained using a back-propagation
                                  algorithm. The resulting neural network models are referred to as Feedforward Backpropagation (FFBP) and
                                  Cascadeforward Backpropagation (CFBP). In this study, ANN models with a single hidden layer were developed.
                                  Identifying the number of neurons in the input and output layers is straightforward, as it is determined by the
                                  input and output variables considered in the physical process model. However, determining the optimal number
                                  of hidden layer nodes required a trial-and-error approach to identify the best network configuration. The optimal
                                  architecture was determined by varying the number of hidden neurons, aiming to minimize the difference
                                  between the predicted values from the neural network model and the desired output. Generally, as the number of
                                  neurons in the hidden layer increases, the network’s prediction capability improves initially and then stabilizes.
                                  For training, a gradient descent algorithm was employed, with the number of training epochs set to 1000. The
                                  performance of all neural network model configurations was evaluated based on the coefficient of correlation (R)
                                  between the predicted values and the desired output, Mean Absolute Percentage Error (MAPE), and Root Mean
                                  Square Error (RMSE). The training was stopped when either an acceptable level of error was achieved or when
                                  the maximum number of iterations was reached. The neural network model configuration that minimized MAPE
                                  and optimized R was selected and the entire analysis was repeated several times. The architecture of the present
                                  ANN model is shown in Figure 2. The dataset used in this study, shown in Table 142, highlights the progressive
                                  increase in corrosion rate (CR) over time under conditions with no sodium chloride (NaCl) or inhibitor dosage
                                  (DOI). This data provides critical insights into the time-dependent behavior of corrosion, forming the basis for
                                  further analysis and modeling. The determination of the network configuration, including the optimal number
                                  of hidden nodes, was a critical step in this study. While the number of input and output nodes (I = 3 and
                                  O = 1, respectively) was straightforward, as dictated by the causative factors and output variable, optimizing the
                                  number of hidden layer neurons required an iterative trial-and-error process. Various configurations were tested
                                  to achieve a balance between minimizing error metrics–such as Mean Absolute Percentage Error (MAPE) and
                                  Root Mean Square Error (RMSE)and maximizing the coefficient of correlation (R). The optimal configurations
                                  identified for the Feedforward Backpropagation (FFBP) and Cascadeforward Backpropagation (CFBP) models
                                  were 20 and 22 hidden neurons, respectively, as summarized in Table 2. Additionally, the learning rate and
                                  momentum factor were set at 0.5 and 0.7, respectively, to ensure stable and efficient convergence during training.
                                  Training was stopped when the error reached a predefined threshold of 0.0001 or when 1000 epochs were
                                  completed. This systematic optimization process ensured the development of robust and accurate ANN models
                                  for corrosion prediction. All ANN models, including FFBP and CFBP architectures, were implemented using the
                                  Neural Network Toolbox in MATLAB (MathWorks, Natick, MA, USA)43.
                                                           Network
                                  Model       Algorithm    Configuration       Learning Rate    Momentum Function
                                                           I         H    O
                                              FFBP                             0.5              0.7
                                  Model-M                  3         20   1
                                              CFBP         3         22   1    0.5              0.7
                                  Table 2. Network Architecture of the Present ANN Model. I , H , and O indicate the number of input, hidden,
                                  and output nodes, respectively
                                  Here, Oi and Pi are the observed and predicted values, while O and P are their respective means.
                                    2. Nash-Sutcliffe Efficiency Coefficient (E)
                                    The Nash-Sutcliffe efficiency coefficient (E ) assesses the predictive power of models. It is defined as:
                                                                                              ∑n
                                                                                                   (Oi − Pi )2
                                                                                      E = 1 − ∑i=1             (4)
                                                                                                            (Oi − O)
                                                                                                             2
                                                                                               n
                                                                                                      i=1
                                     E = 1 indicates a perfect match, E = 0 suggests the model is as accurate as the mean of the observed data,
                                  and E < 0 reflects unacceptable performance.
                                     3. Root Mean Squared Error (RMSE)
                                     RMSE is a measure of the difference between values predicted by a model and those observed. It is given by:
                                                                                     √ ∑n
                                                                                                    (Oi − Pi )2 (5)
                                                                        RM SE =               i=1
                                                                                                     n
                                  Fig. 3. (a-h) Variation of performance parameters with number of hidden neurons in the CFBP model.
                                  “Training,” “Validation,” and “Testing” correspond to the respective subsets of the 80:10:10 data split. “All”
                                  represents the model’s performance across the full dataset, included for visualization purposes.
                                  better generalization across all subsets. Although the FFBP model demonstrated better results (Table 4), the
                                  connection weights and biases for the CFBP architecture are also provided in Table 5 for completeness and
                                  potential reference in future comparative studies.
                                     The performance of the developed ANN models in this study demonstrates a notable improvement over
                                  existing models reported in the literature. Few previous studies have achieved correlation coefficients R ranging
                                  Fig. 4. (a-h) Variation of performance parameters with number of hidden neurons in the FFBP model.
                                  “Training,” “Validation,” and “Testing” correspond to the respective subsets of the 80:10:10 data split. “All”
                                  represents the model’s performance across the full dataset, included for visualization purposes.
                                  from 0.55 to 0.61 with mean absolute percentage errors MAPE between 39% and 53% when predicting corrosion
                                  depths in steel and zinc under atmospheric conditions46. Another study focusing on corrosion current density
                                  prediction in reinforced concrete by Nikoo et al.47 employed a self-organizing feature map (SOFM) optimized
                                  with a genetic algorithm to predict corrosion current density in reinforced concrete, achieving an R of 0.978 and
                                  RMSE of 0.02 during the testing phase. In contrast, our ANN model achieved an R of 0.999 with significantly
                                  lower RMSE and MAPE values, indicating superior predictive accuracy. Moreover, while prior models often
                                  utilized diverse and less controlled datasets, our approach benefits from a well-structured and consistent dataset
                                  derived from controlled laboratory experiments, enhancing the model’s reliability. To our knowledge, this study
                                  is among the first to apply ANN techniques specifically to predict corrosion rates in clay-rich geotechnical
                                  environments, addressing a niche yet critical area in infrastructure durability assessment.
                                      The FFBP model achieved the highest coefficient of correlation (R) and the lowest MAPE and RMSE values.
                                  During training, the FFBP model recorded R = 0.999, MAPE = 37.12%, and RMSE = 0.037. During testing, it
                                  achieved R= 0.998, MAPE = 30.43%, and RMSE = 0.071. While the MAPE appears higher than RMSE or R,
                                  this is largely due to the wide range of corrosion rates in the dataset (0.014 3.60 mpy). As a percentage-based
                                  metric, MAPE is sensitive to small actual values, where even minor absolute errors can yield disproportionately
                                  large percentage deviations. This limitation is common in skewed datasets; therefore, MAPE is interpreted in
                                  conjunction with RMSE and R, which together confirm the model’s strong predictive performance. Overall, all
                                  models exhibited small MAPE, RMSE, and SI values during training, indicating reliable performance. Slightly
                                  higher error values were observed during validation, reflecting the inherent variability in unseen data. Despite
                                  this, the ANN models maintained consistently high correlation during testing, underscoring their robustness.
                                  To summarize, the network configuration of the FFBP model, along with the corresponding weights and
                                  bias matrices provided in Table 4, is recommended for general use in predicting corrosion rates. Its superior
                                  performance across all error metrics establishes it as the reliable model for practical applications.
                                  Sensitivity analysis
                                  Sensitivity tests were conducted to ascertain the relative significance of each of the independent parameter (input
                                  neurons) on the corrosion rate given by equation 1. In the sensitivity analysis, each input neuron was in turn
                                  eliminated from the model, and its influence on the prediction of corrosion rate was evaluated in terms of the
                                  R2 , MAPE, RMSE, E, and SI criteria. The network architecture of the problem as shown in Table 2 considered
                                  in the present sensitivity analysis consist of a hidden layer with 20 neurons in layer for FFBP and 22 neurons
                                  in CFBP respectively. Comparison of different neural networks models with one of the independent parameter
                                  removed in each case is presented in Tables 6 and 7.
                                      The results shown in Table 6 and Heat Map (Figure 7) presents that, for the most suitable model (FFBP), the
                                  inhibitor dosage (DOI) is the most significant input parameter for predicting the corrosion rate. This conclusion
                                  is supported by the sharp drop in the correlation coefficient (R) from 0.996 to 0.246 and the corresponding
                                  increase in MAPE from 27.88% to 552.8% when DOI is excluded from the model. This further indicates that the
                                  presence of corrosion inhibitor plays a dominant role in governing electrochemical behavior under the tested
                                  environmental conditions. The variables in order of decreasing level of sensitivity for FFBP model are: DOI,
                                  NaCl, t. These findings are consistent with recent literature on the relative importance of NaCl concentration,
                                  inhibitor dosage, and exposure time in influencing steel corrosion42. Sodium chloride promotes chloride-
                                  induced depassivation of steel and accelerates corrosion initiation48. Inhibitor dosage has been widely reported
                                  as a key controlling factor in mitigating corrosion through passive layer stabilization49. Exposure time governs
                                  the extent of electrochemical interaction and progressive deterioration under sustained conditions50.
                                      The sensitivity analysis of the FFBP and CFBP models suggests that ’t’ have only a marginal influence on the
                                  resulting corrosion rate compared to the other parameters. However, considering the limitation and uncertainties
                                  in the data, a full-fledged network involving all input variables would be desirable. In view of the variability in
                                  the outcome resulting from application of different analytical schemes (ANNs models), it is felt that the network
                                  that requires all input quantities may be followed for generality.
                                  Practical applications
                                  The proposed ANN models provide a robust computational tool for predicting corrosion rates in reinforced
                                  cementitious composites, with significant implications for real-world applications. By leveraging the ability of
                                  artificial neural networks to simulate complex interactions, these models can address several critical challenges
                                  in the design, maintenance, and sustainability of civil infrastructure.
                                  1. Design Optimization The models can guide the design of durable concrete mixtures by optimizing the
                                     dosage of corrosion inhibitors based on site-specific environmental conditions, such as chloride exposure
                                       levels. This capability ensures that the design specifications are tailored to minimize corrosion risks, thereby
                                       enhancing the longevity of structures.
                                  2.   Infrastructure Maintenance Predictions generated by the models can inform infrastructure maintenance
                                       schedules, enabling proactive and timely interventions. For example, bridges, dams, and culverts exposed
                                       to saline environments are particularly vulnerable to corrosion. The ANN models can help prioritize these
                                       structures for inspection and remediation, reducing the likelihood of structural failures.
                                  3.   Cost Reduction Early predictions of high corrosion rates in specific environmental conditions allow for
                                       better material selection and preemptive adjustments during the design phase. This reduces long-term main-
                                       tenance costs and the need for expensive retrofitting. Moreover, optimizing inhibitor dosage ensures efficient
                                       use of materials, minimizing waste and associated costs.
                                  4.   Policy and Planning Civil engineering firms and regulatory bodies can use the model’s outputs to establish
                                       guidelines and policies for chloride content limits, minimum inhibitor dosages, and maintenance standards.
                                       Such data-driven policies can improve overall infrastructure resilience while promoting sustainable develop-
                                       ment.
                                  5.   Emergency Retrofitting In critical situations where rapid corrosion poses an immediate risk, the models can
                                       identify high-priority structures requiring urgent retrofitting. This capability supports emergency response
                                       planning and helps prevent catastrophic failures by ensuring that resources are allocated effectively.
                                  6.   Sustainability and Environmental Impact By facilitating the design of corrosion-resistant materials and
                                       reducing unnecessary overdesign, the models contribute to more sustainable construction practices. This
                                       aligns with global goals to reduce the environmental footprint of infrastructure projects.
                                  Future work may focus on incorporating additional variables such as pH, temperature, and moisture content to
                                  enhance prediction accuracy. Coupling the ANN framework with geospatial data could support corrosion risk
                                  mapping, while integrating probabilistic models would help quantify uncertainty. Expanding the dataset with
                                  multi-source experimental data may also enable more generalizable and hybrid AI modeling approaches.
                                  Conclusions
                                  This study demonstrates the potential of artificial neural networks (ANNs) as effective computational tools for
                                  predicting corrosion rates in reinforced cementitious composites subjected to clay-dominated soil environments.
                                  By leveraging their flexibility and capacity to capture nonlinear relationships, ANN models offer a robust
                                  framework for assessing durability-related risks.
                                  1. Effectiveness of ANNs: The ANN models successfully predicted corrosion rates ranging from 0.014 to
                                     3.60 mpy using three key input variables: sodium chloride concentration (0-4%), inhibitor dosage (0-5%),
                                  Fig. 7. Refined sensitivity analysis heatmap of the FFBP model showing the effect of excluding input variables
                                  on predictive accuracy. The inhibitor dosage (DOI) is identified as the most critical input.
                                       and exposure time (30-180 days). The best-performing model, FFBP, achieved a testing correlation coeffi-
                                       cient of R = 0.998, MAPE = 30.43%, and RMSE = 0.071, outperforming traditional regression-based ap-
                                       proaches.
                                  2.   Development of a Generalized Model: A generalized model for predicting corrosion rates using ANNs
                                       has been successfully developed, demonstrating low prediction errors and high correlation across all data
                                       subsets.
                                  3.   Recommended Network Configuration: The Feedforward Backpropagation (FFBP) network exhibited
                                       consistently better performance compared to the Cascadeforward Backpropagation (CFBP) model, which
                                       showed signs of overfitting during testing. The FFBP architecture is thus recommended for practical imple-
                                       mentation.
                                  4.   Sensitivity Analysis Insights: Sensitivity analysis revealed that inhibitor dosage (DOI) is the most influen-
                                       tial parameter, as removing it caused R to drop from 0.9985 to 0.7806. Sodium chloride concentration also
                                       significantly impacted model accuracy, while exposure time had a lesser but still notable effect. These results
                                       underscore the importance of including all input parameters to maintain model robustness.
                                  5.   Dataset Scope and Limitations: While the model demonstrated strong predictive performance, it was
                                       trained on a relatively small and uniform dataset (72 samples) from a single study. Future work should incor-
                                       porate larger and more diverse datasets across varied environmental conditions to improve model scalability
                                       and field applicability.
                                  Data availability
                                  The datasets generated and/or analyzed during the current study are available from the corresponding author
                                  upon reasonable request.
                                  References
                                   1. Han, B., Ding, S. & Yu, X. Intrinsic self-sensing concrete and structures: A review. Measurement 59, 110–128. https://do             i.org/10.1 
                                      016/j.meas urement.20 14.09.048 (2015).
                                   2. Schlangen, E. Foreword. In Pacheco-Torgal, F. et al. (eds.) Eco-Efficient Repair and Rehabilitation of Concrete Infrastructures,
                                      Woodhead Publishing Series in Civil and Structural Engineering, xvii, https://doi.org/10.1016/B978-0-08-102181-1.00030-7
                                      (Woodhead Publishing, 2018).
                                   3. Dhandapani, Y., Neithalath, N. & Thomas, C. Performance of calcined clay concrete under chloride exposure: A review of durability
                                      and transport properties. Materials and Structures 57, 1–22. https://doi.org/10.1617/s11527-024-02426-7 (2024).
                                   4. Ali, S., Qasrawi, H., Khan, M. I. & Abbas, S. Chloride-induced corrosion in reinforced concrete: Mechanisms, modelling and
                                      mitigation strategies. RSC Advances 14, 30112–30137. https://doi.org/10.1039/D4RA05506C (2024).
                                   5. Kroviakov, I., Kolesnikov, A. & Nikolaev, A. Durability of fiber-reinforced concrete in acidic soil environments. Concrete and
                                      Construction Materials 5, 6. https://doi.org/10.3390/constrmater5010006 (2025).
                                   6. Bastidas, D. M. & Bastidas, J. M. Editorial: Corrosion and durability of reinforced concrete structures. Frontiers in Materials 7, 170.
                                      https://doi.org/10.3389/fmats.2020.00170 (2020).
                                   7. Ahmad, S., Rizvi, Z. & Wuttke, F. Unveiling soil thermal behavior under ultra-high voltage power cable operations. Scientific
                                      Reports 15, 7315. https://doi.org/10.1038/s41598-025-91831-1 (2025).
                                   8. Ahmad, S., Rizvi, Z., Khan, M. A., Ahmad, J. & Wuttke, F. Experimental study of thermal performance of the backfill material
                                      around underground power cable under steady and cyclic thermal loading. Materials Today: Proceedings 17, 85–95. https://do                        i.or
                                      g/10.1 016/j.matp
                                                                     r.2019.06. 404 (2019).
                                   9. Ahmad, S. et al. Evolution of temperature field around underground power cable for static and cyclic heating. Energies 14, 8191.
                                      https://doi.org/10.3390/en14238191 (2021).
                                  10. Li, Z. et al. Machine learning in concrete science: applications, challenges, and best practices. npj Computational Materials 8, 127.
                                      https://doi.org/10.1038/s41524-022-00810-x (2022).
                                  11. Alkam, M. K. & Alqam, M. Prediction of the service life of a reinforced concrete column under chloride environment. Advances in
                                      Materials Science and Engineering 2015 (2015).
                                  12. Lin, G., Liu, Y. & Xiang, Z. Numerical modeling for predicting service life of reinforced concrete structures exposed to chloride
                                      environments. Cement and Concrete Composites 32 (2010).
                                  13. Ahmad, S. Reinforcement corrosion in concrete structures, its monitoring and service life prediction–a review. Cement and
                                      Concrete Composites 25 (2002).
                                  14. Bazant, Z. P. Physical model for steel corrosion in concrete sea structures–application. ASCE Journal of Structural Division 105
                                      (1979).
                                  15. Morinaga, S. Prediction of service lives of reinforced concrete buildings based on the corrosion rate of reinforcing steel. In
                                      Proceedings of Building Materials and Components, 1155–1166 (Brighton, UK, 1990).
                                  16. Wang, X. M. & Zhao, H. Y. The residual service life prediction of r.c. structures. In et al., S. N. (ed.) Durability of building materials
                                      and components 6, 1107–1114 (E and FN Spon, 1993).
                                  17. Dagher, H. J. & Kulendran, S. Finite element model of corrosion damage in concrete structures. ACI Structural Journal 89, 699–708
                                      (1992).
                                  18. Farooq, F. & Hasan, M. Geotechnical behaviour and utilisation of sustainable waste with expensive soil in rural roads as subgrade
                                      material. Indian Geotechnical Journal https://doi.org/10.1007/s40098-024-00934-5 (2024).
                                  19. Yeh, I. C. Modeling of strength of high-performance concrete using artificial neural networks. Cement and Concrete Research 28,
                                      1797–1808 (1998).
                                  20. Ouyang, B. et al. Predicting concrete s strength by machine learning: Balance between accuracy and complexity of algorithms. ACI
                                      Materials Journal 117, 125–134 (2020).
                                  21. Bhamare, D. K., Saikia, P., Rathod, M. K., Rakshit, D. & Banerjee, J. A machine learning and deep learning based approach to
                                      predict the thermal performance of phase change material integrated building envelope. Building and Environment 199, 107927
                                      (2021).
                                  22. Moreh, F. et al. Deep neural networks for crack detection inside structures. Scientific Reports 14, 4439. https://doi .org/10.10 38/s41
                                      598- 024-54494-y (2024).
                                  23. Moreh, F., Hasan, Y., Rizvi, Z. H., Wuttke, F. & Tomforde, S. Wave-based neural network with attention mechanism for damage
                                      localization in materials. In 2024 International Conference on Machine Learning and Applications (ICMLA), 122–129, https://d                      oi.o
                                      rg/10. 1109/ICMLA     61862.2024 .00023 (Miami, FL, USA, 2024).
                                  24. Moreh, F., Hasan, Y., Rizvi, Z. H., Wuttke, F. & Tomforde, S. Mcmn deep learning model for precise microcrack detection in various
                                      materials. In 2024 International Conference on Machine Learning and Applications (ICMLA), 1928–1933, https:/ /doi.org/1 0.1109/I
                                      CMLA61862.20 24.00297 (Miami, FL, USA, 2024).
                                  25. Xu, J. et al. Parametric sensitivity analysis and modelling of mechanical properties of normal- and high-strength recycled aggregate
                                      concrete using grey theory, multiple nonlinear regression and artificial neural networks. Construction and Building Materials 211,
                                      479–491 (2019).
                                  26. Nasr, D., Behforouz, B., Borujeni, P. R., Borujeni, S. A. & Zehtab, B. Effect of nanosilica on mechanical properties and durability
                                      of self-compacting mortar containing natural zeolite: Experimental investigations and artificial neural network modeling.
                                      Construction and Building Materials 229, 116888 (2019).
                                  27. Nguyen, H., Vu, T., Vo, T. P. & Thai, H. T. Efficient machine learning models for prediction of concrete strengths. Construction and
                                      Building Materials 266, 120950 (2021).
                                  28. Hossain, K. M., Anwar, M. S. & Samani, S. G. Regression and artificial neural network models for strength properties of engineered
                                      cementitious composites. Neural Computing and Applications 29, 631–645 (2018).
                                  29. Adhikary, B. B. & Mutsuyoshi, H. Artificial neural networks for the prediction of shear capacity of steel plate strengthened rc
                                      beams. Construction and Building Materials 18, 409–417 (2004).
                                  30. Elsanadedy, H. M., Abbas, H., Al-Salloum, Y. A. & Almusallam, T. H. Shear strength prediction of hsc slender beams without web
                                      reinforcement. Materials and Structures 49, 3749–3772 (2016).
                                  31. Xu, J. et al. Prediction of triaxial behavior of recycled aggregate concrete using multivariable regression and artificial neural
                                      network techniques. Construction and Building Materials 226, 534–554 (2019).
                                  32. Nehdi, M. L. & Soliman, A. M. Artificial intelligence model for early-age autogenous shrinkage of concrete. ACI Materials Journal
                                      109, 353–362 (2012).
                                  33. Mermerdaş, K. & Arbili, M. M. Explicit formulation of drying and autogenous shrinkage of concretes with binary and ternary
                                      blends of silica fume and fly ash. Construction and Building Materials 94, 371–379 (2015).
                                  34. Güneyisi, E., Gesoğlu, M., Özturan, T. & Özbay, E. Estimation of chloride permeability of concretes by empirical modeling:
                                      Considering effects of cement type, curing condition and age. Construction and Building Materials 23, 469–481 (2009).
                                  35. Asghshahr, M. S., Rahai, A. & Ashrafi, H. Prediction of chloride content in concrete using ann and cart. Magazine of Concrete
                                      Research 68, 1085–1098 (2016).
                                  36. Xu, Y. & Jin, R. Measurement of reinforcement corrosion in concrete adopting ultrasonic tests and artificial neural network.
                                      Construction and Building Materials 177, 125–133 (2018).
                                  37. Dong, Z. et al. Machine learning-based corrosion rate prediction of steel embedded in soil. Scientific Reports 14, 18194. https://do                            
                                      i.org/10.1 038/s41598 -024-68562 -w (2024).
                                  38. Hosseinzadeh, M. et al. An efficient machine learning approach for predicting concrete chloride resistance using a comprehensive
                                      dataset. Scientific Reports 13, 15024. https://doi.org/10.1038/s41598-023-42270-3 (2023).
                                  39. Song, Y. et al. Interpretable machine learning for maximum corrosion depth and influence factor analysis. npj Materials Degradation
                                      7, 9. https://doi.org/10.1038/s41529-023-00324-x (2023).
                                  40. Ji, H., Liu, J. C. & Ye, H. Time-series prediction of steel corrosion in concrete using recurrent neural networks. Nondestructive
                                      Testing and Evaluation 1–20, https://doi.org/10.1080/10589759.2025.2456668 (2025).
                                  41. Rizvi, Z. et al. Dynamic lattice element modelling of cemented geomaterials. In Prashant, A., Sachan, A. & Desai, C. (eds.) Advances
                                      in Computer Methods and Geomechanics, vol. 55 of Lecture Notes in Civil Engineering, https://d
                                                                                                                                            oi.org/10. 1007/978-9 81-15-0886 -8
                                      _53 (Springer, Singapore, 2020).
                                  42. Akhtar, S., Quraishi, M. A. & Arif, M. Use of chemical corrosion inhibitors for protection of metallic fibre reinforcement in
                                      ferrocement composites. Arabian Journal for Science and Engineering 34, 107–113 (2009).
                                  43. The MathWorks Inc. MATLAB version: 9.15.0 (R2023b). The MathWorks Inc., Natick, Massachusetts (2023).
                                  44. Ohazuruike, L. & Lee, K. J. A comprehensive review on clay swelling and illitization of smectite in natural subsurface formations
                                      and engineered barrier systems. Nuclear Engineering and Technology 55, 1495–1506. https://doi.org/10.1016/j.net.2023.01.007
                                      (2023).
                                  45. Kijjanon, A., Sumranwanich, T. & Tangtermsirikul, S. Steel corrosion and critical chloride content in concrete with calcined
                                      kaolinite clay, fly ash, and limestone powder under natural and accelerated marine environments. Structures 62, 106211. https://d
                                      oi.org/10.1 016/j.istr uc.2024.10 6211 (2024).
                                  46. Coelho, L. B. et al. Reviewing machine learning of corrosion prediction in a data-oriented perspective.. npj Materials Degradation
                                      6, 8. https://doi.org/10.1038/s41529-022-00218-4 (2022).
                                  47. Nikoo, M., Sadowski, L. & Nikoo, M. Prediction of the corrosion current density in reinforced concrete using a self-organizing
                                      feature map. Coatings 7, 160. https://doi.org/10.3390/coatings7100160 (2017).
                                  48. Chohan, I. M. et al. Effect of seawater salinity, ph, and temperature on external corrosion behavior and microhardness of offshore
                                      oil and gas pipeline: Rsm modelling and optimization. Scientific Reports 14, 16543. https://doi.org/10.1038/s41598-024-67463-2
                                      (2024).
                                  49. Chen, X., Wang, Z., Yu, S. & Li, G. Corrosion inhibition of carbon steel in nacl solution using a mixture of alkanol amine and
                                      calcium nitrite: Electrochemical and microscopic evaluation. International Journal of Electrochemical Science 19, 100802. https:// d
                                      oi.org/10 .1016/j.ij oes.2024.1 00802 (2024).
                                  50. Vera, R. et al. Atmospheric corrosion and impact toughness of steels: Case study in steels with and without galvanizing, exposed
                                      for 3 years in rapa nui island. Heliyon 9, e17811. https://doi.org/10.1016/j.heliyon.2023.e17811 (2023).
                                  Acknowledgements
                                  Shahbaz A. extends sincere gratitude to Kiel University DEAL-Konsortium for facilitating open-access publi-
                                  cation.
                                  Author contributions
                                  Conceptualization, S.A., M.A., and Sh.A.; methodology, S.A., M.A., F.A., and Sh.A.; software, M.A. and Sh.A.;
                                  experimental investigation, S.A., and Si.A.; resources, S.A. and M.A.; data curation, M.A., F.A., Si.A., and Sh.A.;
                                  writing original draft preparation, M.A., and Sh.A.; writing-review and editing, M.A., F.A. and Sh.A.; visuali-
                                  zation, M.A., and Sh.A.; supervision, S.A., M.A.; project administration, S.A., M.A.; funding acquisition, S.A.,
                                  M.A. All authors read and approved the final manuscript.
                                  Funding
                                  Open Access funding enabled and organized by Projekt DEAL.
Declarations
                                  Competing interests
                                  The authors declare no competing interests.
                                  Additional information
                                  Correspondence and requests for materials should be addressed to S.A.
                                  Reprints and permissions information is available at www.nature.com/reprints.
                                  Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
                                  institutional affiliations.
                                  Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which
                                  permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give
                                  appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and
                                  indicate if changes were made. The images or other third party material in this article are included in the article’s
                                  Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included
                                  in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or
                                  exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy
                                  of this licence, visit http://creativecommons.org/licenses/by/4.0/.