Soil Water Content Modeling using Artificial Neural Network in Arid Soils of Algeria

A
Abdelhafid Youcef1,2
R
Rechachi Miled Zohra1,*
C
Chebbah Mohamed3
M
Menasria Hanane1
1Scientific and Technical Research Center on Arid Regions (CRSTRA), BP 1682 RP, 07000, Biskra, Algeria.
2University Mohamed Khider of Biskra, Algeria.
3University Abdelhafid Boussouf of Mila, Algeria.

Background: In order to model ecological and hydrological processes and schedule irrigation as efficiently as possible, soil water contents at field capacity (FC) and permanent wilting point (PWP) across a landscape are crucial. However, it takes a lot of time and money to measure them directly. An alternative is offered by pedotransfer functions (PTFs), which forecast them based on readily available or simpler soil variables.

Methods: The purpose of this study was to compare the performance of FC and PWP models, developed with artificial neural networks (ANN) to multiple linear regression (MLR). A total of 120 soil samples from various Biskra province locations were used. The developed models are evaluated using the statistical criteria of mean error (ME), coefficient of determination (R2) and root mean square error (RSME).

Result: The results show that the ANN performs better than the traditional MLR. The obtened results were very positive and indicated that the ANN approach held promise for forecasting soil water contents in our area.

Modeling ecological and hydrological processes and scheduling irrigations optimally require consideration of the soil’s water capacity or bioavailable water (Dobarco et al., 2018). However, it is frequently noted that water supplies are determined empirically and without consideration of this data, especially in arid and semi-arid regions. This, leads to environmental damage pollution such as rising surface water tables, nutrient leaching and a loss of water reserves. In order to effectively use the limited water resources that are available, soil water contents at field capacity (FC) and permanent wilting point (PWP) are essential hydraulic characteristics (Santra et al., 2018). However, due to the time-consuming nature of direct measurements and the impossibility of applying them to large areas, these properties are typically only known for a small number of soils (Haghverdi et al., 2012; McNeill et al., 2018). In order to get around these challenges, pedotransfer functions (PTFs) are an alternate method for estimating them from readily available or more readily measurably available soil data, such as bulkdensity, organic matter content and soil texture (Cosby et al., 1984; Saxton et al., 1986). Although there are no truly universal PTFs, the study area’s proximity and the parent materials’ similarity are crucial factors in determining the potential of PTFs (Morvan et al., 2004). Therefore, it is important to carefully evaluate the validity of any given PTF before extrapolating it outside of its geographic training area. Nonetheless, the majority of studies on pedotransfer functions (PTFs) have been produced in humid climates (Julià-Ferrer et al., 2004). Despite the existence of several studies that are interested in the pedotransfer’s function, such as those by Patil and Rajput (2014) and Mousavi (2015), in order to estimate and predict the amount of water retained in the soil The prediction of soil water retention properties has received relatively little attention, despite the fact that water availability is one of the primary factors limiting agricultural production in arides and semi-arides regions (Wösten et al., 2001; Khlosi et al., 2016; El Majou et al., 2016; Ghorbani et al., 2017; Santra et al., 2018). There has been relatively little published research on soil water properties for Algerian soils (Dridi and Dilimi, 2011 ; Dridi and Zemmouri, 2012). PTFs that describe soil water properties over large areas in Algeria’s Bas Sahara are in high demand for creating sustainable farming systems, but their availability is limited. Artificial neural networks (ANNs) and other machine learning algorithms have recently shown greater accuracy than other approaches in predicting the water content at FC and PWP (Morvan et al., 2004; Merdum et al., 2006). The main benefit of employing ANN techniques over traditional approaches is that they are completely nonparametric, meaning they do not consider the relationships between the input and output data (Santra et al., 2018). The goal of the current study was to predict soil water content at FC and PWP using artificial neural networks (ANN) and compare their performance to multiple linear regression (MLR) for various soil types in the region using relatively limited data on water content at FC and PWP.
Study area and data collection
 
As illustrated in Fig 1, Biskra province (eastern Algeria) is situated at the base of the saharan Atlas and covers about 21 509.80 km2. It is renowned for its phoenicicol heritage, estimated at 5 million of date palms and 98.478 ha of irrigated land (ANAT, 2003). This agro-ecological zone belongs to the Saharan bioclimatic stage. According to Word Reference Base classification (IUSS Working Group WRB, 2014), the main soil types in the province of Biskra are Fluvisols (50%), Calcisols and Gypsisols (37.8%) and solonshaks (13.1%) (ANAT, 2003). In all, 120 soil samples were collected from various locations throughout this region. All collected soil samples were analysed in the laboratory. The particle-size distribution (clay, silt and sand contents) were determined by the Robinson’s pipette method, while the soil cylinder cores (100 cm3) method was used to determine BD. Organic matter content (OM%) was determined by the wet oxidation method of Walkley and Black method and soil water contents at FC (-33 kPa) and PWP (-1500 kPa) were determined by pressure plate apparatus method.

Fig 1: Location of the study area with soil profile locations.


 
Statistical modeling 
 
The R program was used to carry out the statistical modeling in this investigation. To estimate FC and PWP, five parameters clay, silt, sand, OM and BD were employed as predictors. Seventy percent of the data were in the training subset after the data were randomly divided into training and testing subsets prior to statistical analysis. PTFs were trained and tested using 84 and 36 samples, respectively. The test and training data were both normalized. Based on their relationship to soil water contents and an assessment of multicollinearity between variables using the variance inflation factor (VIF), the soil variables were selected to serve as arguments for the development of models (Dobarco et al., 2018).
       
The backward stepwise method was used to select the most important input variables for MLR modeling, followed by the linear, quadratic and potential interaction terms of soil variables (Merdum et al., 2006 ; Rab et al., 2011). An equation of the following form was used to develop the PTF.
 
Y = b0 + b1X1...+ b5X+ b6X6
 
Where,
Y= The dependent variable.
b0= The intercept.
b01..bn= Regression coefficients.
X1 - X5= Independent variables referring to basic soil propreties.
       
The feed-forward multilayer perception (MLP) model was used for ANN modeling in order to predict FC and PWP. Clay, silt, BD and OM are the four input soil variables that make up the input layer. The input, hidden and output layers make up this network (Fig 2). The performance of the calibration and validation data sets determines how many hidden neurons are used (Van Looy et al., 2017). To create the hidden layer, the network’s input layer is weighted, summed and biased. A similar process is used to process the hidden neurons’ output, which is then converted by another activation function to generate the output. Through an iterative process, the objective function is minimized to obtain the weights and biases in ANN. Any transfer function, including log-sigmoid, tan-sigmoid and linear, can be used by neurons to produce their output. Since the tan-sigmoid (Tanh) transfer function is the most widely used transfer function in soil science, it was utilized in our case to create the output layer (Mohanty et al., 2015). Several statistical indices, including the mean of prediction (ME), root-mean-square error (RMSE) and coefficient of determination (R2), were used to assess the model’s performance. The most popular metrics for assessing a model’s performance are these statistical indices (Van Looy et al., 2017).

The coefficient of determination (R2):

 
Root-mean-square error (RMSE):

 
Mean of prediction (ME):


Where,
Yi= The mesured value.
Ŷi= The estimated value.
i= The average of the measured value.
N= The number of samples.

Fig 2: ANN model used in study.

According to the Table 1, both data sets showed statistically similar features, for example, BD and sand, silt and clay contents were 1.058-1.940 g.cm-3, 02-68%, 02-94% and 03-85%, respectively for the training data set. FC and PWP ranged from 0.03 to 0.42 g.g-1 and from 0.016 to 0.36g.g-1, respectively. It is obvious that the wide range of physical characteristics in Biskra province is caused by the presence of various soil types.

Table 1: Statistic parameters of soil attributes for the training and test sets.


       
Fig 3 shows that, despite having opposite polarity, both FC and PWP have a reasonably strong and smooth relationship (r>0.70) with clay and sand contents. Thus, it might be the most accurate indicator of FC and PWP. The relationships between BD, silt and OM and FC and PWP are rather slender. Additionally, it seems that BD has a bad relationship with PWP and FC. As a result, these characteristics by themselves are unlikely to be reliable indicators of FC and PWP; however, when combined with clay and sand in a multiple prediction model, they might enhance model performance. The relationship between FC and PWP seems to be solid and seamless. Because of its collinearity with clay, the sand content was eliminated. In order to predict FC and PWP, the following independent variables were selected for modeling: BD, silt, clay and OM content.

Fig 3: Correlation coefficients between soils attributs.


       
According to the results displayed in Table 2, the model 2 created using clay, silt and BD seems to perform better than the others during the training data sets. These predictor variables explained 76% and 62% of variation for FC and PWP respectively. Additionally, when combined with silt, BD seems to enhance the model’s performance, however, OM only slightly enhances the regression. Overall, FC outperformed PWP in terms of evaluation indices (R2, RMSE and ME). For FC, the R2 and RMSE values were 0.76 and 0.095, while for PWP were 0.62 and 0.1161. Both models included the indipendent variables silt, bulk density, BD-silt, for FC and silt- BD and quadratic clay for PWP.

Table 2: Coeficients of variables used in MLR.


       
As indicated in Table 3, the model 2 for FC and PWP, respectively, had the lowest level of RMSE and the highest level of R2. In general, a lower RMSE and higher R2 value are statistical indicators of the model’s strong performance. In our investigation, the R2 and RMSE  values for the training data set, are 0.761 and 0.095, for FC, while the values for PWP are 0.639 and 0,114, respectively. For the testing data set, the R2 and RMSE values are 0.798 and 0.080 for FC, while for the PWP these values are 0.652 and 0.105, respectively.

Table 3: Statistic performances of the ANN model.


       
For forecasting soil water contents at FC and PWP, the particle size distribution (PSD), OM and BD were frequently employed as predictors (Chakraborty et al., 2011; Wösten et al., 2001). In the present study, compared to BD and OM, particle size distribution demonstrated a stronger correlation with soil water content. Consequently, using clay, silt and BD provide a good prediction. Our results were consistent with those of Ostavari et al. (2015) Ghorbani et al., (2017) and Li et al. (2019). In MLR modeling, clay, silt and BD are the variables that explain 76 and 62% of variation in FC and PWP respectivly. The observed correlations between clay, FC and PWP are consistent with those reported for other soil types Cosby et al., (1984) and Rab et al., (2011). In our case, OM is unlikely to have an impact on the FC and PWP predictions. This could be as a result of the clay or sand content variation being significantly greater than the OM content variation. As a result, any connection between FC and OM would have been concealed. Our findings are comparable to those obtened by Minasny and McBratney (2002); Khlosi et al., (2016) and Santra et al., (2018). Rab et al., (2011) noted that Australian soils have very little organic matter (OM) and did not include it in their PTFs.
       
From Table 4, it can be clearly apparent that the ANN produced low RMSE and high R2  values during the training phase when compared to the conventional method MLR. The RMSE values varied from 0.082 to 0.090 g.g-1 and from 0.082 to 0.090 g.g-1 for FC and PWP, respectively. These values were lower than those of the MLR models, which ranged from 0.038 to 0.044 g.g-1. The Rvalues of the ANN models ranged from 0.713 to 0.879 and 0.713 to 0.843, respectively, whereas the MLR models showed the lower values with 0.715-0.788. Nonetheless, the t-statistic (P<0.05) indicates that there was no significant difference in the two methods’ performance for either FC or PWP. Similarity, Skalová et al. (2011) found that ANN performed slighly better than MLR with limeted data.

Table 4: Goodness-of-fit for MLR and ANN models in predicting soil water contents at FC and PWP.


       
In contrast to MLR models, ANN models were able to generate accurate predictions during the testing phase (Table 4). The RMSE of ANN models varied from 0.002 to 0.048 and were smaller than those of the MLR models, witch ranged from 0.031 to 0.057 and from 0.035 to 0.040, respectively. The R2 values of the ANN model ranged from 0.739 to 0.879 indicating a more accurate prediction than MLR models, for wich the values ranged from 0.647 to 0.821 and 0.692 to 0.849, respectively.
       
Overall, Fig 4 displays the performance of every prediction model created during the testing phase in the research area. The best-fit model for predicting FC and PWP was determined to be an ANN model with clay, silt and BD input variables. For all models, evaluation indices performed better at FC than PWP. Haghverdi et al., (2012) demonstrated that the better performance of either of data mining methods could be related to the PTF type and database characteristics rather than to inherent supremacy of either of the data mining methods. According to the developed PTFs, it was observed that ANN models with silt, clay and BD as predictors and four neurons in hidden layer had better performance in predicting soil water content at FC and PWP.
FC = 0.146-1.131×H1-0.207×H2-1.14×H3+0.292×H4
PWP = 0.719+0.355×H1-0.584×H2+1.131×H3-0.889×H4
Where H1 = Tanh (0.5× (-0.254-1.747×Clay-0.554×Silt-0.174×BD), H2 = Tanh (0.5×(0.032-1.048×Clay-2.980×Silt-0.075×BD), H3=Tanh (0.5×(-0.656+1.349×Clay+0.136´Silt-1.016×BD), H4 = Tanh (0.5× (1.815+1.555×Clay+0.049×Silt-0.180×BD)
       
Where Tanh stands for hyperbolic tangent of any given number.

Fig 4: Scatter plots of measured and predicted FC and PWP using ANN and MLR models in testing.

This study examines the performance of using ANN and MLR for estimating soil water contents at FC and PWP in arid soils of Algeria. Relative data samples were used to create different models in order to accomplish this goal. The results indicate that texture is stronger correlation with FC and PWP. Organic matter improve a lower prediction. Clay, silt and BD explain 76 and 62% of the variation in FC and PWP respectively. Based on statistical criteria, the obtened results indicate that the overall performance of ANN model was better than of MLR, but this difference was statististically not significant. Also, it was observed that ANN with three input variables clay, silt, BD and four neurons in hadden layer had better performance in predicting of FC and PWP. Therfore, the results were highly encouraging and suggested that ANN methods were promissing in modeling of soil water contents.
The present study was supported by..CRSTRA ...Scientific and Technical Research Center on arid Regions (CRSTRA).
 
Disclaimers
 
The views and conclusions expressed in this article are solely those of the authors and do not necessarily represent the views of their affiliated institutions.
       
The authors are responsible for the accuracy and completeness of the information provided, but do not accept any liability for any direct or indirect losses resulting from the use of this content.
 
Informed consent
 
 All animal procedures for experiments were approved by the Committee of Experimental Animal care.
The authors declare that there are no conflicts of interest regarding the publication of this article. No funding or sponsorship influenced the design of the study, data collection, analysis, decision to publish, or preparation of the manuscript.

  1. ANAT (Agence National d’Aménagement du Territoire) (2003). Schéma directeur des ressources en eau de la wilaya de Biskra. Dossier Agro-pédologique, 175p.

  2. Chakraborty, D., Mazumdar, S.P., Garg, R.N., Banerjee, S., Santra, P., Singh, R and Tomar, R.K. (2011). Pedotransfer functions for predicting points on the moisture retention curve of Indian soils. Indian J. Agric. Sci. 81(11): 1030.

  3. Cosby, B.J., Homberger, G.M., Clapp, R.B., Ginn, T.R. (1984). A statistical exploration of the relationships of soil moisture characteristics to the physical properties of soils. Water Resour Res. 20(6): 682-690. 

  4. Dobarco, M.R., Cousin, I., Bas, C.L., Martin, M.P. (2018). Pedotransfer functions for predicting available water capacity in French soils, their applicability domain and associated uncertainty. Geoderma. 336: 81-95. https://doi.org/ 10.1016/j.geoderma.2018.08.022.

  5. Dridi, B., Dilmi, A. (2011). Poids des différentes caractéristiques des sols dans l’estimation de leur rétention en eau, Cas des sols d’Algérie. Etude et Gestion des sols. 18(4): 247-258.

  6. Dridi, B., Zemmouri, S. (2012). Fonction de pédotransfert pour les sols de la plaine de la Mitidja (Algérie): Recherche de paramètres les plus pertinents pour la rétention en eau. Biotechnol. Agron. Soc. Environ. 16(2): 193-201.

  7. El Madjou, H., Kaba, R., Almesber, W., Bruand, A. (2016). Validation de l’estimation des propriétes de rétention en eau de sols syriens à partir de fonctions et classes de pédotransfert développées pour des sols français. Étude et Gestion des Sols. 23: 112-123.

  8. Ghorbani, M.A., Shamshirband, S., haghi, D.Z., Azani ABonakdari, H., Ebtehaj, I. (2017). Application of firefly algorith-based support vector maxchines for prediction of fiel capacity and permanent wilting point. Soil and Tillage Research. 172: 32-38.  https://dx.doi.org/10.1016/j.still.2017.04.009.

  9. Haghverdi, A., Cornelis, W.M., Ghahraman, B. (2012). A pseudo- continuous neural network approch for developing water retention pedotransfer functions with limited data. Journal of Hydrology. 442-443: 46-54. https://doi.org/10.1016/ j.jhydrol.2012.03.036.

  10. IUSS Working Group W.R.B. (2014). Word Reference Base for Soil Ressources : International soil classification system for naming soil and creating legends for soil maps. Word soil Ressources Report No 106. FAO, Rome. 

  11. Julià-Ferrer, M., Monreal, TE. Sánchez del Corral Jiménez, A, García Meléndez, E. (2004). Constructing a saturated hydraulic conductivity map of Spain using pedotransfer functions and spatial prediction. Geoderma. 123:  257-277. doi: 10.1016/j.geoderma.2004.02.011.

  12. Khlosi, M., Alhamdoosh, M., Douaik, A., Gabriels, D. and Cornelis, W.M. (2016). Enhanced pedotransfer functions with support vector machines to predict water retention of calcareous soil. European Journal of Soil Science. 67(3): 276-284. https://doi.org/10.1111/ejss.12345. 

  13. Li, X.D., Shao, M.A., Zhao, C.L. (2019). Estimating the field capacity and permanent wilting point at the regional scale for the hexi corridor in China using a state-space modeling approach. J. Soils Sediments. 19: 3805-3816. https:// doi.org/10.1007/s11368-019-02314-6.

  14. McNeill, S.J., Lilburne, L.R., Carrick, S., Webb, T.H., Cuthill, Y. (2018). Pedotransferfunctions for the soil water characteristics of New Zealandsoils using S-map information. Geoderma. 326: 96-110.https://doi.org/10.1016/j.geoderma.2018. 04.011.

  15. Merdum, H., Çinar, Ö., Meral, R., Apan, M. (2006). Comparison of artificial neural network and regression pedotransfer function for prediction of soil water retention and saturated conductivity. Soil and Tillage Research. 90: 108-116. doi: https:// doi:10.1016/j.still.2005.08.011.

  16. Minasny, B., McBratney, A. (2002). The neuro-m method for fitting neural network parametric pedotransfer functions. Soil Science Society of America Journal. 66(2): 352-361.

  17. Mohanty, M., Nishant, K., Painuli, D.K.S., Bandyopadhay, K.K., Hati, K.M., Reddy, K.S., Chaudhary, R.S. (2015). Modelling soil water contents at field capacity and permanent wilting point using artificial neural network for Indian soils. Nati. Acad. Sci. Lett. 38(5). doi: 10.1007/s40009-015-0358-4. 

  18. Morvan, X., Bruand, A., Cousin, I., Roque, J., Baran, N., Mouvet, C. (2004). Prédiction des propriétés de rétention en eau des sols d’un bassin versant à l’aide de fonctions de pédotransfert, influence de la densité apparente et de la teneur en éléments grossiers. Étude et gestion des sols. 11(2): 81-99.

  19. Mousavi, S.B. (2015). Employment of Pedo-transfer functions for predicting initial and final infiltration rates using Horton model and soil readily available characteristics. Indian Journal of Agricultural Researchdoi: 10.5958/0976-058X.2015.00060.8.

  20. Ostavari, Y., Sgari, K., Cornelis, WM. (2015). Performance evaluation of pedotransfer functions to predict field capacity and permanent wilting point using UNSODA and HYPES datasets. Arid Land Research and Management. 29: 383-398. doi: https://doi: 10.1080/15324982.1029649. 

  21. Patil, N.G. and Rajput, G.S. (2014). Surrogate estimation of soil water retention characteristics of seasonally mpounded soils. Indian Journal of Agricultural Research. doi: 10.5958/0976-058X.2014.01324.9.

  22. Rab, M.M., Chandra, S., Fisher, P.D., Robinson, N.J., Kitching, M., Aumann, C.D., Imhof, M. (2011). Modelling and prediction of soil water contents at field capacity and permanent wilting point of dryland cropping soils. Australian Jounal of Soil Research49: 389-407. doi: https://doi.10.1071/ SR10160-1838-2675X/11/050389.

  23. Santra, P., Kumar, M., Kumawat, R.N., Painuli, D.K., Hati, K.M., Heuvelink, G.B.M., Batjes, N.H. (2018). Pedotransfert functions to estimate soil water content at field capacity and permanent wilting point in hot Arid Western India. J. Earth. Sci. 127: 35. doi: https://doi.org/10.1007/s12040-018-0937-0j. jhydrol.2012.03.036.

  24. Saxton, K.E., Rawls, W., Romberger, J.S. and Papendick, R.I. (1986). Estimating generalized soil-water characteristics from texture. Soil Science Society of America Journal. 50(4): 1031-1036. doi: https://doi.org/10.2136/sssaj1986.0361 5995005000040039x.

  25. Skalová, J., Čistý, M., Bezák, J. (2011) Comparison of three regression models for determining water retention curves. J. Hydrl Hydromech. 59(4): 275-284. doi: https://doi:10.10.2478/ v10098-011-0023-7.

  26. Van Looy, K., Bouma, J., Herbst, M., Koestel, J., Minasny, B., Mishra, U., Montzka, C., Nemes, A., Pachepsky, Yakov, A., Padarian, J., Marcel, G.S., Tóth, B., Verhoef, A., Vanderborght, J., Ploeg, J., Weihermüller, L. (2017). Pedotransfer functions in earth system science: Challenges and perspectives. Rev. Geophys. 55: 1199-1256. doi: https:// doi.org/ 10.1002/2017RG000581.

  27. Wösten, J.H.M., Pachepsky, Y.A., Rawls, W.J. (2001). Pedotransfer functions: Bridging the gap between available basic soil data and missing soil hydraulic characteristics. Journal of Hydrology. 251: 123-150.

Soil Water Content Modeling using Artificial Neural Network in Arid Soils of Algeria

A
Abdelhafid Youcef1,2
R
Rechachi Miled Zohra1,*
C
Chebbah Mohamed3
M
Menasria Hanane1
1Scientific and Technical Research Center on Arid Regions (CRSTRA), BP 1682 RP, 07000, Biskra, Algeria.
2University Mohamed Khider of Biskra, Algeria.
3University Abdelhafid Boussouf of Mila, Algeria.

Background: In order to model ecological and hydrological processes and schedule irrigation as efficiently as possible, soil water contents at field capacity (FC) and permanent wilting point (PWP) across a landscape are crucial. However, it takes a lot of time and money to measure them directly. An alternative is offered by pedotransfer functions (PTFs), which forecast them based on readily available or simpler soil variables.

Methods: The purpose of this study was to compare the performance of FC and PWP models, developed with artificial neural networks (ANN) to multiple linear regression (MLR). A total of 120 soil samples from various Biskra province locations were used. The developed models are evaluated using the statistical criteria of mean error (ME), coefficient of determination (R2) and root mean square error (RSME).

Result: The results show that the ANN performs better than the traditional MLR. The obtened results were very positive and indicated that the ANN approach held promise for forecasting soil water contents in our area.

Modeling ecological and hydrological processes and scheduling irrigations optimally require consideration of the soil’s water capacity or bioavailable water (Dobarco et al., 2018). However, it is frequently noted that water supplies are determined empirically and without consideration of this data, especially in arid and semi-arid regions. This, leads to environmental damage pollution such as rising surface water tables, nutrient leaching and a loss of water reserves. In order to effectively use the limited water resources that are available, soil water contents at field capacity (FC) and permanent wilting point (PWP) are essential hydraulic characteristics (Santra et al., 2018). However, due to the time-consuming nature of direct measurements and the impossibility of applying them to large areas, these properties are typically only known for a small number of soils (Haghverdi et al., 2012; McNeill et al., 2018). In order to get around these challenges, pedotransfer functions (PTFs) are an alternate method for estimating them from readily available or more readily measurably available soil data, such as bulkdensity, organic matter content and soil texture (Cosby et al., 1984; Saxton et al., 1986). Although there are no truly universal PTFs, the study area’s proximity and the parent materials’ similarity are crucial factors in determining the potential of PTFs (Morvan et al., 2004). Therefore, it is important to carefully evaluate the validity of any given PTF before extrapolating it outside of its geographic training area. Nonetheless, the majority of studies on pedotransfer functions (PTFs) have been produced in humid climates (Julià-Ferrer et al., 2004). Despite the existence of several studies that are interested in the pedotransfer’s function, such as those by Patil and Rajput (2014) and Mousavi (2015), in order to estimate and predict the amount of water retained in the soil The prediction of soil water retention properties has received relatively little attention, despite the fact that water availability is one of the primary factors limiting agricultural production in arides and semi-arides regions (Wösten et al., 2001; Khlosi et al., 2016; El Majou et al., 2016; Ghorbani et al., 2017; Santra et al., 2018). There has been relatively little published research on soil water properties for Algerian soils (Dridi and Dilimi, 2011 ; Dridi and Zemmouri, 2012). PTFs that describe soil water properties over large areas in Algeria’s Bas Sahara are in high demand for creating sustainable farming systems, but their availability is limited. Artificial neural networks (ANNs) and other machine learning algorithms have recently shown greater accuracy than other approaches in predicting the water content at FC and PWP (Morvan et al., 2004; Merdum et al., 2006). The main benefit of employing ANN techniques over traditional approaches is that they are completely nonparametric, meaning they do not consider the relationships between the input and output data (Santra et al., 2018). The goal of the current study was to predict soil water content at FC and PWP using artificial neural networks (ANN) and compare their performance to multiple linear regression (MLR) for various soil types in the region using relatively limited data on water content at FC and PWP.
Study area and data collection
 
As illustrated in Fig 1, Biskra province (eastern Algeria) is situated at the base of the saharan Atlas and covers about 21 509.80 km2. It is renowned for its phoenicicol heritage, estimated at 5 million of date palms and 98.478 ha of irrigated land (ANAT, 2003). This agro-ecological zone belongs to the Saharan bioclimatic stage. According to Word Reference Base classification (IUSS Working Group WRB, 2014), the main soil types in the province of Biskra are Fluvisols (50%), Calcisols and Gypsisols (37.8%) and solonshaks (13.1%) (ANAT, 2003). In all, 120 soil samples were collected from various locations throughout this region. All collected soil samples were analysed in the laboratory. The particle-size distribution (clay, silt and sand contents) were determined by the Robinson’s pipette method, while the soil cylinder cores (100 cm3) method was used to determine BD. Organic matter content (OM%) was determined by the wet oxidation method of Walkley and Black method and soil water contents at FC (-33 kPa) and PWP (-1500 kPa) were determined by pressure plate apparatus method.

Fig 1: Location of the study area with soil profile locations.


 
Statistical modeling 
 
The R program was used to carry out the statistical modeling in this investigation. To estimate FC and PWP, five parameters clay, silt, sand, OM and BD were employed as predictors. Seventy percent of the data were in the training subset after the data were randomly divided into training and testing subsets prior to statistical analysis. PTFs were trained and tested using 84 and 36 samples, respectively. The test and training data were both normalized. Based on their relationship to soil water contents and an assessment of multicollinearity between variables using the variance inflation factor (VIF), the soil variables were selected to serve as arguments for the development of models (Dobarco et al., 2018).
       
The backward stepwise method was used to select the most important input variables for MLR modeling, followed by the linear, quadratic and potential interaction terms of soil variables (Merdum et al., 2006 ; Rab et al., 2011). An equation of the following form was used to develop the PTF.
 
Y = b0 + b1X1...+ b5X+ b6X6
 
Where,
Y= The dependent variable.
b0= The intercept.
b01..bn= Regression coefficients.
X1 - X5= Independent variables referring to basic soil propreties.
       
The feed-forward multilayer perception (MLP) model was used for ANN modeling in order to predict FC and PWP. Clay, silt, BD and OM are the four input soil variables that make up the input layer. The input, hidden and output layers make up this network (Fig 2). The performance of the calibration and validation data sets determines how many hidden neurons are used (Van Looy et al., 2017). To create the hidden layer, the network’s input layer is weighted, summed and biased. A similar process is used to process the hidden neurons’ output, which is then converted by another activation function to generate the output. Through an iterative process, the objective function is minimized to obtain the weights and biases in ANN. Any transfer function, including log-sigmoid, tan-sigmoid and linear, can be used by neurons to produce their output. Since the tan-sigmoid (Tanh) transfer function is the most widely used transfer function in soil science, it was utilized in our case to create the output layer (Mohanty et al., 2015). Several statistical indices, including the mean of prediction (ME), root-mean-square error (RMSE) and coefficient of determination (R2), were used to assess the model’s performance. The most popular metrics for assessing a model’s performance are these statistical indices (Van Looy et al., 2017).

The coefficient of determination (R2):

 
Root-mean-square error (RMSE):

 
Mean of prediction (ME):


Where,
Yi= The mesured value.
Ŷi= The estimated value.
i= The average of the measured value.
N= The number of samples.

Fig 2: ANN model used in study.

According to the Table 1, both data sets showed statistically similar features, for example, BD and sand, silt and clay contents were 1.058-1.940 g.cm-3, 02-68%, 02-94% and 03-85%, respectively for the training data set. FC and PWP ranged from 0.03 to 0.42 g.g-1 and from 0.016 to 0.36g.g-1, respectively. It is obvious that the wide range of physical characteristics in Biskra province is caused by the presence of various soil types.

Table 1: Statistic parameters of soil attributes for the training and test sets.


       
Fig 3 shows that, despite having opposite polarity, both FC and PWP have a reasonably strong and smooth relationship (r>0.70) with clay and sand contents. Thus, it might be the most accurate indicator of FC and PWP. The relationships between BD, silt and OM and FC and PWP are rather slender. Additionally, it seems that BD has a bad relationship with PWP and FC. As a result, these characteristics by themselves are unlikely to be reliable indicators of FC and PWP; however, when combined with clay and sand in a multiple prediction model, they might enhance model performance. The relationship between FC and PWP seems to be solid and seamless. Because of its collinearity with clay, the sand content was eliminated. In order to predict FC and PWP, the following independent variables were selected for modeling: BD, silt, clay and OM content.

Fig 3: Correlation coefficients between soils attributs.


       
According to the results displayed in Table 2, the model 2 created using clay, silt and BD seems to perform better than the others during the training data sets. These predictor variables explained 76% and 62% of variation for FC and PWP respectively. Additionally, when combined with silt, BD seems to enhance the model’s performance, however, OM only slightly enhances the regression. Overall, FC outperformed PWP in terms of evaluation indices (R2, RMSE and ME). For FC, the R2 and RMSE values were 0.76 and 0.095, while for PWP were 0.62 and 0.1161. Both models included the indipendent variables silt, bulk density, BD-silt, for FC and silt- BD and quadratic clay for PWP.

Table 2: Coeficients of variables used in MLR.


       
As indicated in Table 3, the model 2 for FC and PWP, respectively, had the lowest level of RMSE and the highest level of R2. In general, a lower RMSE and higher R2 value are statistical indicators of the model’s strong performance. In our investigation, the R2 and RMSE  values for the training data set, are 0.761 and 0.095, for FC, while the values for PWP are 0.639 and 0,114, respectively. For the testing data set, the R2 and RMSE values are 0.798 and 0.080 for FC, while for the PWP these values are 0.652 and 0.105, respectively.

Table 3: Statistic performances of the ANN model.


       
For forecasting soil water contents at FC and PWP, the particle size distribution (PSD), OM and BD were frequently employed as predictors (Chakraborty et al., 2011; Wösten et al., 2001). In the present study, compared to BD and OM, particle size distribution demonstrated a stronger correlation with soil water content. Consequently, using clay, silt and BD provide a good prediction. Our results were consistent with those of Ostavari et al. (2015) Ghorbani et al., (2017) and Li et al. (2019). In MLR modeling, clay, silt and BD are the variables that explain 76 and 62% of variation in FC and PWP respectivly. The observed correlations between clay, FC and PWP are consistent with those reported for other soil types Cosby et al., (1984) and Rab et al., (2011). In our case, OM is unlikely to have an impact on the FC and PWP predictions. This could be as a result of the clay or sand content variation being significantly greater than the OM content variation. As a result, any connection between FC and OM would have been concealed. Our findings are comparable to those obtened by Minasny and McBratney (2002); Khlosi et al., (2016) and Santra et al., (2018). Rab et al., (2011) noted that Australian soils have very little organic matter (OM) and did not include it in their PTFs.
       
From Table 4, it can be clearly apparent that the ANN produced low RMSE and high R2  values during the training phase when compared to the conventional method MLR. The RMSE values varied from 0.082 to 0.090 g.g-1 and from 0.082 to 0.090 g.g-1 for FC and PWP, respectively. These values were lower than those of the MLR models, which ranged from 0.038 to 0.044 g.g-1. The Rvalues of the ANN models ranged from 0.713 to 0.879 and 0.713 to 0.843, respectively, whereas the MLR models showed the lower values with 0.715-0.788. Nonetheless, the t-statistic (P<0.05) indicates that there was no significant difference in the two methods’ performance for either FC or PWP. Similarity, Skalová et al. (2011) found that ANN performed slighly better than MLR with limeted data.

Table 4: Goodness-of-fit for MLR and ANN models in predicting soil water contents at FC and PWP.


       
In contrast to MLR models, ANN models were able to generate accurate predictions during the testing phase (Table 4). The RMSE of ANN models varied from 0.002 to 0.048 and were smaller than those of the MLR models, witch ranged from 0.031 to 0.057 and from 0.035 to 0.040, respectively. The R2 values of the ANN model ranged from 0.739 to 0.879 indicating a more accurate prediction than MLR models, for wich the values ranged from 0.647 to 0.821 and 0.692 to 0.849, respectively.
       
Overall, Fig 4 displays the performance of every prediction model created during the testing phase in the research area. The best-fit model for predicting FC and PWP was determined to be an ANN model with clay, silt and BD input variables. For all models, evaluation indices performed better at FC than PWP. Haghverdi et al., (2012) demonstrated that the better performance of either of data mining methods could be related to the PTF type and database characteristics rather than to inherent supremacy of either of the data mining methods. According to the developed PTFs, it was observed that ANN models with silt, clay and BD as predictors and four neurons in hidden layer had better performance in predicting soil water content at FC and PWP.
FC = 0.146-1.131×H1-0.207×H2-1.14×H3+0.292×H4
PWP = 0.719+0.355×H1-0.584×H2+1.131×H3-0.889×H4
Where H1 = Tanh (0.5× (-0.254-1.747×Clay-0.554×Silt-0.174×BD), H2 = Tanh (0.5×(0.032-1.048×Clay-2.980×Silt-0.075×BD), H3=Tanh (0.5×(-0.656+1.349×Clay+0.136´Silt-1.016×BD), H4 = Tanh (0.5× (1.815+1.555×Clay+0.049×Silt-0.180×BD)
       
Where Tanh stands for hyperbolic tangent of any given number.

Fig 4: Scatter plots of measured and predicted FC and PWP using ANN and MLR models in testing.

This study examines the performance of using ANN and MLR for estimating soil water contents at FC and PWP in arid soils of Algeria. Relative data samples were used to create different models in order to accomplish this goal. The results indicate that texture is stronger correlation with FC and PWP. Organic matter improve a lower prediction. Clay, silt and BD explain 76 and 62% of the variation in FC and PWP respectively. Based on statistical criteria, the obtened results indicate that the overall performance of ANN model was better than of MLR, but this difference was statististically not significant. Also, it was observed that ANN with three input variables clay, silt, BD and four neurons in hadden layer had better performance in predicting of FC and PWP. Therfore, the results were highly encouraging and suggested that ANN methods were promissing in modeling of soil water contents.
The present study was supported by..CRSTRA ...Scientific and Technical Research Center on arid Regions (CRSTRA).
 
Disclaimers
 
The views and conclusions expressed in this article are solely those of the authors and do not necessarily represent the views of their affiliated institutions.
       
The authors are responsible for the accuracy and completeness of the information provided, but do not accept any liability for any direct or indirect losses resulting from the use of this content.
 
Informed consent
 
 All animal procedures for experiments were approved by the Committee of Experimental Animal care.
The authors declare that there are no conflicts of interest regarding the publication of this article. No funding or sponsorship influenced the design of the study, data collection, analysis, decision to publish, or preparation of the manuscript.

  1. ANAT (Agence National d’Aménagement du Territoire) (2003). Schéma directeur des ressources en eau de la wilaya de Biskra. Dossier Agro-pédologique, 175p.

  2. Chakraborty, D., Mazumdar, S.P., Garg, R.N., Banerjee, S., Santra, P., Singh, R and Tomar, R.K. (2011). Pedotransfer functions for predicting points on the moisture retention curve of Indian soils. Indian J. Agric. Sci. 81(11): 1030.

  3. Cosby, B.J., Homberger, G.M., Clapp, R.B., Ginn, T.R. (1984). A statistical exploration of the relationships of soil moisture characteristics to the physical properties of soils. Water Resour Res. 20(6): 682-690. 

  4. Dobarco, M.R., Cousin, I., Bas, C.L., Martin, M.P. (2018). Pedotransfer functions for predicting available water capacity in French soils, their applicability domain and associated uncertainty. Geoderma. 336: 81-95. https://doi.org/ 10.1016/j.geoderma.2018.08.022.

  5. Dridi, B., Dilmi, A. (2011). Poids des différentes caractéristiques des sols dans l’estimation de leur rétention en eau, Cas des sols d’Algérie. Etude et Gestion des sols. 18(4): 247-258.

  6. Dridi, B., Zemmouri, S. (2012). Fonction de pédotransfert pour les sols de la plaine de la Mitidja (Algérie): Recherche de paramètres les plus pertinents pour la rétention en eau. Biotechnol. Agron. Soc. Environ. 16(2): 193-201.

  7. El Madjou, H., Kaba, R., Almesber, W., Bruand, A. (2016). Validation de l’estimation des propriétes de rétention en eau de sols syriens à partir de fonctions et classes de pédotransfert développées pour des sols français. Étude et Gestion des Sols. 23: 112-123.

  8. Ghorbani, M.A., Shamshirband, S., haghi, D.Z., Azani ABonakdari, H., Ebtehaj, I. (2017). Application of firefly algorith-based support vector maxchines for prediction of fiel capacity and permanent wilting point. Soil and Tillage Research. 172: 32-38.  https://dx.doi.org/10.1016/j.still.2017.04.009.

  9. Haghverdi, A., Cornelis, W.M., Ghahraman, B. (2012). A pseudo- continuous neural network approch for developing water retention pedotransfer functions with limited data. Journal of Hydrology. 442-443: 46-54. https://doi.org/10.1016/ j.jhydrol.2012.03.036.

  10. IUSS Working Group W.R.B. (2014). Word Reference Base for Soil Ressources : International soil classification system for naming soil and creating legends for soil maps. Word soil Ressources Report No 106. FAO, Rome. 

  11. Julià-Ferrer, M., Monreal, TE. Sánchez del Corral Jiménez, A, García Meléndez, E. (2004). Constructing a saturated hydraulic conductivity map of Spain using pedotransfer functions and spatial prediction. Geoderma. 123:  257-277. doi: 10.1016/j.geoderma.2004.02.011.

  12. Khlosi, M., Alhamdoosh, M., Douaik, A., Gabriels, D. and Cornelis, W.M. (2016). Enhanced pedotransfer functions with support vector machines to predict water retention of calcareous soil. European Journal of Soil Science. 67(3): 276-284. https://doi.org/10.1111/ejss.12345. 

  13. Li, X.D., Shao, M.A., Zhao, C.L. (2019). Estimating the field capacity and permanent wilting point at the regional scale for the hexi corridor in China using a state-space modeling approach. J. Soils Sediments. 19: 3805-3816. https:// doi.org/10.1007/s11368-019-02314-6.

  14. McNeill, S.J., Lilburne, L.R., Carrick, S., Webb, T.H., Cuthill, Y. (2018). Pedotransferfunctions for the soil water characteristics of New Zealandsoils using S-map information. Geoderma. 326: 96-110.https://doi.org/10.1016/j.geoderma.2018. 04.011.

  15. Merdum, H., Çinar, Ö., Meral, R., Apan, M. (2006). Comparison of artificial neural network and regression pedotransfer function for prediction of soil water retention and saturated conductivity. Soil and Tillage Research. 90: 108-116. doi: https:// doi:10.1016/j.still.2005.08.011.

  16. Minasny, B., McBratney, A. (2002). The neuro-m method for fitting neural network parametric pedotransfer functions. Soil Science Society of America Journal. 66(2): 352-361.

  17. Mohanty, M., Nishant, K., Painuli, D.K.S., Bandyopadhay, K.K., Hati, K.M., Reddy, K.S., Chaudhary, R.S. (2015). Modelling soil water contents at field capacity and permanent wilting point using artificial neural network for Indian soils. Nati. Acad. Sci. Lett. 38(5). doi: 10.1007/s40009-015-0358-4. 

  18. Morvan, X., Bruand, A., Cousin, I., Roque, J., Baran, N., Mouvet, C. (2004). Prédiction des propriétés de rétention en eau des sols d’un bassin versant à l’aide de fonctions de pédotransfert, influence de la densité apparente et de la teneur en éléments grossiers. Étude et gestion des sols. 11(2): 81-99.

  19. Mousavi, S.B. (2015). Employment of Pedo-transfer functions for predicting initial and final infiltration rates using Horton model and soil readily available characteristics. Indian Journal of Agricultural Researchdoi: 10.5958/0976-058X.2015.00060.8.

  20. Ostavari, Y., Sgari, K., Cornelis, WM. (2015). Performance evaluation of pedotransfer functions to predict field capacity and permanent wilting point using UNSODA and HYPES datasets. Arid Land Research and Management. 29: 383-398. doi: https://doi: 10.1080/15324982.1029649. 

  21. Patil, N.G. and Rajput, G.S. (2014). Surrogate estimation of soil water retention characteristics of seasonally mpounded soils. Indian Journal of Agricultural Research. doi: 10.5958/0976-058X.2014.01324.9.

  22. Rab, M.M., Chandra, S., Fisher, P.D., Robinson, N.J., Kitching, M., Aumann, C.D., Imhof, M. (2011). Modelling and prediction of soil water contents at field capacity and permanent wilting point of dryland cropping soils. Australian Jounal of Soil Research49: 389-407. doi: https://doi.10.1071/ SR10160-1838-2675X/11/050389.

  23. Santra, P., Kumar, M., Kumawat, R.N., Painuli, D.K., Hati, K.M., Heuvelink, G.B.M., Batjes, N.H. (2018). Pedotransfert functions to estimate soil water content at field capacity and permanent wilting point in hot Arid Western India. J. Earth. Sci. 127: 35. doi: https://doi.org/10.1007/s12040-018-0937-0j. jhydrol.2012.03.036.

  24. Saxton, K.E., Rawls, W., Romberger, J.S. and Papendick, R.I. (1986). Estimating generalized soil-water characteristics from texture. Soil Science Society of America Journal. 50(4): 1031-1036. doi: https://doi.org/10.2136/sssaj1986.0361 5995005000040039x.

  25. Skalová, J., Čistý, M., Bezák, J. (2011) Comparison of three regression models for determining water retention curves. J. Hydrl Hydromech. 59(4): 275-284. doi: https://doi:10.10.2478/ v10098-011-0023-7.

  26. Van Looy, K., Bouma, J., Herbst, M., Koestel, J., Minasny, B., Mishra, U., Montzka, C., Nemes, A., Pachepsky, Yakov, A., Padarian, J., Marcel, G.S., Tóth, B., Verhoef, A., Vanderborght, J., Ploeg, J., Weihermüller, L. (2017). Pedotransfer functions in earth system science: Challenges and perspectives. Rev. Geophys. 55: 1199-1256. doi: https:// doi.org/ 10.1002/2017RG000581.

  27. Wösten, J.H.M., Pachepsky, Y.A., Rawls, W.J. (2001). Pedotransfer functions: Bridging the gap between available basic soil data and missing soil hydraulic characteristics. Journal of Hydrology. 251: 123-150.
In this Article
Published In
Agricultural Science Digest

Editorial Board

View all (0)