Elsevier

Science of The Total Environment

Volume 539, 1 January 2016, Pages 26-35
Science of The Total Environment

Mapping soil organic carbon content using spectroscopic and environmental data: A case study in acidic soils from NW Spain

https://doi.org/10.1016/j.scitotenv.2015.08.088Get rights and content

Highlights

  • We used FTIR–ATR data to model the distribution of SOC in topsoils from NW Spain.

  • SOC predictions using FTIR–ATR are similar to those obtained by wet chemistry data.

  • Climate is the main parameter influencing the accumulation of SOC in the study area.

  • This method is suitable to quickly map SOC in acidic soils under similar conditions.

Abstract

In this study we present a methodology to estimate and map the content of soil organic carbon (SOC) in topsoils using spectroscopic (FTIR–ATR) and environmental raster data. We determined the SOC content in 221 topsoil samples in Galicia (NW Spain) using the Walkley–Black method. FTIR–ATR spectroscopic data was measured upon the same set of samples. The Random Forest (RF) technique was used to link the measured SOC concentrations to the FTIR–ATR measurements in order to identify the relevant absorbance bands explaining most of the variability in SOC. We then used linear regression (MLR) to predict SOC concentrations from the selected FTIR–ATR bands as independent proxy. This model showed a good predictive performance (r-squared = 0.88; RSME = 2.14; ME = 0.05; RPD = 3.14), indicating that SOC can be effectively estimated from the identified spectral bands. Finally, we used Partial Least Squares (PLS) to model the spatial distribution of the predictor bands using a number of environmental raster maps (climate, land use and geology) as covariates. This new raster was used within the MLR model previously created to generalize the predictions of SOC in the whole study area. This approach shows that FTIR data can be used to map SOC while minimizing analytical costs and time efforts.

Introduction

Soil organic carbon (SOC) stock constitutes the largest pool of terrestrial organic carbon, acting as an important long-term sink for carbon released to the atmosphere by human activities (Bellon-Maurel and McBratney, 2011, Grinand et al., 2012, Lal, 2004, Madari et al., 2006, Pedersen et al., 2011). The evaluation of SOC stocks is important for a proper evaluation of the effect of the emission of greenhouse gases to the atmosphere under different climate-change scenarios (Abd-Elmabod et al., 2014, Lal, 2004, Page et al., 2013).

Climate, land use and the nature of the soil mineral fraction are environmental variables closely related to the amount of SOC stored in soils. At a global scale, climate has a clear effect on the amount of organic carbon stored in soils, being low temperatures and high precipitation the most favourable conditions for enhanced accumulation (Barford et al., 2001, Batjes, 1996, Melillo et al., 2002, Trumbore et al., 1996). Anthropogenic practices such as deforestation, drainage and forest fires are converting large areas into globally significant sources of carbon dioxide to the atmosphere (Lal, 2004, Monastersky, 2014, Moore et al., 2013). Soil mineralogy also has an effect on both the quantity and turnover of SOC in soils. Torn et al. (1997) showed a positive relationship between the presence of non-crystalline minerals and the SOC content, and it has been observed that amorphous phases of iron and aluminium oxy-hydroxides in soils derived from metamorphic and igneous basic rocks can form stable organo-mineral complexes with organic compounds, which promotes the accumulation of SOC (Álvarez et al., 1992, Carballas et al., 1979, García-Rodeja et al., 1987, Verde et al., 2004). Additionally, stable micro-aggregates, which protect organic compounds against microbial degradation, have been observed in soils with high clay contents (Jobbágy and Jackson, 2000, Lal, 2004, Torn et al., 1997).

Over the past few decades, Digital Soil Mapping has been used to predict and describe the spatial distribution of soil properties (Behrens et al., 2005, McBratney et al., 2003, Scull et al., 2003, Vaysse and Lagacherie, 2015). Digital Soil Mapping makes use of statistical algorithms to relate soil parameters, measured on field samples, to environmental auxiliary data and make predictions on a spatial basis. The approaches most commonly used to estimate SOC include multiple linear regression, ordinary kriging, co-kriging, regression-kriging and geographically weighted regression (Chaplot et al., 2001, Chen et al., 2000, Dobos et al., 2006, Grimm et al., 2008, Kumar and Lal, 2011, Kumar et al., 2012, Martin et al., 2014, Mishra et al., 2009, Phachomphon et al., 2010, Simbahan et al., 2006). Recent studies obtained good SOC predictions using Random Forest and Partial Least Squares Regression algorithms (Grimm et al., 2008, Rodríguez-Lado and Martínez-Cortizas, 2015, Were et al., 2015).

The development of methods for mapping SOC contents along extensive areas while minimizing sampling and laboratory analyses is still a challenge. Wet chemistry techniques to measure SOC concentrations, such as the Walkley–Black method, are time consuming and relatively expensive. During the last years, infrared spectroscopy has been proposed as a robust, rapid and effective alternative technique to evaluate soil compounds and properties such as organic carbon, total carbon, total nitrogen, potassium, phosphorus, organic matter and clay contents, CEC, pH or water potential by using statistical models (Ge et al., 2014, Linker, 2011, Soriano-Disla et al., 2014, Viscarra Rossel et al., 2006). Models using spectroscopic measurements to evaluate SOC concentrations mostly involve data either in the mid-infrared (MIR, Table 1) or near-infrared (NIR) regions, or a combination of data in the near-infrared and visible (VisNIR) regions (Nocita et al., 2014, Shi et al., 2015, Stenberg et al., 2010). Accurate models should present small standard errors, high R-squared values and high ratios of standard deviation (RPD) (Bellon-Maurel et al. 2012).

MIR (4000 to 400 cm 1) is a prominent region that clearly discriminates molecular functional groups, easily identifiable through spectral libraries (Reeves, 2010). Since bands in MIR can be affected by distortion or total absorption phenomena, the dilution of the samples before the measurement is often required (Linker, 2011, McCarty et al., 2002). MIR spectroscopy coupled to Fourier Transform Infrared Attenuated Total Reflectance (FTIR–ATR) is a powerful technique for quantitative and qualitative analyses which avoids the dilution of the samples required by MIR analysis and thus decreases the time required for each measurement (Ge et al., 2014). FTIR–ATR has been widely used to quantify different soil properties and processes such as nitrate concentrations and the kinetics of its transformation (Borenstein et al., 2006, Janh et al., 2006, Kira et al., 2014, Linker et al., 2004, Linker et al., 2005, Linker et al., 2006, Shaviv et al., 2003), the speciation and amount of organic and inorganic carbon and the sand and clay contents (Ge et al., 2014, Solomon et al., 2005), the adsorption mechanisms of phosphate and arsenate (Arai and Sparks, 2001, Sun and Doner, 1996), and it has been even used to perform agro-environmental classifications of soils (Aranda et al., 2014, Du et al., 2008).

Multivariate statistics constitute a highly suitable mean to analyse complex data such as data from spectroscopy (Linker, 2011). Algorithms such as Multiple Linear Regression (MLR), Principal Component Regression (PCR), Partial Least Squares regression (PLS), Artificial Neural Networks (ANN), Multivariate Adaptive Regression Splines (MARS) or Random Forest (RF) make use of spectroscopic data to predict specific soil properties (Bellon-Maurel and McBratney, 2011, Knox et al., 2015, Stenberg et al., 2010). MLR, PLS and PCR are mostly used for this purpose, while ANN, MARS and RF constitute more elaborated algorithms mainly used when the former methods fail to ascertain relationships between soil properties and the spectroscopic signal (Næs and Mevik, 2001, Viscarra Rossel et al., 2006). The identification of specific spectroscopic bands, within the whole spectroscopic signal, explaining a high proportion of variability of the soil property under investigation, is of special interest since it can provide information on the dominant constituents or processes influencing such property. Recent studies showed that non-linear methods, such as regression trees, can be used to identify specific spectroscopic bands that improve the performance of the obtained models. Knox et al. (2015) modelled soil carbon fractions using Random Forest and concluded that the most influential bands to explain SOC were those allocated around 2200 and 1700 cm 1. Peaks around 2000 cm 1 have been identified as Si–O vibration from quartz. These peaks are usually intense in mineral soils with poor organic matter content (Du and Zhou, 2009, Reeves and Smith, 2009, Soriano-Disla et al., 2014). Peaks around 1700 cm 1 were attributed to stretching of Cdouble bondO bonds from aldehydes, ketones and carboxylic acids associated to hydrophobic and hydrophilic compounds of soil organic matter (Ellerbrock and Kaiser, 2005, Pedersen et al., 2011, Simkovic et al., 2008, Viscarra Rossel and Behrens, 2010, Vohland et al., 2014).

Despite the good model performance obtained in these studies, the results can only be used to quantify the amount of SOC from spectroscopic data at sample scale. However studies showing how to translate the statistical relationships between SOC and spectroscopic data to a predictive model depicting SOC variability in space are lacking. In this study we present an approach to map SOC content in soil epipedons by using FTIR–ATR data and raster maps of environmental covariates. The objectives of this study are: i) to identify the relevant bands of FTIR–ATR data providing information on the SOC concentrations and make a multiple linear regression model (MLR) relating SOC and the relevant FTIR–ATR bands at plot scale, ii) to map the spatial distribution of these bands using a PLS model relating the FTIR–ATR values for these specific bands and a number of easily accessible environmental covariates, in the form of raster maps, and iii) to create a map of SOC using the initial MLR model and the maps of spectral bands formerly created by PLS. This study shows that FTIR–ATR data can be spatialized and used to evaluate SOC stocks while decreasing time and project costs.

Section snippets

Study site

Topsoil samples were collected in the autonomous region of Galicia (NW-Spain). This region is a transitional climatic area from oceanic hyper-humid to sub-humid conditions with a climate described as temperate subtropic with wet winters (Martínez-Cortizas et al., 1994). The dominant soil temperature regime is mesic (Taboada and García, 1999). There is a WE gradient of temperature, with the coldest areas corresponding to those locations in eastern longitudes. The dominant soil moisture regime is

Soil organic carbon

The relationship between the SOC measurements obtained by the Walkley–Black method (SOCWB) and the total carbon measured by combustion (CLECO) was modelled using a linear regression model (Eq. (1)). The r-squared value (r2 = 0.88) showed that in our samples almost all the carbon is present as oxidizable carbon, thus the SOC can be properly evaluated by the Walkley–Black method.SOCWB=0.8336CLECO+0.279

The mean SOC content in our dataset of samples is 7.4%, with a range of variation from 46.2% in

Discussion

In our study, absorbance at 1697 cm 1 appears as the most relevant band to predict the SOC in the studied area. According to bibliographic sources this peak corresponds to the stretching vibration of Cdouble bondO bond from aldehydes, ketones and carboxylic acids in hydrophobic and hydrophilic compounds of soil organic matter (Ellerbrock and Kaiser, 2005, Pedersen et al., 2011, Simkovic et al., 2008, Viscarra Rossel and Behrens, 2010, Vohland et al., 2014). Recent studies relating processes of

Conclusions

In this paper we developed a new methodology to estimate and map SOC content in epipedons by using the value of absorbance at 1697 cm 1 from the FTIR–ATR spectra. The statistical relationships between the information contained in this spectral band and the environmental covariates here employed are similar to those obtained with Walkley–Black SOC measurements and enabled to translate the spectroscopic information from sample/plot scale to regional scale and map SOC. The correlation coefficients

Acknowledgments

This research was funded by the autonomous Government of Galicia (Xunta de Galicia) through the research grant EM 2012/60. Luis Rodríguez Lado was supported by the Isidro Parga Pondal Research Programme and the Galician Research Plan I2C (Xunta de Galicia, 2011, 2014). We would like to thank David Romero and Carmen Pérez Llaguno for their support in the laboratory analyses. We specially acknowledge our colleagues at the Supercomputational Center of Galicia (CESGA) for the support in the

References (94)

  • S. Kumar et al.

    A geographically weighted regression kriging approach for mapping soil organic carbon stock

    Geoderma

    (2012)
  • R. Linker et al.

    Soil identification and chemometrics for direct determination of nitrate in soils using FTIR–ATR mid-infrared spectroscopy

    Chemosphere

    (2005)
  • R. Linker et al.

    Nitrate determination in soil pastes using attenuated total reflectance mid-infrared spectroscopy: improved accuracy via soil identification

    Biosyst. Eng.

    (2006)
  • B.E. Madari et al.

    Mid- and near-infrared spectroscopic assessment of soil compositional parameters and structural indices in two Ferralsols

    Geoderma

    (2006)
  • M.P. Martin et al.

    Evaluation of modelling approaches for predicting the spatial distribution of soil organic carbon stocks at the national scale

    Geoderma

    (2014)
  • A.B. McBratney et al.

    On digital soil mapping

    Geoderma

    (2003)
  • A.B. McBratney et al.

    Spectral soil analysis and inference systems: a powerful combination for solving the soil data crisis

    Geoderma

    (2006)
  • M. Nocita et al.

    Prediction of soil organic carbon content by diffuse reflectance spectroscopy using a local partial least square regression approach

    Soil Biol. Biochem.

    (2014)
  • J.A. Pedersen et al.

    Characterization of soil organic carbon in drained thaw-lake basins of Arctic Alaska using NMR and FTIR photoacoustic spectroscopy

    Org. Geochem.

    (2011)
  • K. Phachomphon et al.

    Estimating carbon stocks at a regional level using soil information and easily accessible auxiliary variables

    Geoderma

    (2010)
  • J.B. Reeves

    Near- versus mid-infrared diffuse reflectance spectroscopy for soil analysis emphasizing carbon and laboratory versus on-site analysis: where are we and what needs to be done?

    Geoderma

    (2010)
  • J.B. Reeves et al.

    The potential of mid- and near-infrared diffuse reflectance spectroscopy for determining major- and trace-element concentrations in soils from a geochemical survey of North America

    Appl. Geochem.

    (2009)
  • L. Rodríguez-Lado et al.

    Modelling and mapping organic carbon content of topsoils in an atlantic area of southwestern Europe (Galicia, NW-Spain)

    Geoderma

    (2015)
  • G.C. Simbahan et al.

    Fine resolution mapping of soil organic carbon based on multivariate secondary data

    Geoderma

    (2006)
  • I. Simkovic et al.

    Thermal destruction of osil water repellency and associated changes to soil organic matter as observed by FTIR spectroscopy

    Catena

    (2008)
  • B. Stenberg et al.

    Visible and near infrared spectroscopy in soil science

    Adv. Agron.

    (2010)
  • T. Taboada et al.

    Smectite formation produced by weathering in a coarse granite saprolite in Galicia (NW Spain)

    Catena

    (1999)
  • K. Vaysse et al.

    Evaluating Digital Soil Mapping approaches for mapping GlobalSoilMap properties from legacy data in Languedoc-Roussillon (France)

    Geoderma Reg.

    (2015)
  • R.A. Viscarra Rossel et al.

    Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties

    Geoderma

    (2006)
  • M. Vohland et al.

    Determination of soil properties with visible to near- and mid-infrared spectroscopy: effects of spectral variable selection

    Geoderma

    (2014)
  • K. Were et al.

    A comparative assessment of support vector regression, artificial neural networks, and random forest for predicting and mapping soil organic carbon stocks across an Afromontane landscape

    Ecol. Indic.

    (2015)
  • S.K. Abd-Elmabod et al.

    Modelling soil organic carbon stocks along topographic transects under climate change scenarios using CarboSOIL

    Geophys. Res. Abstr.

    (2014)
  • H. Abdi

    Partial least squares regression and projection on latent structure regression (PLS Regression)

    Wiley Interdiscip. Rev. Comput. Stat.

    (2010)
  • E. Álvarez et al.

    Geochemical aspects of aluminium in forest soils in Galicia (N.W. Spain)

    Biogeochemistry

    (1992)
  • S. Bará et al.

    Calidad de estación del Pinus pinaster Ait. en Galicia

    (1983)
  • C.C. Barford et al.

    Factors controlling long- and short-term sequestration of atmospheric CO2 in a mid-latitude forest

    Science

    (2001)
  • N.H. Batjes

    Total carbon and nitrogen in the soils of the world

    Eur. J. Soil Sci.

    (1996)
  • T. Behrens et al.

    Digital soil mapping using artificial neural networks

    J. Plant Nutr. Soil Sci.

    (2005)
  • V. Bellon-Maurel et al.

    Critical review of chemometric indicators commonly used for assessing the quality of the prediction of soil attributes by NIR spectroscopy

    Trends Anal. Chem.

    (2012)
  • A. Borenstein et al.

    Determination of soil nitrate and water content using attenuated total reflectance spectroscopy

    Appl. Spectrosc.

    (2006)
  • L. Breiman

    Bagging predictors

    Mach. Learn.

    (1996)
  • L. Breiman

    Random forests — random features

    Technical Report 567

    (1999)
  • L. Breiman

    Random Forest

    Mach. Learn.

    (2001)
  • M. Carballas et al.

    Biodegradación y humificación de la material orgánica en suelos humíferos

    An. Edaf. Agrobiol.

    (1979)
  • V. Chaplot et al.

    Soil carbon storage prediction in temperate hydromorphic soils using a morphologic index and digital elevation model

    Soil Sci.

    (2001)
  • F. Chen et al.

    Field-scale mapping of surface soil organic carbon using remotely sensed imagery

    Soil Sci. Soc. Am. J.

    (2000)
  • F. Díaz et al.

    Capacidad productiva de los suelos de Galicia

    (1984)
  • Cited by (35)

    • Digital mapping of GlobalSoilMap soil properties at a broad scale: A review

      2022, Geoderma
      Citation Excerpt :

      These additional independent data can be collected using a design-based sampling strategy involving probability sampling and design-based estimation. Due to the high cost of additional soil sampling, only a few broad-scale studies (9.1%) have used independent validation for map evaluation (Thomas et al., 2015; Rial et al., 2016; Vaysse et al., 2017; Ellili Bargaoui et al., 2019). Lagacherie et al. (2019) suggested that if this independent validation is not conducted with a proper sampling density, it can lead to uncertain prediction performance assessments.

    View all citing articles on Scopus
    View full text