the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Observational benchmarks inform representation of soil organic carbon dynamics in land surface models
Kamal Nyaupane
Kyongmin Yeo
William J. Riley
Forrest M. Hoffman
Sagar Gautam
Representing soil organic carbon (SOC) dynamics in Earth system models (ESMs) is a key source of uncertainty in predicting carbon–climate feedbacks. Machine learning models can help identify dominant environmental controllers and establish their functional relationships with SOC stocks. The resulting knowledge can be integrated into ESMs to reduce uncertainty and improve predictions of SOC dynamics over space and time. In this study, we used a large number of SOC field observations (n=54 000), geospatial datasets of environmental factors (n=46), and two machine learning approaches (namely random forest, RF, and generalized additive modeling, GAM) to (1) identify dominant environmental controllers of global and biome-specific SOC stocks, (2) derive functional relationships between environmental controllers and SOC stocks, and (3) compare the identified environmental controllers and predictive relationships with those in models used in Phase 6 of the Coupled Model Intercomparison Project (CMIP6). Our results showed that the diurnal temperature, drought index, cation exchange capacity, and precipitation were important observed environmental predictors of global SOC stocks. While the RF model identified 14 environmental factors that describe climatic, vegetation, and edaphic conditions as important predictors of global SOC stocks (R2=0.61, RMSE = 0.46 kg m−2), current ESMs oversimplify the relationships between environmental factors and SOC, with precipitation, temperature, and net primary productivity explaining > 96 % of the variability in ESM-modeled SOC stocks. Further, our study revealed notable disparities among the functional relationships between environmental factors and SOC stocks simulated by ESMs compared with observed relationships. To improve SOC representations in ESMs, it is imperative to incorporate additional environmental controls, such as the cation exchange capacity, and refine the functional relationships to align more closely with observations.
- Article
(3849 KB) - Full-text XML
-
Supplement
(361 KB) - BibTeX
- EndNote
Soil is the largest actively cycling carbon pool in terrestrial ecosystems and stores almost twice the amount of carbon currently present in the atmosphere (Lal, 2016). Even a small change in soil carbon stocks can lead to large changes in the atmospheric CO2 concentration, influencing future climate change trajectories. Additionally, soils play a crucial role in capturing atmospheric CO2 through the storage of soil organic carbon (SOC) (Hinge et al., 2018). Therefore, the sequestration, protection, and sustainable management of SOC stocks can be a promising climate mitigation strategy (Lal, 2020). Accurately representing global SOC storage and its environmental controllers is essential for predicting realistic SOC changes under different land use and climate change scenarios. However, there is currently no consensus among Earth system models (ESMs) on how to represent the spatial distributions of global SOC storage and its fate under future climate change scenarios (Friedlingstein et al., 2014; Arora et al., 2020).
Multiple environmental variables, including climatic and topographic factors, land use history, and edaphic properties, have been identified as possible controllers of SOC storage (Luo et al., 2021; Mishra et al., 2022). However, current ESMs use only a limited number of environmental factors to represent SOC storage and dynamics. A recent study that compared SOC stocks from multiple ESMs with observations indicated a large knowledge gap in both ESMs and observations (Georgiou et al., 2021). Therefore, it is crucial to compare ESM simulations with global SOC observations to evaluate model performance and identify key environmental controllers of global SOC storage.
Benchmarking ESM simulations against observed data is a common approach for model evaluation (Luo et al., 2012; Collier et al., 2018). By comparing model simulations with observations, model strengths, deficiencies, and needed improvements can be identified. The insights gained from SOC benchmarking can lead to new ESM land model structures (by identifying key processes) and new parameterizations (by quantifying key relationships between SOC and environmental variables). Therefore, benchmarking analysis of ESMs is an effective tool for reducing uncertainties in predicting SOC dynamics and can provide more realistic information for managing SOC under changing climate conditions (Lauer et al., 2017).
Currently, ESMs predict SOC stocks primarily with model representations that depend on soil temperature, moisture, and belowground net primary production (Todd-Brown et al., 2013). ESMs capture the positive correlation between net primary productivity (NPP) and precipitation, leading to high SOC stocks in regions with high NPP and sufficient moisture (Sun et al., 2016). Higher temperatures increase soil respiration, which, in the short term, reduces SOC storage. Over the longer term, increased soil respiration can release nutrients, promoting plant growth and increasing belowground carbon inputs, thereby contributing to SOC stocks; however, the balance of these effects can take centuries to manifest (Mekonnen et al., 2022). In ESMs, the temperature sensitivity of soil respiration is often defined using the Q10 or Arrhenius equations (Wynn et al., 2006), although modifications for extreme temperatures are likely needed (Jiang et al., 2013; Azizi-Rad et al., 2022).
Among environmental factors, soil moisture plays a crucial role in plant growth, microbial activity, carbon inputs, and the decomposition of litter and SOC. Global soil carbon stocks correlate with mean annual precipitation, emphasizing the significance of water availability in SOC dynamics. The relationship between soil moisture and microbial activity follows a curve, reaching a maximum at the optimal moisture content. Variations in soil moisture can either inhibit or enhance microbial activity, impacting SOC decomposition rates and carbon cycling (Moyano et al., 2013, 2018; Wieder et al., 2018; Davidson et al., 2012). This nonlinear soil moisture function is crucial for predicting SOC turnover, although its exact form varies among models (Sierra et al., 2015). Different measures of soil moisture are vital for understanding water availability across different scales, serving as indicators for soil–water relationships and ecosystem functioning. Previous studies suggest various functional forms, such as linear, quadratic, or asymptotic, to capture the impact of moisture on soil microbial activity, with relative water saturation proving to be a reliable predictor across diverse soil types (Moyano et al., 2013). The temperature function in soil carbon models represents the sensitivity of SOC decomposition to temperature, as well as the availability of soluble substrates that drive carbon substrate decomposition (Davidson et al., 2012). Based on the Q10 equation, a 10 °C temperature increase roughly doubles the rate of soil respiration, reflecting increased microbial activity that accelerates organic matter decomposition and increases CO2 emissions from the soil. Recent research emphasizes that the variability in the temperature sensitivity of SOC decomposition is influenced by the microbial community composition. A comprehensive understanding of the temperature function requires accounting for microbial community dynamics in soil carbon models, as these dynamics are intricately linked to temperature. This includes both accelerated microbial activity and faster O2 depletion, which together affect soil oxygen dynamics. Despite this complexity, the empirical relationship between soil respiration and temperature, as represented by the Q10 relationship, remains essential for predicting the impacts of temperature change on soil carbon dynamics and understanding its global implications for carbon cycling (Lloyd and Taylor, 1994).
In a previous US continental-scale study, we derived empirical nonlinear relationships between SOC and environmental factors that achieved comparable prediction accuracy to a random forest (RF) machine learning approach (Mishra et al., 2022). In this study, we apply a similar approach using both global field observations and ESMs to (1) identify key observed environmental controllers of global SOC stocks and their functional relationships and (2) evaluate ESMs using these observational benchmarks. Simulated SOC stocks from three ESMs used in Phase 6 of the Coupled Model Intercomparison Project (CMIP6) – the Community Earth System Model (CESM; Hurrell et al., 2013), the UK Earth System Model (UKESM; Sellar et al., 2019), and the Beijing Climate Center model (BCC; Xiao-Ge et al., 2019) – were benchmarked with over 50 000 SOC profile observations across the globe. We used a machine learning (i.e., random forest) approach with 46 environmental factors to identify the key environmental controllers of SOC stocks at the global scale. We then applied a generalized additive model (GAM) to derive the predictive relationships between these key environmental factors and SOC stocks in both observations and ESM-simulated SOC stocks. The specific objectives of this study were to (1) identify dominant environmental controllers of SOC stocks in field observations and CMIP6 ESM simulations, (2) derive the observed and ESM-modeled functional relationships between environmental factors and SOC stocks, and (3) analyze these functional relationships to inform needed improvements in ESM representations of SOC dynamics.
2.1 Soil organic carbon stock observations
We used two datasets of SOC stocks: one for the upper 30 cm of soil (i.e., 0–30 cm) and one for the upper meter of soil (i.e., 0–100 cm). We note that, by limiting our analysis to these depth intervals, we may not be accounting for the total SOC stocks, as large SOC stocks can be found at much deeper depths in some soils (e.g., peatlands). The World Soil Information Service (WoSIS) compiled SOC profiles across the globe after quality assessment. The 2019 snapshot of the WoSIS dataset contained 111 380 soil profiles with SOC content information (given in grams of carbon per gram of soil) at different soil depths (Batjes et al., 2020). We estimated the SOC stock (g C m−2) at different soil layers using the following expression:
where G is the coarse fragment fraction (%), BD is the bulk density of soil (g m−3), and D is the soil layer depth (m).
When the measured bulk density value was absent from the dataset, we used a pedotransfer function (Yigini et al., 2018) to estimate the soil bulk density:
Here, OM is the organic matter, equivalent to SOC × 1.724, with SOC content in percent (%); α, β, and γ are fitting parameters. We found α=0.32, β=1.30, and γ=0.0089 after fitting WoSIS data to this equation.
Another dataset that we used in this study was compiled from Mishra et al. (2021). This dataset contained 2546 soil profiles with SOC stock (g C m−3) information from permafrost regions in North America, northern Eurasia, and the Qinghai–Tibet Plateau. In total, we used 113,926 soil profile observations from these two data sources. SOC stocks of different soil layers were then summed to SOC stocks at the 0–30 cm and 0–100 cm depth intervals. Because not all of these soil profiles covered the whole 0–30 cm or 0–100 cm intervals, we used a total of 54 000 soil profiles that included SOC stock information for both depth intervals. The geographical distributions of soil profiles used in this study are shown in Fig. 1. Because SOC stock values across the globe were highly skewed, we used a natural logarithm transformation in this study.
2.2 Environmental predictors of SOC stocks
The storage and cycling of SOC are controlled by multiple environmental factors. In this study, we used observations of 46 environmental variables, which represented major soil-forming factors (McBratney et al., 2003). A total of 21 of the 46 environmental variables were climatic variables, including the annual average temperature, precipitation, evapotranspiration, drought severity index, and statistics for different temporal scales (e.g., during the wettest and driest quarter in a year). A total of 13 of the 46 variables described soil properties (e.g., clay content, sand content, silt content, soil texture, pH, and cation exchange capacity). Six variables represented topographic factors (e.g., elevation and soil depth). Six variables represented land use and land cover types. All of the categorical variables were converted to integer variables, and the environmental variables were resampled to a common 1 km resolution. The environmental factors, their original spatial resolution, and data sources are provided Table S1 in the Supplement.
2.3 Selection of dominant environmental controllers of SOC stocks
We used RF to select dominant environmental predictors of SOC stocks within biomes and at the global scale in both observations and ESMs. RF is an ensemble learning method that is an extension of the classical classification and regression trees (CART) approach. Building a collection of uncorrelated CARTs through bootstrapping the samples and applying the random subspace method at each branch of the trees, RF improves prediction performance (Breiman, 2001; Wiesmeier et al., 2011; Mishra et al., 2020). RF is well known for its strength in modeling highly nonlinear relationships between the predictors and is robust to overfitting (Chagas et al., 2016). Moreover, RF is not very sensitive to the choice of the hyperparameters, making it one of the most popular off-the-shelf models for many classification and regression problems.
In this study, we trained the RF model using SOC content as a response variable and environmental factors as predictors. The model performance was evaluated using the coefficient of determination (R2) and root-mean-square error (RMSE). A 10-fold cross-validation was used to compute the R2 and RMSE values. Biome-specific analyses were conducted on a subset of the global dataset. For biome classification, we used the International Geosphere–Biosphere Programme (IGBP) land classes (Loveland and Belward, 1997). The “randomForest” package in R was used to train an RF model using all of the observed environmental factors in the dataset and to identify dominant environmental controllers of SOC stocks. Prior to fitting into the final model, we performed a potential collinearity test among the environmental variables by calculating pairwise correlations and the variance influence factor (VIF) values. Predictors showing a VIF value greater than 10 were omitted, leaving 14 uncorrelated environmental predictors of SOC stocks in the observations.
2.4 Generalized additive model
The generalized additive model (GAM) approach is an extension of generalized linear models that employs spline functions to model nonlinear relationships between predictor and response variables (Arnold et al., 2013). In GAMs, the relationship between the predictor and response variable can be modeled as follows (Hastie and Tibshirani, 1987):
Here, Y is the response variable (SOC), C is a constant, Xi represents the environmental controller variables, fi is a spline function for Xi, and p is the total number of environmental controllers. We used the “mgcv” package in R to build nonlinear relationships between environmental factors and SOC stocks using GAMs for both the SOC field observations and CMIP6 ESM-simulated SOC data (Arnold et al., 2013). The performance of GAMs was evaluated using the R2 and RMSE.
2.5 Earth system model outputs
We downloaded and aggregated the ESM-simulated SOC and environmental controller data from three ESMs that participated in CMIP6: the Community Earth System Model (Hurrell et al., 2013), the UK Earth System Model (Sellar et al., 2019), and the Beijing Climate Center model (Xiao-Ge et al., 2019). These ESMs included most of the environmental factors used by CMIP6 ESMs. ESMs did not report depth-dependent soil carbon projections, making direct comparison with depth-dependent SOC observations difficult. The majority of land models used in the ESMs were designed to simulate topsoil carbon for topsoil depth; thus, we assumed that the simulated soil carbon is contained within the top 1 m of the soil profile to simplify comparison with observations.
3.1 Descriptive statistics of SOC observations
The average global SOC stock within the 0–1 m depth interval was 13.5 kg C m−2, ranging from 0.14 to 435.3 kg C m−2. Our results indicate substantial variability in global-scale SOC observations, as the standard deviation (18.2 kg C m−2) is greater than the average SOC stock. Summary statistics of SOC stocks at the global scale and within different biomes are presented in Table 1. Boreal forests and temperate forests exhibited higher SOC stocks compared with other biomes, whereas tundra and the tropical and subtropical broadleaf forests showed lower and relatively similar average SOC stocks. Tundra and the tropical and subtropical grasslands and savannas exhibited similar and lower standard deviations with respect to SOC stock values. Conversely, boreal forests and shrublands showed higher standard deviations, indicating a broader range in SOC stock values. The distributions of total SOC stocks across different biomes are presented in Fig. 2.
3.2 Dominant environmental controllers of SOC stocks in observations and ESMs
At the global scale, we found that the diurnal temperature, drought severity index, annual temperature, and cation exchange capacity are the dominant environmental controllers of SOC stocks in the observations (Fig. 3). By including all environmental controllers, the RF model explained 61 % of observed global spatial variation in SOC stocks. The R2 values ranged from 43 % in boreal forests to 62 % in croplands (Table 2), with the importance of key environmental controllers varying across biomes (Fig. 4). In croplands, precipitation, drought, diurnal temperature, and cation exchange capacity were identified as the dominant controllers of SOC stocks. In grasslands, annual temperature, cation exchange capacity, and sand content were the dominant controllers. In forests, the cation exchange capacity, precipitation, and temperature were the dominant controllers, while in shrublands, annual temperature, soil pH, and cation exchange capacity were the most important controllers. In savannas, soil-related variables, temperature, and precipitation were the most important controllers. Across all land cover types, we found that the cation exchange capacity and seasonal climatic variables were the dominant environmental controllers of SOC stocks.
In contrast, the RF model with eight environmental variable predictors made near-perfect predictions of ESM-simulated SOC stocks (average R2=0.95; the R2 values for the UKESM, CESM, and BCC models were 0.99, 0.89, and 0.98, respectively). In contrast to the results obtained from the observed SOC stocks, the dominant controllers of ESM-simulated SOC stocks were annual temperature, NPP, and annual precipitation (Fig. 5), with NPP being by far the most dominant predictor of SOC stocks in the UKESM.
3.3 Predictive relationships between environmental factors and SOC stocks
The dominant environmental controllers of observed SOC stocks, as identified by the RF model, were used in a GAM to derive predictive relationships. We retrieved explicit analytical expressions by fitting splines derived from the GAM to the observation dataset. Despite its role as the sole carbon source for soil, our results did not show NPP as a strong controller of observed SOC stocks (Fig. 6a). In contrast to field observations, all ESMs showed a significant dependence (exponential increase) of SOC stocks on NPP. Our results also showed that observed SOC stocks increased almost linearly with observed annual precipitation (Fig. 6b). In contrast, ESMs showed different relationships between SOC and precipitation. We found a nonlinearly increasing SOC with precipitation in the CESM, an initial sharply increasing and then decreasing relationship in the UKESM, and a decreasing relationship in the BCC ESM. Regarding the relationship of SOC storage with soil texture and elevation, ESMs did not capture the observed relationships. Our results indicated that observed SOC stocks decreased with clay content in the interval between 0 % and 20 %, whereas they increased with clay content above 20 % (Fig. 6c). Similarly, observed SOC stocks increased with silt content up to 55 % and decreased thereafter (Fig. 6d).
SOC stock functional relationships differed between the three ESMs, and, in many cases, they also differed from the relationships that we derived from observations. In terms of the effects of annual temperature on modeled SOC storage, we found that SOC stocks decreased with increasing annual temperature and were most sensitive to temperature within the range of 0 and 10 °C (Fig. 6e). However, while the three ESMs captured the general negative relationship between SOC storage and temperature, none of them correctly described the varying sensitivity of SOC stocks across different temperature ranges, especially in extreme temperature ranges (< 0 and > 20 °C). With respect to representing the control of elevation on SOC storage, only the UKESM showed patterns consistent with observations: SOC storage remained stable at elevations below 2000 m and decreased at elevations above 2000 m (Fig. 6f).
Previous studies have suggested that the spatial variation in SOC depends on multiple environmental factors, such as climatic and edaphic variables, geography, and vegetation. In this study, we found that climatic variables (i.e., temperature and precipitation) are the most important controllers of global SOC stocks, followed by edaphic variables (i.e., cation exchange capacity), topography (i.e., elevation), and vegetation (i.e., NPP). Using boosted regression trees, Luo et al. (2021) studied edaphic and climatic controls on SOC dynamics at different soil depths and found that soil type and climatic variables are the most important variables for explaining SOC stocks (Luo et al., 2021). In this study, we found that seasonal climatic variables, such as diurnal temperature range and precipitation seasonality, are among the most important environmental controllers of the spatial variation in SOC stocks. This result indicates the critical role of seasonal and interannual climatic variables in understanding SOC dynamics.
The importance of climatic variables on global SOC storage emerges from their close links with processes that affect ecosystem productivity and soil microbial activity. Consistent with our findings, Wiesmeier et al. (2014) reported climatic variables (temperature and precipitation) as significant controllers of SOC stocks up to 1 m depth in German soils under oceanic climate conditions. Similarly, Sreenivas et al. (2014) used RF to predict SOC variability across semiarid and humid areas of India in the top 30 cm of soil and found that the top three environmental controllers were land cover, the mean temperature of the hottest months, and mean annual precipitation. In our analysis, the overall relative importance of climatic variables was significantly higher than other variables at both the global and biome scales.
Soil properties were identified as the second most important controllers of global SOC stocks. Soil properties influence various processes that govern soil carbon dynamics. For example, soil properties impact microbial activity, porosity, and oxygen availability in the soil profile, which directly or indirectly control soil water dynamics, plant growth, and SOC stocks. Consistent with our findings, Luo et al. (2021) reported that sand content, silt content, and soil pH were significant controllers of SOC stocks in all soil depths globally.
The Palmer drought index used in this study explicitly includes factors such as evapotranspiration, precipitation, and temperature. This index accounts for controls on the soil water balance, making it a better predictor than precipitation alone. Drought severity and duration play crucial roles in influencing the extent of soil carbon losses through microbial respiration (Borken and Matzner, 2009). A decline in soil CO2 efflux is observed as both the quantity and frequency of precipitation events decrease (Harper et al., 2005). In the initial phases of drought, elevated soil CO2 emissions occur due to the rapid response of plants and microorganisms to environmental stress (Ru et al., 2018). However, as drought intensifies, overall CO2 emissions diminish due to reduced root growth and microbial CO2 efflux caused by increasing soil dryness (Hasibeder et al., 2015). Similar to the impacts of drought duration, drought intensification results in a decrease in total CO2 emission by suppressing soil microbial activity and associated soil CO2 fluxes (Harper et al., 2005; Hu et al., 2020).
Cation exchange capacity is a soil property that indicates the active soil surface to which SOC can be adsorbed, with polyvalent metal cations playing a significant role in SOC stabilization by binding organic compounds to mineral surfaces (O'Brien et al., 2015; Solly et al., 2020). O'Brien et al. (2015) found that exchangeable soil Ca2+ is a significant predictor of SOC stocks. This relationship is supported by the mechanism in which Ca2+ and Mg2+ promote clay flocculation and bind organic matter to clay surfaces. Solly et al. (2020) reported that SOC and cation exchange capacity are significantly related in both topsoil and subsoil with a strong positive relationship.
After climatic factors and cation exchange capacity, topography and vegetation (NPP) were important controllers of observed global SOC stocks. The effect of NPP on observed SOC stocks was found to be relatively small (∼ 6 % in soils at 0–100 cm depth). Similar to our findings, Luo et al. (2021) reported that NPP explained about 10 % of the variation in SOC stocks. NPP provides the primary inputs of carbon to the soil, and NPP generally increases with moisture, temperature, and CO2 levels, up to a certain limit. NPP also depends on the availability of soil nutrients. Most ESMs tend to overestimate the increase in SOC pools in response to NPP increases (Todd-Brown et al., 2013). The effect of NPP on SOC also varies depending on biome type and soil depth (Luo et al., 2021; Georgiou et al., 2021). The contribution of NPP to SOC stocks mostly depends on how much NPP ends up in the soil and how it is translocated to different soil depths. Georgiou et al. (2021) reported a saturating relationship of SOC stocks with increasing NPP in a global observational dataset. However, Chen et al. (2018) reported high SOC stocks associated with increasing productivity and soil water-holding capacity (Chen et al., 2018).
The three CMIP6 ESMs that we analyzed predicted SOC stocks mostly as a function of temperature, precipitation, and NPP. These ESMs simulated positive correlations between SOC stocks and NPP (Fig. 6a), resulting in higher SOC stocks in areas with a high NPP (Shi et al., 2013; Sun et al., 2016). In these ESMs, the effects of temperature and precipitation on SOC stocks are driven by soil respiration. Most current ESMs simulate the response of soil respiration to temperature using either the Q10 or Arrhenius equations (Wynn et al., 2006), where higher temperatures lead to increased soil respiration, which, all else being equal, eventually reduces SOC stocks (Fig. 6e). Our results showed diverse controls of precipitation on SOC stocks in different ESMs. Todd-Brown et al. (2013) showed that ESMs simulate soil respiration as either (1) increasing monotonically with precipitation or (2) rising to a plateau under optimal precipitation and then experiencing a subsequent decline with further increases in precipitation. Consistent with those results, the ESMs that we analyzed in this study showed differences in the functional relationship between SOC storage and precipitation (Fig. 6b).
In this study, we found that while current ESMs consider key environmental controllers, such as soil temperature and moisture, with respect to regulating SOC storage, they show large inter-model variations in representing the functional relationships between these factors and SOC at the global scale. Meanwhile, none of the three ESMs investigated in this study align with the patterns that emerged from observations. These results signify potential unrealistic parameterization or missing critical processes in model representation. Moreover, our results highlight the importance of including other environmental factors in simulating global SOC storage. The observed global SOC stocks are influenced by processes beyond those currently considered in ESMs, such as temperature, precipitation, and NPP. Effects of other environmental factors, such as the drought severity index and cation exchange capacity should also be considered in future representations of SOC dynamics in ESMs. Our results highlight the critical role of observational data in benchmarking ESM simulations and informing model structures and parameterization. Although our findings cannot be directly used to develop model parameterizations, they can (1) point to categories of functional forms for controllers, (2) guide where efforts may be best applied to improve model functional forms (e.g., for the dominant controllers), and (3) provide information for modelers regarding where their models may differ significantly from observed functional forms for emergent relationships.
We note that although ecosystem-specific environmental factors (e.g., the presence or absence of certain species, such as earthworms or termites, or indicators of anthropogenic land management, such as the use of fertilizers or conservation agriculture practices) are not represented in the current generation of ESMs, incorporating these factors – specific to ecosystems like croplands or forests – may improve the accuracy of SOC stock predictions in comparison to observations.
Our results document a disagreement between the environmental controllers of SOC stocks in observations and CMIP6 ESM simulations. Specifically, while global SOC observations indicate that the diurnal temperature, drought index, annual temperature, cation exchange capacity, and other soil-related variables are critical for controlling SOC stocks at the global scale, ESMs overemphasize the roles of NPP, annual temperature, and annual precipitation in simulating SOC stocks. Moreover, the functional relationships between key environmental factors and SOC stocks show significant uncertainty among ESMs, with no agreement with those derived from observations. Optimizing current model parameterizations and developing new model structures that incorporate more processes in the soil carbon cycle are crucial for improving the simulation of global SOC storage in future ESMs. Our results highlight the importance of benchmarking ESMs against observations to improve the mechanistic understanding of the global soil carbon cycle.
The data and code used in this study are available at http://datadryad.org/stash/share/CqDKdhwmOsJeLt6isJe917o1ha8gmrjgpGz0WPG_JdI (Nyapane et al., 2024).
The supplement related to this article is available online at: https://doi.org/10.5194/bg-21-5173-2024-supplement.
KN conducted the analysis and prepared the original draft of the manuscript, UM conceptualized and supervised the study, and all other co-authors helped to review and edit the manuscript.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
This study was jointly supported by the Laboratory Directed Research and Development program of Sandia National Laboratories and the Reducing Uncertainties in Biogeochemical Interactions through Synthesis and Computation Science Focus Area (RUBISCO SFA), which is sponsored by the Regional and Global Model Analysis (RGMA) activity of the Earth Environmental Systems Modeling (EESM) program of the Earth and Environmental Systems Sciences Division (EESSD), Office of Biological and Environmental Research (BER), within the US Department of Energy's Office of Science. Sandia National Laboratories is a multi-mission laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International, Inc., for the US Department of Energy's National Nuclear Security Administration under contract no. DE-NA-0003525. Lawrence Berkeley National Laboratory (LBNL) is managed by the Regents of the University of California for the US Department of Energy under contract no. DE-AC02-05CH11231. Oak Ridge National Laboratory (ORNL) is managed by UT-Battelle, LLC, for the US Department of Energy under contract no. DE-AC05-00OR22725.
This research has been supported by the Office of Science (grant no. DE-NA-0003525).
This paper was edited by Kirsten Thonicke and reviewed by Lorenzo Menichetti, Renato Braghiere, and two anonymous referees.
Arnold, D., Wagner, P., and Baayen, R. B.: Using generalized additive models and random forests to model prosodic prominence in German, https://isca-speech.org/ (last access: 24 January 2023), 2013.
Arora, V. K., Katavouta, A., Williams, R. G., Jones, C. D., Brovkin, V., Friedlingstein, P., Schwinger, J., Bopp, L., Boucher, O., Cadule, P., Chamberlain, M. A., Christian, J. R., Delire, C., Fisher, R. A., Hajima, T., Ilyina, T., Joetzjer, E., Kawamiya, M., Koven, C. D., Krasting, J. P., Law, R. M., Lawrence, D. M., Lenton, A., Lindsay, K., Pongratz, J., Raddatz, T., Séférian, R., Tachiiri, K., Tjiputra, J. F., Wiltshire, A., Wu, T., and Ziehn, T.: Carbon–concentration and carbon–climate feedbacks in CMIP6 models and their comparison to CMIP5 models, Biogeosciences, 17, 4173–4222, https://doi.org/10.5194/bg-17-4173-2020, 2020.
Azizi-Rad, M., Guggenberger, G., Ma, Y., and Sierra, C. A.: Sensitivity of soil respiration rate with respect to temperature, moisture and oxygen under freezing and thawing, Soil Biol. Biochem., 165, 108488, https://doi.org/10.1016/j.soilbio.2021.108488, 2022.
Batjes, N. H., Ribeiro, E., and van Oostrum, A.: Standardised soil profile data to support global mapping and modelling (WoSIS snapshot 2019), Earth Syst. Sci. Data, 12, 299–320, https://doi.org/10.5194/essd-12-299-2020, 2020.
Breiman, L.: Random forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001.
Borken, W. and Matzner, E.: Reappraisal of drying and wetting effects on C and N mineralization and fluxes in soils, Glob. Chang. Biol., 15, 808–824, https://doi.org/10.1111/j.1365-2486.2008.01681.x, 2009.
Chagas, C. D. S., Junior, W. D. C., Bhering, S. B., and Filho, B. C.: Spatial prediction of soil surface texture in a semiarid region using random forest and multiple linear regressions, Catena, 139, 232–240, https://doi.org/10.1016/j.catena.2016.01.001, 2016.
Chen, S., Wang, W., Xu, W., Wang, Y., Wan, H., Chen, D., Tang, Z., Tang, X., Zhou, G., Xie, Z., Zhou, D., Shangguan, Z., Huang, J., He, J. S., Wang, Y., Sheng, J., Tang, L., Li, X., Dong, M., Wu, Y., Wang, Q., Wang, Z., Wu, J., Stuart Chapin, F., and Bai, Y.: Plant diversity enhances productivity and soil carbon storage, P. Natl. Acad. Sci. USA, 115, 4027–4032, https://doi.org/10.1073/PNAS.1700298114, 2018.
Collier, N., Hoffman, F. M., Lawrence, D. M., Keppel-Aleks, G., Koven, C. D., Riley, W. J., Mu, M., and Randerson, J. T.: The International Land Model Benchmarking (ILAMB) system: design, theory, and implementation, J. Adv. Model. Earth Syst., 10, 2731–2754, https://doi.org/10.1029/2018MS001354, 2018.
Davidson, E. A., Samanta, S., Caramori, S. S., and Savage, K.: The Dual Arrhenius and Michaelis–Menten kinetics model for decomposition of soil organic matter at hourly to seasonal time scales, Glob. Change Biol., 18, 371–384, https://doi.org/10.1111/j.1365-2486.2011.02546.x, 2012.
Friedlingstein, P., Meinshausen, M., Arora, V. K., Jones, C. D., Anav, A., Liddicoat, S. K., and Knutti, R.: Uncertainties in CMIP5 climate projections due to carbon cycle feedbacks, J. Climate, 27, 511–526, https://doi.org/10.1175/JCLI-D-12-00579.1, 2014.
Georgiou, K., Malhotra, A., Wieder, W. R., Ennis, J. H., Hartman, M. D., Sulman, B. N., Berhe, A. A., Grandy, A. S., Kyker-Snowman, E., Lajtha, K., Moore, J. A. M., Pierson, D., and Jackson, R. B.: Divergent controls of soil organic carbon between observations and process-based models, Biogeochemistry, 156, 5–17, https://doi.org/10.1007/S10533-021-00819-2, 2021.
Harper, C. W., Blair, J. M., Fay, P. A., Knapp, A. K., and Carlisle, J. D.: Increased rainfall variability and reduced rainfall amount decreases soil CO2 flux in a grassland ecosystem, Glob. Chang. Biol., 11, 322–334, https://doi.org/10.1111/j.1365-2486.2005.00899.x, 2005.
Hasibeder, R., Fuchslueger, L., Richter, A., and Bahn, M.: Summer drought alters carbon allocation to roots and root respiration in mountain grassland, New Phytol., 205, 1117–1127, https://doi.org/10.1111/nph.13146, 2015.
Hastie, T. and Tibshirani, R.: Generalized additive models: Some applications, J. Am. Stat. Assoc., 82, 371–386, https://doi.org/10.1080/01621459.1987.10478440, 1987.
Hinge, G., Surampalli, R. Y., and Goyal, M. K.: Prediction of soil organic carbon stock using digital mapping approach in humid India Environ. Earth Sci., 77, 172, https://doi.org/10.1007/s12665-018-7374-x, 2018.
Hu, Z., Chen, H. Y. H., Yue, C., Gong, X. Y., Shao, J., Zhou, G., Wang, J., Wang, M., Xia, J., Li, Y., Zhou, X., and Michaletz, S. T.: Traits mediate drought effects on wood carbon fluxes, Glob. Chang. Biol., 26, 3429–3442, https://doi.org/10.1111/GCB.15088, 2020.
Hurrell, J. W., Holland, M. M., Gent, P. R., Ghan, S., Kay, J. E., Kushner, P. J., Lamarque, J. F., Large, W. G., Lawrence, D., Lindsay, K., and Lipscomb, W. H.: The community earth system model: a framework for collaborative research, B. Am. Meterol. Soc., 94, 1339–1360, https://doi.org/10.1175/BAMS-D-12-00121.1, 2013.
Jiang, H., Deng, Q., Zhou, G., Hui, D., Zhang, D., Liu, S., Chu, G., and Li, J.: Responses of soil respiration and its temperature/moisture sensitivity to precipitation in three subtropical forests in southern China, Biogeosciences, 10, 3963–3982, https://doi.org/10.5194/bg-10-3963-2013, 2013.
Lal, R.: Soil health and carbon management, Food Energy Secur., 5, 212–222, https://doi.org/10.1002/fes3.96, 2016.
Lal, R.: Managing soils for negative feedback to climate change and positive impact on food and nutritional security, Soil Sci. Plant Nutr., 66, 1–9, https://doi.org/10.1080/00380768.2020.1718548, 2020.
Lauer, A., Eyring, V., Righi, M., Buchwitz, M., Defourny, P., Evaldsson, M., Friedlingstein, P., de Jeu, R., de Leeuw, G., Loew, A., and Merchant, C. J.: Benchmarking CMIP5 models with a subset of ESA CCI Phase 2 data using the ESMValTool, Remote Sens. Environ., 203, 9-39, https://doi.org/10.1016/j.rse.2017.01.007, 2017.
Lloyd, J. and Taylor, J. A.: On the temperature dependence of soil respiration, Funct. Ecol., 8, 315–323, https://doi.org/10.2307/2389824, 1994.
Loveland, T. R. and Belward, A. S.: The igbp-dis global 1km land cover data set, discover: First results, Int. J. Remote Sens., 18, 3289–3295, https://doi.org/10.1080/014311697217099, 1997.
Luo, Y. Q., Randerson, J. T., Abramowitz, G., Bacour, C., Blyth, E., Carvalhais, N., Ciais, P., Dalmonech, D., Fisher, J. B., Fisher, R., Friedlingstein, P., Hibbard, K., Hoffman, F., Huntzinger, D., Jones, C. D., Koven, C., Lawrence, D., Li, D. J., Mahecha, M., Niu, S. L., Norby, R., Piao, S. L., Qi, X., Peylin, P., Prentice, I. C., Riley, W., Reichstein, M., Schwalm, C., Wang, Y. P., Xia, J. Y., Zaehle, S., and Zhou, X. H.: A framework for benchmarking land models, Biogeosciences, 9, 3857–3874, https://doi.org/10.5194/bg-9-3857-2012, 2012.
Luo, Z., Viscarra-Rossel, R. A., and Qian, T.: Similar importance of edaphic and climatic factors for controlling soil organic carbon stocks of the world, Biogeosciences, 18, 2063–2073, https://doi.org/10.5194/bg-18-2063-2021, 2021.
McBratney, A. B., Santos, M. M., and Minasny, B.: On digital soil mapping, Geoderma, 117, 3–52, https://doi.org/10.1016/S0016-7061(03)00223-4, 2003.
Mekonnen, Z. A., Riley, W. J., Randerson, J. T., Shirley, I. A., Bouskill, N. J., and Grant, R. F.: Wildfire exacerbates high-latitude soil carbon losses from climate warming, Environ. Res. Lett., 17, 094037, https://doi.org/10.1088/1748-9326/ac8be6, 2022.
Mishra, U., Gautam, S., Riley, W. J., and Hoffman, F. M.: Ensemble machine learning approach improves predicted spatial variation of surface soil organic carbon stocks in data limited northern circumpolar region, Front. Big Data, 3, 528441, https://doi.org/10.3389/FDATA.2020.528441/FULL, 2020.
Mishra, U., Hugelius, G., Shelef, E., Yang, Y., Strauss, J., Lupachev, A., J. W. Harden, J. W., Jastrow, J. D., Ping, C.-L., Riley, W. J., Schuur, E. A. G., Matamala, R., Siewert, M., Nave, L. E., Koven, C. D., M. Fuchs, M., Palmtag, J., Kuhry, P., Treat, C. C., Zubrzycki, S., Hoffman, F. M., Elberling, B., Camill, P., Veremeeva, A., and Orr. A.: Spatial heterogeneity and environmental predictors of permafrost region soil organic carbon stocks. Sci. Adv., 7, eaaz5236, https://doi.org/10.1126/sciadv.aaz5236, 2021.
Mishra, U., Yeo, K., Adhikari, K., Riley, W. J., Hoffman, F. M., Hudson, C., and Gautam, S.: Empirical relationships between environmental factors and soil organic carbon produce comparable prediction accuracy to machine learning, Soil Sci. Soc. Am. J., 86, 1611–1624, https://doi.org/10.1002/saj2.20453, 2022.
Moyano, F. E., Manzoni, S., and Chenu, C.: Responses of soil heterotrophic respiration to moisture availability: An exploration of processes and models, Soil Biol. Biochem., 59, 72–85, https://doi.org/10.1016/j.soilbio.2013.01.002, 2013.
Moyano, F. E., Vasilyeva, N., and Menichetti, L.: Diffusion limitations and Michaelis–Menten kinetics as drivers of combined temperature and moisture effects on carbon fluxes of mineral soils, Biogeosciences, 15, 5031–5045, https://doi.org/10.5194/bg-15-5031-2018, 2018.
Nyapane, K., Gautam, S., Mishra, U., Tao, F., Yeo, K., Riley, W. J., Hoffman, F. M.: Observational benchmarks inform representation of soil organic carbon dynamics in land surface models, Dryad [data set], http://datadryad.org/stash/share/CqDKdhwmOsJeLt6isJe917o1ha8gmrjgpGz0WPG_JdI (last access: 12 November 2024), 2024.
O'Brien, S. L., Jastrow, J. D., Grimley, D. A., and Gonzalez-Meler, M. A.: Edaphic controls on soil organic carbon stocks in restored grasslands, Geoderma, 251, 117–123, https://doi.org/10.1016/j.geoderma.2015.03.023, 2015.
Ru, J. Y., Zhou, Y. Q., Hui, D. F., Zheng, M. M., and Wan, S. Q.: Shifts of growing-season precipitation peaks decrease soil respiration in a semiarid grassland, Glob. Chang. Biol., 24, 1001–1011, https://doi.org/10.1111/gcb.13941, 2018.
Sellar, A. A., Jones, C. G., Mulcahy, J. P., Tang, Y., Yool, A., Wiltshire, A., O'Connor, F. M., Stringer, M., Hill, R., Palmieri, J., Woodward, S., de Mora, L., Kuhlbrodt, T., Rumbold, S. T., Kelley, D. I., Ellis, R., Johnson, C. E., Walton, J., Abraham, N. L., Andrews, M. B., Andrews, T., Archibald, A. T., Berthou, S., Burke, E., Blockley, E., Carslaw, K., Dalvi, M., Edwards, J., Folberth, G. A., Gedney, N., Griffiths, P. T., Harper, A. B., Hendry, M. A., Hewitt, A. J., Johnson, B., Jones, A., Jones, C. D., Keeble, J., Liddicoat, S., Morgenstern, O., Parker, R. J., Predoi, V., Robertson, E., Siahaan, A., Smith, R. S., Swaminathan, R., Woodhouse, M. T., Zeng, G., and Zerroukat, M.: UKESM1: Description and Evaluation of the U.K. Earth System Model, J. Adv. Model Earth Syst., 11, 4513–4558, https://doi.org/10.1029/2019MS001739, 2019.
Shi, X., Mao, J., Thornton, P. E., and Huang, M.: Spatiotemporal patterns of evapotranspiration in response to multiple environmental factors simulated by the Community Land Model, Environ. Res. Lett., 8, 024012, https://doi.org/10.1088/1748-9326/8/2/024012, 2013.
Sierra, C. A., Trumbore, S. E., Davidson, E. A., Vicca, S., and Janssens, I.: Sensitivity of decomposition rates of soil organic matter with respect to simultaneous changes in temperature and moisture, J. Adv. Model. Earth Syst., 7, 335–356, https://doi.org/10.1002/2014MS000358, 2015.
Solly, E. F., Weber, V., Zimmermann, S., Walthert, L., Hagedorn, F., and Schmidt, M. W. I.: A critical evaluation of the relationship between the effective cation exchange capacity and soil organic carbon content in Swiss forest soils, Front. For. Glob. Change, 3, 00098, https://doi.org/10.3389/ffgc.2020.00098, 2020.
Sreenivas, K., Sujatha, G., Sudhir, K., Kiran, D. V., Fyzee, M. A., Ravisankar, T., and Dadhwal, V. K.: Spatial assessment of soil organic carbon density through random forests based imputation, J. Indian Soc. Remote Sens., 42, 577–587, https://doi.org/10.1007/s12524-013-0332-x, 2014.
Sun, Y., Piao, S., Huang, M., Ciais, P., Zeng, Z., Cheng, L., Li, X., Zhang, X., Mao, J., Peng, S., Poulter, B., Shi, X., Wang, X., Wang, Y. P., and Zeng, H.: Global patterns and climate drivers of water-use efficiency in terrestrial ecosystems deduced from satellite-based datasets and carbon cycle models, Glob. Ecol. Biogeogr., 25, 311–323, https://doi.org/10.1111/GEB.12411, 2016.
Todd-Brown, K. E. O., Randerson, J. T., Post, W. M., Hoffman, F. M., Tarnocai, C., Schuur, E. A. G., and Allison, S. D.: Causes of variation in soil carbon simulations from CMIP5 Earth system models and comparison with observations, Biogeosciences, 10, 1717–1736, https://doi.org/10.5194/bg-10-1717-2013, 2013.
Wieder, W. R., Hartman, M. D., Sulman, B. N., Wang, Y.-P., Kover, C. D., and Bonan, G. B.: Carbon cycle confidence and uncertainty: Exploring variation among soil biogeochemical models, Glob. Chang Biol., 24, 1563–1579, https://doi.org/10.1111/gcb.13979, 2018.
Wiesmeier, M., Barthold, F., Blank, B., and Kögel-Knabner, I.: Digital mapping of soil organic matter stocks using Random Forest modeling in a semi-arid steppe ecosystem, Plant Soil, 340, 7–24, https://doi.org/10.1007/S11104-010-0425-Z, 2011.
Wiesmeier, M., Barthold, F., Spörlein, P., Geuß, U., Hangen, E., Reischl, A., Schilling, B., Angst, G., von Lützow, M., and Kögel-Knabner, I.: Estimation of total organic carbon storage and its driving factors in soils of Bavaria (southeast Germany), Geoderma Reg., 1, 67–78, https://doi.org/10.1016/j.geodrs.2014.09.001, 2014.
Wynn, J. G., Bird, M. I., Vellen, L., Grand-Clement, E., Carter, J., and Berry, S. L.: Continental-scale measurement of the soil organic carbon pool with climatic, edaphic, and biotic controls, Global Biogeochem. Cy., 20, GB1007, https://doi.org/10.1029/2005GB002576, 2006.
Xiao-Ge, X., Tong-Wen, W., Jie, Z., Fang, Z., Wei-Ping, L., Yan-Wu, Z., Yi-Xiong, L., Yong-Jie, F., Wei-Hua, J., Li, Z., Min, D., Xue-Li, S., Jiang-Long, L., Min, C., Qian-Xia, L., and Jing-Hui, Y.: Introduction of BCC models and its participation in CMIP6, Adv. Clim. Change Res., 15, 533, https://doi.org/10.12006/j.issn.1673-1719.2019.039, 2019.
Yigini, Y., Olmedo, G., Reiter, S., Baritz, R., and Viatkin, K.: Soil organic carbon mapping: cookbook, Food and Agriculture Organization of the United Nations, Rome, https://openknowledge.fao.org/server/api/core/bitstreams/5628102c-c895-416b-bbf2-728cadf3bc70/content (last access: 8 November 2024), 2018.