Articles | Volume 19, issue 5
https://doi.org/10.5194/bg-19-1377-2022
https://doi.org/10.5194/bg-19-1377-2022
Research article
 | 
07 Mar 2022
Research article |  | 07 Mar 2022

MODIS Vegetation Continuous Fields tree cover needs calibrating in tropical savannas

Rahayu Adzhar, Douglas I. Kelley, Ning Dong, Charles George, Mireia Torello Raventos, Elmar Veenendaal, Ted R. Feldpausch, Oliver L. Phillips, Simon L. Lewis, Bonaventure Sonké, Herman Taedoumg, Beatriz Schwantes Marimon, Tomas Domingues, Luzmila Arroyo, Gloria Djagbletey, Gustavo Saiz, and France Gerard
Abstract

The Moderate Resolution Imaging Spectroradiometer Vegetation Continuous Fields (MODIS VCF) Earth observation product is widely used to estimate forest cover changes and to parameterize vegetation and Earth system models and as a reference for validation or calibration where field data are limited. However, although limited independent validations of MODIS VCF have shown that MODIS VCF's accuracy decreases when estimating tree cover in sparsely vegetated areas such as tropical savannas, no study has yet assessed the impact this may have on the VCF-based tree cover data used by many in their research. Using tropical forest and savanna inventory data collected by the Tropical Biomes in Transition (TROBIT) project, we produce a series of calibration scenarios that take into account (i) the spatial disparity between the in situ plot size and the MODIS VCF pixel and (ii) the trees' spatial distribution within in situ plots. To identify if a disparity also exists in products trained using VCF, we used a similar approach to evaluate the finer-scale Landsat Tree Canopy Cover (TCC) product. For MODIS VCF, we then applied our calibrations to areas identified as forest or savanna in the International Geosphere-Biosphere Programme (IGBP) land cover mapping product. All IGBP classes identified as “savanna” show substantial increases in cover after calibration, indicating that the most recent version of MODIS VCF consistently underestimates woody cover in tropical savannas. We also found that these biases are propagated in the finer-scale Landsat TCC. Our scenarios suggest that MODIS VCF accuracy can vary substantially, with tree cover underestimation ranging from 0 % to 29 %. Models that use MODIS VCF as their benchmark could therefore be underestimating the carbon uptake in forest–savanna areas and misrepresenting forest–savanna dynamics. Because of the limited in situ plot number, our results are designed to be used as an indicator of where the product is potentially more or less reliable. Until more in situ data are available to produce more accurate calibrations, we recommend caution when using uncalibrated MODIS VCF data in tropical savannas.

1 Introduction

Tree cover values derived from Earth observation (EO) data form a fundamental part of ecological research. They are used to estimate forest cover change, biomass, and carbon stocks (Bastin et al., 2019; Giriraj et al., 2017; Saatchi et al., 2011; Song et al., 2014) and to help identify key areas for conservation efforts (Miles et al., 2006) and as a basis for climatic and vegetation modelling and model evaluation (Brovkin et al., 2013; Burton et al., 2019; Kelley et al., 2013). All this research, in turn, plays a vital role in informing local, regional, and global environmental policies (Harris et al., 2012). As such, an EO product's accuracy is important to consider, as any errors in the initial tree cover estimate can be further compounded in downstream work.

Only a handful of EO products provide global maps of percent tree cover or forest and shrub cover distributions (Bartholomé and Belward, 2005; Bicheron et al., 2008), and fewer still provide information stretching over at least a decade (Friedl et al., 2002; Hansen et al., 2003; Sexton et al., 2013; DiMiceli, 2017). Of these, one of the products most widely used in ecological modelling is the Moderate Resolution Imaging Spectroradiometer Vegetation Continuous Fields (MODIS VCF) product (DiMiceli, 2017). MODIS VCF is a yearly product that provides percent tree cover globally at a spatial resolution of 250 m. The most recent iteration (Collection 6) is available for the years 2000 through to 2020. Its quantitative measure of woody cover is recorded annually and is described as a percentage of ground cover, making it particularly suited for use in evaluating dynamic global models (Lasslop et al., 2018; Rabin et al., 2017), as a proxy for in situ data that are harder to collect (Kelley et al., 2019), and to help define parameters for calculating global tree restoration potential (Bastin et al., 2019). MODIS VCF is also used to train alternative products, such as the newer finer-scale Landsat Tree Canopy Cover (TCC) product (Sexton et al., 2013).

As the VCF product has progressed from Collection 1 to its current Collection 6, several validations using in situ field data or higher-resolution remotely sensed data as a reference measurement have been carried out. These have been few and limited to sites within a biome (Montesano et al., 2009), a region (Hansen et al., 2005; White et al., 2005), or within a country (Gao et al., 2014; Sexton et al., 2013). The MODIS VCF product evaluated was the most recent collection available at the time (i.e. Collection 3: Hansen et al., 2005; White et al., 2005; Collection 4: Montesano et al., 2009; Collection 5: Gao et al., 2015; Sexton et al., 2013). To our knowledge, no such independent validation experiment has yet been conducted on Collection 6, which produces tree cover estimates in the same manner as Collection 5 but with improvements made to the upstream inputs to enhance its accuracy (DiMiceli, 2017). Likewise, validation of the finer-scale TCC product has been limited to its penultimate version and to the taiga–tundra circumpolar region (Montesano et al., 2016).

The validations found that MODIS VCF may be less suitable for estimating tree cover in sparsely vegetated areas. Huang and Siegert (2006) noted that MODIS VCF classified large areas of land as “bare” where their land cover classification system identified it as sparsely vegetated. Montesano et al. (2009) found that MODIS VCF data (Collection 4) overestimated cover in areas of low tree cover in taiga–tundra transition zones. Sexton et al. (2013) found that the Collection 5 product overestimated cover in areas of low cover (below 20 %) and underestimated in areas of higher tree cover, while Gao et al. (2015) found that MODIS VCF can only partially discriminate between tropical forest and non-forest, struggling in areas that have greater heterogeneity. Similarly to MODIS VCF (Montesano et al., 2009), Montesano et al. (2016) revealed an overestimation of the taiga–tundra low tree covers in the finer-scale Landsat TCC, suggesting that using VCF as training has propagated these overestimations into the higher-resolution product. What is clear from the history of these validation and comparison experiments is that MODIS VCF has accuracy issues in areas with low woody-vegetation cover, which has implications when its tree cover estimates are treated as accurately representative of real-world conditions. Failure to account for VCF's difficulty in estimating low woody covers can, therefore, lead to miscalibrated models and estimations that do not reflect real-world conditions. This, in turn, has knock-on effects on environmental policymaking, conservation efforts, and future ecological research, especially in areas with vegetation cover types that are most prone to error.

Tropical savannas have woody covers that fall within the range particularly affected by the reported MODIS VCF errors. A large proportion of these savannas can be found in tropical developing countries (Boval and Dixon, 2012) and are predicted to be home to half of the world's population by 2050 (Archer et al., 2020). Tropical savannas are therefore highly vulnerable to anthropogenic change. In the face of a growing population, land fragmentation, and changing climate, a savanna's ability to maintain robust ecosystem functions is directly linked to the amount of woody cover present (Sankaran et al., 2006). As a result, the ability to accurately monitor the state, dynamics, and woody cover trends of tropical savannas is a vital part of understanding how and why savannas are changing in the tropics (Harris et al., 2012; Miles et al., 2006), while also improving modelled climate projections and vegetation dynamics for this complex biome.

In this study, we evaluate MODIS VCF Collection 6 in tropical savannas and forest areas by comparing VCF's tree cover percentage to corresponding field data. Similarly, we evaluate Landsat TCC (version 4) to explore if, when VCF is used as training, VCF biases are propagated. We then, for MODIS VCF, characterize the observed bias in woody covers across both savanna and forest ecosystems and apply our calibration across the tropics to highlight the regions most likely affected by these inaccuracies. We finish by discussing the implications the uncovered biases may have on tropical vegetation and terrestrial biogeochemical modelling.

2 Methods

2.1 EO products and field data

We used the MODIS VCF Collection 6 product (250 m spatial resolution, DiMiceli, 2017) with tree cover values averaged across the years 2006 through to 2009 to reflect the range of the field data collection period. MODIS VCF was downloaded using the MODIS R package (Hijmans, 2017) in R 3.5.2 (R Core Team, 2018). We used the 2005 and 2010 30 m Landsat TCC version 4 product (https://lcluc.umd.edu/metadata/global-30m-landsat-tree-canopy-version-4, last access: 31 October 2021) and worked with the 2005 and 2010 average values. The product was downloaded manually from https://e4ftl01.cr.usgs.gov/MEASURES/GFCC30TC.003/ (last access: 31 October 2021).

The in situ field data were sourced from the “Tropical Biomes in Transition” project (TROBIT) (https://www.forestplots.net, last access: 31 October 2021, Torello-Raventos et al., 2013) and accessed via the https://www.forestplots.net database (Lopez-Gonzalez et al., 2011, 2009). The data we used include the corner locations and the canopy area index (CAI) values for 17 forest and 31 savanna sites distributed across Australia, Brazil, Bolivia, Cameroon, and Ghana (Fig. 1 and Table A1 in the Appendix; Fig. 2 in Torello-Raventos et al., 2013). The TROBIT field campaigns were carried out over a 3-year period, from 2006 to 2009, and the field plots used in this study are 1 ha in size except for BFI-01 (0.5 ha), BFI-02 (0.5 ha), BFI-03 (0.5 ha), CTC-01 (0.93 ha), and VCR-01 (0.6 ha).

All the sites fall within the tropics, that is, within 23.5 north and south of the Equator and were selected in regions where savannas and forests were in close proximity and within ecotones or “zones of tension”. As such, the sites sampled show a large variation in physiognomy and edaphic and climatic conditions (Table S1 in the Supplement; Veenendaal et al., 2015).

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f01

Figure 1Location of sampling sites in Africa, Australia, and South America from the TROBIT project (based on Fig. 2 of Torello-Raventos et al., 2013) shown in MODIS VCF (DiMiceli, 2017). Of the 63 field sites, only the 48 sites with available GPS coordinates were selected (https://www.forestplots.net).

The classification of the TROBIT sites as either “forest” or “savanna” is based on the parameters described in Torello-Raventos et al. (2013) and Veenendaal et al. (2015). A savanna is a natural land cover that is not a forest, bare ground, or body of water. Forest is defined as woody vegetation with an average tree height of or exceeding 6 m and a canopy area index (CAI) value of at least 0.3 for open forests and 0.7 for forests. In addition, floristic differences (i.e. dominance of savanna species) are used to differentiate forests from taller-growing savannas that have similar CAIs and tree heights (see Fig. 9 of Torello-Raventos et al., 2013).

There is some ambiguity in how savannas and “grasslands” are defined. Some modelling-based research treat the two biomes as different (Whitley et al., 2017), while studies based on plant functional traits group them together (Solofondranohatra et al., 2018; White et al., 2000). As there is some concern that MODIS VCF will struggle to pick up woody cover in areas with really sparse vegetation, in this paper we decided to treat grasslands as part of the savanna domain.

2.2 Converting in situ canopy area index to MODIS VCF–Landsat TCC percent tree cover

CAI is defined as the sum of the projected areas of individual tree crowns divided by the ground area. In the TROBIT project (Torello-Raventos et al., 2013; Veenendaal et al., 2015), plot-wide CAI is made up of the sum of the upper-stratum, mid-stratum, and subordinate-stratum crown areas. Membership to a stratum is determined by the tree's dbh (diameter at breast height; upper stratum: dbh > 10 cm, mid stratum: 2.5 cm < dbh < 10 cm, and subordinate stratum: dbh < 2.5 cm; height > 1.5 m). About 50 trees per stratum per plot were measured to derive plot-specific allometric relations between stem diameter and crown area (Supplement B of Torello-Raventos et al., 2013). These were then applied to the whole plot to establish plot-level CAI. For the allometric relationships, tree crowns were treated as circles, and the individual tree projected crown area was determined using the average of crown radii measured along the four cardinal points (i.e. from the centre of the stem to the distance furthest from the stem).

CAI values do not account for within-site tree canopy distribution patterns and the overlap between individual tree canopies. We account for this by converting each CAI value into a probability distribution function incorporating the following two extreme scenarios: “enforced overlap”, where the location probability of individual canopies increases linearly from 0 to 1 across a site, and “unenforced overlap”, where individual canopies follow a uniform random distribution pattern and canopy overlap is not purposefully introduced (Fig. 2). We repeated this 1000 times per CAI measurement to determine the probability distribution of expected CAI for each field plot.

Unlike CAI, which is the fraction of ground covered by tree crowns, the percent tree cover value from MODIS VCF (and so Landsat TCC) is defined as “the portion of the skylight orthogonal to the surface which is intercepted by trees” (Hansen et al., 2002). To make MODIS VCF and Landsat TCC comparable to tree cover derived from TROBIT plot CAIs, we divided these product values by 0.8 as suggested by Hansen et al. (2002). This is also the standard approach in most modelling studies using VCF (e.g. Lasslop et al., 2020; Kelley et al., 2013; Burton et al., 2019). The 0.8 value can be thought of as a gap correction factor (GCF) that accounts for within-canopy gaps. Although the GCF has been shown to vary with vegetation type (Lloyd et al., 2008; 0.34–0.60) and crown cover (Tang et al., 2019b; 0.70–0.96), we opted to use 0.8, as we found that it yielded more conservative results compared to a variable GCF. It also avoided introducing additional parameters into our analysis.

Next, to account for the difference in size between the MODIS VCF pixel (250 m × 250 m) and the smaller field plot size (100 m × 100 m), we calculated the possible percent tree cover an area the size of a TROBIT field plot could have, given the MODIS VCF percent tree cover for a MODIS-sized pixel. This was done for two extreme scenarios: “enforced clumping,” where all the tree cover for the given MODIS VCF value is forcibly “clumped” on one side of the pixel, or “unenforced clumping”, where “clumping” is not enforced and tree cover is distributed randomly within the pixel (Fig. 3). The clumping scenarios introduce possible variations in percent cover due to the area and location mismatch between a TROBIT field plot and a MODIS pixel. A probability distribution was generated for each MODIS VCF pixel by calculating percent tree cover values for 1000 samples (100 m × 100 m) randomly placed within the 250 m × 250 m MODIS VCF pixel.

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f02

Figure 2Visual representation of the effects of enforcing overlap within a (100 m × 100 m) TROBIT site with a given canopy area index (CAI). (a) Overlap is not enforced, and individual crowns follow a uniform random distribution. (b) Overlap is enforced by linearly increasing the probability of a canopy being located more on one side of the site (i.e. here the right side of the site) than the other. This results in tree canopies “overlapping” to a greater extent, which affects how accurately CAI represents actual canopy cover.

Download

For Landsat TCC, where the Landsat TCC pixels (30 m × 30 m) are smaller than the TROBIT field sites, we calculated a TCC percent tree cover to match the TROBIT field site size by summing the percent tree cover within the TCC pixel part found inside the TROBIT field site and then dividing the sum by the TROBIT site area. As TROBIT site orientation was not recorded, we randomized the angle between the TROBIT site and TCC pixel grid for each of the 1000 samples when generating the probability distribution. Enforced clumping was performed as per MODIS VCF (Fig. 3), with the direction of clumping randomized.

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f03

Figure 3(a, c) Example of the effects of unenforced and enforced clumping in a 250 m × 250 m MODIS VCF pixel with 50 % tree cover. Clumping all the cover (green) to one side of the pixel (c) affects the average canopy cover value of a 100 m × 100 m sized TROBIT site (black boxes). (b, d) Example of the effects of unenforced and enforced clumping on 30 m × 30 m Landsat TCC pixels with a mix of tree cover (green) and non-tree cover (brown). White dotted lines are TCC pixel boundaries. Clumping all the cover to one side of the pixel (d) affects the average canopy cover value of a 100 m × 100 m sized TROBIT site (black boxes).

Download

2.3 Calculating uncertainty under different clumping–overlap scenarios

We compared both MODIS VCF and Landsat TCC with TROBIT under four different scenarios: (1) unenforced overlap and clumping, (2) enforced overlap and unenforced clumping, (3) unenforced overlap and enforced clumping, and (4) enforced overlap and clumping. Comparisons were conducted by fitting the following logit function:

(1) logit ( Pixel ) = C 0 + Δ × log ( C τ 1 / ( 1 - C τ 2 ) ) ,

where C0, Δ, τ1, and τ2 are optimized parameters and Pixel and C are the MODIS VCF–Landsat TCC pixel (post-conversion as described in Sect. 2.2) and TROBIT site probability distributions, respectively. This is similar to a standard linear regression of logit-transformed data, accounting for maximum and minimum bounds of 0 %–100 % tree cover, with τ1 and τ2 allowing for a non-symmetric transformation of tree cover. To account for the probability density of each point, we inferred the parameters in Eq. (1) using a total least-squares Bayesian inference technique using a Metropolis–Hastings Markov chain Monte Carlo step. Priors were uninformed but physically bounded (i.e. Δ, τ1, τ2>0) to assume an increasing relationship between MODIS VCF–Landsat TCC and C. Equation (1) allowed us to assume normally distributed model errors, thus describing our conditional probability of observations for a given parameter combination by a normal distribution (Gelman et al., 2013). Each combination was run over 10 chains, with 1000 warm-up iterations and 10 000 sampling iterations. Optimization was performed using the rstan 2.19.2 (Stan Development Team, 2019) package in R 3.5.2 (R Core Team, 2018). Our optimization accounts for potential errors in TROBIT cover, which includes those caused by the allometric construction of the CAI, provided that the errors are unbiased and remain roughly consistent across sites (Gelman et al., 2013). As the TROBIT plots have relatively small total errors associated with the allometric relationships (Table B1 in Torello-Raventos et al., 2013), systematic errors are unlikely to affect our results.

2.4 Mapping MODIS VCF uncertainty across the tropics

We evaluated the impact of the MODIS VCF biases inferred from these regression equations across the tropics by inverting our calculation of MODIS VCF bias (Fig. A1) as follows: first, the inverse (i.e. solving for C) of Eq. (1) was applied to MODIS VCF values after conversion to a 100 m × 100 m pixel size grid (matching the field site area); then this calibrated value was translated back to the original 250 m × 250 m VCF pixel size. As the inverse of Eq. (1) has no analytical solution, we found the rounded percent value of C that minimizes the absolute difference between the left- and right-hand side of the equation. For computational feasibility, we constructed maps of the tropics with calibrated MODIS VCF values (Fig. A2) by randomly sampling 5 iterations from each of our 10 optimization chains (50 in total) and masking out pixels with cover types not considered forest or savanna in the 500 m MODIS Land Cover Type (MCD12Q1 – Collection 6) (Sulla-Menashe and Friedl, 2018).

We then used the MCD12Q1 product to identify the areas of forest and savanna across the tropics in the MODIS VCF product. MCD12Q1 is widely used by the global land surface modelling community (e.g. Sellar et al., 2019; Wiltshire et al., 2021) and describes land cover in terms of 17 global land cover classes as per the International Geosphere-Biosphere Programme (IGBP, Table 3 in Sulla-Menashe and Friedl, 2018). The product is based on the same spectroradiometer (MODIS) and temporal resolution as the VCF product. Referring to the definition of savanna of Veenendaal et al. (2015), the following land cover classes were chosen to represent savanna: closed shrublands, open shrublands, woody savanna, savanna, and grassland; meanwhile forest encompasses the following: evergreen needleleaf forests, evergreen broadleaf forests, deciduous needleleaf forests, deciduous broadleaf forests, and mixed forests. We subset MCD12Q1 to the tropical zone between 30 north and south of the Equator and took the median class for the 2006 to 2009 period, matching the field data collection period.

For a more detailed land-cover-specific evaluation, we extracted the calibrated 250 m MODIS VCF pixel values for each corresponding 500 m MCD12Q1 pixel to construct land-cover-specific MODIS VCF tree cover frequency distributions (Fig. A3). Our tree cover calibration by cover type for the four clumping–overlap regression combinations was then calculated by multiplying each cover type MODIS VCF frequency distribution (Fig. A3) with curves representing the median, 5 %, and 95 % confidence lines of the calibration equation ensembles.

3 Results

3.1 Comparing MODIS VCF and Landsat TCC to tree cover from TROBIT field sites

MODIS VCF underestimates tree cover within the 19 % to 81 % range across all four combinations of enforced–unenforced overlap and clumping (black line, Fig. 4). Below 12 %, MODIS VCF tree cover values do not significantly disagree with TROBIT field data and may instead be overestimating tree cover (50 % confidence, dashed line, Fig. 4). A similar pattern is seen when tree cover exceeds 84 %: MODIS VCF does not differ significantly from TROBIT when there is enforced overlap (i.e. when tree canopies are clustering towards one side increasing the degree of canopy overlap, Fig. 4 right) but may underestimate tree cover when overlap is not enforced (i.e. tree canopies are spaced randomly within the site, Fig. 4 left).

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f04

Figure 4MODIS VCF percent tree cover versus percent tree cover from TROBIT field data, taking into account uncertainties associated with tree cover spatial distributions within a MODIS pixel and/or field site. The four combinations are the following: (1) no overlap and no clumping, where tree canopies are randomly distributed within both the pixel and site; (2) no overlap and maximum clumping, where tree canopies are clustered in one area of the pixel and randomly distributed throughout the field site; (3) with overlap and no clumping, where tree canopies are randomly distributed within the pixel but overlap substantially within the field site; and (4) with overlap and maximum clumping, where tree canopies are clustered to one side within the pixel and overlap substantially within the site. The bolded dashed line in black shows the 1 : 1 relationship. The solid lines represent the median of the respective regressions (green for forest, orange for savanna, and black for forest and savanna combined), and the thin lines represent the 5 % and 95 % confidence interval of their respective regression lines. The vertical error bars represent uncertainty introduced by clumping; the horizontal error bars represent the uncertainty introduced by overlap.

Download

There is a clear difference in how accurately MODIS VCF estimates tree cover in forested areas (in green, Fig. 4) as opposed to areas identified as savannas (in orange, Fig. 4). In savanna sites, MODIS VCF significantly and consistently underestimates tree cover regardless of the amount of overlap and clumping. Significant underestimation (at 95 % confidence) occurs when in situ tree cover exceeds 19 %–21 % (without enforced clumping) or 11 %–12 % (with enforced clumping). In forest sites, MODIS VCF does not show the same pattern of systematic underestimation. Divergence does occur at high covers, depending on the enforcement of overlap or clumping. MODIS VCF underestimates tree cover where tree cover exceeds 84 % (at the 95 % confidence interval) when neither overlap nor clumping is enforced and overestimates where tree cover exceeds 78 % (at 5 % confidence interval) when both overlap and clumping are enforced.

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f05

Figure 5Landsat TCC percent tree cover versus percent tree cover from TROBIT field data, taking into account uncertainties associated with tree cover spatial distributions within a TCC pixel and/or field site. The four combinations are the following: (1) no overlap and no clumping, where tree canopies are randomly distributed within both the pixel and site; (2) no overlap and maximum clumping, where tree canopies are clustered in one area of the pixel and randomly distributed throughout the field site; (3) with overlap and no clumping, where tree canopies are randomly distributed within the pixel but overlap substantially within the field site; and (4) with overlap and maximum clumping, where tree canopies are clustered to one side within the pixel and overlap substantially within the site. The bolded dashed line in black shows the 1 : 1 relationship. The solid lines represent the median of the respective regressions (green for forest, orange for savanna, and black for forest and savanna combined), and the thin lines represent the 5 % and 95 % confidence interval of their respective regression lines. The vertical error bars represent uncertainty introduced by clumping; the horizontal error bars represent the uncertainty introduced by overlap.

Download

Similar patterns can be observed with Landsat TCC (black line, Fig. 5). There is a significant underestimation of tree cover in the lower cover ranges up to 59 % when there is enforced overlap and up to 82 % when overlap is not enforced. In savanna sites (orange line, Fig. 5) the underestimation (at 95 % confidence) is significant and consistent for covers below 75 %–80 % (without enforced overlap) or below 52 %–60 % (with enforced overlap). In forest sites (green line, Fig. 5) there is no systematic difference.

3.2 Global estimates of post-calibration change in tropical tree cover

We assessed the impact of VCF's underestimation of intermediate tree covers across the tropics (IGBP forest and savanna land cover classes), using a calibration based on the combined forest and savanna sites (black curve, Fig. 4) instead of using the savanna-only sites for a savanna-specific calibration (orange curve, Fig. 4). This is because there were few TROBIT sites representing savanna with MODIS VCF tree cover values exceeding 40 %, and global land cover maps disagree on the distribution of savannas within the forest–savanna ecotone (Herold et al., 2008).

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f06

Figure 6(a) The post-calibration change in tree cover that is statistically significant (95 % interval) in the same direction (positive or negative calibration leading to an increase or decrease in tree cover, respectively) across all four scenarios. (b) Uncertainty range of the post-calibration change in tree cover, calculated as the 90th percentile (maximum of the four scenarios in Fig. A2) minus the 10th percentile (minimum of the four scenarios in Fig. A2). (c) Geographic distance to the closest TROBIT site sampled. (d) Regions coloured to denote priority for field surveying to constrain map uncertainty (based on multiplying the uncertainty range of each pixel with the pixel's geographical distance to the closest TROBIT site sampled).

The distribution of tree cover change after calibrating against field data are similar across the four scenarios (Fig. A2), and the regions where all four scenarios agree on the direction of change (positive and negative) are substantial (Fig. 6). However, there are some differences caused by the uncertainty introduced by different extents of overlap and clumping. While we see a significant increase in tree cover across all clumping–overlap combinations in many regions of tropical savannas and grasslands (Pennington et al., 2018), such as in the forest–savanna mosaics that surround Congolian rainforests, we do not see the same pattern in the Cerrado of Brazil. This is likely because the African forest–savanna regions fall within the range of MODIS VCF values that consistently undergo a positive calibration ( 30 %–50 %, see Fig. A1), while the Cerrado of Brazil does not.

Using field plots over a limited geographic extent creates uncertainty that may be unaccounted for in our analysis when our calibration is broadly applied across the highly variable tropical-forest–savanna ecotone. By multiplying the uncertainty range of our calibrations with the geographical distance to the closest sampled TROBIT site, we identified priority regions for further field surveying (Fig. 6 bottom). We found Southeast Asia, Central America, and Mexico are areas where additional in situ observations would greatly help improve confidence. Field data from the northwestern region of South America, the southeast of the African continent, and Madagascar would also help.

As our calibrations were based on a limited number of sites in a limited number of regions, it is important to note that the maps shown in Figs. 6 and A2 are not definitive. For instance, we found a significant tree cover decrease in the Sahel post-calibration in multiple scenarios, which runs counter to the results of Brandt et al. (2020), who found that tree cover was underestimated in this region. This disparity may be due to our lack of field sites in these more arid regions, further highlighting the importance of more in situ data for more accurate and precise calibration. Therefore, our calibrations are most useful in identifying areas where MODIS VCF estimates may be more or less reliable.

3.3 Post-calibration change in tree cover within different vegetation classes in tropical ecosystems

When looking at our calibration in more detail, we see that MODIS VCF significantly underestimates tree cover in all the IGBP land cover classes that we considered, regardless of overlap or clumping (95 % confidence interval) (Fig. 7). The most substantial and significant underestimation is in the classes “woody savannas” and savannas. The underestimation is the largest in woody savannas, except when clumping and overlap are enforced (in purple, Fig. 7). This is because the peak in the tree cover frequency distribution for savannas aligns with where the calibration for maximum overlap and clumping is the largest (i.e. at about 20 % tree cover; see Fig. A3), while the peak in cover distribution for woody savannas (26 %–67 %, Fig. A3) aligns with the cover range that undergoes the greatest change (Fig. 7) in the other clumping and overlap scenarios.

“Open shrublands” only show a small underestimation of tree cover, despite the woody cover definition (10 %–60 %) matching the range where MODIS VCF most underestimates tree cover (26 %–67 % cover). The discrepancy may be because the majority of the open-shrublands class commission error is with the grasslands class (see Table S6 in Sulla-Menashe et al., 2019). The MODIS VCF tree cover in areas classified as open shrublands is therefore likely to be lower than the IGBP definition would suggest (see Fig. A3), resulting in calibrations that are more conservative.

We found significant increases in tree cover for forests in every calibration scenario, though net change is not significant (95 % confidence) when overlap is enforced. This can be explained by the presence of both negative and positive calibrations in the higher ranges of tree cover when overlap is enforced. Similarly, the net change is insignificant across all clumping and overlap scenarios for the IGBP classes matching the lower ranges of tree cover (grasslands, closed shrublands, and open shrublands).

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f07

Figure 7Percent change in tree cover post-calibration (clockwise: no enforced clumping or overlap (black), enforced clumping and no enforced overlap (blue), no enforced clumping and enforced overlap (red), and enforced clumping and overlap (pink)) in the forest supercategory and the five savanna classes. The palest tone indicates positive change; the mid tone indicates negative change; and the darkest tone indicates net change. Error bars denote the 5 %–95 % confidence interval; if the error bar extends past the x axis, the post-calibration change is not considered significant.

Download

4 Discussion

While MODIS VCF is a powerful and accessible tool to map tree cover, our field-data-based calibrations indicate that the latest MODIS VCF Collection 6 is missing a lot of woody cover, even when uncertainty introduced by site canopy overlap and clumping within the MODIS VCF pixel are accounted for. The Landsat TCC product, which may be viewed as an alternative with a higher spatial resolution, behaves in a similar manner. Our map (Fig. 6 top) highlights that this potential underestimation of woody cover is mainly occurring in tropical savannas. Moreover, the highest underestimation in the savanna classes occurs when there is no enforced overlap (Fig. A2) (i.e. when there is a uniform random distribution of trees) which is the scenario that most likely reflects the TROBIT savanna plots. This is evidenced by work done by Veenendaal et al. (2015), where TROBIT plots were tested for complete spatial randomness, and only minor indications of overlap were found. Woody savannas, as an example, may have their tree cover underestimated by up to 32 % (95 % confidence) when neither clumping nor overlap is enforced (grey tones, Fig. 7). If our results are representative of the tropics, then overall, MODIS VCF may be underestimating tropical tree cover by between 7 %–29 % for unenforced clumping and overlap or 0 %–21 % for when either clumping or overlap are enforced (5 %–95 % confidence).

An overestimation at the lower end of the cover (<20 %) (Hansen et al., 2002; Sexton et al., 2013) and underestimation in the lower to middle range of cover (20 %–60 %) have been identified in validations of previous MODIS VCF collections (Gross et al., 2018; Yang and Crews, 2019) and Landsat TCC versions (Montesano et al., 2016). According to its definition, MODIS VCF only maps trees that are 5 m or taller (Hansen et al., 2003), while the TROBIT CAI includes all trees with a minimum dbh of 2.5 cm, as well as trees with a height exceeding 1.5 m when dbh < 2.5 cm. This could explain our observed underestimation in the lower tree cover ranges for both MODIS VCF and Landsat TCC. In fact Montesano et al. (2016) showed an improved match between Landsat TCC and their lidar-derived tree cover reference data when reducing the height threshold from 5 to 2 m. However, because of how our field reference CAI is derived, we were not able to conclusively link the 5 m threshold to our observed underestimation.

On the other hand, when looking at the relationship between TROBIT's upper-stratum canopy height and the difference between TROBIT and VCF, we would have expected an increasing underestimation in the lower height ranges. Instead we found a low R2 and a mixture of under- and overestimations in heights between 0 and 10 m (Fig. A4). This suggests that the inclusion of trees below 5 m height in the TROBIT inventory does not fully explain the observed underestimation. However, as the relationship between upper canopy heights and the subordinate strata composition (and canopy cover thereof) varies widely depending on factors including ecosystem type and altitude (Rutten et al., 2015), more research needs to be done with in situ height data.

We also found discrepancies between the tree cover values derived from MODIS VCF and the corresponding class definition of the MCD12Q1 product (Fig. A3) which again suggests that the 5 m height threshold may not always apply in MODIS VCF. For example, MODIS VCF recorded tree cover in the open-shrublands and “closed-shrublands” classes of the MCD12Q1 product (Fig. A3), even though the height range for these classes is 1–2 m. For the savanna class, MODIS VCF yields a percent tree cover range that matches closely with the savanna class definition (between 10 % and 30 %), despite the differing tree thresholds for MODIS VCF and IGBP (5 m minimum for MODIS VCF and 2 m minimum for IGBP). These discrepancies suggest one of the following three things: open/closed shrublands and savannas contain trees taller than 5 m, MODIS VCF is distinguishing trees below the 5 m threshold, or some combination of both.

Another explanation for the discrepancy between the IGBP class definitions and those estimated through MODIS VCF could be the between-class omission and commission errors (Fig. 5 in the present paper and Table S6 in Sulla-Menashe et al., 2019). For example, the accuracy for closed shrublands is particularly low. It is mainly confused with open shrublands, woody savannas, and savannas. The majority of the open-shrublands class commission error is with the grasslands class, and there is confusion to a lesser extent between open shrublands, woody savannas, and savannas. Also, the “cropland/natural vegetation mosaics” class is often mapped as closed shrublands, woody savannas, savannas, or grasslands.

More work needs to be done to evaluate how effective both MODIS VCF, Landsat TCC, and MCD12Q1 are at implementing the height thresholds in their respective “tree” definitions, as this may have implications when MODIS VCF, Landsat TCC, and MCD12Q1 are used for global model calibration or validation.

Overall, our results suggest that the biases found in the previous collections may have persisted in Collection 6, despite reported improvement in accuracy (DiMiceli et al., 2017). This indicates that the biases introduced by binning the training data (Gerard et al., 2017) and using a CART (classification and regression tree) model (Hanan et al., 2013) are inherent and still present within this version of MODIS VCF. Similar results for MODIS VCF and Landsat TCC also suggest that by training TCC with VCF tree cover these biases have been propagated into the finer-scale product. There is a risk of bias propagation when MODIS VCF (or related products) is used as a single source for benchmarking models (e.g. Brandt et al., 2017; Lasslop et al., 2020; Burton et al., 2019; Kelley et al., 2019, 2021). To avoid this, studies should try use multiple data sources (e.g. for woody cover possibly the GEDI – Global Ecosystem Dynamics Investigation – lidar-based canopy cover product, Tang et al., 2019a) whether for model calibration/validation (e.g. Sellar et al., 2019; Wiltshire et al., 2021; Burton et al., 2021) or hypothesis testing (e.g. Taylor et al., 2012).

We suggest that while MODIS VCF and landsat TCC give a good overview of tree cover on a global scale, both should be used cautiously in savanna regions. Special care should be taken in savannas, a biome that has long been noted as being challenging for EO products to characterize, as solitary trees in the landscape tend to be missed by global tree cover products (Jung et al., 2006; Brandt et al., 2020). The poor performance of MODIS VCF and Landsat TCC in savannas in particular (Gaughan et al., 2013; Gross et al., 2018; Kumar et al., 2019) emphasizes the importance of continuous independent validation and re-calibration of these products. The ecosystem functions of savannas can vary drastically with just a slight difference in tree cover (Gaughan et al., 2013), and even slight errors may create issues in how we interpret the state and dynamics of the biome, which in turn affects how the land is managed.

Work on forest restoration potential would also be impacted. Bastin et al. (2019), for example, used MODIS VCF to estimate tree cover in agricultural land. As this tree cover is likely to have been underestimated substantially, the derived available land space for replanting may be less than projected, with the restoration potential being overestimated. However, our results also indicate an underestimated tree cover in woodier savannas and forests. Accounting for this, the restoration potential could actually be greater than anticipated because the carrying capacity of a unit of land could be greater than previously thought. Calibration could also result in a more uniform cover distribution across regions, producing a more gradual transition between low-cover savannas and high-cover forests. This could have implications for work that, for example, uses MODIS VCF to study forest–savanna dynamics and bi-stability (Lasslop et al., 2018; Wuyts et al., 2017; Xu et al., 2016).

To ensure the appropriate use of both products, we suggest that where field data are available, the products should be calibrated for use in the target region. However, calibrating on a large scale using field data as a reference presents several challenges. Firstly, different in situ measurement techniques tend to measure different types of tree cover (e.g. Fiala et al., 2006; Korhonen et al., 2006; Rautiainen et al., 2005), and each will require a specific conversion method to enable direct comparison with MODIS VCF or Landsat TCC. For example, the comparison of Montesano et al. (2016) did not acknowledge VCF's and thus TCC's “within canopy gaps”, which may explain their observed underestimation in covers above 80 %. In our case, to account for gaps between tree crowns, we applied the 0.8 “gap correction factor” to the CAI. However, the GCF and resulting tree cover could vary widely on a plot-by-plot basis (Lloyd et al., 2008). With further in situ data that describe tropical vegetation-type-specific GCF variation, we may be able to incorporate site-specific GCFs into our analysis.

There is also the uncertainty associated with the field data collection. In our case, the site-specific CAI standard errors (Supplement B in Torello-Raventos et al., 2013) are small and show no systematic bias and are therefore not expected to significantly change our results. However, our results in Fig. 6 have been extrapolated from a limited number of field sites, with somewhat limited distribution across the tropics. While our uncertainty map gives a good idea of areas of concern, for a more robust description of MODIS VCF's accuracy, we would need substantially more widely distributed in situ sites to represent the tropics. Field data collection remains costly and labour intensive, and adopting protocols that ensure standardization and data sharing via information platforms would facilitate large-scale validation exercises. In the case of validating woody cover products, including direct tree cover measurements such as crown area would also help.

However, the reality for much of pre-existing data is that they are not standardized, and field data (whether standardized or not) are not designed to be directly comparable to remote-sensing-derived variables. Here, our technique offers a potential solution to this. By accounting for uncertainties that arise from differences between the available in situ data and the remotely sensed variable, we are able to use in situ data, which may initially seem unsuitable, to carry out a more conservative evaluation.

Finally, factors such as cloud cover, landscape heterogeneity, phenology, vegetation type, and soil type affect the accuracy of remotely sensed products like MODIS VCF and Landsat TCC (Hansen et al., 2003; Huete et al., 1997; Smith et al., 2002). Data characterizing these at the plot level would help identify potential confounding factors affecting performance and so help further constrain uncertainties.

Alternatively, comparing MODIS VCF to other land cover maps or higher-resolution remotely sensed data are recommended (Gross et al., 2018; Lary and Lait, 2006; Montesano et al., 2016), though without a large-scale effort to re-calibrate MODIS VCF and products trained using VCF like Landsat TCC, the question of how appropriate MODIS VCF is for use in both forests and savannas in the tropics will remain. By highlighting the extent to which MODIS VCF struggles to estimate tree cover in tropical forests and savannas, we hope to inform the future use of this product and improve its useability.

5 Conclusion

We found that MODIS VCF significantly underestimates tree cover in tropical forests and savannas, even when within-site and pixel–site variation are accounted for in calibration. We also found that using MODIS VCF for training likely propagates these biases, even in the finer-scale Landsat TCC. As MODIS VCF is a product that is commonly used in a wide variety of ecological research including vegetation modelling, estimating restoration potential, and identifying forest–savanna bimodality, we stress that more independent work on validating and re-calibrating is required before its tree cover estimates can be relied upon in the tropics.

Appendix A

Table A1Site names, locations, canopy area index values, MODIS VCF percent tree cover values, cover type, and TROBIT site descriptions of the 48 TROBIT project plots used in this study. TROBIT site descriptions are based on Table S1 of Veenendaal et al. (2015).

Download Print Version | Download XLSX

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f08

Figure A1The calibration curves developed for MODIS VCF based on four pixel–site mismatch scenarios (no clumping and no overlap, enforced clumping and no overlap, no clumping and enforced overlap, and enforced clumping and enforced overlap). The dashed line signifies the “ideal” 1 : 1 relationship wherein calibrated MODIS VCF is unchanged from the original MODIS VCF values. The shaded regions represent the 5 % to 95 % confidence intervals for the respective calibrated MODIS VCF values.

Download

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f09

Figure A2The change in tree cover post-calibration for all four scenarios. Black dots indicate areas where the post-calibration values have a 95 % certainty of being positive (increasing cover) or negative (decreasing cover) calibrations. These uncertainty maps are indicators of areas where MODIS VCF estimates may be more or less reliable and cannot be used as definitive calibrations due to the limited number of field sites used as reference.

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f10

Figure A3Frequency distributions of percent tree cover value as estimated by MODIS VCF across the forest supercategory and the following IGBP classes that by our definition count as part of the savanna domain: closed shrublands, open shrublands, woody savannas, savannas, and grasslands. Specific class definitions as per the user guide for the MODIS land cover product (Sulla-Menashe and Friedl, 2018).

https://bg.copernicus.org/articles/19/1377/2022/bg-19-1377-2022-f11

Figure A4TROBIT plot upper-stratum height versus the difference between MODIS VCF and TROBIT percent tree cover for the four clumping and overlap scenarios. Upper and lower bars represent the uncertainty range's 10th and 90th percentile, respectively, based on the convolution of MODIS VCF and TROBIT cover uncertainties from Fig. 4.

Download

Code and data availability

The code and data used to support the findings of this study are archived at https://github.com/douglask3/VCF_vs_sites with revision number fdda3ff (Adzhar et al., 2022).

Upon being granted access, the data are available at https://www.forestplots.net and can be found using the site names we have listed in Table A1 (Torello-Raventos et al., 2013).

MODIS VCF is accessible via the MODIS R package (http://CRAN.R-project.org/package=raster; Hijmans, 2017). The product itself should be attributed to DiMiceli (2017, https://doi.org/10.5067/MODIS/MOD44B.006).

TCC data are accessible at https://e4ftl01.cr.usgs.gov/MEASURES/GFCC30TC.003/ (Sexton et al., 2013).

Author contributions

RA, DIK, and FG designed the TROBIT–MODIS VCF pixel–site comparison technique. RA and FG collated TROBIT and corresponding MODIS VCF values, and CG collated Sexton et al. (2013) values. DIK and ND performed regression analysis and constructed global maps. RA wrote the first draft of the paper with input from DIK and FG. DIK plotted the figures. MTR, EV, TRF, OLP, SLL, BS, HT, BSM, TD, LA, GD, and GS carried out the extensive TROBIT field campaigns. MTR was the main person responsible for field data quality checking and digitizing. RA, DIK, FG, and ND contributed to the final manuscript.

Competing interests

The contact author has declared that neither they nor their co-authors have any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Acknowledgements

The contribution by Douglas I. Kelley was supported by the UK Natural Environment Research Council through the UK Earth System Modelling project (UKESM, grant no. NE/N017951/1). France Gerard was supported by the SUNRISE project (grant no. NE/R000131/1). Ning Dong was supported by the Australia Research Council (grant no. DP170103410). Beatriz Schwantes Marimon was supported by the National Council for Scientific and Technological Development (CNPq, grant no. 301153/2018-3).

Maps in Figs. 1 and 3 were constructed using raster 2.6-7 (Hijmans, 2017) and mapproj 1.2-5 (Brownrigg et al., 2017) in R 3.2.0 (R Core Team, 2018). Coastlines were obtained from maps 3.1.0 (Becker et al., 2016).

Special thanks go to Jon Lloyd for supervising Rahayu Adzhar at the start of the project, to Jeanette Kemp for her work in the Australian field sites, and to Azemiyah Abdul Rahim for her support during the production of this paper.

Financial support

This research has been supported by the Natural Environment Research Council (grant nos. NE/N017951/1 and NE/R000131/1) and the Australian Research Council (grant no. DP170103410). Beatriz Schwantes Marimon was supported by the National Council for Scientific and Technological Development (CNPq, grant no. 301153/2018-3).

Review statement

This paper was edited by Alexandra Konings and reviewed by three anonymous referees.

References

Archer, C., Penny, A., Templeman, S., McKenzie, M., Hunt, E., Toral, D., Diakhite, M., Nhlapo, T., Mawoko, D., Vergnani, L., Chamdimba, C., Diop, H., Kalanzi, B., Touitha, Y., Jackson, A., Mchugh, J., Chang, O., Mohamad, A., Hunter, E., and Lopez, C.: State of the Tropics 2020 Report, James Cook University, ISBN 978-0-6486803-7-6, 2020. 

Adzhar, R., Kelley, D. I., Dong, N., George, C., Torello Raventos, M., Veenendaal, E., Feldpausch, T. R., Philips, O. L., Lewis, S. L., Sonké, B., Taedoumg, H., Schwantes Marimon, B., Domingues, T., Arroyo, L., Djagbletey, G., Saiz, G., and Gerard, F.: VCF_vs_sites, GitHub [code], https://github.com/douglask3/VCF_vs_sites, last access: 26 February 2022. 

Bartholomé, E. and Belward, A. S.: GLC2000: a new approach to global land cover mapping from Earth observation data, Int. J. Remote Sens., 26, 1959–1977, https://doi.org/10.1080/01431160412331291297, 2005. 

Bastin, J. F., Finegold, Y., Garcia, C., Mollicone, D., Rezende, M., Routh, D., Zohner, C. M., and Crowther, T. W.: The global tree restoration potential, Science, 365, 76–79, https://doi.org/10.1126/science.aax0848, 2019. 

Becker, R. A., Minka, T. P., Wilks, A. R., Brownrigg, R., and Deckmyn, A.: maps: Draw Geographical Maps, http://CRAN.R-project.org/package=maps (last access: 1 July 2016), 2016. 

Bicheron, P., Defourny, P., Brockmann, C., Schouten, L., Vancutsem, C., Huc, M., Bontemps, S., Leroy, M., Frédéric, A., Herold, M., Ranera, F. and Arino, O.: GLOBCOVER: products description and validation report, MEDIAS-France, JRC49240, 2008. 

Boval, M. and Dixon, R. M.: The importance of grasslands for animal production and other functions: a review on management and methodological progress in the tropics, Animal, 6, 748–762, https://doi.org/10.1017/S1751731112000304, 2012. 

Brandt, M., Rasmussen, K., Penuelas, J., Tian, F., Schurgers, G., Verger, A., Mertz, O., Palmer, J., and Fensholt, R.: Human population growth offsets climate-driven increase in woody vegetation in sub-Saharan Africa, Nature Ecology & Evolution, 1, 0081, https://doi.org/10.1038/s41559-017-0081, 2017. 

Brandt, M., Tucker, C., Kariryaa, A., Rasmussen, K., Abel, C., Small, J., Chave, J., Rasmussen, L., Hiernaux, P., Diouf, A., Kergoat, L., Mertz, O., Igel, C., Gieseke, F., Schöning, J., Li, S., Melocik, K., Meyer, J., Sinno, S., and Fensholt, R.: An unexpectedly large count of trees in the West African Sahara and Sahel, Nature, 587, 78–82, https://doi.org/10.1038/s41586-020-2824-5, 2020. 

Brovkin, V., Boysen, L., Raddatz, T., Gayler, V., Loew, A., and Claussen, M.: Evaluation of vegetation cover and land-surface albedo in MPI-ESM CMIP5 simulations, J. Adv. Model. Earth Sy., 5, 48–57, https://doi.org/10.1029/2012MS000169, 2013. 

Brownrigg, R., Mcilroy, D., Minka, T. P., and Bivand, R.: mapproj: Map Projections, http://CRAN.R-project.org/package=mapproj (last access: 15 March 2018), 2017. 

Burton, C., Betts, R., Cardoso, M., Feldpausch, T. R., Harper, A., Jones, C. D., Kelley, D. I., Robertson, E., and Wiltshire, A.: Representation of fire, land-use change and vegetation dynamics in the Joint UK Land Environment Simulator vn4.9 (JULES), Geosci. Model Dev., 12, 179–193, https://doi.org/10.5194/gmd-12-179-2019, 2019. 

Burton, C., Kelley, D. I., Jones, C. D., Betts, R. A., Cardoso, M., and Anderson, L.: South American fires and their impacts on ecosystems increase with continued emissions, Clim. Resil. Sustain., https://doi.org/10.1002/cli2.8, online first, 2021. 

DiMiceli, M. C.: MOD44B MODIS/Terra Vegetation Continuous Fields Yearly L3 Global 250m SIN Grid V006, MOD44Bv006 [data set], https://doi.org/10.5067/MODIS/MOD44B.006, 2017. 

DiMiceli, C., Carroll, M., Sohlberg, R. A., Huang, C., Hansen, M. C., and Townshend, J. R. G.: Annual global automated MODIS vegetation continuous fields (MOD44B) at 250 m spatial resolution for data years beginning day 65, 2000–2014, collection 5 percent tree cover, version 6, University of Maryland [data set], https://doi.org/10.5067/MODIS/MOD44B.006, 2017. 

Fiala, A. C. S., Garman, S. L., and Gray, A. N.: Comparison of five canopy cover estimation techniques in the western Oregon Cascades, Forest Ecol. Manage., 232, 188–197, https://doi.org/10.1016/j.foreco.2006.05.069, 2006. 

Friedl, M. A., McIver, D. K., Hodges, J. C. F., Zhang, X. Y., Muchoney, D., Strahler, A. H., Woodcock, C. E., Gopal, S., Schneider, A., Cooper, A., Baccini, A., Gao, F., and Schaaf, C.: Global land cover mapping from MODIS: algorithms and early results, Remote Sens. Environ., 83, 287–302, https://doi.org/10.1016/S0034-4257(02)00078-0, 2002. 

Gao, Y., Mas, J. F., Paneque-Gálvez, J., Skutsch, M., Ghilardi, A., Pacheco, J. A. N., and Paniagua, I.: Validation of MODIS vegetation continuous fields in two areas in Mexico, in: 2014 Third International Workshop on Earth Observation and Remote Sensing Applications (EORSA), 14–18, https://doi.org/10.1109/EORSA.2014.6927840, 2014. 

Gao, Y., Ghilardi, A., Paneque-Gálvez, J., Skutsch, M. M., and Mas, J.: Validation of MODIS Vegetation Continuous Fields for monitoring deforestation and forest degradation: two cases in Mexico, Geocarto Int., 31, 1019–1031, https://doi.org/10.1080/10106049.2015.1110205, 2015. 

Gaughan, A., Holdo, R., and Anderson, T.: Using short-term MODIS time-series to quantify tree cover in a highly heterogeneous African savanna, Int. J. Remote Sens., 34, 6865–6882, https://doi.org/10.1080/01431161.2013.810352, 2013. 

Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B.: Bayesian Data Analysis, 3rd Edn., CRC Press, https://doi.org/10.1201/9780429258411, 2013. 

Gerard, F., Hooftman, D., Langevelde, F., Veenendaal, E., White, S., and Lloyd, J.: MODIS VCF should not be used to detect discontinuities in tree cover due to binning bias. A comment on Hanan et al. (2014) and Staver and Hansen (2015), Global Ecol. Biogeogr., 26, 854–859, https://doi.org/10.1111/geb.12592, 2017. 

Giriraj, A., Babar, S., and Murthy, M.: Evaluating MODIS-vegetation continuous field products to assess tree cover change and forest fragmentation in India – A multi-scale satellite remote sensing approach, Egyptian Journal of Remote Sensing and Space Science, 20, 157–168, https://doi.org/10.1016/j.ejrs.2017.05.004, 2017. 

Gross, D., Achard, F., Dubois, G., Brink, A., and Prins, H. H. T.: Uncertainties in tree cover maps of Sub-Saharan Africa and their implications for measuring progress towards CBD Aichi Targets, Remote Sensing in Ecology and Conservation, 4, 94–112, https://doi.org/10.1002/rse2.52, 2018. 

Hanan, N., Tredennick, A., Prihodko, L., Bucini, G., and Dohn, J.: Analysis of stable states in global savannas: Is the CART pulling the horse?, Global Ecol. Biogeogr., 23, 259–263, https://doi.org/10.1111/geb.12122, 2013. 

Hansen, M. C., Defries, R., Townshend, J., Marufu, L., and Sohlberg, R.: Development of a MODIS tree cover validation data set for Western Province, Zambia, Remote Sens. Environ., 83, 320–335, https://doi.org/10.1016/S0034-4257(02)00080-9, 2002. 

Hansen, M. C., DeFries, R. S., Townshend, J. R. G., Carroll, M., Dimiceli, C., and Sohlberg, R.: Global Percent Tree Cover at a Spatial Resolution of 500 Meters: First Results of the MODIS Vegetation Continuous Fields Algorithm, Earth Interact., 7, 1–15, 2003. 

Hansen, M. C., Townshend, J., DeFries, R., and Carroll, M.: Estimation of tree cover using MODIS data at global, continental and regional/local scales, Int. J. Remote Sens., 26, 4359–4380, https://doi.org/10.1080/01431160500113435, 2005. 

Harris, N. L., Brown, S., Hagen, S. C., Saatchi, S. S., Petrova, S., Salas, W., Hansen, M. C., Potapov, P. V., and Lotsch, A.: Baseline map of carbon emissions from deforestation in tropical regions, Science, 336, 1573–1576, https://doi.org/10.1126/science.1217962, 2012. 

Herold, M., Mayaux, P., Woodcock, C., Baccini, A., and Schmullius, C.: Some challenges in global land cover mapping: An assessment of agreement and accuracy in existing 1 km datasets, Remote Sens. Environ., 112, 2538–2556, https://doi.org/10.1016/j.rse.2007.11.013, 2008. 

Hijmans, R. J.: raster: Geographic Data Analysis and Modeling, http://CRAN.R-project.org/package=raster (last access: 15 March 2018), 2017. 

Huang, S. and Siegert, F.: Land cover classification optimised to detect areas at risk of desertification in North China based on SPOT VEGETATION imagery, J. Arid Environ., 67, 308–327, https://doi.org/10.1016/j.jaridenv.2006.02.016, 2006. 

Huete, A. R., Liu, H., and van Leeuwen, W. J. D.: The use of vegetation indices in forested regions: issues of linearity and saturation, in: IGARSS'97. 1997 IEEE International Geoscience and Remote Sensing Symposium Proceedings, Remote Sensing – A Scientific Vision for Sustainable Development, IGARSS'97, 1997 IEEE International Geoscience and Remote Sensing Symposium Proceedings, Remote Sensing – A Scientific Vision for Sustainable Development, 1966–1968, Vol. 4, https://doi.org/10.1109/IGARSS.1997.609169, 1997. 

Jung, M., Henkel, K., Herold, M., and Churkina, G.: Exploiting synergies of global land cover products for carbon cycle modeling, Remote Sens. Environ., 101, 534–553, https://doi.org/10.1016/j.rse.2006.01.020, 2006. 

Kelley, D. I., Prentice, I. C., Harrison, S. P., Wang, H., Simard, M., Fisher, J. B., and Willis, K. O.: A comprehensive benchmarking system for evaluating global vegetation models, Biogeosciences, 10, 3313–3340, https://doi.org/10.5194/bg-10-3313-2013, 2013. 

Kelley, D. I., Bistinas, I., Whitley, R., Burton, C., Marthews, T. R., and Dong, N.: How contemporary bioclimatic and human controls change global fire regimes, Nat. Clim. Change, 9, 690–696, https://doi.org/10.1038/s41558-019-0540-7, 2019. 

Kelley, D. I., Burton, C., Huntingford, C., Brown, M. A. J., Whitley, R., and Dong, N.: Technical note: Low meteorological influence found in 2019 Amazonia fires, Biogeosciences, 18, 787–804, https://doi.org/10.5194/bg-18-787-2021, 2021. 

Korhonen, L., Korhonen, K., Rautiainen, M., and Stenberg, P.: Estimation of forest canopy cover: a comparison of field measurement techniques, Silva Fenn., 40, 315, https://doi.org/10.14214/sf.315, 2006. 

Kumar, S. S., Hanan, N. P., Prihodko, L., Anchang, J., Ross, C. W., Ji, W., and Lind, B. M.: Alternative Vegetation States in Tropical Forests and Savannas: The Search for Consistent Signals in Diverse Remote Sensing Data, Remote Sens., 11, 815, https://doi.org/10.3390/rs11070815, 2019. 

Lary, D. and Lait, L.: Using probability distribution functions for satellite validation, IEEE T. Geosci. Remote Sens., 44, 1359–1366, https://doi.org/10.1109/TGRS.2005.860662, 2006. 

Lasslop, G., Moeller, T., D'Onofrio, D., Hantson, S., and Kloster, S.: Tropical climate–vegetation–fire relationships: multivariate evaluation of the land surface model JSBACH, Biogeosciences, 15, 5969–5989, https://doi.org/10.5194/bg-15-5969-2018, 2018. 

Lasslop, G., Hantson, S., Harrison, S. P., Bachelet, D., Burton, C., Forkel, M., Forrest, M., Li, F., Melton, J. R., Yue, C., Archibald, S., Scheiter, S., Arneth, A., Hickler, T., and Sitch, S.: Global ecosystems and fire: Multi-model assessment of fire-induced tree-cover and carbon storage reduction, Glob. Change Biol., 26, 5027–5041, https://doi.org/10.1111/gcb.15160, 2020. 

Lloyd, J., Bird, M. I., Vellen, L., Miranda, A. C., Veenendaal, E. M., Djagbletey, G., Miranda, H. S., Cook, G., and Farquhar, G. D.: Contributions of woody and herbaceous vegetation to tropical savanna ecosystem productivity: a quasi-global estimate, Tree Physiol., 28, 451–468, https://doi.org/10.1093/treephys/28.3.451, 2008. 

Lopez-Gonzalez, G., Lewis, S. L., Burkitt, M., and Phillips, O. L.: ForestPlots.net: a web application and research tool to manage and analyse tropical forest plot data, J. Veg. Sci., 22, 610–613, https://doi.org/10.1111/j.1654-1103.2011.01312.x, 2011. 

Lopez-Gonzalez, G., Lewis, S. L., Burkitt, M., Baker T. R., and Phillips, O. L.: ForestPlots.net Database, https://www.forestplots.net (last access: 14 April 2020), 2009. 

Miles, L., Newton, A. C., DeFries, R. S., Ravilious, C., May, I., Blyth, S., Kapos, V., and Gordon, J. E.: A global overview of the conservation status of tropical dry forests, J. Biogeogr., 33, 491–505, https://doi.org/10.1111/j.1365-2699.2005.01424.x, 2006. 

Montesano, P., Nelson, R., Sun, G., Margolis, H., Kerber, A., and Ranson, K. J.: MODIS tree cover validation for the circumpolar taiga–tundra transition zone, Remote Sens. Environ., 113, 2130–2141, https://doi.org/10.1016/j.rse.2009.05.021, 2009. 

Montesano, P., Neigh, C., Sexton, J., Feng, M., Channan, S., Ranson, K., and Townshend, J.: Calibration and Validation of Landsat Tree Cover in the Taiga–Tundra Ecotone, Remote Sens., 8, 551, https://doi.org/10.3390/rs8070551, 2016. 

Pennington, R. T., Lehmann, C. E. R., and Rowland, L. M.: Tropical savannas and dry forests, Curr. Biol., 28, R541–R545, https://doi.org/10.1016/j.cub.2018.03.014, 2018. 

Rabin, S. S., Melton, J. R., Lasslop, G., Bachelet, D., Forrest, M., Hantson, S., Kaplan, J. O., Li, F., Mangeon, S., Ward, D. S., Yue, C., Arora, V. K., Hickler, T., Kloster, S., Knorr, W., Nieradzik, L., Spessa, A., Folberth, G. A., Sheehan, T., Voulgarakis, A., Kelley, D. I., Prentice, I. C., Sitch, S., Harrison, S., and Arneth, A.: The Fire Modeling Intercomparison Project (FireMIP), phase 1: experimental and analytical protocols with detailed model descriptions, Geosci. Model Dev., 10, 1175–1197, https://doi.org/10.5194/gmd-10-1175-2017, 2017. 

Rautiainen, M., Stenberg, P., and Nilson, T.: Estimating canopy cover in Scots pine stands, Silva Fenn., 39, 137–142, https://doi.org/10.14214/sf.402, 2005. 

R Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/ (last access: 31 January 2020), 2018. 

Rutten, G., Ensslin, A., Hemp, A., and Fischer, M.: Vertical and Horizontal Vegetation Structure across Natural and Modified Habitat Types at Mount Kilimanjaro, PLOS ONE, 10, e0138822, https://doi.org/10.1371/journal.pone.0138822, 2015. 

Saatchi, S., Harris, N., Brown, S., Lefsky, M., Mitchard, E., Salas, W., Zutta, B., Buermann, W., Lewis, S., Hagen, S., Petrova, S., White, L., Silman, M., and Morel, A.: Benchmark map of forest carbon stocks in tropical regions across three continents, P. Natl. Acad. Sci. USA, 108, 9899–904, https://doi.org/10.1073/pnas.1019576108, 2011. 

Sankaran, M., Hanan, N., Scholes, R., Ratnam, J., Augustine, D., Cade, B., Gignoux, J., Higgins, S., Roux, X., Ludwig, F., Ardö, J., Banyikwa, F., Bronn, A., Bucini, G., Caylor, K., Coughenour, M., Diouf, A., Ekaya, W., Feral, C., and Zambatis, N.: Determinants of woody cover in African Savannas, Nature, 438, 846–849, https://doi.org/10.1038/nature04070, 2006. 

Sellar, A. A., Jones, C. G., Mulcahy, J. P., Tang, Y., Yool, A., Wiltshire, A., O'Connor, F. M., Stringer, M., Hill, R., Palmieri, J., Woodward, S., Mora, L. de, Kuhlbrodt, T., Rumbold, S. T., Kelley, D. I., Ellis, R., Johnson, C. E., Walton, J., Abraham, N. L., Andrews, M. B., Andrews, T., Archibald, A. T., Berthou, S., Burke, E., Blockley, E., Carslaw, K., Dalvi, M., Edwards, J., Folberth, G. A., Gedney, N., Griffiths, P. T., Harper, A. B., Hendry, M. A., Hewitt, A. J., Johnson, B., Jones, A., Jones, C. D., Keeble, J., Liddicoat, S., Morgenstern, O., Parker, R. J., Predoi, V., Robertson, E., Siahaan, A., Smith, R. S., Swaminathan, R., Woodhouse, M. T., Zeng, G., and Zerroukat, M.: UKESM1: Description and Evaluation of the U.K. Earth System Model, J. Adv. Model. Earth Sy., 11, 4513–4558, https://doi.org/10.1029/2019MS001739, 2019. 

Sexton, J. O., Song, X.-P., Feng, M., Noojipady, P., Anand, A., Huang, C., Kim, D.-H., Collins, K. M., Channan, S., DiMiceli, C., and Townshend, J. R.: Global, 30-m resolution continuous fields of tree cover: Landsat-based rescaling of MODIS vegetation continuous fields with lidar-based estimates of error, Int. J. Digit. Earth, 6, 427–448, https://doi.org/10.1080/17538947.2013.786146, 2013 (data available at: https://e4ftl01.cr.usgs.gov/MEASURES/GFCC30TC.003/, last access: 3 March 2022). 

Smith, J., Wickham, J., Stehman, S., and Yang, L.: Impacts of Patch Size and Land-Cover Heterogeneity on Thematic Image Classification Accuracy, Photogramm. Eng. Rem. S., 68, 65–70, 2002. 

Solofondranohatra, C. L., Vorontsova, M. S., Hackel, J., Besnard, G., Cable, S., Williams, J., Jeannoda, V., and Lehmann, C. E. R.: Grass Functional Traits Differentiate Forest and Savanna in the Madagascar Central Highlands, Front. Ecol. Evol., 6, 184, https://doi.org/10.3389/fevo.2018.00184, 2018. 

Song, X. P., Huang, C., Feng, M., Sexton, J. O., Channan, S., and Townshend, J. R.: Integrating global land cover products for improved forest cover characterisation: an application in North America, Int. J. Digit. Earth, 7, 709–724, https://doi.org/10.1080/17538947.2013.856959, 2014. 

Stan Development Team: RStan: The R Interface to Stan, http://mc-stan.org/ (last access: 31 January 2020), 2019. 

Sulla-Menashe, D. and Friedl, M. A.: User Guide to Collection 6 MODIS Land Cover (MCD12Q1 and MCD12C1) Product, https://lpdaac.usgs.gov/documents/112/MOD44B_User_Guide_V6.pdf (last access: 26 February 2022), 2018. 

Sulla-Menashe, D., Gray, J. M., Abercrombie, S. P., and Friedl, M. A.: Hierarchical mapping of annual global land cover 2001 to present: The MODIS Collection 6 Land Cover product, Remote Sens. Environ., 222, 183–194, https://doi.org/10.1016/j.rse.2018.12.013, 2019. 

Tang, H., Armston, J., Hancock, S., Marselis, S., Goetz, S., and Dubayah, R.: Characterizing global forest canopy cover distribution using spaceborne lidar, Remote Sens. Environ., 231, 111–262, https://doi.org/10.1016/j.rse.2019.111262, 2019a. 

Tang, H., Song, X.-P., Zhao, F. A., Strahler, A. H., Schaaf, C. L., Goetz, S., Huang, C., Hansen, M. C., and Dubayah, R.: Definition and measurement of tree cover: A comparative analysis of field-, lidar- and landsat-based tree cover estimations in the Sierra national forests, USA, Agr. Forest Meteorol., 268, 258–268, https://doi.org/10.1016/j.agrformet.2019.01.024, 2019b. 

Taylor, C., de Jeu, R., Guichard, F., Harris, P. P., and Dorigo, W. A.: Afternoon rain more likely over drier soils, Nature, 489, 423–426, https://doi.org/10.1038/nature11377, 2012. 

Torello-Raventos, M., Feldpausch, T., Veenendaal, E., Schrodt, F., Saiz, G., Domingues, T., Djagbletey, G., Ford, A., Kemp, J., Marimon, B., Marimon-Junior, B. H., Lenza, E., A Ratter, J., Maracahipes, L., Sasaki, D., Sonké, B., Zapfack, L., Taedoumg, H., Daniel, V., and Lloyd, J.: On the delineation of tropical vegetation types with an emphasis on forest/savanna transitions, Plant Ecol. Divers., 6, 101–137, https://doi.org/10.1080/17550874.2012.762812, 2013 (data available at: https://www.forestplots.net, last access: 31 October 2021). 

Veenendaal, E. M., Torello-Raventos, M., Feldpausch, T. R., Domingues, T. F., Gerard, F., Schrodt, F., Saiz, G., Quesada, C. A., Djagbletey, G., Ford, A., Kemp, J., Marimon, B. S., Marimon-Junior, B. H., Lenza, E., Ratter, J. A., Maracahipes, L., Sasaki, D., Sonké, B., Zapfack, L., Villarroel, D., Schwarz, M., Yoko Ishida, F., Gilpin, M., Nardoto, G. B., Affum-Baffoe, K., Arroyo, L., Bloomfield, K., Ceca, G., Compaore, H., Davies, K., Diallo, A., Fyllas, N. M., Gignoux, J., Hien, F., Johnson, M., Mougin, E., Hiernaux, P., Killeen, T., Metcalfe, D., Miranda, H. S., Steininger, M., Sykora, K., Bird, M. I., Grace, J., Lewis, S., Phillips, O. L., and Lloyd, J.: Structural, physiognomic and above-ground biomass variation in savanna–forest transition zones on three continents – how different are co-occurring savanna and forest formations?, Biogeosciences, 12, 2927–2951, https://doi.org/10.5194/bg-12-2927-2015, 2015.  

White, M., Shaw, J., and Ramsey, R.: Accuracy assessment of the vegetation continuous field tree cover product using 3954 ground plots in the south-western USA, Int. J. Remote Sens., 26, 2699–2704, https://doi.org/10.1080/01431160500080626, 2005. 

White, R. P., Murray, S., and Rohweder, M.: Pilot Analysis of Global Ecosystems: Grassland Ecosystems, World Resources Institute, ISBN 1-56973-461-5, 2000. 

Whitley, R., Beringer, J., Hutley, L. B., Abramowitz, G., De Kauwe, M. G., Evans, B., Haverd, V., Li, L., Moore, C., Ryu, Y., Scheiter, S., Schymanski, S. J., Smith, B., Wang, Y.-P., Williams, M., and Yu, Q.: Challenges and opportunities in land surface modelling of savanna ecosystems, Biogeosciences, 14, 4711–4732, https://doi.org/10.5194/bg-14-4711-2017, 2017. 

Wiltshire, A. J., Burke, E. J., Chadburn, S. E., Jones, C. D., Cox, P. M., Davies-Barnard, T., Friedlingstein, P., Harper, A. B., Liddicoat, S., Sitch, S., and Zaehle, S.: JULES-CN: a coupled terrestrial carbon–nitrogen scheme (JULES vn5.1), Geosci. Model Dev., 14, 2161–2186, https://doi.org/10.5194/gmd-14-2161-2021, 2021. 

Wuyts, B., Champneys, A. R., and House, J. I.: Amazonian forest-savanna bistability and human impact, Nat. Commun., 8, 15519, https://doi.org/10.1038/ncomms15519, 2017. 

Xu, C., Hantson, S., Holmgren, M., van Nes, E. H., Staal, A., and Scheffer, M.: Remotely sensed canopy height reveals three pantropical ecosystem states, Ecology, 97, 2518–2521, https://doi.org/10.1002/ecy.1470, 2016. 

Yang, X. and Crews, K.: Applicability analysis of MODIS tree cover product in Texas savanna, Int. J. Appl. Earth Obs., 81, 186–194, https://doi.org/10.1016/j.jag.2019.05.003, 2019. 

Download
Short summary
The MODIS Vegetation Continuous Fields (VCF) product underestimates tree cover compared to field data and could be underestimating tree cover significantly across the tropics. VCF is used to represent land cover or validate model performance in many land surface and global vegetation models and to train finer-scaled Earth observation products. Because underestimation in VCF may render it unsuitable for training data and bias model predictions, it should be calibrated before use in the tropics.
Altmetrics
Final-revised paper
Preprint