Assessing the sensitivity of multi-frequency passive microwave vegetation optical depth to vegetation properties
- 1Technische Universität Dresden, Institute of Photogrammetry and Remote Sensing, 01069 Dresden, Germany
- 2Technische Universität Wien, Department of Geodesy and Geoinformation, Vienna, Austria
- 3Leverhulme Centre for Wildfires, Environment, and Society, London, SW7 2AZ, UK
- 4Department of Physics, Imperial College London, London, SW7 2AZ, UK
- 5Planet, Wilhelminastraat 43A, 2011 VK Haarlem, The Netherlands
- 6Fenner School of Environment & Society, Australian National University, ACT 2601, Australia
- 7School of Engineering, Australian National University, ACT 2601, Australia
- 1Technische Universität Dresden, Institute of Photogrammetry and Remote Sensing, 01069 Dresden, Germany
- 2Technische Universität Wien, Department of Geodesy and Geoinformation, Vienna, Austria
- 3Leverhulme Centre for Wildfires, Environment, and Society, London, SW7 2AZ, UK
- 4Department of Physics, Imperial College London, London, SW7 2AZ, UK
- 5Planet, Wilhelminastraat 43A, 2011 VK Haarlem, The Netherlands
- 6Fenner School of Environment & Society, Australian National University, ACT 2601, Australia
- 7School of Engineering, Australian National University, ACT 2601, Australia
Abstract. Vegetation attenuates the microwave emission from the land surface. The strength of this attenuation is quantified in models in terms of the parameter Vegetation Optical Depth (VOD), and is influenced by the vegetation mass, structure, water content, and observation wavelength. Earth observation satellites operating in the microwave frequencies are used for global VOD retrievals, enabling the monitoring of vegetation status at large scales. VOD has been used to determine above-ground biomass, monitor phenology or estimate vegetation water status. VOD can be also used for constraining land surface models or modelling wildfires at large scale. Several VOD products exist differing by frequency/wavelength, sensor, and retrieval algorithm. Numerous studies present correlations or empirical functions between different VOD datasets and vegetation variables such as normalised difference vegetation index, leaf area index, gross primary production, biomass, vegetation height or vegetation water content. However, an assessment of the joint impact of land cover, vegetation biomass, leaf area, and moisture status on the VOD signal is challenging and has not yet been done.
This study aims to interpret the VOD signal as a multi-variate function of several descriptive vegetation variables. The results will help to select certain VOD wavelengths for specific applications and can guide the development of appropriate observation operators to integrate VOD with large-scale land surface models. Here we use VOD from the Land Parameter Retrieval Model (LPRM) of Ku-, X- and C-bands of the harmonised VODCA dataset and level 3 L-band derived from SMOS and SMAP sensors. Within a multivariable regression random forest model for simulating these VOD signals, leaf area index, live-fuel moisture content, above-ground biomass, and land cover are able to explain up to 0.95 of the variance (coefficient of determination). Thereby, the variance in L-band VOD is reproduced spatially and for Ku-, X- and C-band VOD spatially as well as temporally. Analyses of accumulated local effects demonstrate that Ku-, X- and C-band VOD is mostly sensitive to leaf area index and L-band VOD to above-ground biomass. However, for all VODs the global relationships with vegetation properties are non-monotonic and complex and differ with land cover type. This indicates that the use of simple global regressions to estimate single vegetation properties (e.g. above-ground biomass) from VOD is over-simplistic.
- Preprint
(1488 KB) -
Supplement
(3023 KB) - BibTeX
- EndNote
Luisa Schmidt et al.
Status: final response (author comments only)
-
RC1: 'Comment on bg-2022-85', Martin Baur, 20 Apr 2022
General comments:
Dear Authors,
Thank you for the submitted manuscript. I believe the presented analysis of multi-frequency VOD is interesting and complements recent work. I think the study design, analyzing multiple VOD products with statistical models and predictor datasets, gives new insights into VOD dynamics and might help to understand the underlying reasons for these dynamics at different frequencies. Nevertheless, I think the manuscript gives away o lot of potential by not communicating methodology and results in a clear and intuitive way. Some parts of the discussion make claims that are not fully supported by the presented figures and results. I believe a main reason for some of these problems is the fact that the main manuscript has five figures only and a lot of result presentation and discussion is done referencing figures in the supplementary document. This makes it very difficult to follow the storyline of the manuscript. If this problem arises from a figure limit, please try to aggregate results into higher level figures with multiple labeled panels, I think this might be the best way to improve the manuscript.
Specific comments:
- I am not sure about the usefulness of Fig. 1, most of the shown maps are known to many readers and have been shown in many studies. Even if a reader is unfamiliar with VOD at multiple frequencies Fig. 1 does not provide much information as VOD is normalized to 0-1. This removes the frequency component. Even if the patterns are interesting 80% of the maps is not even used in the rest of the study as LFMC is only available for some regions. Most of the RF model results and discussion does not really include and spatial component, so a reader would hardly go back to Fig. 1 and look at a specific VOD map.
- The use of the LFMC parameter seems to be motivated by Eq. 2. Predictor variables AGB, LFMC and landcover type explicitly and implicitly (through analysis of landcover classes) recreate Eq. 2. It seems a little bit unintuitive that this logic is applied when average LFMC over whole Europe is 100% or even larger. It seems like this dataset might only work well in Australia or Southern Africa. Even if this is not a problem for the GAMs and RF analysis, I can’t retrace why you would introduce Eq. 2 and then have a LFMC which is larger than 100% for nearly whole Europe. Even you could fix the range to 0-100% it is a problem that AGB is not dynamic in time. Furthermore, LFMC and LAI are probably quite similar as both originate from MODIS. At some point you say that LAI and LFMC are strongly correlated. I do not really see the point of introducing Eq. 2 and then not having the datasets to check whether this VOD formulation is sufficient or not.
- Section 2.5, which introduces the ALE figures should be simplified. I think it is crucial to have a sound understanding of what these plots show to understand the manuscript. I am sorry but currently I don’t. ALE plots are introduced relative to PDP plots, readers might not know about neither of them. Can you try to work on this section and describe what the y-axis in Fig. 4 really shows?
- Currently GAMs results are only presented in text and not in a figure. If it is a goal of this manuscript to show that VOD to predictor relationship is complex, and therefore not captured well by GAMs, it would be nice to have visual results. I guess you ended up with many figures, one for each model type, landcover and monthly or 8-daily. Nevertheless, it would greatly improve the manuscript if these results could be aggregated into high level figures.
- I don’t fully understand why all VOD products were normalized. Can you please explain this? I am not sure whether this is a requirement for the RF and GAMs model.
Technical corrections:
- I added multiple comments to the pdf of the manuscript. Please try to address them.
Thank you,
Martin Baur
- RC3: 'Reply on RC1', Anonymous Referee #1, 05 May 2022
-
RC2: 'Comment on bg-2022-85', Andrew Feldman, 03 May 2022
Schmidt et al. aim to determine the vegetation canopy information that describes satellite VOD variability across X, C, and L band frequencies. They specifically use regression models to predict VOD with aboveground biomass, leaf area index, and live fuel moisture content. They show mainly monthly scale results with a strong control of AGB on L-VOD and LAI on C- and X-VOD. This is a well-focused and neat study addressing a much needed objective for us to interpret global VOD time variations. The methods are well thought out. I find this to be important work and encourage its publication. I spent some time thinking about each figure and find them to have a lot of value, despite some caveats. The manuscript personally interests me and will indeed influence my future work. Nevertheless, I do have some major points below that need to be addressed. I hope they help the authors improve the manuscript and hope the authors expand on this line of thinking with future articles.
-Andrew Feldman
Major Comments
1) The major concern I have is that LFMC may not adequately partition only the plant saturation amount (i.e., relative water content, predawn water potential of leaves or xylem, etc.). As a result, the results may have a strong bias in overemphasizing the role of biomass changes because saturation considerations are not independently considered by any of the datasets. Namely, as shown in Table 1 of Konings et al. 2019 referenced in this study, LFMC is still a strong function of AGB. LFMC ultimately does not normalize out effects of saturation alone - a change in dry biomass will still influence LFMC even without a change in plant water status. Ideally, a plant water status parameter should be used that is insensitive to dry biomass. Additionally, LFMC is modeled from MODIS. MODIS is undesirably mainly a function of greenness/biomass (not always strongly a function of plant hydraulics) where use of this dataset relies heavily on how well plant hydraulics and water status are modeled (a highly uncertain modeling process). This is a lynchpin of a study that ultimately relies on partitioning effects of VOD signals into their components, where there is a bias of more certainty in the dry biomass and structure observations and less certainty in those about the plant hydraulic parameters.
One insight that suggests saturation may not be well modeled is that there is a large decline in explanation of 8-day versus monthly timescales (figures S1 and S2). While noise may act more at 8-day timescales reducing its R2 compared to monthly R2 as the authors discuss in lines 460-470, we also know that the short timescales are commonly describing plant saturation dynamics as described in the Konings et al. 2019 and Feldman et al. 2020 papers referenced here. There are also several other published works (see for example: https://doi.org/10.5194/bg-18-831-2021 with error analysis: 10.1109/JSTARS.2021.3124857) based on SMAP VOD that shows the sub-weekly timescale L-VOD variations are indicative of plant rehydration and water loss dynamics, especially in water-limited locations where the authors here tend to find some of the lowest R2 (lines 330-337). I am speculating (with help from fig. 4 that shows some stronger LVOD sensitivity to LFMC), but this issue could additionally be reflected in lower R2 for L-VOD than for X-VOD where potentially the former is more sensitive to canopy water status, especially in tree canopies. That is assuming the L-VOD and X-VOD retrieval processes are equivalent.
The issue is not enough to prevent publication of the study because there are still very valuable results here and reliable large scale partitioned plant water status information is practically unavailable. However, I strongly recommend that the authors explicitly address this issue beyond few sentences in 500-503. Either consider using a more detailed model output of plant water status (some common large scale DGVMs like LPJ and ED2 now have plant hydraulic schemes to provide canopy water status information) in place of the MODIS LFMC and/or do more to explain the uncertainties of the MODIS LFMC product and how that could bias the results as stated about. For example, I am not familiar with the Yebra et al. 2018 study - I think a summary of how they obtain LFMC and potential limitations relevant to its use in this study are needed.
2) I think the manuscript should emphasize the time dependence of the main results more. Even though monthly and 8-day timescales are different, the main predictors tend to be different and previous studies do not suggest that this would only be due to noise; the results nicely give evidence for what we have speculated all along that different aspects of the canopy (dry biomass, water, structure, etc.) influence VOD at different timescales. Specifically, the abstract (lines 31-33), overall results (lines 367-373), and much of the discussion are written as an absolute result. However, these statements mainly appear to be only a function of the monthly timescale. Future work should repeat the same analysis as equivalently as possible across different timescales.
Line-Specific Comments
Line 27: “…level 3 L-band derived…” should it be “…level 3 L-band [VOD] derived…”?
Line 47-52: Please reference some more recent work on the b parameter and other aspects by Kaitlin Togliatti et al.:
Togliatti et al., "Quantitative Assessment of Satellite L-Band Vegetation Optical Depth in the U.S. Corn Belt," in IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2022, Art no. 2500605, doi: 10.1109/LGRS.2020.3034174.
Line 118: I think the article would benefit from a more explicit research objective or question. Line 118 is very general and broad and has been done in various forms previously. I suggest adding some more nuance so the reader knows what the authors wish to establish.
Line 130: I want to double check that the authors were careful in keeping consistent retrieval processes for all VOD products here. It is a great step to normalize VOD at each frequency (lines 143-146). It is also good they all use LPRM (line 131). However, were there greatly different processes for choosing parameters like single scattering albedo and the modeling of surface roughness? Harmonizing VOD could potentially bias some results in disproportionately influencing certain components of the VOD power spectra (for example, monthly variances of VOD could have been greatly altered while the interannual VOD variance was not influenced).
Line 205-206: The authors means that the “AGB dataset is not representative…” not that AGB itself is not representative. AGB is certainly relevant to all plants.
Line 212: Please explain what “grid-search” means. Is this describing spatial pixels across the globe or a method within the random forest approach?
Line 218-219: I am not sure these definitions are common knowledge. What exactly is used to determine “minimum samples within a leaf”, “number of estimators”, etc.?
Line 230-234: It would be helpful to point out what the regressors are (f(x)) and what is being predicted? I am guessing VOD is g(mu) and vegetation structure, leaf, water, etc. observations are the (f(x))? Please clarify.
Line 236-237: The land cover maps are technically binary “dummy variables” here that tell whether or not to include AGB (values of zero or one). Or are they being used beyond this?
Table 2: A caution that for the “global” regression, one of the land cover “dummy variables” needs to be left out or else it will bias the regression. For example, if you have four binary regressors (tree, herb, shrub, crop), one needs to be chosen not to be in the regression.
Fig. 2 and Fig. S1: A point of major clarification: It may be helpful to walk the reader through what the authors are looking for here. Are you choosing between GAM and RF? Any other specific decisions? I am not sure how to interpret the difference between the short vegetation and tree cover models. If the tree cover models have AGB, they should be expected to have better variance explained by default because they have an additional parameter that the herbaceous pixels do not have. They are also comparing different regions. It may not be a one-to-one comparison. I am not sure how to also interpret the difference between the global model and land cover model (line 289-299). I am still struggling to understand what their difference is from Table 2 and whether it is a one-to-one comparison. One thing to consider is that models that have more regressors are forced to have higher R2 because adding regressors never reduces the variance explained (at least in least squares regression). These points should be clarified given that they influence the subsequent results.
Figure 3 and line 252: I make heavy use of linear regression models and least squares, but tend to not use ML approaches like random forest. The negative R2 are confusing because they are not possible in least squares, fundamentally because they are the square of the correlation coefficient which is forced to be non-negative. Negative R2 may be because a predictor (a constant like the y-intercept) is left out of the regression and therefore it is not interpreted like other R2. Can the authors clarify?
Figure 4: Is this monthly?
Section 3.2.2 and Figure 5: Given that I did not follow the specific differences between the land cover and global model, I am having trouble interpreting Figure 5 compared to Figure 4. Is the only difference that the global model (figure 4) includes all pixels, but the land cover model are different subsets of pixels? I am guessing the use of “dummy” variable regressors in the global model also has an effect that the land cover models do not? Please clarify.
Line 414: Local factors like what?
Line 419-422: See my major comment: this could in part be a limitation of not having an independent plant water status predictor. Lower prediction of VOD dynamics in these regions is not expected because VOD and soil moisture errors tend to be smaller in regions with herbaceous vegetation. R2 should therefore tend to be higher in these regions if we have relevant predictors with R2 in forested regions decreasing due to noise. See results from recent VOD error quantification work: see https://doi.org/10.1016/j.rse.2019.111257 and 10.1109/JSTARS.2021.3124857
Line 444: “Varies by wavelength.” This is microwave wavelength, correct? (as opposed to a timescale-dependent wavelength related to power spectra of the time series)
Luisa Schmidt et al.
Luisa Schmidt et al.
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
296 | 91 | 13 | 400 | 20 | 1 | 6 |
- HTML: 296
- PDF: 91
- XML: 13
- Total: 400
- Supplement: 20
- BibTeX: 1
- EndNote: 6
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1