the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Grazing enhances carbon cycling but reduces methane emission during peak growing season in the Siberian Pleistocene Park tundra site
Wolfgang Fischer
Christoph K. Thomas
Nikita Zimov
Download
- Final revised paper (published on 21 Mar 2022)
- Preprint (discussion started on 09 Jun 2021)
Interactive discussion
Status: closed
-
RC1: 'Comment on bg-2021-110', Anonymous Referee #1, 02 Jul 2021
General comments
The authors describe the effects of a manipulation of grazing herbivores density in a continuous permafrost tundra ecosystem on carbon fluxes during the growing season and some of its predictors. The core data is an extensive set of NEE and Reco measurements obtained over two weeks at peak growing season over three replicates of the high grazing density system and two replicates of the low grazing density system, and is accompanied by meteorological variables. Overall, gross fluxes GPP and Reco were increased in the high grazing density plots, concomitant with an increase air and soil temperature and a decreased soil moisture content, while NEE was largely unaffected. CH4 fluxes were lower in the high grazing density plots but with high variability between plots. The flux measurement dataset is valuable, and my main concerns lie in the choice of an unbalanced design which hampers statistical evaluation of the results, and that too few details are provided to justify the fact that initial conditions were comparable and throughout the Methods section. I would therefore recommend that these aspects be thoroughly improved before publication and try to provide suggestions for such improvements.
I appreciate that, since initial submission, further information has been added regarding the choice of the two replicates in the UGR site, but still think that beyond the choice of these particular plots, the decision of having only two control plots should at least be better motivated and the limitations it implies better discussed. Overall, that does not entirely appease my concerns regarding the unbalanced study design. On the contrary, by choosing two sites that are close to the average within that transect, rather than e.g. at random, the mean value is preserved but the variance is artificially deflated, possibly biasing comparisons. While this may or may not affect Reco dynamics so much, as the differences are marked and variability seems limited, the differences observed in other variables and in particular CH4 fluxes could be artefactual for this reason. In addition, UGR-2 is described as standing out on several aspects, from the lower soil temperature to the higher time lag of soil vs air temperature, to having only half as many flux measurements as the other plots (so that there is, in total, twice as much flux data for GR than UGR plots), which explains in part the poor measured vs modelled fit of GPP for UGR-2.
I do not see a simple way to solve this issue, but perhaps simulating data based on flux measurements from Kwon et al 2016, using the relationship between the fluxes in the current study and those in Kwon et al 2016, could allow to carry out a sensitivity analysis to determine whether the choice of these 2 replicates affected the findings.Comparing the “grazed” and “ungrazed” treatments is central to this manuscript, but their identity prior to the experiment is unclear from the information currently provided. For instance, as it is now GR sites are described as a wet lowland tundra that gets flooded every year, while UGR sites are a wet tussock tundra floodplain, it is unclear whether the distinction between the two is intended to avoid repetition or to convey a more fundamental difference between the sites. Beyond the more detailed comments below requesting that more detailed data on initial conditions should be presented if available, I would suggest reorganizing the part of the Methods section where the sites are described. In addition to the general geographic and climatic information about the area, it could be easier to follow if the authors would first describe the similarities between the two sites prior to manipulation (e.g. flooding, vegetation, etc.) before delving into what makes them distinct.
Specific comments
L30: Might be worth citing the recent review by Mekkonen et al 2021 found here: https://doi.org/10.1088/1748-9326/abf28b
L53-56: It’s not clear how Pleistocene Park and the measurements presented here address non-growing season carbon cycling, in fact the abstract (L14-15) explicitly states that those are not addressed. I suggest reformulating or removing the reference to non-growing season fluxes from this section altogether.
L58-63: It’s perhaps more a question of personal taste, but I find it a bit confusing to present hypotheses that are not tested in the current study (e.g. above-belowground partitioning of GPP, decrease in respiration from colder permafrost in the winter). I would think these wider considerations about the aims of Pleistocene Park itself would fit better in the discussion, or presented differently than a list of hypotheses near the end of the introduction, as that might give the reader the wrong impression that those are the hypotheses addressed in this study.
L83-86: Can you provide more precise information on that site, such as for how long the density of grazing herbivores has been increased, and by how much compared to the “ungrazed” site which arguably hosts large grazing herbivores in lower densities such as the rest of the Arctic, unless the site is too wet and not visited by grazers?
L91-93 and 101-103: Can you provide more precise information on e.g. vegetation composition and soil conditions prior to the onset of manipulating density of grazing herbivores? Something similar to Table 2 in Kwon et al 2016 would be a good start to support the central assumption that the sites were initially similar.
L103-104: For future reference, it would be good to mention to which plot numbers in Kwon et al 2016 the UGR-1 and UGR-2 sites in this study refer to. 2-0 and 2-2 are mentioned further in the discussion (L300) and should be mentioned here instead, but do not appear directly relatable to the denomination in Kwon et al 2016: does that mean Control-0 and Control-2?
L108-112: Please clarify how many probes were used and where. From the phrasing I would have expected one probe per plot for temperature, and three probes per site for moisture, but from Fig.1 and the corresponding Results section it seems like there was only one probe in one of the GR plots and two in UGR. Please also clarify what was considered 0cm depth in the water-logged tussock tundra (e.g. water-table, top of the tussocks, between tussocks), I assume it is the soil surface between tussocks but I shouldn’t have to make assumptions in the Methods section. Please also mention the logging interval and the procedure used for producing the interpolated data presented in Fig.1.
L113-115: Please clarify what is meant by plausibility limits, which offsets were corrected and how. Please also provide further detail on the interpolation of soil temperature data: which variables other than air temperature and incoming radiation were used, and how. In addition, indicating the date and value of individual measurements in Fig. 1 would be helpful.
L123-124: Please clarify whether a single relationship was used (if so, which one) or whether incoming radiation-PAR relationships were adjusted by incoming radiation classes.
L139-140: I assume that not all measurements were 2 minutes long since that time is mentioned as a maximum. I would expect an arbitrary threshold in change in CO2 concentration was used to limit non-linearities from excessive CO2 buildup or uptake within the chamber, but could not find that value here or in Kwon et al 2016. Could you please clarify that part?
L149-152: Is it correct to assume that these slopes were linear fits? If so, perhaps mention it at L150 instead of “steady”, if not please clarify. In addition, a duration criteria must have been used for selecting the periods with a steady increase, please mention how long these periods had to be in order to be considered.
L152-155: I assume this is the reason for the uneven number of “utilizable” measurements in the different plots, if so please move the reference to Table 1 after this section instead of at L147. Please clarify in Table 1 what “utilizable” refers to. Please also mention somewhere why so few measurements were utilizable in UGR-2.
Appendix A: In line with my earlier comment about the number of probes used and their location, please clarify where the moisture / temperature data used for modelling comes from. Were GR-1, GR-2 and GR-3 models based on the same temperature data from the probe in GR-1? If so, was the data averaged between UGR-1 and UGR-2 or did these plots benefit from a distinct processing where they each had their own supporting data for the modelling?
L200: One of the assumptions of the Mann-Whitney test is that the observations are independent, but for both Ts and fluxes the data are time-series and are therefore not independent.
L199-201 and throughout the text: The statistical tests results mostly present P values. I could not find precise guidelines of in-house rules regarding the presentation of statistical results in Biogeosciences, but I would suggest the test statistic to be presented as well, and the degrees of freedom. If this burdens the text too much, please consider a supplementary table.
L201: default t-test may not be appropriate with unequal variances, particularly so with unequal sample sizes as well. Please mention whether the assumptions for using a t-test were checked and met.
L207 and throughout the text: When presenting mean values, please provide associated uncertainties in the form of SD, SEM or CI.
L223-224: Is there any data on this difference in air temperatures before the onset of the experiment? I.e. does this reflect initial differences or an effect of altered vegetation and soil conditions?
L229-234: There is no mention in the Methods section of how these values were computed and it is unclear what the P values presented refer to, please clarify.
L236: “clearly” is strong phrasing considering the absence of replication. Given the 100h lag at 15cm depth, effects of the change in weather pattern should only be visible in the last couple of days, at best, but one would expect deeper layers to have higher thermal inertia and therefore not to see an effect of the change in overall weather pattern by the end of the study period. In that respect, I do not think it is justified to carry out separate tests for the two weeks at this depth.
L249-248: There is no mention in the Methods section of how or when thaw depths were measured or inferred from temperature data, please correct this.
L252-277 and Figure 2: Judging by L199-200, the pairwise comparisons presented as letters in Figure 2 were computed by running 10 different Mann-Whitney tests per variable, plus one for the averaged values. If that is not the case, please describe this in the Statistics section, if that is the case, please clarify it in the Statistics section as well. In both cases, please indicate (how) were the P values adjusted for multiple comparisons. Beyond concerns about the Mann-Whitney test assumption of independence of observations expressed above, I would advise running an omnibus test prior to post-hoc pairwise comparisons. With a balanced design, a repeated-measures ANOVA could be a correct way to account for dependent observations within a plot. Considering the central role of flux data in this manuscript, their statistical treatment should be improved.
L256-258: It is unclear what “flux rates” refers to in the first part of this sentence: NEE, Reco?
L286-287: When were the collars installed at the GR site? Presumably after setting up the wooden fences to prevent trampling by the herbivores, but please mention this here or in the Methods section.
L293-295: Why not mention tussock-forming plants here? As far as small-scale heterogeneity is concerned it seems odd not to mention one of the main ecosystem engineers of these systems.
L308-309: It might be good to remind here that when comparing CVs of GR and UGR one should keep in mind GR having 50% more plots and ~100% more measurement points.
L324: See above at L108-112, this is not a reminder and this information should be stated more explicitly in the Methods section.
L329: I assume “the actual measured values” refer to Reco, but please clarify.
L351-355: It would be good to mention examples of which such operations might be confounded with the effects attributed to the increased grazing herbivore density.
L357-359: See my earlier comment about L91-93 101-103, in absence of more detailed data such photographs may be an interesting supplementary display item.
L394: “mostly likely” should be “most likely”, but the phrasing is a bit strong for an hypothetical future development, which to date is in contradiction with the observations as shown in Fig 1. While I understand the hypothesis of a cooling of the soil and grazing-induced protection of permafrost in Pleistocene Park, it is hard to ignore that Fig 1 shows an almost twice as deep active layer thickness in the grazed site. Either the hypothesis is correct but the sites differed drastically in active layer thickness prior to the experiment, or the effects observed after 22 years of manipulation contradict the expected consequences of the hypothesis. A transient regime is possible but less parsimonious, and “most probably” or “most likely” are too strong for that to my taste.
L404-406: Liquid water has a fairly high thermal conductivity, a comparison between values for a compacted soil and a water-logged soil could be useful information here.
L408-410: It is hard to say for 25cm depth since the data is not shown, but for 35cm it would be good to remind that the observed difference in soil temperature is lower or similar to the observed difference in air temperature.
L414-415: In line with the previous comment, this sentence could be complemented by starting it with “Barring differences prior to the onset of the experiment”.
L416-419: Considering that no difference was observed in growing season NEE and that in presence of grazers, a larger fraction of NPP is removed by herbivory, this argument should be substantiated with above- and below-ground plant biomass measurements or a complete C budget. In their absence, it is speculative and because this is not central to the reasoning, I would suggest removing it.
L419-420: See above at L394, this is speculative and in direct contradiction with data presented in Fig. 1.
L453 and 456-457: This is speculative, please use less strong phrasing.Technical corrections
L27: change sentence order
L57: facilitates -> allows
L116-119: I would suggest using GR and UGR rather than Pleistocene Park and Ambolikha for consistency.
L165-166 and L200: Rstudio is only a GUI software to R and does not do calculations. Please move the mention to the software used to the end of the statistics section, and provide adequate reference including R, the version number and the appropriate citation (e.g. R Core Team. R: A language and Environment for Statistical Computing. (2021)
L200: “(?)”?
L226 – Figure 1: I would recommend making two separate panels out of panel (a). I do not think the y-axis break simplifies the figure, and the factor 5 change in axis scale would be more obvious that way.
L437-439: Please consider rephrasing, the current syntax poses “increases in primary productivity” as an explanation for increased GPP.
L480: “differences in NEE were not pronounced” -> “no differences in NEE were found”Appendix A, L494-496: This sentence would be easier to understand if the information was split across several sentences, please rephrase it.
Review criteria:
Does the paper address relevant scientific questions within the scope of BG?
YesDoes the paper present novel concepts, ideas, tools, or data?
Yes, effects of a long-term herbivory experiment on C-cycling in a continuous permafrost systemAre substantial conclusions reached?
SomewhatAre the scientific methods and assumptions valid and clearly outlined?
Methods and assumptions need to be substantiatedAre the results sufficient to support the interpretations and conclusions?
There is some amount of speculation not supported by the results, but not overly muchIs the description of experiments and calculations sufficiently complete and precise to allow their reproduction by fellow scientists (traceability of results)?
No, further work on the Methods is necessaryDo the authors give proper credit to related work and clearly indicate their own new/original contribution?
YesDoes the title clearly reflect the contents of the paper?
YesDoes the abstract provide a concise and complete summary?
YesIs the overall presentation well structured and clear?
Mostly, I suggested some changesIs the language fluent and precise?
Some improvements can be made to the language, I suggested some but not allAre mathematical formulae, symbols, abbreviations, and units correctly defined and used?
YesShould any parts of the paper (text, formulae, figures, tables) be clarified, reduced, combined, or eliminated?
Some sentences should be deleted, perhaps a paragraph in the discussion. Further details should be provided in the Methods, and the statistics should be better described both in the Methods section and in their presentation throughout the Results section. Fig.1 could be improved, legend of Fig.2 should clarify the source of post-hoc / significance letters.Are the number and quality of references appropriate?
YesIs the amount and quality of supplementary material appropriate?
Yes- AC1: 'Reply on RC1', Mathias Goeckede, 16 Nov 2021
-
CC1: 'Comment on bg-2021-110', Cole Brachmann, 20 Jul 2021
The manuscript aims to determine the role of grazing in carbon cycling through CO2 and CH4 gaseous fluxes in wet tundra habitat by the means of the large-scale herbivore reintroduction experiment of Pleistocene Park. The authors measured ecosystem respiration (Reco), Net Ecosystem Exchange (NEE) and CH4 using chamber methods and a flow through gas analyzer over seventeen days in five different plots distributed over two sites, one for the grazed (GR) condition within Pleistocene Park and one for the ungrazed (UGR) condition located nearby to the park. Gross Primary Productivity (GPP) was also calculated from Reco and NEE. The fluxes were interpolated based on the chamber measurements, air and soil temperatures, and soil moisture conditions over the measurement period. There were differences in the fluxes between the site conditions, which were primarily attributed to grazing having a drying effect on the GR sites. These initial findings, if further verified with additional measurements as outlined below, could result in some important implications for the role of grazers on the tundra landscape. Overall, this paper hints at some very interesting connections between carbon cycling, environmental conditions, and grazers but require some additional measurements to support the bold claims as they are currently in the manuscript.
Major comments:
- The data are not enough to support the claims being made in the manuscript. The limited number of independent measurements and an unequal sampling design undermine the conclusions reached about the relationships. 17 days of measurements give an accurate estimate of the fluxes over that period, but do not necessarily represent the whole growing season. It is mentioned in the paper that these should be treated as a snapshot in time (especially for the GR plots), however, I do not believe the main takeaway points as they are written are properly taking that caveat into account which can result in some miscommunication of the strength of the findings. Additionally, only having two plots in the UGR condition, and only measuring those plots four times (4 days compared to 9 days for the three GR plots) makes accurate comparisons between the treatment types difficult for the full measurement period.
- The two selected UGR plots had large differences in their GPP and NEE measurements and may not be a good representation of these sites. Selecting additional plots from the 10 previously established UGR plots for measurements would help to more accurately determine average flux values. The individual UGR plots are also showing very similar fluxes as the GR plots, but not consistently (see table 3). For instance, UGR 1 have similar GPP and CH4 as the GR plots, while UGR 2 seems to bring down the average GPP in the UGR plots. In addition, the UGR plots were not measured on the same days. This clearly demonstrates how the low replications undermine their conclusions.
- Site differences between the GR and UGR plots make it difficult to determine if the differences in fluxes are actually due to grazing effects and not moisture itself. Stronger evidence of the GR plots being water-logged throughout the growing season ~30 years previous, and that the drying of the site is due to grazing, is necessary to solidify the link between grazers and fluxes. Alternatively, flux measurements on wetter areas in Pleistocene park, and dryer areas in the UGR site may help disentangle the effect of moisture from the effects of grazing.
Minor comments:
- L 21: “Based on expert assessment”, please delete.
- L 53: The drawbacks of measuring fluxes only in the growing season were mentioned, however, this study also only measured fluxes during a subset of the growing season. Consider leaving this to the discussion section as the reader expects some mention of a whole-year upscaling when it is mentioned early on in the introduction.
- In the introduction, there are multiple mentions of shrubs and the effect of shrubs on C dynamics (possibly due to a large amount of the reference studies coming from Scandinavia and focusing on reindeer browsing), but your sites are dominated by graminoids. I would suggest reframing the introduction to focus more on the effect of graminoids on C dynamics and their interaction with large herbivores. This is also not much elaborated in the discussion, and the introduction as it reads now give the wrong expectations on the manuscript.
- Suggest renaming the plots from grazed (GR) and ungrazed (UGR) to heavily grazed (HGR) and ambient grazed (AGR), respectively, unless there are no populations of grazing herbivores on the landscape at the ambient site (no information provided).
- L 200: Mann-Whitney U tests were brought up in the statistics section but I could not find the results or a figure on these tests. Since these measurements also are repeated measurements, you need to provide evidence that they are independent between days (your statistical unit) or perform statistical test considering the repeated measures.
- L 306-311: Coefficients of Variance (CV) were discussed to determine if the heterogeneity between plots were in an acceptable range. However, when compared to the paper cited as a reference for this metric (Davidson et al. 2002), the present study has half the number of total plots they are assessing over which could be a factor in the low values found. The Davidson et al. (2002) paper also suggests a formula for determining the number of measurements needed to ensure a decent variance around the mean, which could be a useful way to determine if the number of measurements taken are representative or if more measurements are needed. In addition, it is unclear what measurements the CV is calculated on. It should be the daily data, 4 measurements for UGR and 9 for GR.
- Equation 3, which corresponds to interpolating Reco from UGR plots according to section 4.2 (lines 320-322), includes the data from GR-3. The interpretation of data from the GR plots therefore differ from each other, and GR-3 is interpolated more accurately with the same formula as that for the UGR plots. This was mentioned on line 326 stating that the measurements are not representative across the GR plots, which poses problems for the final conclusions drawn regarding these plots.
- L 364-373: Is it possible to tie these vegetation changes into the differences in measured fluxes more directly? Maybe a reference on fluxes from tussocks vs. grass mats?
- L 373-375: Were the addition of CO2 and CH4 from grazers themselves factored into any calculation of total fluxes from the sites?
- Clarification of the prevalence of these wet tussock tundra sites within and outside of Pleistocene Park would be a useful addition when visualizing how these results may affect the larger arctic region.
- L 402: This sentence needs a reference at the end.
- L 403: “only very inefficiently”, consider revising.
- L 731 reference for Zimov et al. 2012, seems to have the incorrect initials for one author (F. S. Chapin).
- AC3: 'Reply on CC1', Mathias Goeckede, 16 Nov 2021
-
RC2: 'Comment on bg-2021-110', Anonymous Referee #2, 01 Sep 2021
Review of bg-2021-110
Grazing enhances carbon cycling, but reduces methane emission in the Siberian Pleistocene Park tundra siteGeneral comments
The authors provide a nice introduction into herbivory impacts on permafrost ecosystems. The study provides a very interesting insight into ecosystem changes under grazing pressure. The data set used is a measurement series of NEE and Reco, measured for two weeks at a grazed and an ungrazed site with several replicates. The observed flux changes in CO2 and CH4 are well described and put into relation with animal activity, such as soil compaction and drying, which shows a significant reduction in CH4 emissions from grazed sites.
The methods used are suitable to use for the provided explanation of these effects, however, the method description itself should provide more detail on the approach.
There are several further topics arising from this study, such as the influence of vegetation species on fluxes and how fluxes change throughout different seasons. It would be great to have more comparison to other studies regarding this.
There is a minor lack of context regarding the general hypotheses of the Pleistocene Park experiment as to why the findings from this study suggest a different effect of animal grazing than previously hypothesized by Zimov et al. (2005). The findings should also be discussed in relation to those hypotheses.Specific comments
Please consider making the data accessible via a scientific data repository.
L89: Please add a map indicating the sampling sites.
L91: There is a new paper by Reinecke et al. (https://doi.org/10.1038/s41598-021-92079-1) dealing with the Pleistocene Park vegetation in more detail, which you should consider here.
L151: Please describe the bootstrapping approach in more detail (number of iterations etc.).
L206: How did you test for significance?
Figure 2: For CH4, it should be clearly stated that these are emissions only. Using “fluxes” suggests a bi- or omnidirectional gas exchange.
Table 2: I assume “ns” means “not significant”? Please make the caption overall more clear. Also, please add something like “ungrazed sites (UGR-1 and -2) and grazed (GR-1, -2 and -3)” to the title of this table. I suggest, for uniformity, to switch axes of this table to make it similar to table 3.
L295: What about previous disturbances of the soil itself, especially in the active layer with freeze-thaw cycles? Please consider this in your manuscript
L357: These pre-existing site differences are very likely, taking the distance between the sites into account. Especially the differences in thaw depth (greater thaw depth at UGR) are opposing the general hypothesis of large animal impact on permafrost ground as a conservation mechanism, which is said
to mainly originate from snow compaction in winter. Maybe you should elaborate or highlight these a little more and discuss why your findings might differ from named hypothesis.
Figure A1: Please provide letters for each graph (e.g. as in figure A3). Also, adding the equation for each regression curve to the corresponding graph would be good.
Figure A2: Please see the comments on figure A1.
Figure A3: Please add the equations for each regression curve.
Figure A4: Please provide headlines for a), d) and g). Also, it should say somewhere in the graph (not only in the caption) that the graphs show CH4 emissions.Technical comments
Please make “C-Fluxes / C-fluxes / C fluxes” consistent throughout the paper. Maybe consider replacing flux considering my earlier comment
L99: Please put Betula nana in italics and capitalize, since it’s a species name. Also, please change “willow spec.” to “Salix sp.”
L100: Please change “lugens” to “C. lugens”.
L166: R Studio is just the main software. Please provide the used packages.
L170: Suggestion: “…not uniform across plots even at one site…”
L200: There is a leftover “?” in this line. Also, the test should be named “Mann-Whitney-U-test”.
Table 2 caption: inconsistency in * and spaces, please adjust
Line 283: please capitalizeReview criteria:
Does the paper address relevant scientific questions within the scope of BG?
Yes
Does the paper present novel concepts, ideas, tools, or data?
Yes
Are substantial conclusions reached?
Yes, to some extent
Are the scientific methods and assumptions valid and clearly outlined?
Yes, but method description could be more precise
Are the results sufficient to support the interpretations and conclusions?
Some interpretations are a little one-directional, but supported by the data
Is the description of experiments and calculations sufficiently complete and precise to allow their reproduction by fellow scientists (traceability of results)?
Somewhat, method explanation needs some more detail
Do the authors give proper credit to related work and clearly indicate their own new/original contribution?
Yes
Does the title clearly reflect the contents of the paper?
Yes
Does the abstract provide a concise and complete summary?
Yes
Is the overall presentation well structured and clear?
Yes
Is the language fluent and precise?
Yes, just very minor things indicated in the technical comments
Are mathematical formulae, symbols, abbreviations, and units correctly defined and used?
Yes
Should any parts of the paper (text, formulae, figures, tables) be clarified, reduced, combined, or eliminated?
The Methods section should receive more detailed information on the modelling approach. A figure (map) indicating the sites’ positions and relations would be nice.
Are the number and quality of references appropriate?
Yes
Is the amount and quality of supplementary material appropriate?
Yes- AC2: 'Reply on RC2', Mathias Goeckede, 16 Nov 2021
Peer review completion









