I thank the authors for taking their time to address all of my comments, which I admit were many. The impressive number of samples and analyses performed for this study is a step forward to understand the biogeochemical effects of mobile demersal fisheries, which is why I have thoroughly revised the response to my first review. I believe that there are still several aspects that are not quite clear and would like the authors to further clarify and address in their manuscript before publication.
I have separated the different comments based on the same numbering I employed in the first review, in order to avoid repetition and to indicate what comment and reply I am referring to. In this review, I only address the aspects that I believe the authors should further clarify. In my opinion, all the other several comments have been addressed.
1. Type of experimental design
In my first review, I asked the authors to clarify what type of experimental design they followed, since it was hard to identify based on the methods and results of the manuscript:
1. Control-Impact (CI), when an experimentally trawled site is compared with a control site
2. Before-After (BA), when the same site is sampled before the disturbance and after the disturbance
3. Before-After-Control-Impact (BACI), when (at least) one site is sampled before and after the disturbance (impact site), and (at least) another site is sampled at the same time (before and after) but is not disturbed. This approach also includes collecting additional samples in time after the disturbance to get a better temporal variability.
Clarifying and identifying what type of experimental design this study followed is crucial to then know what type of statistical analysis to employ. The authors clarify that their study follow a CI approach, but then mention that they collected samples in the impact site before the disturbance. Hence, it should be either a BA or a BACI approach, not a CI approach.
In their statistical analysis, the authors aggregate all their “control” (before impact?) and “impact” (after impact?) sites to assess if there are any statistically significant differences in the different parameters they studied. This aggregation is done with samples collected in different periods. The authors argue that they can aggregate this temporal data because “their results […] show that the difference due to trawling is larger than those caused by natural temporal variability in the CL areas”. What statistical analyses did the authors do to arrive to these conclusions? This should be clarified.
I would first perform a linear mixed-effects model that accounts for the disturbance (impact/control) time, and the interaction of both. This would allow the authors to properly assess if natural temporal variability is relevant. If the results of this analysis shows that temporality is not relevant, then it is justifiable to average the data in the figures. If the results of this analysis shows that temporality is actually relevant, then the authors can not average the data in the figures. Moreover, this would imply that temporality should be addressed when assessing the biogeochemical impacts of demersal fisheries, which is seldomly done.
The interpretations and conclusions of this manuscript depend on this, which is why I insist on a proper statistical analysis given the large number of samples collected.
2. Statistical analyses
Assuming that the statistical analysis employed is correct, the authors mention that TOU and TA are not significantly different among the impact and control sites. However, in several occasions in both the manuscript and the reply to my reviews, the authors then argue that there is a decrease in the TA flux in the HI site – this decrease is not statistically significant, right? Please be consistent. This is especially important in this study given the large number of proxies measured.
8. Alkalinity fluxes and their reasons
First of all, I want to re-iterate that according to the statistical analysis performed (which should also be revised), there is no statistical difference in the TA fluxes between the HI and CI sites (see my comments above). However, the authors go in great detail to explain that sediment disturbance has an effect on TA fluxes.
Assuming that this is a relevant process, the authors point that this reduction in TA fluxes is due to changes in the sulfate reduction. However, in my previous review I noted that the authors identified that there are no statistically significant changes in sulfate reduction or pyrite content. Isn’t this contradictory? Please clarify.
In addition, in my previous review, I asked about the influence of carbonate dissolution. To this, the authors replied: “We do not calculate carbonate dissolution for the HI site since here we are interested in the reduction in TA and DIC following trawling, which we ascribed to a reduction in POC degradation and carbonate dissolution (now mentioned). Denitrification (i.e. NO3 flux) is included in the mass balance (Eq. 8) to calculate carbonate dissolution.” I had to re-read these sentences several times since I was confused in the contradiction, which I point below to clarify:
- You don't calculate carbonate dissolution for the HI site because you associate the reduction in TA and DIC due to carbonate dissolution? So carbonate dissolution should be calculated, no?
- You include denitrification in the mass balance to calculate carbonate dissolution. So is carbonate dissolution calculated?
As you can see, it is not clear to me whether carbonate dissolution is calculated. If so, what are the values, and is it relevant? In their reply to my comment 7, the authors mention “Yes, the reduction in TA and DIC may be attributed to a decrease in the rate of POC remineralization and calcite dissolution.” As you can see, this is not at all clear to me.
Again, all of this is assuming that the changes in TA is statistically significant, which the authors previously say it is not.
9. Seafloor-water-air box model
I still find confusing adding two kind of disturbance (with and without changes in DIC and TA fluxes). A more thorough explanation is needed. This is beyond my area of expertise, and I believe other readers would like to understand this better.
Lines 325-336. Calculation of diffusion fluxes using FindFit function
The authors should use a fitting that provides measures of uncertainties within a specific confidence interval to be able to compare the diffusive fluxes across sites and see if they are significant while accounting for their confidence interval.
Lines 410-416. Description of POC, PON, and CaCO3 variations.
The authors provided the revised sentence with a description of POC, PON and CaCO3 variations across sites, which is exactly what I meant in my previous comment. However, this revised sentence should also include the results of the statistical analysis which are not statistically significant. |
This manuscript presents the results of a field investigation of the impact of dragging an otter trawl rope across the seafloor. This is a well-designed field study that makes great use of the unique in situ observational capacities of GEOMAR. It is overall well written and structured, and will be a valuable and nuanced addition to the current literature on the impact of mobile-bottom contact fishing (MBCF) on the marine carbon cycle. One comment on the writing is that I would urge the authors to consider tempering the tone of the text – hyperbolic words such as ‘severe’, ‘dramatic’, ‘substantial’ … are used often throughout the text, but they are not necessary given the nuanced nature of the data and main results of the study. While recent publications on MBCF impacts have tended to sensationalise, this is not aiding our understanding or helping to nuance the discussion around managing MBCF. A bit more scientific sobriety would be welcome in this specific field.
I also have a few concerns and suggestions on content that probably should be addressed before the manuscript will be ready for publication. I give some general comments here, with more detailed comments below.
I believe the authors will be able to address these, and I am looking forward for a revised version of this interesting manuscript.
Kind regards
Sebastiaan van de Velde
General comments:
The main observation is an overall reduction in benthic fluxes after disturbance, which the authors suggest is due to the erosion of the surface layer with more reactive organic matter and silicates. This is very likely correct, but throughout the discussion I feel this is sometimes forgotten (see my detailed comments on L544, L609, L694). The total impact of dragging ropes on the marine carbon cycle cannot be accounted for by only measuring before and after fluxes, as the fate of the eroded layer needs to also be considered. This should be better reflected throughout the MS.
Another effect that does not receive a lot of attention is the transient nature of the data. How much of the observed change in flux is transient due to the porewater build-up after erosion/mixing event – rather than reflecting actual changes of the biogeochemical pathways? For example, the way you calculate calculation RPOC_tot from the fluxes assumes steady-state, but if you flush out the top porewater, you will get a recovery phase where fluxes will be lower until the new steady-state is reached. So your estimation of RPOC after disturbance is an underestimation. Since the large variability in SR probably means the difference is not statistically significant can you confidentally say there is a difference?
My final comment relates to the model usage and claim of pyrite oxidation. While I think this is indeed a factor that needs to be consider, I don’t see any new evidence in the manuscript that this occurs. The modelling part of the MS, which is used to claim that pyrite oxidation is important, present essentially the same model runs (with minor variations) as previously done by (Kalapurakkal et al. 2025), and thus do not validate the conclusions of the earlier manuscript, nor do they bring much new to the table, since you are essentially getting the same results. At the very least, I would have expected the field data to be used to validate the model runs, but this is also not the case.
I would suggest that the authors reconsider the added value of the model simulations in their current form (see also my detailed comment below), and also ask them to consider how these results are presented, as the sentence in the abstract at L34 and L669 give the impression that this study is an independent validation of the earlier model results of (Kalapurakkal et al. 2025), which it is not.
Detailed comments:
Throughout: ‘bottom trawling’ – is supposedly colloquial, and a more accurate term is mobile bottom-contact fishing. Not so much for the experiment in this paper, as you are looking at otter trawl specifically, but for the more broad scope papers in the introduction.
Title: might be more appropriate to name that it was an otter trawl rope
L30: ‘supresses benthic mineralization’ – the reason that happens is because a lot of reactive POC is removed – so it does not so much supress it than displace it?
L50ff – might be worth to discuss the results of (Porz et al. 2024; Zhang et al. 2024) as well in the light of the carbon sequestration debate
L82ff – I would include the papers that show/discuss the importance for pyrite formation as an alkalinity source (Hu and Cai 2011; Reithmaier et al. 2021). And it might also be interesting to bring in our recent global estimate of the chronic impact of repeated trawling (van de Velde et al. 2025) – especially since there is a first-order estimate of alkalinity loss for the area in which you did the experiment (also see the SI for a long-term reduction in TA flux due to chronic trawling). There is also an estimate of how import each individual process is for shelf sediment alkalinity generation.
L223: so you had the instruments (nutrient analyzer, alkalinity titrator, etc.) on board the ship?
L288: unclear, is this 1 to 50 or 150 or?
Section 2.8 – Curious, how does this compare to the DIC flux?
You can also do a similar exercise by including DIC and TA fluxes and making a similar mass budget, this time including carbonate dissolution and pyrite/FeS burial
Figure 4: maybe say ‘sediment cast’, which makes it easier to directly understand the figure
L432: but higher up you assume a different oxidation state for your organic carbon? Why not be consistent?
L454: lower near the surface, as they become higher at depth in the higher impact areas?
L486: why not include this in this manuscript?
L514: why surprisingly? You just describe yourself that your site is at the threshold where bottom-water O2 is controlling the O2 flux – so removing POC should not affect the O2 flux.
L518: could it also have to do with sediment type? A sandy sediment might be more prone to porewater flushing due to the disturbance, where a muddy sediment would be less. I can then see how muddy sediments would show higher fluxes right after recovery if the porewater is mixed rather than flushed out.
L533ff: are the studies you mentioned not directly determining the denitrification rates through modelling, isotope pairing, or N2/Ar fluxes? Whereas you are comparing it to the NO3 flux alone – which is not the same?
L544: I don’t think I agree with that statement – if the loss of fluxes is due to the erosion and removal of the reactive surface layer, you need to also account for the fate of that surface layer before you can make claims about the impact. If the POC gets remineralized in the water column, you still produce the nutrients, so you don’t affect the productivity.
L575: and probably most importantly: the nature of the organic matter itself (age, origin) – which to a large extent will determine its sensitivity to environmental conditions.
L590: in what way? Coarser grain size = bigger difference?
L590: Our study from the anoxic Baltic Sea suggests that low mineral protection (high OM concentrations and low sediment accumulation) leads to high mineralization rates, even under anoxic conditions (van de Velde et al. 2023; Placitu et al. 2025). This indicates that the lack of mineral protection leads to no difference in oxic versus anoxic conditions – rather the inverse of the interpretation of the results of Kalapurukkal.
Could it be that the results of Kalapurukkal actually show the effect of desorption and the age of the organic matter? Fine-grained sediments would protect OM from mineralization, meaning more reactive fractions remain. When incubated in suspension, desorption occurs in both oxic and anoxic conditions – and since more reactive OM fractions show little difference in mineralization rate under oxic or anoxic conditions (see the earlier work of, e.g., (Kristensen et al. 1995)), you observe little difference.
With coarse-grained sediments, there is little mineral protection, and the more reactive fractions have quickly reacted away. When you then incubate the sediment in suspension, the less reactive OM fractions show differences in mineralisation in oxic versus anoxic conditions.
This would lead to a slightly different mechanistical interpretation of the results and would reconcile it with our findings. It is not the grain size that controls the response of mineralization in oxic versus anoxic conditions, but grain size that controls which OM fractions are retained in the sediment – and this eventually is reflected in the resuspension experiments.
L606: also considering including our recent global estimate (van de Velde et al. 2025), and papers that discuss that sedimentary pyrite burial is an important source of alkalinity (Hu and Cai 2011; Reithmaier et al. 2021)
L609: but what about the fate of the resuspended material?
L611: ‘dramatic’ – a bit over-the-top, since you show a temporary reduction in fluxes, how does that say anything about ecosystem functioning or structure?
L612: ‘to the best of our knowledge’ – remove sentence, this does not add to the manuscript
L634: this – interestingly – is exactly the number that comes out of our global modelling exercise (van de Velde et al. 2025), and also close to the numbers of (Krumins et al. 2013). Would also be worth referencing some earlier work on carbonate dissolution in muddy sediments (Aller 1982; Green and Aller 2001)
L650: Is there no data from trawling intensity/disturbed area for the region you are studying? Would probably be worth checking (e.g., (Amoroso et al. 2018) or (Eigaard et al. 2017; Rickwood et al. 2025)) to do more realistic simulations or some sensitivity tests.
L652: So you assume no impact on carbonate dissolution? Why? It is your biggest source, and if you reduce organic matter mineralization in the sediment, you will reduce porewater acidification and this carbonate dissolution rates? Note that our model study did not find any impact, but we did not erode the top layer, but let it settle after resuspension.
L657: The way this model runs are explained are a bit confusing to me – ‘no impact’ is still impact, right? You induce mixing and get pyrite reoxidation? So the only difference with the ‘standard run’ is that you force the benthic fluxes – but are those not a consequence of the disturbance? Should you then not use your observed fluxes to validate the model, rather than run the model to upscale something which is actually not based on your observations?
L671: the paper from Kalapurakkal is a bottle incubation experiment, so I would not really say this paper shows that it happens in reality. The paper suggests that pyrite oxidation is more important that the organic matter impact – and this study seems to be a validation – or at least should be, because at the moment it seems you are using their paper to claim pyrite oxidation happens, without actually showing any data that backs that claim.
L673: but what drives that reduction in alkalinity fluxes? Pyrite oxidation should be reflected in these fluxes as well.
L694: only because you do not account for the fate of the resuspended material
L699: but you do not present evidence for the oxidation of pyrite ?