the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Dissolved organic matter signatures in urban surface waters: spatio-temporal patterns and drivers
Clara Romero González-Quijano
Sonia Herrero Ortega
Peter Casper
Mark O. Gessner
Gabriel A. Singer
Download
- Final revised paper (published on 10 Jun 2022)
- Supplement to the final revised paper
- Preprint (discussion started on 05 Jan 2022)
- Supplement to the preprint
Interactive discussion
Status: closed
-
RC1: 'Comment on bg-2021-340', Anonymous Referee #1, 16 Jan 2022
Overall, this is an interesting dataset and questions.
The results and discussion are somewhat challenging. The discussion and results lack a clear organizing structure. Some of the relationships claimed in the analysis have no clear/singular interpretation. For example, the authors say that C2 and C8 are associated with wastewater because authors have said they were associated with wastewater in other study in other regions of the world and because they are associated with TrOCs. A similar statement is made about nutrients. The problem, however, is that all of the components ordinate in the same direction. So all of the components are correlated with wastewater and all of the components are correlated with nutrients (at least on RDA 1). What then makes you focus on a few components over others?
Some more methodological details are needed on the PARAFAC process. Which software and how did you handle validation. An overlay of the split halves would be nice to see on the plot of the PARAFAC model. It helps the reader evaluate the quality of the model.
A number of times DOM diversity is equated with functional diversity ‘in the aquatic system’ and I don’t seem much evidence of this or a framework built for it. Connection DOM composition to ecosystem functioning is still pretty speculative (to be clear it is speculation I support, just still feel it has a long way to go). In particular the authors point to the diversity of DOM as an indicator of diversity or functional diversity in the aquatic system, but provide little evidence why it should be so. Clearly, DOM diversity is an indicator of the diversity of watershed processes both natural and anthropogenic – source diversity if you will. Is that function ‘in the aquatic system’ or function in the watershed? I would argue that it is the latter. However, with respect to the watershed you only ever look at a 50 M buffer (see detailed comments on this below).
In several places hydrology and runoff are presented as the cause of an observed relationship, but there is no mention of any aspect of the study design that evaluates hydrology. E.g. “…for example, was formerly connected to a sewage farm and appeared to be influenced by previously unrecognized storm water runoff that likely delivered inputs during heavy rain. “ No storm sampling was ever discussed, no pre-post sampling that would disambiguate this. There are just a lot of instances of statements and conclusions that are not or are not unambiguously supported by collected data.
All that said. This is an interesting dataset and general question, I do encourage the authors to develop it further and focus on the clear and well-supported interpretations of the data.
Specific comments
39 ‘failure of citizens’ … inappropriate and subjective statement. You blame the public, but have scientists properly communicated the issue to the public? Rather adversarial language that will only function to pit the general public against science. Why make an enemy?
47-48 Subjective. What is the purpose of monitoring? What is the endpoint. Often it is something much larger like ensuring healthy available habitat for human or animal use. If a primary driver of healthy habitat for animals is the availability of oxygen in the water, is that really a ‘narrow focus’ or is it the focus that is appropriate for monitoring given the monitoring goals. I think you would be better arguing that high resolution approaches can expand the suite of bigger picture ecosystems states that can be monitored with DOM.
78-80 More info on this. What about these sites, what type of pollution do they represent. All the same type/intensity, different types?
84-86 Why only a 50 buffer? Why not a series of buffers to determine what the spatial scale is that is most relevant. The water interconnections in an urban ecosystem are complex, I doubt 50m captures the reality of the source areas. See Kaushal and Belt 2012.
105-110 Did you collect and process any blanks?
105-110 Was iron measured in any of these samples? This can have significant effects on optical DOM determination and is often elevated as it runs through urban infrastructure.
123 A few things here. This is almost universally abbreviated FI and not FIX. You are using the wavelengths for you calculations for samples corrected for instrumental bias. This is appropriate. However, the citation you reference here was based on FI values calculated from a the old wavelengths that were not corrected for instrumental bias. McKnight updated this in Corey et al. 2010 and it makes a significant difference in the reference values of allochthonous and autochthonous endpoints. Lastly in heavily impacted urban systems, the classical interpretation of FI as developed by McKnight may simply not be applicable. You may be getting a 1.2 or a 1.9, but it may not mean the same thing as it would in a more natural system.
126-127 Would like to see the split half validation overlaid on this PARAFAC model (Fig A1). Overall more details on the PARAFAC modelling process used would be nice.
286-288 Does it reflect high functional diversity across the ‘aquatic network’? So far it would seem to suggest a variety of inputs or a diversity of input. I don’t know if it says anything about what is going on in terms of fucntional/metabolic processes in the aquatic network. Also consider what 'functional diversity' means and what is 'desirable' vs. 'undesirable.' High functional diversity might be due to the wide range of degradation states that stream in an urban landscape may be experiencing.
306-308 Could be, but you have provided no information on the hydrologic conditions at the time of sampling. Also within a season you haven’t sampled during runoff conditions and during ‘base flow’ conditions to determine if there is a difference.
313-315 weak inference. All of your components ordinate in the same direction of TrOCs. Also how did you establish the link to WWTPs. Is it just based on what other people said who found similar looking components?
316-317 I would think that the greater abundance of light might be as big or a bigger factor than nutrients.
320-321 Again, not sure I see where that statement comes from. All of your DOM components ordinate in the same direction not just C2 and C8. C1,2,4,5,7,6,8 (what happened to C3?) are all pretty well correlated with elevated nutrients on the primary RDA axis. It just seems like increased fluorescence is associated with increased nutrients.
340-341 why would you propose green space as a proxy for paved surfaces when you said you measured paved surfaces earlier?
353 I don't know if your map is showing urban heterogeneity or not. I mean, none of this is clearly linked to urban influences (clearly some of it has to be). I just don't think the data and analyses you have presented lead to strong support for this statement.
374-375 what do you mean by that? This study is based on single grab samples and average data? Most monitoring is part of a broader survey. This needs to be clarified.
384 How are you coming to this conclusion? You have presented no information that you ever sampled storm runoff?
385 This is the first time it is mentioned. You should talk about this up in the sites section of the methods. Overall, a map showing the location of WWTPs would be very helpful. The WWTPs are being treated as a bit of an afterthought in the analysis when I feel like you should be framing your study and analysis around them.
388-389 how do you know it “actually” received the inputs anything you have showing the hydrologic connectivity to a WWTP would be appreciated.
404-405 What was actually detected? Which optical properties? Fluorescence? All the components ordinate in the same direction. TrOCs seem to be more a function of increasing DOC fluorescence overall. In this particular case for Berlin, I would then argue that the simplest thing to do is to measure FDOM fluorescence as an aggregate value as opposed to the finer resolution.
Citation: https://doi.org/10.5194/bg-2021-340-RC1 -
AC1: 'Reply on RC1', Clara Romero, 17 Mar 2022
Overall, this is an interesting dataset and questions.
Thank you
The results and discussion are somewhat challenging. The discussion and results lack a clear organizing structure.
We will improve the links between the Methods, Results and Discussion sections to better structure the manuscript. We realized that information about some sites was first mentioned in the Discussion only, which made it difficult to follow. Therefore, we will describe the sites with greater detail in Methods. This concerns particularly information on WWTP effluents and other specific features of various sites. We will also introduce the idea of DOM monitoring in the Introduction and return to it in the Discussion.
Some of the relationships claimed in the analysis have no clear/singular interpretation. For example, the authors say that C2 and C8 are associated with wastewater because authors have said they were associated with wastewater in other study in other regions of the world and because they are associated with TrOCs. A similar statement is made about nutrients. The problem, however, is that all of the components ordinate in the same direction. So all of the components are correlated with wastewater and all of the components are correlated with nutrients (at least on RDA 1). What then makes you focus on a few components over others?
The similar behavior of the various PARAFAC components was caused by using raw absolute fluorescence values without any normalization. Prompted by another reviewer comment, we will normalize PARAFAC components by DOC concentration. As a result, compositional changes of DOM will emerge more clearly, which also alleviates the problem of non-singular interpretation. In particular, the humic components C3, C4 and C5 point in a different direction than the other PARAFAC components. Furthermore, a new correlation analysis shows that C3, C4 and C5 were not or negatively correlated with the TrOCs, whereas the other PARAFAC components were positively correlated with the TrOCs. Note that apart from their relation to the TrOCs, C2 and C8 have been previously found in WWTP effluents.
Some more methodological details are needed on the PARAFAC process. Which software and how did you handle validation.
Details of the PARAFAC process were provided in the supplement of the manuscript (see lines 1-20 there). This includes information on the software we used and how we handled validation. In the revised manuscript, we will further expand this description in the appendix. Please also see the response further below.
An overlay of the split halves would be nice to see on the plot of the PARAFAC model. It helps the reader evaluate the quality of the model.
We will add a graph showing results of the split-half validation of the final PARAFAC model to the appendix as suggested.
A number of times DOM diversity is equated with functional diversity ‘in the aquatic system’ and I don’t seem much evidence of this or a framework built for it. Connection DOM composition to ecosystem functioning is still pretty speculative (to be clear it is speculation I support, just still feel it has a long way to go). In particular the authors point to the diversity of DOM as an indicator of diversity or functional diversity in the aquatic system, but provide little evidence why it should be so. Clearly, DOM diversity is an indicator of the diversity of watershed processes both natural and anthropogenic – source diversity if you will. Is that function ‘in the aquatic system’ or function in the watershed? I would argue that it is the latter.
There appears to be agreement with the reviewer about the idea that DOM can encapsulate information about ongoing or past processes. However, it is less clear how such information may best be exploited. We investigated a set of diverse urban ecosystems to assess whether expectedly strong environmental gradients arising naturally across sites affect DOM diversity, which can refer to the diversity of DOM compounds at a given site (alpha diversity), or to the compositional turnover of DOM across sites (beta diversity). In both cases, the rationale behind the linkage of DOM diversity and functional diversity is the idea that individual processes leave an imprint on DOM composition, for instance by generating specific compounds. We will clarify these aspects in the revised manuscript.
However, with respect to the watershed you only ever look at a 50 M buffer (see detailed comments on this below).
Please see our response below.
In several places hydrology and runoff are presented as the cause of an observed relationship, but there is no mention of any aspect of the study design that evaluates hydrology. E.g. “…for example, was formerly connected to a sewage farm and appeared to be influenced by previously unrecognized storm water runoff that likely delivered inputs during heavy rain. “ No storm sampling was ever discussed, no pre-post sampling that would disambiguate this. There are just a lot of instances of statements and conclusions that are not or are not unambiguously supported by collected data.
Our objective was to characterize DOM composition in diverse water bodies within an urban setting and relate it to site characteristics. We could not directly study runoff effects during precipitation events. Nevertheless, we can relate DOM composition to specific site characteristics such as imperviousness in the surroundings or upstream WWTP that make some sites more vulnerable to stormwater runoff than others. Importantly, DOM composition has potential to provide integrative information, which could inform about runoff legacies even though we did not specifically sample before and after storm events.
To clarify these points, we will provide more site-specific information in the revised manuscript. Specifically, we will provide information why some sites are more susceptible to surface runoff than others (as inferred from large proportions of impervious surface area within the 50-m strips adjacent to our sampling sites) as well as wastewater inputs (downstream sites in receiving streams) during storm events. To do so, we will add information on WWTP outlets potentially affecting our sampling sites both in Figure 1, the main text (first in Methods) and the table that was formerly in the Supplement and will be moved to the appendix. This will also help clarify the linkage between all the measurements taken and the Discussion, and make it easier to follow. We will further provide precipitation data showing that our own sampling did not occur during or shortly after heavy precipitation events. We will show this in a new figure in the appendix. We will also refer to the idea of storm sampling in the context of (potentially continuous, real-time) DOM monitoring.
All that said. This is an interesting dataset and general question, I do encourage the authors to develop it further and focus on the clear and well-supported interpretations of the data.
Thank you very much.
Specific comments
39 ‘failure of citizens’ … inappropriate and subjective statement. You blame the public, but have scientists properly communicated the issue to the public? Rather adversarial language that will only function to pit the general public against science. Why make an enemy?
Thank you for drawing our attention to the fact that this statement could be perceived as being offensive, which was of course not our intention. We will rephrase the sentence to “This and the limited recognition of urban freshwaters as…”
47-48 Subjective. What is the purpose of monitoring? What is the endpoint. Often it is something much larger like ensuring healthy available habitat for human or animal use. If a primary driver of healthy habitat for animals is the availability of oxygen in the water, is that really a ‘narrow focus’ or is it the focus that is appropriate for monitoring given the monitoring goals. I think you would be better arguing that high resolution approaches can expand the suite of bigger picture ecosystems states that can be monitored with DOM.
We will rephrase the sentence to “These approaches do not take advantage of the extreme diversity of DOM observed in freshwaters, which potentially can provide insights into water quality status and the factors driving it, complementing established procedures”.
78-80 More info on this. What about these sites, what type of pollution do they represent. All the same type/intensity, different types?
We will add information about these sites in the table in the supplement (appendix in the revised manuscript): nature of the sites (i.e. natural vs artificial), channelization, receiving streams of WWTP effluents. As stated above, we will also add information on potential WWTP inputs in Figure 1a.
84-86 Why only a 50 buffer? Why not a series of buffers to determine what the spatial scale is that is most relevant. The water interconnections in an urban ecosystem are complex, I doubt 50m captures the reality of the source areas. See Kaushal and Belt 2012.
The 50-m strips we chose were supposed to capture influences in the vicinity of the sites, i.e. influences of the riparian zone and somewhat farther away but not from the whole watershed, which is difficult to define in urban areas. This choice enabled us particularly to distinguish between urban sites adjacent to paved surfaces and others in green spaces. Note also that Tufekcioglu (2020) and Johnson (2005) used buffer zones of similar size and a study on ponds by Declerck (2006) used a range of widths (ranging from 50-3200 m) and found 50 and 100 m to be most appropriate to assess land-cover effects.
105-110 Did you collect and process any blanks?
We used ultra-pure water as blank. This information is given in the former supplement (appendix in the revised manuscript) where method details are described.
105-110 Was iron measured in any of these samples? This can have significant effects on optical DOM determination and is often elevated as it runs through urban infrastructure.
We did not measure iron in our water samples, but a recent study on the River Spree in Berlin indicates that total iron concentrations are <0.3 mg/l (Friedland et al., 2021). Concentrations in the other water bodies we sampled are likely to be lower. Nevertheless, since we cannot rule out Fe-DOM associations in our samples, we will add a short discussion on the potential interference of Fe with our optical DOM signatures. We will also provide a rough assessment of the potential importance of iron for water colour and absorption-based data by calculating abs420/DOC and compare these data with (i) literature values and (ii) available from the Senate database iron concentrations in Berlin´s surface waters.
123 A few things here. This is almost universally abbreviated FI and not FIX. You are using the wavelengths for you calculations for samples corrected for instrumental bias. This is appropriate. However, the citation you reference here was based on FI values calculated from a the old wavelengths that were not corrected for instrumental bias. McKnight updated this in Corey et al. 2010 and it makes a significant difference in the reference values of allochthonous and autochthonous endpoints. Lastly in heavily impacted urban systems, the classical interpretation of FI as developed by McKnight may simply not be applicable. You may be getting a 1.2 or a 1.9, but it may not mean the same thing as it would in a more natural system.
Thank you for sharing these insights. We will replace FIX by FI, use Cory et al. (2010) as the reference, and revise the discussion to point to the limited applicability of McKnight’s framework to urban waters.
126-127 Would like to see the split half validation overlaid on this PARAFAC model (Figure A1). Overall more details on the PARAFAC modelling process used would be nice.
We will add the split-half validation as suggested in the appendix. The PARAFAC modelling was described in the supplement (now moved to the appendix), but it will be expanded particularly to provide information on cross-validation. See also the response above.
286-288 Does it reflect high functional diversity across the ‘aquatic network’? So far it would seem to suggest a variety of inputs or a diversity of input. I don’t know if it says anything about what is going on in terms of fucntional/metabolic processes in the aquatic network. Also consider what 'functional diversity' means and what is 'desirable' vs. 'undesirable.' High functional diversity might be due to the wide range of degradation states that stream in an urban landscape may be experiencing.
We agree with the reviewer that the processes we think of as leaving an imprint on DOM composition include internal processes (such as intense primary productivity) as well as watershed processes (such as terrestrial-aquatic coupling through allochthonous inputs). Both types of processes shape functioning of the various ecosystems, specifically as our study encompasses small as well as large lentic and lotic ecosystems. Our point here is simply that high diversity of DOM translates to potentially high information content about these various processes. We agree that the phrase “across the aquatic network” may be misleading and we will replace it by “across the urban landscape”. We note that a value judgment (‘desirable’ vs ‘undesirable’) is not needed, indeed degraded ecosystems to some extent add to the ecosystem-level diversity of urban environments.
306-308 Could be, but you have provided no information on the hydrologic conditions at the time of sampling. Also within a season you haven’t sampled during runoff conditions and during ‘base flow’ conditions to determine if there is a difference.
All samples were taken during base flow conditions, which we will clarify by including a graph on precipitation overlain on our sampling periods. We will also clarify that we do not refer to immediate effects of high flow, WWTP overflow etc. but to legacy effects that we expect to differ among sites depending on impervious surface area in the surroundings of the sites and other factors. See also the response above.
313-315 weak inference. All of your components ordinate in the same direction of TrOCs. Also how did you establish the link to WWTPs. Is it just based on what other people said who found similar looking components?
We will strengthen the discussion based on two changes in the expression of our data that we will make. First, we will replace the PCA based on TrOC data by an aggregate measure of TrOC abundance, which we interpret as an indicator of the influence of WWTP effluents. Secondly, we will normalize the PARAFAC data by expressing them as a proportion of total DOC, which will enhance the information value of the PARAFAC components as indicators by emphasizing qualitative differences in DOM composition (see also above).
316-317 I would think that the greater abundance of light might be as big or a bigger factor than nutrients.
High nutrient levels would be expected to correlate with a DOM signature reflecting autochthonous primary production as the source. That, however, was not the case in our study, where high nutrient levels were rather related to WWTP effluents. We will revise the paragraph to clarify this point.
320-321 Again, not sure I see where that statement comes from. All of your DOM components ordinate in the same direction not just C2 and C8. C1,2,4,5,7,6,8 (what happened to C3?) are all pretty well correlated with elevated nutrients on the primary RDA axis. It just seems like increased fluorescence is associated with increased nutrients.
The C3 label was inadvertently omitted in the graph. We will include it in the revised manuscript. The problem of the lack of differentiation will be alleviated by normalizing the PARAFAC data, as explained above.
340-341 why would you propose green space as a proxy for paved surfaces when you said you measured paved surfaces earlier?
While paved surfaces and green space are to some extent (negatively) correlated and may thus be hard to separate as controls, it was the variable green space that was identified as significant in the RDA. In the revised manuscript, we will directly discuss the meaning of green space in a more straightforward way and bring up important mechanisms such as facilitated soil infiltration and input of allochthonous organic matter.
353 I don't know if your map is showing urban heterogeneity or not. I mean, none of this is clearly linked to urban influences (clearly some of it has to be). I just don't think the data and analyses you have presented lead to strong support for this statement.
Different colours in the Figures 1b and 1c translate to different DOM compositions. That these colours do not cluster indicates that the maps illustrate the spatial turnover of DOM composition across the urban area. We will include this rationale in the legend of Figure 1 or the main text.
374-375 what do you mean by that? This study is based on single grab samples and average data? Most monitoring is part of a broader survey. This needs to be clarified.
We will revise the paragraph to clarify that we refer to changes in DOM composition over time (or compositional turnover) and the potential of using analyses of DOM composition as a complementary integrative measure for urban freshwater monitoring.
384 How are you coming to this conclusion? You have presented no information that you ever sampled storm runoff?
This conclusion is based on very high levels of various variables we measured, especially NH4+ concentrations, and historical information that the site used to receive stormwater runoff in the past. We will provide more information about this site in the Methods section.
385 This is the first time it is mentioned. You should talk about this up in the sites section of the methods. Overall, a map showing the location of WWTPs would be very helpful. The WWTPs are being treated as a bit of an afterthought in the analysis when I feel like you should be framing your study and analysis around them.
As suggested, we will add information on potential WWTP influences to Figure 1a and the table in the Appendix describing the sites.
388-389 how do you know it “actually” received the inputs anything you have showing the hydrologic connectivity to a WWTP would be appreciated.
As stated just above, we will add information about which sites could have received WWTP effluents upstream.
404-405 What was actually detected? Which optical properties? Fluorescence? All the components ordinate in the same direction. TrOCs seem to be more a function of increasing DOC fluorescence overall. In this particular case for Berlin, I would then argue that the simplest thing to do is to measure FDOM fluorescence as an aggregate value as opposed to the finer resolution.
We will clarify that we refer here to the PARAFAC components indicative of WWTP effluents. As explained above, with the new PCA, using normalized components, all the components do not ordinate in the same direction.
References
Cory, R. M., Miller, M. P., McKnight, D. M., Guerard, J. J., and Miller, P. L.: Effect of instrument-specific response on the analysis of fulvic acid fluorescence spectra, Limnology and Oceanography: Methods, 8, 67-78, https://doi.org/10.4319/lom.2010.8.67, 2010.
Declerck, S., De Bie, T., Ercken, D., Hampel, H., Schrijvers, S., Van Wichelen, J., Gillard, V., Mandiki, R., Losson, B., Bauwens, D., Keijers, S., Vyverman, W., Goddeeris, B., De meester, L., Brendonck, L., and Martens, K.: Ecological characteristics of small farmland ponds: Associations with land use practices at multiple spatial scales, Biological Conservation, 131, 523-532, https://doi.org/10.1016/j.biocon.2006.02.024, 2006.
Friedland, G., Grüneberg, B., and Hupfer, M.: Geochemical signatures of lignite mining products in sediments downstream a fluvial-lacustrine system, Science of The Total Environment, 760, 143942, https://doi.org/10.1016/j.scitotenv.2020.143942, 2021.
Johnson, M.R., and Zelt, R.B., Protocols for Mapping and Characterizing Land Use/Land Cover in RiparianZones: U.S. Geological Survey Open-File Report 2005-1302, 22 p. 2005
Tufekcioglu, M., Schultz, R. C., Isenhart, T. M., Kovar, J. L., and Russell, J. R.: Riparian Land-Use, Stream Morphology and Streambank Erosion within Grazed Pastures in Southern Iowa, USA: A Catchment-Wide Perspective, Sustainability, 12, 6461, 2020.
Citation: https://doi.org/10.5194/bg-2021-340-AC1
-
AC1: 'Reply on RC1', Clara Romero, 17 Mar 2022
-
RC2: 'Comment on bg-2021-340', Clayton Williams, 29 Jan 2022
General Comments
González-Quijano et al.’s manuscript seeks to understand how dissolved organic matter (DOM) pools are structured in an urban ecosystem. The manuscript further describes how well DOM quality relates to conventional water quality monitoring measurements and asks if DOM would be a useful indicator of water quality. The study used three different approaches to characterize the organic matter pool. I think the manuscript would be of interest to a broad audience and provides a useful and complex dataset. The results did well at describing the main multidimensional patterns in the data without focusing to heavily on individual data specifics. I think the multivariate approach used in the paper has merits, but I think adjustments to the approach would be useful. First, the trace organic compounds (TrOC) PCA overcomplicated the manuscript and made it challenging to connection microcontaminant loads with urban pollution. Second, I think the DOM PCA could be focused by reducing the number of variables used and by presenting the parallel factor analysis (PARAFAC) results as percent of Fmax or relative to DOC. I think that both of these changes to the DOM PCA would allow the PCA and RDA to better highlight the data and connections between urban water quality markers. These adjustments will likely also meaningfully influence how DOM optical properties related to mass spectrometry results. Finally, I think the manuscript highlights an important topic, using DOM optical properties as a management tool. The current framing of the manuscript could be altered to better bring out this point. I think the "DOM as a monitoring tool" argument would be strengthened by adding broader statements explaining what makes DOM ideal for management, adding a hypothesis around the RDA between urban pollution drivers, and more fully explaining what basic knowledge is missing. Below I provide more detail around these main points for the author’s consideration as well as other specific suggestions
Specific Suggestions
Abstract & manuscript framing – I think the study would be better set up if “basic” was explained in more detail. I am curious to know what connections are missing and how or why this information is needed to better understand DOM composition in urban ecosystems. I think it would be useful if the abstract reconnects the expected high DOM diversity to “filling in the basics”
Discussion section 4.3 & manuscript framing – this seems like the main objective of the study and is a valuable argument to be made and supported. I think some of this framing is lost in the methods and results. It would be useful for the reader if perhaps more direct statements of hypotheses were made that connect more traditional water quality measures to the potential use of DOM in water quality monitoring.
Methods, 2.1 study sites – I think it would be useful to provide the size cutoff for streams vs rivers as was done for lakes and ponds.
Methods - Given the differences in habitat and the location of primary producers in Rivers, Streams, Ponds, and Lakes, I am not certain CHL should be used as a proxy for trophic state. Benthic algae are often abundant in urban streams. I think CHL should be removed from RDA because I don’t think it’s a comparable measure of eutrophication between lotic and lentic systems.
Methods - I did not understand what the hypotheses were in RDA around DOM drivers. It was unclear what possible drivers were measured and then how they were applied to see changes in DOM. Perhaps more explanation is needed for readers like me with less experience using RDA and also to strengthen this analysis' connection to using DOM in urban monitoring.
Methods & Result - There are an impressive number of variables determined in this study. Many of which correlate or are a proxy for the same type of measure. For example, molecular weight is approximated through three optical indices and measured more directly with liquid size-exclusion chromatography. Given the complexity of the manuscript’s dataset, it might be easier for the reader if only one variable that measures or estimates a DOM property was used. In my experience, especially given the inherent correlation between DOM characteristics, redundancy of multiple variables targeting the same DOM attribute are not needed. For example, I suggest only using S275-295 or SR as the optical indicator of DOM molecular weight. For comparison, its fine to keep all measures in the appendix but, for the main body of the paper and multivariate analysis, I think it would be easier for the reader to understand the results if only S275-295 or SR was used as the optical indictor of size. One final note: Short slope (S275-295) should be positive (see Fichot & Benner 2012 L&O 57(5):1453).
Methods & Results – I am not certain the TrOC PCA is necessary. The purpose is to show micropollutant load. I worry about the below detection limit impact of ponds on the PCA loadings and scores. It seems like an easier and more staightforward metric that would also show micropollutant load is to sum all TrOCs and report a total TrOC concentration. This way, the reader does not need to remember three different PCAs and use a relative measure of load, when the sum of TrOCs would provide an understandable indicator of micropollutant load.
Methods & Results – I think the DOM PCA would structure better around DOM quality if PARAFAC components were set as percentages of Fmax or relative to DOC rather than RU. RU tends to follow concentration rather than clearly line up to quality measures and I think this is why all PARAFAC components point in the same direction in the DOM PCA. For example, DOC was significantly higher in streams. The DOM PCA water body type clusters follow fairly well this DOC concentration gradient, with all the PARAFAC components increasing in intensity toward places that had higher DOC. This would make quantity rather than quality the main driving force behind the PCA loadings. I agree that there is some separation of allochthonous to autochthonous sourced DOM along PC1 but the PARAFAC components did not follow the expected pattern based on quality. I am guessing that making the PARAFAC components relative, will cause them to line up much better across the source gradient. Using PARAFAC components as a percent of total Fmax or standardized to DOC (RU/DOC) might also provide better overlap between optic and FT-ICR-MS properties of DOM and help DOM optical estimates of similar compositional properties better align in the PCA space. The last paragraph of the results describes the FT-ICR-MS results clearly and in summarized way the read can understand. However, the PC comparisons don’t always match with the optical multivariate space. For example, C6 & C8 are negative on PC1 and positive on PC2 suggesting there is more protein in that quadrant, but the comparison with FT-ICR-MS indicates that N containing compounds are positively related to PC1. Similar conflicts arise with humics. I think the reason for this is the PARAFAC components are being driven by quantity rather than quality. By making PARAFAC relative, then the resulting re-analyzed PCA might track more expectedly with FT-ICR-MS patterns.
Discussion, around line 320 - These are interesting ideas and useful points. I wonder if the lower B:A in streams reflects that the WWTP degrades DOM and the effluent is highly processed, while in lakes and ponds there is more new production of DOM resulting in a higher index score.
Figure 1 – I like the style of this figure. I think it shows the sites and results well. For the caption, the reader needs more information. At this stage in the manuscript, it is unclear what variables are included in the PCA scores and what the patterns mean. It would be useful for the reader if a plan language interpretation of each PC axis was given so the reader could quickly understand the pattern
Supplemental Information – I am not certain why the SI is needed given the large set of appendices. Why not move table S1 to the appendix and incorporate the DOM analysis information into the main methods. At least for, FT-ICR-MS the SI methods are very similar to the main methods.
Table A4 - Should the FIX for rivers be 1.68 rather than 0.68? 0.68 is well outside the normal range for FIX and could indicate an issue with how EEMs were processed, contamination or scanning error, or a coding error for the calculations
Citation: https://doi.org/10.5194/bg-2021-340-RC2 -
AC2: 'Reply on RC2', Clara Romero, 17 Mar 2022
General Comments
González-Quijano et al.’s manuscript seeks to understand how dissolved organic matter (DOM) pools are structured in an urban ecosystem. The manuscript further describes how well DOM quality relates to conventional water quality monitoring measurements and asks if DOM would be a useful indicator of water quality. The study used three different approaches to characterize the organic matter pool. I think the manuscript would be of interest to a broad audience and provides a useful and complex dataset. The results did well at describing the main multidimensional patterns in the data without focusing to heavily on individual data specifics. I think the multivariate approach used in the paper has merits, but I think adjustments to the approach would be useful. First, the trace organic compounds (TrOC) PCA overcomplicated the manuscript and made it challenging to connection microcontaminant loads with urban pollution. Second, I think the DOM PCA could be focused by reducing the number of variables used and by presenting the parallel factor analysis (PARAFAC) results as percent of Fmax or relative to DOC. I think that both of these changes to the DOM PCA would allow the PCA and RDA to better highlight the data and connections between urban water quality markers. These adjustments will likely also meaningfully influence how DOM optical properties related to mass spectrometry results. Finally, I think the manuscript highlights an important topic, using DOM optical properties as a management tool. The current framing of the manuscript could be altered to better bring out this point. I think the "DOM as a monitoring tool" argument would be strengthened by adding broader statements explaining what makes DOM ideal for management, adding a hypothesis around the RDA between urban pollution drivers, and more fully explaining what basic knowledge is missing. Below I provide more detail around these main points for the author’s consideration as well as other specific suggestions
Thank you for these excellent suggestions, which we will consider as follows:
- We will compute a mean of all the TrOC compounds after standardizing the values to ensure equal weighting. This aggregation is justified by the strongly positive correlations between all TrOCs.
- We will slightly reduce the complexity of the PCA by removing obviously redundant variables. This eases the interpretation of the PCA biplot without a major loss of information.
- We will present a new PCA using PARAFAC components normalized by DOC, replacing the PCA of the raw PARAFAC data.
- We side with the reviewer that an important implication of our study is the suggestion to develop DOM optical properties for monitoring purposes that complement established approaches. In particular, DOM “fingerprints” could yield valuable information about functional impairment. We will further develop this in the Discussion.
Specific Suggestions
Abstract & manuscript framing – I think the study would be better set up if “basic” was explained in more detail. I am curious to know what connections are missing and how or why this information is needed to better understand DOM composition in urban ecosystems. I think it would be useful if the abstract reconnects the expected high DOM diversity to “filling in the basics”
We will reframe the manuscript and also revise the abstract as suggested:
- Explain that basic information refers to the unknown causal relationships between processes (or functions) in urban settings and their imprints on DOM composition and dynamics.
- Clarify that “high DOM diversity” is expected to result from the putatively high diversity of processes in urban settings, owing to the combination of natural and human-facilitated processes in these environments, both within the aquatic systems and in the watershed that supply much of the DOM.
- Link both aspects to the use of DOM composition for water quality monitoring.
Discussion section 4.3 & manuscript framing – this seems like the main objective of the study and is a valuable argument to be made and supported. I think some of this framing is lost in the methods and results. It would be useful for the reader if perhaps more direct statements of hypotheses were made that connect more traditional water quality measures to the potential use of DOM in water quality monitoring.
We will improve the rationale behind our hypotheses, focusing on the value of monitoring and the use of DOM composition to infer processes, rather than exploring correlations with established water quality measures.
Methods, 2.1 study sites – I think it would be useful to provide the size cutoff for streams vs rivers as was done for lakes and ponds.
Rivers and streams were classified according to a width cutoff of 5 m.
Methods - Given the differences in habitat and the location of primary producers in Rivers, Streams, Ponds, and Lakes, I am not certain CHL should be used as a proxy for trophic state. Benthic algae are often abundant in urban streams. I think CHL should be removed from RDA because I don’t think it’s a comparable measure of eutrophication between lotic and lentic systems.
Most rivers in Berlin are relatively deep and slow-flowing. In contrast, although streams are also typically slow-flowing, much of their autotrophic biomass could indeed be benthic. This limits the value of chl concentration in the water column as a proxy of eutrophication. However, in the absence of more suitable indicators and because high chl values do indicate high productivity (while the inverse is not necessarily true), we still prefer retaining the variable - along with ammonium, nitrate and total phosphorus concentrations.
Methods - I did not understand what the hypotheses were in RDA around DOM drivers. It was unclear what possible drivers were measured and then how they were applied to see changes in DOM. Perhaps more explanation is needed for readers like me with less experience using RDA and also to strengthen this analysis' connection to using DOM in urban monitoring.
The goal of the RDA was to identify potential drivers of DOM composition and to test whether a series of selected predictors provide useful information on DOM composition. We will expand the RDA paragraph in Methods (formerly lines 181-188) to better explain the rationale behind using the RDA (besides the PCA and the Procrustes test) and the associated hypotheses. We will also clarify the use of the selected predictors.
Methods & Result - There are an impressive number of variables determined in this study. Many of which correlate or are a proxy for the same type of measure. For example, molecular weight is approximated through three optical indices and measured more directly with liquid size-exclusion chromatography. Given the complexity of the manuscript’s dataset, it might be easier for the reader if only one variable that measures or estimates a DOM property was used. In my experience, especially given the inherent correlation between DOM characteristics, redundancy of multiple variables targeting the same DOM attribute are not needed. For example, I suggest only using S275-295 or SR as the optical indicator of DOM molecular weight. For comparison, its fine to keep all measures in the appendix but, for the main body of the paper and multivariate analysis, I think it would be easier for the reader to understand the results if only S275-295 or SR was used as the optical indictor of size. One final note: Short slope (S275-295) should be positive (see Fichot & Benner 2012 L&O 57(5):1453).
The very purpose of a PCA is to reduce the number of dimensions in a dataset, so that prior selection of variables is not critically needed, even when variables are redundant. However, highly correlated variables can clutter PCA biplots and could thus hamper discerning clear patterns. Thus, we will remove a small number of strongly correlated and obviously redundant variables for the PCA. This changes the PCA biplot only slightly. We will report the correlations between the discarded variables and the variables that were included (or the principal components) in the appendix.
The sign of the short slope will be corrected.
Methods & Results – I am not certain the TrOC PCA is necessary. The purpose is to show micropollutant load. I worry about the below detection limit impact of ponds on the PCA loadings and scores. It seems like an easier and more staightforward metric that would also show micropollutant load is to sum all TrOCs and report a total TrOC concentration. This way, the reader does not need to remember three different PCAs and use a relative measure of load, when the sum of TrOCs would provide an understandable indicator of micropollutant load.
We will standardize all TrOC concentrations to zero mean and unit variance, compute a mean for each site as an aggregate indicator of pollution by TrOCs, and use this indicator in the RDA analysis. This standardization step ensures equal weightings of the variables. The approach is justified by the strong positive correlations between all TrOCs, which we will show by the (old) PCA results in the appendix.
Methods & Results – I think the DOM PCA would structure better around DOM quality if PARAFAC components were set as percentages of Fmax or relative to DOC rather than RU. RU tends to follow concentration rather than clearly line up to quality measures and I think this is why all PARAFAC components point in the same direction in the DOM PCA. For example, DOC was significantly higher in streams. The DOM PCA water body type clusters follow fairly well this DOC concentration gradient, with all the PARAFAC components increasing in intensity toward places that had higher DOC. This would make quantity rather than quality the main driving force behind the PCA loadings. I agree that there is some separation of allochthonous to autochthonous sourced DOM along PC1 but the PARAFAC components did not follow the expected pattern based on quality. I am guessing that making the PARAFAC components relative, will cause them to line up much better across the source gradient. Using PARAFAC components as a percent of total Fmax or standardized to DOC (RU/DOC) might also provide better overlap between optic and FT-ICR-MS properties of DOM and help DOM optical estimates of similar compositional properties better align in the PCA space. The last paragraph of the results describes the FT-ICR-MS results clearly and in summarized way the read can understand. However, the PC comparisons don’t always match with the optical multivariate space. For example, C6 & C8 are negative on PC1 and positive on PC2 suggesting there is more protein in that quadrant, but the comparison with FT-ICR-MS indicates that N containing compounds are positively related to PC1. Similar conflicts arise with humics. I think the reason for this is the PARAFAC components are being driven by quantity rather than quality. By making PARAFAC relative, then the resulting re-analyzed PCA might track more expectedly with FT-ICR-MS patterns.
We will include a new PCA with the DOC-normalized PARAFAC components, which will reduce the influence of DOC concentration and emphasize instead qualitative differences among DOM from different sites. In the new PCA, three of the PARAFAC components are now distinctly different from the others on the biplot, while the rest of the variables ordinate almost like in the old PCA.
Discussion, around line 320 - These are interesting ideas and useful points. I wonder if the lower B:A in streams reflects that the WWTP degrades DOM and the effluent is highly processed, while in lakes and ponds there is more new production of DOM resulting in a higher index score.
Thank you for the idea. We will add the point to the Discussion.
Figure 1 – I like the style of this figure. I think it shows the sites and results well. For the caption, the reader needs more information. At this stage in the manuscript, it is unclear what variables are included in the PCA scores and what the patterns mean. It would be useful for the reader if a plan language interpretation of each PC axis was given so the reader could quickly understand the pattern
We will expand the caption as follows:
Figure 1 (a) Map of 32 sampling sites in the city of Berlin, including 7 lakes (dark green), 7 ponds (light green), 9 streams (light blue), and 9 rivers (dark blue), including two heavily polluted stream sites and two heavily polluted river sites. (b, c) Scores of a Principal Component Analysis (PCA) of DOM characteristics for all sites sampled in four different seasons. The PCA includes the measured DOC concentrations, all absorbance and fluorescence data, absolute component-specific fluorescence intensities from PARAFAC, and the results from size-exclusion chromatography. Different colours indicate differences in DOM composition. Site codes are given in Table S1. Sites marked by asterisks (*) were restricted to three seasons and were hence excluded from the PCA.
Supplemental Information – I am not certain why the SI is needed given the large set of appendices. Why not move table S1 to the appendix and incorporate the DOM analysis information into the main methods. At least for, FT-ICR-MS the SI methods are very similar to the main methods.
We will move this information in the supplement to the main text or the appendix.
Table A4 - Should the FIX for rivers be 1.68 rather than 0.68? 0.68 is well outside the normal range for FIX and could indicate an issue with how EEMs were processed, contamination or scanning error, or a coding error for the calculations
Thank you for spotting this mistake. We checked the value and found that it must indeed read 1.68. We will correct the number.
Citation: https://doi.org/10.5194/bg-2021-340-AC2
-
AC2: 'Reply on RC2', Clara Romero, 17 Mar 2022