Refining marine net primary production estimates: advanced uncertainty quantification through probability prediction models

Niu, Jie; Xie, Mengyu; Lu, Yanqun; Sun, Liwei; Liu, Na; Qiu, Han; Liu, Dongdong; Wu, Chuanhao; Wu, Pan

doi:https://doi.org/10.5194/bg-22-5463-2025

Articles | Volume 22, issue 19

https://doi.org/10.5194/bg-22-5463-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/bg-22-5463-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 22, issue 19

Research article

|

09 Oct 2025

Research article |

| 09 Oct 2025

Refining marine net primary production estimates: advanced uncertainty quantification through probability prediction models

Jie Niu, Mengyu Xie, Yanqun Lu, Liwei Sun, Na Liu, Han Qiu, Dongdong Liu, Chuanhao Wu, and Pan Wu

Download

Final revised paper (published on 09 Oct 2025)
Supplement to the final revised paper
Preprint (discussion started on 02 Dec 2024)

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-3221', Anonymous Referee #1, 17 Dec 2024

Please find the comments in the attached PDF.

Citation: https://doi.org/10.5194/egusphere-2024-3221-RC1
- AC4: 'Reply on RC1', Mengyu Xie, 31 Jan 2025
  
  Dear anonymous Reviewer,
  We express our sincere gratitude for the insightful comments and constructive criticisms on our manuscript titled "Refining marine net primary production estimates: Advanced uncertainty quantification through probability prediction models" (MS No.: egusphere-2024-3221). In response to your valuable feedback, we have meticulously revised our manuscript to enhance its clarity, coherence, and overall scientific contribution. The previous response to the review comments did not contain details of the changes made to the article, this new attachment contains the response to the review comments and the specific changes made to the article in response to the review comments, which will allow you to have a clearer picture of the changes. Reviews’ comments are in normal text, whereas our responses are in blue. We hope you will read this latest response and attachment.
  
  With kind regards,
  Mengyu Xie (on behalf of all co-authors)
  
  Citation: https://doi.org/10.5194/egusphere-2024-3221-AC4
RC2:
'Comment on egusphere-2024-3221', Anonymous Referee #2, 18 Dec 2024

The manuscript presents a comparative analysis of Bayesian and neural network-based probability prediction models for estimating Net Primary Production (NPP) at a location near Weizhou Island (though this spatial focus is not clearly stated in the abstract or introduction). While the study demonstrates interesting methodological approaches to uncertainty quantification, it requires major revisions and clarifications.
general comments
The spatial scope and context of the study need to be clearly defined in the abstract and introduction. The location or spatial extent of the study is not mentioned in the title, abstract or introduction, suggesting a global analysis of marine NPP, when in fact the study focuses on a specific (point) location near Weizhou Island off the Chinese coast. Given the large number of inputs required for the Neural Network (NN) and Bayesian technique used in the study, it would not be easy to scale the approach to a larger region.
A critical limitation of the study is the data used for training the NN and the Bayesian model. The models are trained on outputs from existing NPP models (VGPM, CbPM, and CAFE) rather than directly on NPP data. Effectively, the NN and Bayesian model serve as emulators of the NPP models, inheriting their underlying errors and biases. Thus, the uncertainty estimates reported in the manuscript reflect the uncertainty in emulating the output, but not the uncertainty in estimating actual NPP. Furthermore, as shown in Fig. 3, estimates from VGPM, CbPM, and CAFE differ strongly, and it is not clear which output is more accurate. These points need to be explicitly acknowledged in the manuscript, as it means the reported uncertainty estimates do not represent true NPP estimation uncertainty.
The differences between VGPM, CbPM, and CAFE output raise questions about which model provides the best NPP estimates and the most reliable training data. The current version of the manuscript initially does not mention which of the 3 models provided the output used to generate the full time series of NPP estimates near Weizhou Island in Section 3.3. Section 4 finally reveals that CAFE was used to generate the NPP training data, but that choice appears to have been motivated by results showing that the NN and the Bayesian model can emulate CAFE output well and not that CAFE output best represents true NPP.
In the context of the above comments, it would be interesting for the reader to know what inputs VGPM, CbPM, and CAFE used to generate their results. If the NN or the Bayesian model require more or more difficult to measure input data than VGPM, CbPM, or CAFE, why use them at all? Similarly, it would be interesting to investigate which of the inputs to the NN or the Bayesian model are actually required to obtain good performance.
The manuscript's writing style suggests the use of AI-assisted writing, which, while not problematic in itself, has led to the use of emphatic language and filler words (such as "pivotal", "integral", "advanced", "comprehensive", "indispensable", "paramount", etc.). The manuscript would benefit from removing these words in places and rewording passages.
A few passages in the manuscript appear to suggest surprise in discovering periodicity in NPP values: "Upon visualizing the values of the three NPP products (VGPM, CbPM, and CAFE) (Fig. 3), it became evident that each exhibits a distinct periodicity" (l 198). "The analysis of the annual change of NPP shows a clear periodicity, which means that the change of NPP is not random, but follows certain laws and patterns." (l 571). Even at 21 degrees north, one can expect seasonal patterns in marine primary production - this context should be provided in the text.

specific comments
L 117: What are "stochastic optimization" and "advanced chance constraints"? They are only used here and nowhere else in the manuscript. It would be useful to describe relevant new concepts to the reader right away, or not mention them when they are not used or described in the manuscript.
L 149: What does "sea accumulation" mean?
L 149: "Surrounded by the sea on all sides, Weizhou Island ...": I think this is the definition of an island.
L 168: "For the analysis of three NPP algorithms - namely, VGPM, CbPM, and CAFE - we acquired datasets at an eight-day temporal resolution ...": Here it is unclear to the reader if the "acquired datasets" are the input required to run the algorithms or their output. I assume it is the latter, but that should be made more explicit.
L 177/Table 2: Just listing the numbers of missing entries is not very informative. At which frequency were they recorded?
L 198: "Upon visualizing the values of the three NPP products (VGPM, CbPM, and CAFE) (Fig. 3), it became evident that each exhibits a distinct periodicity, with the fluctuation ranges remaining stable yet markedly varied among them." What exactly does this mean? Do the signals not have an underlying annual periodicity?
L 311: Samples are mentioned here for the first time and need a better introduction.
Eq. 3: This looks like a recursive definition of CRPS, I would suggest using different names for the "CRPS" used in Eq. 2 and 3.
Eq. 4: The notation is inconsistent: In Eq. 2 and 3, x denotes the observed value and y the predicted value, but in Eq. 4 and 5, y is used for the actual/observed value and y-hat for the predicted value.
L 501: The test data distribution for CAFE NPP does not look similar to that of the train data distribution, suggesting that the test data may not be well-represented by the train data.
L 503: What is the difference between the values shown in Table 5 and Fig. 4? Why not combine the two?
Fig. 2 and 3: The date label locations 2007/1/1, 2008/3/13, 2009/5/25, ... make it difficult to interpret the plot and detect seasonality.
Fig. 4: The caption mentions "input variables". Are these inputs to VGPM, CAFE, and CbPM?
Fig. 5: Why does the y-axis go past 0.8 in panels a, b, d and e, when the values all stay below 0.4? Also, the units are missing.
Fig. 8 and 9: The NPP units here are incorrect. The data appears to have been normalized, but why? Without normalization, it would be easier to interpret for which NPP ranges the NN and the Bayes model over- or underestimate VGPM NPP.

Citation: https://doi.org/10.5194/egusphere-2024-3221-RC2
- AC2: 'Reply on RC2', Mengyu Xie, 13 Jan 2025
  
  Dear Editor and anonymous Reviewer,
  We express our sincere gratitude for the insightful comments and constructive criticisms on our manuscript titled "Refining marine net primary production estimates: Advanced uncertainty quantification through probability prediction models" (MS No.: egusphere-2024-3221). In response to your valuable feedback, we have meticulously revised our manuscript to enhance its clarity, coherence, and overall scientific contribution. Specific modifications have been made to address each point raised by the reviewers, and these are detailed in the subsequent pages, where we provide a point-by-point response to your comments. Reviews’ comments are in normal text, whereas our responses are in blue.
  This revision process has been a collaborative effort among all co-authors, and we believe that the adjustments made significantly improve the manuscript. We are confident that these changes have addressed your concerns and enriched the manuscript.
  With kind regards,
  Mengyu Xie (on behalf of all co-authors)
  
  Citation: https://doi.org/10.5194/egusphere-2024-3221-AC2
- AC3: 'Reply on RC2', Mengyu Xie, 31 Jan 2025
  
  Dear anonymous Reviewer,
  We express our sincere gratitude for the insightful comments and constructive criticisms on our manuscript titled "Refining marine net primary production estimates: Advanced uncertainty quantification through probability prediction models" (MS No.: egusphere-2024-3221). In response to your valuable feedback, we have meticulously revised our manuscript to enhance its clarity, coherence, and overall scientific contribution. The previous response to the review comments did not contain details of the changes made to the article, this new attachment contains the response to the review comments and the specific changes made to the article in response to the review comments, which will allow you to have a clearer picture of the changes. Reviews’ comments are in normal text, whereas our responses are in blue. We hope you will read this latest response and attachment.
  
  With kind regards,
  Mengyu Xie (on behalf of all co-authors)
  
  Citation: https://doi.org/10.5194/egusphere-2024-3221-AC3
AC1: 'Reply on RC1', Mengyu Xie, 13 Jan 2025

Dear Editor and anonymous Reviewer,
We express our sincere gratitude for the insightful comments and constructive criticisms on our manuscript titled "Refining marine net primary production estimates: Advanced uncertainty quantification through probability prediction models" (MS No.: egusphere-2024-3221). In response to your valuable feedback, we have meticulously revised our manuscript to enhance its clarity, coherence, and overall scientific contribution. Specific modifications have been made to address each point raised by the reviewers, and these are detailed in the subsequent pages, where we provide a point-by-point response to your comments. Reviews’ comments are in normal text, whereas our responses are in blue.
This revision process has been a collaborative effort among all co-authors, and we believe that the adjustments made significantly improve the manuscript. We are confident that these changes have addressed your concerns and enriched the manuscript.
With kind regards,
Mengyu Xie (on behalf of all co-authors)

Citation: https://doi.org/10.5194/egusphere-2024-3221-AC1

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

ED: Reconsider after major revisions (05 Feb 2025) by Stefano Ciavatta

AR by Mengyu Xie on behalf of the Authors (17 Feb 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (11 Mar 2025) by Stefano Ciavatta

RR by Anonymous Referee #1 (20 Mar 2025)

RR by Anonymous Referee #2 (27 Mar 2025)

Suggestions for revision or reasons for rejection

The revised version of the manuscript is a much better read, and the authors have spent considerable effort in addressing my comments. However, I still have some reservations about the methodology and framing in the revised version.

general comments

The authors have incorporated feedback from my previous comments in the manuscript, and importantly, they acknowledge that the uncertainty estimates do not reflect the full model uncertainty. However, the first such acknowledgment appears late in the manuscript, in line 476 in the results and discussion section. Later, the authors still claim that "Our objective extends beyond merely reproducing satellite NPP products. We aim to improve the overall accuracy and uncertainty quantification of NPP estimates by incorporating a robust probabilistic framework." (l 697). But the uncertainty is not fully qualified, in particular, this approach does not capture structural uncertainty, i.e. model bias or inadequacy. The estimates of CAFE may be heavily biased, but we do not know, and the uncertainty analysis conducted here would not show it. A more careful language is needed.

The authors claim that "The results reveal that both models are competent in quantifying CAFE uncertainty." (l. 726). Beyond the problem mentioned in my comment above, it remains unclear if the two methods actually capture main parts of the CAFE signal. Based on Fig. 7 and 10, the NN and Bayes model can capture the seasonal dynamics of the CAFE output. But is there a trend in the CAFE data, and do the two models capture that trend?

Furthermore, what evidence is there that the NN and Bayes model perform better than climatology? My concern is that one could build a simple climatological NPP model for Weizhou Island with uncertainty that would produce very similar output to the NN or Bayes model. For example, one could use
a + b * sin((c + time)/d) + epsilon
where epsilon ~ Normal(0, sigma) is a random variable. After estimating the model parameters (a, b, c, d, sigma) from CAFE data, it would require only time input and produce NPP estimates with uncertainty. Of course, this a very simple model and every year is the same, there is no trend, and the uncertainty does not vary with time. But then the NN and Bayes model seem to produce nearly identical output for each year as well, and the uncertainty envelope in Fig. 7 and 10 are very similar from year to year. Thus, it is important to show that NN and Bayes model perform better than a simple climatology model.

An aspect that is important but not described well in the manuscript is the required model input compared to that of VGPM, CbPM, and CAFE. In one statement, the authors write: "These inputs overlap substantially with those used in VGPM, CbPM, and CAFE, demonstrating that the NN and Bayesian models do not require additional or more complex inputs." (l. 315). Later the manuscript states: "These probabilistic models do not require additional input variables beyond those used by VGPM, CbPM, and CAFE." (l. 720) Are really all 11 inputs listed in Table 1 used in VGPM, CbPM, and CAFE? Did the authors perform any experiments limiting the inputs to the NN and Bayes model further to examine which inputs are actually required to produce the output?

When the data used for training a NN or model is very limited, a common thing to do is bootstrapping, i.e. dividing the data into different training and testing datasets repeatedly. Did the authors try different testing and training data configurations? It may shed more light on the differences in the CDF curves that are discussed in Section 3.2.2.

Overall, the manuscript reads much better than the initial version. However, the discussion of the results is quite long and feels repetitive at times. I would recommend tightening up Section 3 and removing repetitive statements.

specific comments

L 54: "Conventional methods of NPP measurement, such as ship-based sampling and bottle incubations, are beset with challenges like human errors and inadequacies in capturing spatial and temporal dynamics. This underscores the necessity for more sophisticated and comprehensive methods (Yang et al., 2021; Li et al., 2020)." True, but this study relies very much on monitoring data from a station and thus does not capture spatial dynamics -- it further relies on continuous measurements to capture the temporal dynamics. The authors mention this later: "Due to factors such as equipment malfunctions and adverse weather conditions, some data for the eleven variables were incomplete." (l 198).

L 79: "Currently, the most widely utilized models for estimating NPP include the Vertically Generalized Production Model (VGPM), [...], have been proposed.": This sentence needs to be rephrased.

L 156: "The proportion of excellent water quality in Guangxi's near-shore waters reaches more than 90% all year round": It is not clear what this means. What is this measure of water quality, and is this based on a study or survey that could be cited? Similarly, what does "the quality of the marine ecological environment has remained at the forefront of the country" imply? More specific language and references would be useful here.

L 163: "Weizhou Island, located in the southern subtropical monsoon zone, experiences a pleasant climate with abundant heat and precipitation throughout the year." Phrases like "pleasant climate" or "abundant heat and precipitation" are not specific or quantitative. The next sentence already specifies average (air?) temperatures, so the "pleasant climate" is not necessary here.

Eq. 1: Mention right away what theta and D represent in the equation.

L 367: "In probabilistic forecasting, the focus extends beyond mere point estimates to encompass the shape and dispersion of the probability distribution.": This sentence and the next could go to the beginning of the section to give a better motivation for the use of CRPS.

L 382: "y the predicted value, x the observed value". This works, but is not conventional. Typically, x are the predicted values and y denotes observations.

L 393: The CDF is introduced here, but it has already been used above in the definition of CRPS. I would suggest switching the section order.

L 483: "On using CAFE as a prediction target, both models show more consistent performance.": The term model has now been used to describe VGPM, CbPM, and CAFE, but also the NN and Bayesian model. Please ensure that the reader always knows what models are referenced in the text. Furthermore, this statement about consistent performance for both models seems to contradict a later one: "In addition, for NN model's MAPD index value for CAFE is lower than that for Bayes model" (l 487).

L 490: "Overall evaluation indicates that under both models' assessment criteria, CAFE demonstrates superior accuracy in predicting effects compared to VGPM and CbPM.": This paragraph is not very helpful. What are the two assessment criteria used here? (Fig. 5 uses three metrics, not two.) What does "predicting effects" mean? It is not helpful to the reader that the remaining paragraph discuss VGPM and CbPM results and not CAFE.

L 499: "(1) prior research indicating that CAFE provides relatively accurate estimates of NPP in marine ecosystems with characteristics similar to the Weizhou Island area, due to its advanced parameterization of phytoplankton dynamics". Please cite this prior research or provide some evidence for this statement.

L 520: Is this analysis based on the testing data or the full CAFE-based dataset?

L 523: Are these confidence intervals credible intervals for the Bayesian model?

L 590: "Fig. 8 demonstrates the CDF curves of the predicted mean values after the normalization process and the CDF curves of the CAFE." This sentence and the next are difficult to understand. Are they meant to emphasize the advantages of normalizing the values? Why make this point right after stating that divergence between these two CDFs should be minimal? Please rephrase.

L 671: Is the only difference between the estimates in this section and previous ones the daily resolution?

L 722: "By prioritizing variables such as SST and AP, the models can be optimized to reduce reliance on less influential inputs, improving efficiency without compromising accuracy." Was this actually shown? Did the authors try to run the NN or Bayes model with fewer input variables?

Hide

ED: Reconsider after major revisions (03 Apr 2025) by Stefano Ciavatta

AR by Mengyu Xie on behalf of the Authors (07 May 2025) Author's response Author's tracked changes

EF by Katja Gänger (08 May 2025) Manuscript

ED: Referee Nomination & Report Request started (16 May 2025) by Stefano Ciavatta

RR by Anonymous Referee #2 (10 Jun 2025)

Suggestions for revision or reasons for rejection

In the second revision of the manuscript, the authors have addressed my most recent feedback, and the manuscript now better characterizes the uncertainty quantification and the missing structural uncertainty. At this point, I have only a few minor comments.

general comments

There are still a few phrases and wording that suggest the help of AI in the writing process. As I have stated before, I do not see the use of AI as a writing aid as problematic, but it would be good for the reader to remove overly elaborate phrases such as "efficaciously elucidating" (line 42) or "more pronounced capability to capture the nuances of uncertainty" (line 650) just to point out two examples.

In their response to my last comments, the authors state that "While a sinusoidal climatology model [...] could indeed replicate seasonal dynamics, it does not incorporate external drivers or respond to changes in environmental conditions. In contrast, both the NN and Bayesian models in our study utilize real-time environmental inputs (e.g., temperature, precipitation, radiation), enabling them to adapt to interannual variability and capture ecosystem responses under non-stationary conditions." Indeed, that was precisely the point of my comment: even though the NN and Bayesian models utilize real-time environmental inputs, their output looks very much climatological and appears to repeat from one year to the next. Here, it would have been very revealing to have an event in the data that departed further from the annual mean. But I understand that the time series does not appear to contain such an event to test the models' abilities more thoroughly. No changes to the manuscript are required based on this comment.

specific comments (line numbers are based on the "tracked changes" version)

L 42: "in specific marine regions": Please change to "in a specific marine region", or be more specific and mention Weizhou Island.

L 559: "the CAFE model explains the most variance and has the lowest model bias, and also reproduces the magnitude and seasonality of field-measured NPP better than other satellite remote sensing models (Silsbe et al., 2016).": This statement is a bit misleading, as it may appear to the reader that Silsbe et al. (2016) used data from Weizhou Island in their study and assessment of CAFE.

L 795: "The results reveal that both models aDue to factors such as eqre competent in quantifying CAFE uncertainty.": Something went wrong in this sentence.

Hide

ED: Publish subject to technical corrections (24 Jun 2025) by Stefano Ciavatta

AR by Mengyu Xie on behalf of the Authors (01 Jul 2025) Manuscript

Download

Article (3357 KB)
Full-text XML

Short summary

This study employs two probabilistic methods – the Bayesian model and a deep-learning-based neural network – to estimate net primary production (NPP) and quantify its uncertainties. Results indicate that both models effectively capture NPP dynamics, with the neural network model outperforming the Bayesian approach in predictive accuracy. Furthermore, these models successfully predict interannual trends in NPP variation across the study area.