Articles | Volume 23, issue 3
https://doi.org/10.5194/bg-23-1043-2026
© Author(s) 2026. This work is distributed under the Creative Commons Attribution 4.0 License.
Machine learning for estimating phytoplankton size structure from satellite ocean color imagery in optically complex Pacific Arctic waters
Download
- Final revised paper (published on 04 Feb 2026)
- Supplement to the final revised paper
- Preprint (discussion started on 20 Mar 2025)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2025-799', Anonymous Referee #1, 28 Aug 2025
- AC1: 'Reply on RC1', Hisatomo Waga, 16 Oct 2025
- AC4: 'Reply on RC1', Hisatomo Waga, 16 Oct 2025
-
RC2: 'Comment on egusphere-2025-799', Anonymous Referee #2, 27 Sep 2025
- AC2: 'Reply on RC2', Hisatomo Waga, 16 Oct 2025
- AC3: 'Reply on RC2', Hisatomo Waga, 16 Oct 2025
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
ED: Reconsider after major revisions (25 Oct 2025) by Jamie Shutler
AR by Hisatomo Waga on behalf of the Authors (26 Oct 2025)
Author's response
Author's tracked changes
EF by Mario Ebel (28 Oct 2025)
Manuscript
ED: Referee Nomination & Report Request started (02 Dec 2025) by Jamie Shutler
RR by Anonymous Referee #2 (04 Jan 2026)
ED: Publish subject to minor revisions (review by editor) (13 Jan 2026) by Jamie Shutler
AR by Hisatomo Waga on behalf of the Authors (13 Jan 2026)
Author's response
Author's tracked changes
Manuscript
ED: Publish as is (14 Jan 2026) by Jamie Shutler
AR by Hisatomo Waga on behalf of the Authors (15 Jan 2026)
Manuscript
The authors develop and compare chlorophyll-a size-distribution (CSD) models to retrieve η (as an indicator of phytoplankton size structure) for an optically complex sector of the Pacific Arctic. They use an in-situ dataset (>150 stations, 2007–2021) to show that machine learning models outperform the commonly used PCA approach, although a simple linear regression on normalized Rrs appears to perform best for satellite applications.
The study is sound and offers novel and relevant insights. The main conclusions are also supported by the analyses. However, several aspects require clarification and strengthening before publication.
Major comments:
1. The dataset (N=177) is rather heterogeneous, encompassing different decades, methodologies, and water masses. With fewer than 200 samples and a random 70/30 split, there is a clear risk of bias during validation.
Before 2012, the cruises used different filter pore-size schemes. While the 5 µm vs. 2 µm cutoff for nanophytoplankton may not introduce major differences, the 20 µm vs. 10 µm cutoff applied in 2009 and 2010 could significantly affect the microphytoplankton fraction. These three cruises alone account for ~1/3 of the dataset.
I would also be cautious about merging fluorometer-derived Chl-a with HPLC-derived values in such a complex region. Is this necessary, particularly when the latter include only 10 samples? Typically, unless the two methods have been explicitly compared and shown to agree for this dataset, it may be better to exclude the HPLC samples.
Furthermore, the in-situ stations span from ~50°N to 78°N, covering the Bering, Chukchi, and Beaufort seas. This spatial heterogeneity thus likely introduces substantial variability. It is also unclear which cruises and years correspond to which regions, but it is likely that different regions were sampled in different years.
Therefore, I recommend the following:
2. For a paper focused on estimating phytoplankton size structure from satellite data, I would have expected a comparison with established PSC/PFT algorithms applied to the in-situ dataset, even if brief.
Given that you already have both Rrs and pigment data available, this would be straightforward to implement. For example, models developed by T. Hirata and B. Brewin could be applied and compared against your results. Such a comparison would help to contextualize your findings and highlight the added value of your study.
3. I recommend reducing the number of figures and tables in the main text.
Currently, there are 12 figures and 7 tables in total in the main body of the manuscript. This makes the manuscript, although very interesting, dense for the reader. Some of these could be moved to the supplementary material. For instance, Figure 11. Also, consider moving parts of the Methods to the supplementary material to further streamline the manuscript.
4. The paper presents monthly climatologies of η from MODIS, but it is unclear why no matchup analysis was conducted to verify that the model performs reliably with satellite data
While the climatology figures are interesting, uncertainties are substantial in such a complex region. Ideally, the authors should identify in-situ matchups and compare η estimates estimated from L2 MODIS images against their dataset. If this is not feasible, a useful alternative would be to compare climatologies restricted to the period of one or two cruises to provide at least a partial validation.
Minor comments: