On the challenges of retrieving phytoplankton properties from remote-sensing observations

Prochaska, Jason Xavier; Frouin, Robert J.

doi:https://doi.org/10.5194/bg-22-4705-2025

Articles | Volume 22, issue 18

https://doi.org/10.5194/bg-22-4705-2025

Articles | Volume 22, issue 18

Research article

17 Sep 2025

Research article |

| 17 Sep 2025

On the challenges of retrieving phytoplankton properties from remote-sensing observations

Jason Xavier Prochaska and Robert J. Frouin

Abstract

Remote-sensing satellites provide the only means to observe the entire ocean at high-temporal resolution. Optical sensors assess ocean color through estimates of remote-sensing reflectance (R_rs(λ)). We emphasize a physical degeneracy in the radiative transfer equation that relates R_rs(λ) to the absorption and backscattering coefficients (a(λ), b_b(λ)) known as inherent optical properties (IOPs). This degeneracy stems from R_rs(λ) depending on the ratio $b_{b} (λ) / a (λ)$ , preventing the independent retrieval of non-water IOPs without prior knowledge. We demonstrate that multi-spectral satellite observations lack the statistical power to recover more than three parameters describing non-water absorption and backscattering. Due to exponential-like absorption by colored dissolved organic matter and detritus at shorter wavelengths, multi-spectral R_rs(λ) data cannot detect phytoplankton absorption without strict priors, leading to biased and uncertain estimates. These results challenge decades of IOP retrieval literature, including assessments of phytoplankton growth and biomass. While hyperspectral observations hold promise to recover additional parameters, significant hurdles remain in accurately quantifying IOPs and phytoplankton biomass at a global scale.

Download & links

Article (PDF, 6876 KB)

Download & links

How to cite.

Received: 27 Feb 2025 – Discussion started: 14 Mar 2025 – Revised: 11 Jun 2025 – Accepted: 02 Jul 2025 – Published: 17 Sep 2025

1 Introduction

Phytoplankton play essential roles within our ecosystem, serving as the base of the ocean food web and performing ∼50 % of all photosynthesis on Earth. Therefore, assessing phytoplankton growth and death – especially in a changing climate (Behrenfeld et al., 2016; Flombaum et al., 2020) – is critical to any effort to track and predict the health of our planet. Decades of phytoplankton research has revealed significant regional variations in these process and demonstrated that phytoplankton are highly dynamic on relatively short timescales (hours to weeks, especially in coastal areas due to tides, upwelling, pulses of freshwater inflow, and other episodic events (e.g., Cloern and Jassby, 2010)). Therefore, to identify any long-term trend, one must first develop a detailed picture of the variations on seasonal and shorter timescales.

Unfortunately, our ability to measure phytoplankton in situ is greatly hampered by the vast expanse of the ocean. Measurements with high temporal frequency can only be acquired at select, fixed stations such as OceanSITES (Boss et al., 2022). Therefore, oceanographers have turned to remote-sensing satellite observations to perform high-cadence global analyses of the ocean surface. Beginning with the Coastal Zone Color Scanner experiment (Hovis et al., 1980), multi-band observations in optical channels have enabled the inference of phytoplankton and other seawater constituent properties, such as chlorophyll-a concentration and absorption by colored dissolved organic matter (CDOM) and detritus (IOCCG, 2000; Siegel et al., 2002). These properties are obtained from satellite-derived remote-sensing reflectance:

\begin{matrix} (1) & R_{rs} (λ) \equiv \frac{L_{w} (λ)}{E_{d} (λ)}, \end{matrix}

with L_w the water-leaving radiance and E_d the incident solar irradiance. Their retrieval relies on recovering the absorption coefficient:

\begin{matrix} (2) & a (λ) \equiv \frac{d A (λ)}{d r}, \end{matrix}

with A(λ) the fraction of incident power absorbed and the backscattering coefficient of

\begin{matrix} (3) & b_{b} (λ) \equiv \frac{d B (λ)}{d r}, \end{matrix}

with B(λ) the fractional incident power scattered out of the beam, which govern R_rs(λ) and are known as inherent optical properties (IOPs).

The underlying physics for IOP retrievals is radiative transfer: the absorption and scattering of sunlight by seawater modulates and directs incident sunlight back to the satellite. While the radiative transfer physics is straightforward (but not simple; Mobley, 2022), there are many factors that complicate the calculations. These include but are not limited to the concentration of the constituents (typically the desired unknown), their variation with depth, the precise wavelength dependence of the absorption and scattering coefficients of each constituent, and the geometric factors associated with the Sun's location relative to the satellite. In addition, Earth's atmosphere attenuates the signal and introduces a dominant background radiation field that must first be estimated and subtracted (“corrected”), which fundamentally limits the precision of any space-based R_rs(λ) estimation (e.g., Frouin et al., 2022).

For decades, researchers have attacked this radiative transfer problem to attempt retrievals of scientifically valuable quantities, including an estimate of the phytoplankton biomass. There is robust and well-founded literature describing (and performing) the translation of apparent optical properties (AOPs, e.g., R_rs(λ)) into inherent optical properties (IOPs; a(λ), b_b(λ)) that depend solely on the water constituents and the water itself. Ideally, one first parameterizes and then estimates (“retrieves”) the absorption and backscattering spectra of the non-water component a_nw(λ), b_b,p(λ) and then infers concentrations or proxies for phytoplankton, CDOM, detritus, etc. From these, one may examine the geographic distribution and temporal evolution of fundamental biological processes across the global ocean (e.g., Behrenfeld et al., 2005; Siegel et al., 2014; Fox et al., 2022).

During the development of a diverse set of IOP retrieval algorithms for this purpose (see Werdell et al., 2018, for a review), the ocean optics community has acknowledged key challenges in the problem that are largely independent of those associated with radiative transfer. These include uncertainties related to the atmospheric corrections, non-uniqueness between common constituents (e.g., CDOM and detritus), and retrieving multiple unknowns from limited datasets (e.g., multi-spectral observations). A few sparsely cited works have also highlighted a more fundamental obstacle to the process: a physical “ambiguity” in the inversion of the radiative transfer equation (Sydor et al., 2004; Defoin-Platel and Chami, 2007). Unfortunately, this problem has often been confused or conflated with the statistical limitations of an insufficient number of bands measuring R_rs(λ) (Werdell et al., 2018; Cetinić et al., 2024). As such, while the community has acknowledged challenges to IOP retrievals from remote-sensing observations, rigorous assessment of the algorithms themselves has been limited and usually only performed in the context of comparisons to sparse in situ observations (e.g., Lee, 2006; Seegers et al., 2018).

Another fundamental aspect of the problem is that we do not know the optimal basis functions that describe a(λ) and b_b(λ), nor even the complete set (e.g., Garver et al., 1994). Indeed, it is an aspiration within the ocean color field to recover (or even discover) the composition of phytoplankton communities (e.g., Mouw et al., 2017). The ocean color research community has hoped that the main limitation is the sparsity of existing multi-spectral bands provided by current satellites and that hyperspectral observations will lead to a major breakthrough. Indeed, Cael et al. (2023) have demonstrated its limited information content from a data-driven analysis of R_rs(λ) spectra, i.e., only ∼2 degrees of freedom in multi-spectral satellite observations. But they also concluded that in situ hyperspectral datasets provide only 1 or 2 additional degrees of freedom to describe the seawater composition. In this paper, we examine this question from a new angle – with the standard approach of IOP retrievals – and reach similar conclusions.

Here, we introduce the Bayesian INferences with Gordon coefficients (BING) package for ocean retrievals in a Bayesian context (see Erickson et al., 2020, 2023, for a complementary Bayesian approach). Our approach follows many of the standard assumptions of widely adopted algorithms in the literature, e.g., the generalized IOP (GIOP) model (Werdell et al., 2013) and the Garver–Siegel–Maritorena (GSM) algorithm (Maritorena et al., 2002). In addition, we emphasize and expand upon the “ambiguity” problem – a physical degeneracy in the radiative transfer equation that couples reflectances to IOPs – which fundamentally limits IOP retrievals. In turn, we demonstrate that IOP retrievals from multi-spectral datasets constrain at most three parameters describing a_nw(λ) and b_b,p(λ). Consequently, if the spectral shape of CDOM absorption is allowed to vary and remains unconstrained, it becomes challenging, even impossible, to independently retrieve phytoplankton absorption with high confidence. This limitation applies to previous satellite missions equipped with multi-band sensors, emphasizing the need for additional constraints or improved observational capabilities for more accurate phytoplankton absorption retrievals. We then examine the prospects for IOP retrievals with hyperspectral observations and discuss additional opportunities to address the deep degeneracies that lurk within.

2 Methods

2.1 Bayesian formalism

At the heart of our analysis is an open-source Bayesian inference algorithm developed for the retrieval of IOPs from remote-sensing reflectances, the BING package. The primary motivations for introducing a Bayesian framework are threefold:

i.
it forces one to explicitly describe all of the priors that influence the result;
ii.
it leverages well-developed techniques to assess error and correlations in the results without requiring Gaussianity¹, i.e., the assumption that errors, uncertainties, or distributions of the retrieved parameters follow a Gaussian (normal) distribution; and
iii.
it permits well-established approaches for performing model selection, i.e., estimating the maximum number of free parameters one can use to describe the data.

BING is conceptually similar to the algorithm presented in Erickson et al. (2020) to analyze diatoms and Noctiluca scintillans, although BING adopts a Monte Carlo Markov chain sampler. In addition, the code base incorporates several previous inference algorithms (e.g., GSM, GIOP) and is extensible to any new user-driven parameterization of the IOPs. The BING package is purely Python and is available on GitHub (Prochaska, 2024).

Provided that there is a forward model (described in the following section) and a parameterization of a(λ) and b_b(λ), the Bayesian inference is straightforward, and a wealth of well-trodden approaches and software packages are available. For BING, we adopt the Monte Carlo Markov Chain (MCMC) formalism, which empirically derives the posterior probabilities for the a(λ), b_b(λ) parameterization P(X|Y), including full uncertainties and all of the cross-correlation terms. This requires the definition of a likelihood function P(Y|X), which will have the form

\begin{matrix} (4) & P (Y | X) \propto \exp \{- \frac{1}{2} [Y - H (X)]^{T} C [Y - H (X)]\}, \end{matrix}

where Y represents the measured R_rs(λ) values, C is the full covariance matrix of R_rs(λ) including correlations, and H(X) is the forward model of R_rs(λ) at the locations of Y.

It can be shown that an MCMC analysis converges to the exact solution if run for an infinitely long time; in practice, the calculations tend to converge after ≈10 000 iterations. For the analysis here, we generally run 75 000 trials with at least 2 walkers per parameter (and at least 16 walkers) and only analyze the last 7000 iterations of each. This release of BING uses the EMCEE sampler (Foreman-Mackey et al., 2013), which was developed for astrophysical applications and has seen widespread adoption in the field (over 8000 citations).

We have also implemented standard χ² minimization (Levenberg–Marquardt; L–M) as a fitting option to speed up model development and portions of the analysis. This also enables one to implement standard inference models in the literature (e.g., GSM and GIOP), which generally use L–M optimization.

Note that the Bayesian approach in BING involves using Bayes' theorem to update the probability of a hypothesis based on prior knowledge and new data. When retrieving IOPs from R_rs(λ), this approach explicitly incorporates all available prior information about the IOPs and their uncertainties into the model. By doing so, it allows for a more transparent and rigorous estimation process. The Bayesian framework considers the likelihood of the observed R_rs(λ) given the IOPs and combines it with the prior probability distributions of the IOPs to obtain a posterior distribution. This posterior distribution provides a probabilistic solution to the inverse problem, highlighting the most likely values of the IOPs while quantifying the uncertainties, leading to more reliable and informed retrievals.

In reference to the detection of phytoplankton, we consider a series of models with and without a_ph(λ) absorption. We then examine the evidence for models with a_ph(λ) in the Bayesian context by applying standard approaches for model selection, i.e., assessing the balance between model fit and complexity. Specifically, we have evaluated the Akaike and Bayesian information criteria (AIC, BIC), which are akin to χ²-difference tests (Bentler and Bonett, 1980):

\begin{matrix} (5) & AIC = 2 k - 2 \ln L, \end{matrix}

and

\begin{matrix} (6) & BIC = k \ln n - 2 \ln L, \end{matrix}

with n the number of R_rs(λ) measurements and ℒ the likelihood function calculated assuming Gaussian statistics for uncertainties σ(R_rs(λ)). The likelihood function ℒ quantifies the probability of observing the data given the specific forward model and its parameters. Since it is calculated under the assumption of Gaussian (normal) statistics for the R_rs(λ) uncertainties, this means that the observed data, i.e., the R_rs(λ) measurements, are assumed to be normally distributed around the model predictions with the standard deviation. For our hyperspectral analysis where n≫10, BIC offers a more stringent constraint, but the results with AIC are qualitatively similar. A high AIC or BIC value implies that the model is less likely to be the best model given the data, considering both fit and complexity. Quantitatively, we assess model selection by evaluating the difference in BIC for any two models:

\begin{matrix} (7) & Δ {BIC}_{i, j} = {BIC}_{i} - {BIC}_{j}, \end{matrix}

where $Δ {BIC}_{i, j} < 0$ indicates that model i is preferred and vice versa.

2.2 Radiative transfer with a physical degeneracy

To construct any such algorithm, one must have a well-defined forward model to predict the observables, here remote-sensing reflectances R_rs(λ). For IOP inversion, this means a radiative transfer model – or its approximation – which estimates R_rs(λ) from a(λ) and the backscattering coefficients b_b(λ). The majority of IOP retrieval algorithms developed by the community have used the quasi-single-scattering approximation originally introduced by Hansen (1971) and translated to ocean color by Gordon (1973) (see also Zege et al., 1991). This approach was refined further by Gordon (1986), who approximated the sub-surface remote reflectances r_rs(λ) with a Taylor series expansion:

\begin{matrix} (8) & r_{rs} (λ) = \sum_{i = 1}^{N} G_{i} u (λ)^{i}, \end{matrix}

with

\begin{matrix} (9) & u (λ) \equiv \frac{b_{b} (λ)}{a (λ) + b_{b} (λ)} . \end{matrix}

Most IOP retrieval algorithms have taken N=2 and set the coefficients to constants, G₁=0.0949 and G₂=0.0794, i.e., values independent of wavelength. In this paper and for the default mode of BING, we adopt the same prescription and coefficients and scrutinize the accuracy of this assumption in Appendix A. For the results in the main text, we assume a perfect forward model; i.e., we use Eq. (8) to generate the target r_rs(λ) and perform the fits on these values. In practice, we work with remote-sensing reflectances R_rs(λ) following a standard conversion from r_rs(λ) (Lee et al., 2002):

\begin{matrix} (10) & r_{rs} (λ) = \frac{R_{rs} (λ)}{0.52 + 1.17 R_{rs} (λ)} . \end{matrix}

Despite the approximation of Eq. (8), it does capture a salient aspect of the physics: the functional dependence of R_rs(λ) on u(λ) and thereby on the IOPs a(λ) and b_b(λ). However, this dependence reveals an especially challenging aspect of IOP retrievals: because u(λ) is a function of the ratio of $b_{b} (λ) / a (λ)$ ,

\begin{matrix} (11) & r_{rs} (λ) = Func (\frac{b_{b}}{a}), \end{matrix}

the radiative transfer solutions are physically degenerate in $b_{b} / a$ . Put succinctly, any IOP solution that recovers a set of R_rs(λ) observations can be replaced by an infinite set that preserves the $b_{b} / a$ ratio. Therefore, the retrieval is only tractable if one implements strong constraints (known as priors in Bayesian analysis) on the functional forms of a(λ) and b_b(λ). In Sect. 3.1, we examine the consequences of this physical degeneracy for IOP retrievals.

2.3 A hyperspectral IOP dataset

For the development and testing of BING, we have leveraged a hyperspectral dataset of a(λ),b_b(λ) spectra made public by Loisel et al. (2023) (hereafter L23). The spectra sample waters with both Case I and Case II properties, with chlorophyll-a concentrations varying from Chl a ≈0.01 to 10 mg m⁻³.

The IOP spectra were generated from their database of in situ measurements of phytoplankton a_ph(λ) and models of several additional constituents: CDOM a_g(λ), pure seawater a_w(λ), and detritus a_d(λ). L23 then generated estimates of the backscattering coefficients b_b,p(λ) following standard assumptions based on in situ and laboratory work (see Loisel et al., 2023, for additional details). These 3320 a(λ) and b_b(λ) spectra define our dataset, and while they range from 350 to 750 nm, we restrict analysis to λ=400–700 nm.

Although IOP retrievals are greatly challenged by the physical degeneracy in the radiative transfer described in the previous section, a positive aspect of the problem is the presence of water, which introduces an ever-present and precisely known constraint on the problem (except in the ultraviolet, λ<400 nm; Mason et al. (2016)). The absorption a_w(λ) and backscattering b_b,w(λ) spectra of pure seawater impose priors on the model that serve to partially alleviate the physical degeneracy described in the previous section. First, a_w(λ) and b_b,w(λ) span the entire spectrum and therefore couple the otherwise independent R_rs(λ) values. Second, to the extent that the shapes of a_w and b_b,w are unique relative to other constituents, this helps one avoid the $b_{b} / a$ degeneracy. Third, the strong absorption of water at λ>500 nm and the relatively high magnitude of b_b,w(λ) at λ<450 nm define regions where one may retrieve information on the non-water components (Sydor et al., 2004).

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f01

Figure 1Comparison of the IOP spectra for water ( $a_{w}, b_{b, w}$ ; solid and dashed black curves) versus one example set of non-water spectra ( $a_{nw}, b_{b, p}$ ; solid and dashed orange curves) from L23 (their index = 170, an example representative of the open ocean with low Chl-a concentration). The red region indicates where absorption by water dominates (a_w>5a_nw). In this region, reflectance measurements constrain the non-water component of backscattering. Similarly, the blue region is where water dominates backscattering, and retrievals may constrain a_nw. In turn, for observations with noise, the inversion problem has very limited constraints on the non-water components.

Download

As for the last point, Fig. 1 compares the absorption and backscattering coefficients of water versus one example of non-water spectra a_nw(λ), b_b,p(λ) from the L23 dataset. As emphasized in the figure, at red wavelengths a≈a_w(λ) and $b_{b, w} \approx b_{b, p}$ such that the observations may constrain b_b,p. Similarly at λ<450 nm, $b_{b} \approx b_{b, w}$ and a_nw(λ)>a_w(λ) such that the observations constrain a_nw. These inferences from Fig. 1, however, rely on the strong (but frequently satisfied) prior that a_w(λ)>a_nw(λ) at λ>500 nm and $b_{b, w} (λ) > b_{b, p} (λ)$ at λ<450 nm. If this is relaxed, e.g., if a_nw(λ) and b_b,p(λ) may take on any values, then the $b_{b} / a$ degeneracy forces an infinite set of solutions (i.e., no unique retrieval is possible; see Sect. 3.1).

2.4 Simulating satellite observations

For many of the analyses presented here, we have generated simulated R_rs(λ) spectra for several multi-spectral and hyperspectral missions. For the fits presented in this paper, we ignore the R_rs(λ) spectra provided by L23 (generated with Hydrolight) and instead use Eqs. (8)–(10) to calculate R_rs(λ) from a(λ) and b_b(λ). We then resample these spectra to the bands/channels of several satellite missions.

MODerate resolution Imaging Spectroradiometer (MODIS)/Aqua. We adopt eight multi-spectral bands as listed in Table 1 corresponding to MODIS/Aqua, and we evaluate R_rs(λ) at the center of each. For uncertainties, we have estimated the RMS difference between satellite and in situ R_rs(λ) “match-up” measurements collated with the SeaBASS database (Werdell and Bailey, 2002) after iteratively clipping any 4σ outliers. Figure 2 shows an example of the data and clipping for one band. We further assumed that one-half of the variance is due to the in situ observations themselves. These RMS values are also provided in Table 1, and we find that they are in good agreement with other estimations (Zhang et al., 2022; Kudela et al., 2019).
Sea-viewing Wide Field-of-view Sensor (SeaWiFS)/SeaStar. We followed a similar procedure for SeaWiFS using six bands and the uncertainties provided in Table 2.
Ocean Color Instrument (OCI)/Plankton, Aerosol, Cloud, ocean Ecosystem (PACE) Satellite. For simulated OCI spectra, we assumed δ=5 nm sampling and limited the wavelength range between 400 and 700 nm. The lower bound is due to (i) greater uncertainties in the atmospheric corrections, (ii) greater uncertainty in water absorption and scattering, and (iii) greater uncertainty in how best to parameterize the non-water components in the UV bands. The upper wavelength bound is to avoid systematics that likely dominate the uncertainty at the lowest R_rs(λ) signals. Furthermore, we have limited measurements to the absorption of standard seawater constituents (e.g., phytoplankton) at these longer wavelengths.

For the PACE noise model, we downloaded a single granule (2 175 120 pixels) of Level 2 data, v2.0: PACE_OCI.20240413T175656.L2.OC_AOP.V2_0.NRT.nc. We then took the median uncertainty spectrum (Rrs_unc; Zhang et al., 2022) for all non-flagged data between 33–40° N and 73–78° W. This median uncertainty spectrum is plotted in Fig. 3 at the Level 2 wavelength sampling (≈2.5 nm). We also show the values adopted at our δ=5 nm sampling, and one notes that we did not adjust σ(R_rs(λ)) despite the larger sampling size. This is because OCI is over-sampled at δ=2.5 nm; i.e., neighboring data points are highly correlated. Further work should assess the degree of this correlation to obtain a better noise estimate.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f02

Figure 2(a) Comparison of the MODIS measurements of R_rs(λ) at λ=443 nm versus in situ observations at the same wavelength. These were taken from the SeaBASS site (Werdell and Bailey, 2002) dedicated to MODIS matchups (NASA Goddard Space Flight Center, 2024). The points follow the over-plotted one-to-one line relatively well, albeit with significant scatter, which we assess to be the RMS noise in the MODIS observations. (b) Distribution of the difference between in situ and satellite $Δ R_{rs} (λ) = R_{rs} (λ)^{insitu} - R_{rs} (λ)^{MODIS}$ . The dashed red lines show the 4σ interval beyond which we clipped the data when calculating the noise estimate (RMS).

Download

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f03

Figure 3The blue curve is the median PACE uncertainty in R_rs(λ) estimated in the Level 2 product v2.0 for one granule (PACE_OCI.20240413T175656.L2.OC_AOP.V2_0.NRT.nc). The black dots are the values of σ(R_rs(λ)) adopted in this paper when simulating PACE spectra with noise. The red stars show the signal-to-noise (S $/$ N) ratio for an example spectrum.

Download

Table 1MODIS data.

Note that the error assumes that $1 / 2$ of the variance is due to the in situ measurements.

Download Print Version | Download XLSX

Table 2SeaWiFS data.

Note that the error assumes that $1 / 2$ of the variance is due to the in situ measurements.

Download Print Version | Download XLSX

Note that the noise is independent across spectral bands, meaning that no spectral correlation is assumed. However, in standard atmospheric correction procedures, noise is expected to be spectrally correlated due to systematic uncertainties in aerosol and Rayleigh scattering treatments, as well as instrumental effects. Ignoring these correlations could introduce additional uncertainties in the retrievals by misrepresenting the spectral structure of the observed signals. Whether treating noise as being uncorrelated maximizes or underestimates the effective retrieval uncertainty depends on the interplay between error propagation and the constraints imposed by the retrieval algorithm. A more rigorous treatment should incorporate spectral noise correlations to provide a more accurate error characterization.

2.5 IOP models

For the principal analysis of this paper, we will consider a series of increasingly complex models for the IOPs a_nw(λ) and b_b,p(λ). We generally follow common practice for models of a_nw(λ) and b_b,p(λ), which have been informed by in situ and laboratory measurements of ocean constituents. In turn, we will examine the maximum complexity that can be statistically constrained by observations designed to mimic satellite retrievals, e.g., data from multi-band and hyperspectral observations.

Consider first the simplest scenario we may conceive of: a two-parameter k=2 model with both a_nw(λ) and b_b,p(λ) taken as constant at all wavelengths.

\begin{matrix} (12) & a_{nw} (λ) = A_{cst} \\ (13) & b_{b, p} (λ) = B_{nw} \end{matrix}

This model may not have physical merit, but it serves as a baseline for comparison with other physically motivated IOP scenarios.

Now consider three additional models of increasing complexity. The k=3 model,

\begin{matrix} (14) & a_{nw} (λ) = A_{dg} \exp [- S_{dg} (λ - 400)] \\ (15) & b_{b, p} (λ) = B_{nw}, \end{matrix}

where λ is expressed in nm and where one assumes the non-water absorption is strictly an exponential function. This spectral shape is commonly used to describe the absorption by CDOM and/or detritus. In situ absorption measurements show typical values of $S_{dg} \approx 0.015 {nm}^{- 1}$ for CDOM (Roesler et al., 1989) and $S_{dg} \approx 0.010 {nm}^{- 1}$ for detritus (Stramski et al., 2001). Our fiducial models only require S_dg>0, but we also consider stricter priors on this parameter.

The k=4 model,

\begin{matrix} (16) & a_{nw} (λ) = A_{dg} \exp [- S_{dg} (λ - 400)] \\ (17) & b_{b, p} (λ) = B_{nw} (λ / 600)^{β_{nw}}, \end{matrix}

adds the commonly adopted power law for backscattering by particulate matter (Gordon and Morel, 1983). For the k= 4 and k= 5 models, we allow β_nw to vary but consider fixed exponents for the other models introduced below (and, strictly speaking, β_nw=0 for models k= 2 and k= 3).

Last in this sequence, the k= 5 model includes a phytoplankton component:

\begin{matrix} (18) & a_{nw} (λ) = A_{dg} \exp [- S_{dg} (λ - 400)] + A_{ph} a_{ph} (λ) \\ (19) & b_{b, p} (λ) = B_{nw} (λ / 600)^{β_{nw}}, \end{matrix}

with a_ph(λ) introduced to capture “typical” absorption by phytoplankton. It is expected and has been observed that this component may exhibit the greatest complexity. Indeed, scientifically, the community aims to distinguish the potentially large variations in phytoplankton families throughout the ocean and inland waters. For this k= 5 model, we adopt the parameterization of Bricaud et al. (1995):

\begin{matrix} (20) & a_{ph} (λ) = C_{ph} (λ) [Chl a]^{E_{ph} (λ)}, \end{matrix}

with Chl a as the Chlorophyll-a concentration in mg m⁻³, and the tabulation of C_ph(λ) and E_ph(λ) are provided by Bricaud et al. (1998). Furthermore, we follow L23 (and the previous literature) and assume

\begin{matrix} (21) & Chl a = a_{ph} (440 nm) / 0.05582 m^{- 1} \end{matrix}

so that phytoplankton absorption is described by one free parameter: a_ph(440 nm) (a.k.a. A_ph).

We also include a k= 2b model that has two parameters, Chl a and B_nw (we assume a constant value), and we set the A_ph using Eq. (21). This model has no CDOM or detritus absorption.

For these models, we impose the following priors (constraints) on the five parameters. For each amplitude (A_dg, A_ph, B_nw), we assume a uniform log prior from 10⁻⁶ to 10⁵ m⁻¹ in magnitude. For the shape parameters, we assume a uniform prior for S_dg in the interval $U = [0.1, 0.2]$ and that β_nw has a uniform prior with values $U = [0, 2]$ . These priors for the shape parameters are motivated by the range of measured in situ values for each (Roesler et al., 1989; Lee et al., 2002).

We have also generated a series of “flexible” models with one free parameter at each of 61 wavelengths, and the model is the linear interpolation between each parameter. We will refer to this as the arbitrary IOP model.

For a portion of the analysis, we consider the GIOP and GSM models, which have been widely adopted within the community, including operational implementations by NASA. These two models are effectively constrained versions of the k= 5 model with different priors on the parameters. In particular, both GSM and GIOP adopt a fixed S_dg – 0.018nm⁻¹ for GIOP and 0.0206 nm⁻¹ for GSM. The models also either adopt a fixed value for β_nw (1.0337 for GSM) or estimate it from the R_rs(λ) measurements (GIOP) with a separate prescription (Lee et al., 2002).

Each model also has a different approach to setting the shape of a_ph(λ) compared to our k= 5 model. The standard GIOP model estimates Chl a from a separate prescription (typically the OC4 algorithm from O'Reilly et al. (1998) and then adopts Eq. (18) for the shape of a_ph(λ). For GSM, we adopt their multi-spectral description of a_ph(λ) and interpolate to hyperspectral resolution as needed.

We consider one final scenario, a many-parameter model (k= free) with one free parameter per wavelength channel for each a_nw(λ) and b_b,p(λ) This model is used to attempt retrievals with any arbitrary shape for the IOPs.

3 Results

3.1 Failed attempts at arbitrary IOP retrievals

The physical degeneracy in the radiative transfer relating R_rs(λ) to IOPs (Sect. 2.2) implies that the greater the freedom that one allows for a_nw(λ) or b_b,p(λ), the more degenerate the solutions. This fundamentally limits our ability to retrieve arbitrary a(λ) or b_b(λ), even in the presence of perfect data (an infinite number of channels and no uncertainty). Therefore, no algorithm can retrieve arbitrary or even highly complex a(λ) and b_b(λ). To make progress, one most also impose strong constraints on a_nw(λ) and b_b,p(λ) to recover unique or most probable solutions. These priors, however, must ensure that the values of a and b_b cannot vary freely at any individual wavelength, where one seeks a retrieval in a way that holds their ratio constant.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f04

Figure 4A series of R_rs(λ) “fits” to an example R_rs(λ) spectrum (a) for varying a_nw(λ) (b) and b_b,p(λ) (c) using the arbitrary IOP model. We show a series of solutions with scaling factors of 0.9, 1.0, 3.0, 10.0, and 100.0 relative to the true model for a(λ). Owing to the physical degeneracy in the radiative transfer equation, $R_{rs} (λ) = F (a / b_{b})$ , there are an infinite number of such solutions, yielding infinite uncertainty in a_nw(λ) and b_b,p(λ).

Download

To demonstrate this with an example, we performed a series of IOP retrievals of b_b,p(λ) assuming a perfect forward model (Eq. 8), perfect knowledge of the uncertainties, and perfect knowledge of water (a_w, b_b,w). In this case, we adopt the arbitrary IOP model and show the fits to R_rs(λ) for the index = 170 spectra of L23 in Fig. 4, assuming the exact answer, and then a_nw(λ) with a series of assumed scale factors. We then solved for the corresponding b_b,p(λ) spectra that provide the same best fit to R_rs(λ). Indeed, there are an infinite number of a_nw(λ), b_b,p(λ) solutions; one cannot retrieve arbitrary IOPs from R_rs(λ) spectra.

This physical degeneracy limits the information content of retrievals and precludes arbitrary functional forms for a_nw(λ) and b_b,p(λ), e.g., models that strive to retrieve arbitrary a_nw(λ) are ruled out (e.g., Loisel et al., 2018). Instead, one must impose constraints (priors) on the functional forms of a_nw(λ) and b_b,p(λ) (i.e., parameterize them) and, ideally, priors on the parameters themselves.

3.2 Parameterized IOP retrievals with BING

We now perform attempted retrievals of the IOPs a_nw(λ) and b_b,p(λ) using assumed spectral shapes with a set of increasingly complex prescriptions. We begin by fitting the k= 2 model to two examples from the L23 dataset (Fig. 5): one chosen to be representative of their full dataset and the other chosen to have a higher-than-typical phytoplankton absorption a_ph(λ) relative to the combined CDOM and detritus components a_dg(λ) (i.e., a_ph(440 nm)>a_dg(440 nm). Both have relatively low Chl-a concentrations ( $\approx 0.1 mg m^{- 3}$ ). For these, we fit to the R_rs(λ) values calculated directly from Eq. (8) and assume a constant $S / N = 50$ for the R_rs(λ) values for the likelihood calculation. Despite the extreme simplicity of this k= 2 model, the R_rs(λ) fits are not too dissimilar from the true values, especially for the high- $a_{ph} (λ) / a_{dg} (λ)$ example, and we note that this a_ph-dominated spectrum has a low total non-water absorption, i.e., weak a_dg absorption. This follows from our discussion in Fig. 1: water backscattering and absorption dominate the solution at short and long wavelengths, respectively, and the non-water components have limited impact on the R_rs(λ); i.e., we primarily measure seawater from IOP retrievals, especially for ocean waters with low chlorophyll concentrations.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f05

Figure 5Retrievals of IOPs – (b, e) a_nw(λ) and (c, f) b_b,p(λ) – from fits to (a, d) remote-sensing reflectances R_rs(λ) assuming perfect radiative transfer and without adding noise. The black points are the true values of R_rs(λ), a_nw(λ), and b_b,p(λ) for two examples from the L23 dataset: index = 170, which we chose as representative of the full dataset, and index = 1032, which has a_ph>a_dg at 440 nm. The figure shows solutions for a series of models with increasing complexity and number of free parameters k. These models are (k= 2; red) constant a_nw(λ) and b_b,p(λ), (k= 3; green) constant b_b,p(λ) and a two-parameter exponential for a_nw(λ), (k= 4; blue) exponential a_nw(λ) and a power law for b_b,p(λ), and (k= 5; orange) power law b_b,p(λ) and a_nw(λ) modeled by the exponential and a phytoplankton function (see text for further details). It is evident that all of the k≥3 models produce excellent fits to the R_rs(λ) data at the few percent level.

Download

Figure 5 shows the best solutions derived with BING for the k= 2–5 models. While the log scaling of the R_rs(λ) panels hides differences at the few percent level, it emphasizes that distinguishing between models requires nearly perfect observations. Notably, when k=2, the model does not fit the observed a_nw(λ) and b_b,p(λ) well, suggesting that this level of complexity is insufficient to fully capture the IOP variability. However, despite these discrepancies in IOPs, the R_rs(λ) fit remains relatively good, which indicates that multiple IOP configurations can produce similar reflectance spectra due to the underlying ill-posed nature of the inversion problem. As model complexity increases, the R_rs(λ) fit improves. k= 3 reproduces the reflectance spectrum to within 10 % at all wavelengths, and the k= 4 and k= 5 models achieve even better agreement, at the several percent level or less. The k= 4 model achieves a reduced chi-squared $χ_{ν}^{2} \approx 1$ when assuming 5 % uncertainties (S $/$ N = 20) in R_rs(λ), suggesting that the model complexity is well-matched to the information content in the data. We have also considered the k= 2b model (not shown) but find that because it does not capture the exponential-like absorption of CDOM and detritus, it is generally a poorer model.

This progression implies that increasing k improves retrieval fidelity up to a certain point. If k=5 continues this trend, one might expect a reasonably good retrieval of a_nw(λ) and b_b,p(λ) under ideal conditions. However, Fig. 5 also underscores a fundamental limitation in IOP retrievals: beyond a certain level of complexity, additional parameters may not significantly enhance the retrieval unless supported by sufficiently independent spectral information. This illustrates the inherent degeneracy in the inversion problem, where different IOP parameterizations can yield similar R_rs(λ), making it difficult to extract unique IOP solutions. Thus, while a k= 5 model might provide a better fit, it remains constrained by the amount of independent spectral information available in the data, reinforcing the challenge of retrieving detailed IOP spectra from multi-spectral or even hyperspectral observations.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f06

Figure 6Evaluations of the difference in BIC values, ΔBIC, for fits to reflectance data derived from the 3320 spectra of L23. The curves describe the cumulative distribution function of the ΔBIC values. (a) Simulated MODIS observations (eight bands) with a series of signal-to-noise (S $/$ N) assumptions (colored curves) and for retrievals adopting actual MODIS noise estimates from NASA validation (Werdell and Bailey, 2002). The > 99 % negative ΔBIC_3,5 values for the realistic noise case indicate that the L23 spectra greatly favor models with only three parameters and without phytoplankton. (b) Similar analysis but for OCI-/PACE-simulated observations with fixed S $/$ N and for a best guess at nominal OCI/PACE performance (Sect. 2.3). We find, even with OCI, that only ≈30 % of the L23 dataset favors the model with phytoplankton.

Download

We can statistically evaluate the constraining power of the data with the BIC formalism introduced in Sect. 2.1. Figure 6 shows the ΔBIC values for simulated MODIS observations of the L23 spectra (see Sect. 2.3 for details). The cumulative distribution function (CDF) on the y axis represents the cumulative probability that the ΔBIC values (Eq. 7) for the 3320 fits to the L23 R_rs(λ) data are less than or equal to a specific value. Each ΔBIC value corresponds to a comparison between two models, specifically the BIC values for a model with and without phytoplankton parameters (k= 3 and k= 5 in the multi-spectral case (Fig. 6a) and k= 4 and k= 5 in the hyper-spectral case (Fig. 6b)). If the CDF value is y_CDF at a specific ΔBIC value, it means that the 100×y_CDF % of the ΔBIC values in the dataset are less than or equal to that specific value. Thus, the CDF curve shows the proportion of the dataset for which the simpler model is preferred as a function of the ΔBIC value. The higher the CDF value at ΔBIC=0, the higher the fraction of the dataset that favors the simpler model over the more complex one.

For the analysis with a realistic MODIS noise model, fewer than 1 % of the spectra prefer the k= 5 model with phytoplankton. This holds true even though our analysis assumed a perfect forward model and perfect knowledge of the measurement uncertainties, without correlated errors. Allowing for these uncertainties would result in zero cases with ΔBIC<0. In fact, we find that one cannot retrieve more than three parameters from MODIS observations and that even the k= 2 model is satisfactory for low-Chl-a waters (Appendix B). Without perfect knowledge of the absorption by CDOM, one cannot retrieve phytoplankton from MODIS observations alone.

The curve corresponding to $S / N = 20$ in Fig. 6a shows that while there is some support for the simpler model, indicated by the CDF values for positive ΔBIC, the more complex model, which includes additional parameters for phytoplankton, is generally preferred. This is because the CDF for negative ΔBIC values is low, indicating that the simpler model is not favored in most of the dataset. In other words, although the simpler model is supported in some cases, the overall trend indicates that the more complex model is usually favored for $S / N = 20$ . Thus, reducing noise in the R_rs(λ) data is essential when increasing model complexity. However, achieving $S / N = 20$ is challenging, even at blue wavelengths in open-ocean Case 1 waters, as demonstrated in numerous validation studies. Moreover, achieving $S / N = 20$ at λ>600 nm, where absorption by seawater alone is very high, may be impossible (Zhang et al., 2022).

We reach even stronger conclusions for simulated SeaWiFS observations that have fewer bands. Unless one identifies an approach to regularly achieve $S / N ≫ 10$ measurements in the presence of all error terms (e.g., atmospheric corrections), phytoplankton cannot be retrieved from multi-spectral observations without strong additional priors. In Appendix B, we examine the GIOP and GSM models, which assume a fixed and steep S_dg shape parameter and examine the negative outcomes of this assumption. These results are central to our broader conclusion that multi-spectral satellite observations, such as from MODIS and SeaWiFS, lack the statistical power to constrain more than three parameters describing non-water absorption and backscattering. They also emphasize that retrieval performance depends on not only the number of parameters but also how they are structured within the model. Model design must account for both physical realism and statistical identifiability, especially when working with data of limited spectral resolution and a limited signal-to-noise ratio.

Now consider an assessment with simulated OCI hyperspectral observations for the PACE satellite (see Sect. 2.3 for details). Our fiducial case uses the L23 spectral sampling, and we limit the observations to $400 nm < λ < 700$ nm, outside of which systematics of the L23 dataset and instrumentation dominate the uncertainties in R_rs(λ), and poor knowledge of the wavelength dependence of the ocean's constituents precludes confident analysis. Figure 6b shows the distribution of the difference in BIC values, ΔBIC, between the k= 4 and 5 models, assuming several choices for the S $/$ N ratio and our estimate for the OCI/PACE noise from v2.0 Level 2 products. We find that OCI/PACE may not recover an absorption signature of phytoplankton from water with properties similar to those represented by less than half of the L23 dataset (primarily those with lower Chl-a concentrations). We are led to conclude that one may retrieve four parameters for IOPs from an OCI-like observation and possibly a fifth. Two of these numbers describe the amplitude and shape of a_nw(λ) parameterized as an exponential, and two numbers describe b_b,p(λ) modeled as a power law. Absent strong priors that account for one of these four, extracting even one number describing a_ph(λ) (at all wavelengths) will be challenging. The following section explores such hyperspectral retrievals in greater depth.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f07

Figure 7IOP retrievals from a BING analysis of a PACE-simulated spectrum (a) for the k= 5 model with its standard priors. For this low-Chl-a example (index = 175 of the L23 dataset), we recover estimates of a_nw(λ) and b_b,p(λ) and their uncertainties (colored curves and shaded regions that encompass the 68 % confidence intervals) that are in good agreement with the true values (solid points). However, the 99 % uncertainty interval for a_ph(440 nm) includes vanishingly small values, and we would not conclude that phytoplankton is detected at even 3σ significance.

Download

3.3 Retrieving a_ph(λ) with NASA/PACE

The results in Fig. 6b indicate that hyperspectral observations with characteristics representative of data from the NASA/PACE mission should have the statistical power to infer the presence of phytoplankton in the majority of ocean waters. With BING, we may explore further the promise of such a_ph(λ) retrievals, as well as assess potential biases. This analysis complements the hyperspectral assessment of a portion of the NOMAD dataset of in situ observations by Erickson et al. (2023).

For the following analysis, we adopt the k= 5 model and compare results with the GIOP and GSM algorithms, re-emphasizing that the latter was only designed for multi-spectral observations and is only included for illustration. Figure 7 shows the results for the k= 5 model fit to the top spectrum in Fig. 5 but now with random noise included and with the 68 % confidence interval illustrated. The model provides an excellent description of the R_rs(λ) measurements, but the reduced $χ_{ν}^{2}$ being much less than 1 suggests potential overfitting of the data. Furthermore, the retrievals are well matched to the known a_nw(λ) and b_b,p(λ) spectra and are fully encompassed by the uncertainty. This includes the individual a_dg(λ) and a_ph(λ) spectra that comprise a_nw(λ).

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f08

Figure 8A corner plot showing the ≈500 000 samples of the k= 5 model fit shown in Fig. 7. The histogram panels show the marginalized posterior distributions of each parameter, with the dotted blue lines showing the 68 % confidence interval. The contour plots describe the correlations between parameters. Note especially the correlations between A_ph and both A_dg and S_dg: models with shallower S_dg and higher A_dg can describe the data without any phytoplankton absorption. Also note the very poor constraint on β_nw.

Download

Quantitatively, on the positive side, we recover $a_{ph} (440 nm) = 0.0084 \pm 0.0033 m^{- 1}$ , which lies within 1σ of the correct value (0.0073 m⁻¹). On the negative side, the uncertainty implies less than 3σ detection; i.e., over 1 % of the MCMC samples have values 10× lower than the true a_ph(440 nm). This is due to the degeneracy between a_dg(λ) and a_ph(λ), as illustrated in Fig. 8, which shows a “corner” plot for the five-parameter model. The very low a_ph(440 nm) values ( $< 10^{- 3} m^{- 1}$ ) are correlated with lower (shallower) S_dg and higher A_dg, i.e., a degeneracy between CDOM/detritus and phytoplankton absorption. We also see from Fig. 8 that the R_rs(λ) offers effectively no constraint on β_nw; its values are almost entirely defined by the prior we have imposed.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f09

Figure 9Similar to Fig. 7 but for an example spectrum with high Chl-a concentration and strong detrital absorption. As with the low-Chl-a case, this fit yields $χ_{ν}^{2} < 1$ despite the overestimates of a_nw(λ) and b_b,p(λ) at λ<450 nm. This is because of the degeneracy in the radiative transfer ( $b_{b} / a$ ) and the poorer S $/$ N ratio in R_rs(λ) at these wavelengths.

Download

Now consider an example with high Chl-a concentration. Figure 9 presents the BING fit for the k= 5 model to an example representative of eutrophic (Case II) waters (index = 2773 of the L23 dataset). The resultant R_rs(λ) spectrum shows the effects of strong detrital and phytoplankton absorption at blue wavelengths, peaking at λ≈550 nm. As with the low-Chl-a example, the a_ph(λ) and a_dg(λ) retrievals are generally in agreement with their true values, aside from an excess of a_dg(λ) absorption at the shortest wavelengths. This excess is due to the combination of a lower S $/$ N ratio in the R_rs(λ) data at λ<450 nm and errors in the adopted basis functions. In particular, our model does not capture the variations in b_b,p(λ) at λ<500 nm due to phytoplankton, and these are compensated (in part) by the higher a_dg(λ): yet another manifestation of the $b_{b} / a$ degeneracy in IOP retrievals. Despite these issues, the resultant $χ_{ν}^{2}$ is approximately 1 (i.e., the model fits the data well), and a more sophisticated model introduced to capture the variations in b_b,p(λ) may result in over-fitting.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f10

Figure 10A series of fits and IOP retrievals for the high-Chl-a spectrum also shown in Fig. 9. The solid, dashed, and dotted curves are the results for the k= 5, GIOP, and GSM models, respectively, each with an uncertainty similar to that for the k= 5 model in Fig. 9. We find that each model provides a statistically acceptable fit to the R_rs(λ) data ( $χ_{ν}^{2} \approx 1$ ) despite the large differences in their IOP retrievals.

Download

We may explore the impacts of imposing priors on model parameters by comparing the fits from GIOP and GSM to this high-Chl-a example. Figure 10 shows the fits to R_rs(λ) and the IOP retrievals for these three models. All three yield a reduced $χ_{ν}^{2} \approx 1$ and would be considered acceptable models (GSM is marginal). These results further demonstrate the physical degeneracy within the radiative transfer; here, the GIOP model systematically underestimates both a_ph(λ) and b_b,p(λ), but these compensate to yield very similar R_rs(λ) to that of the k= 5 model. We also emphasize that the GIOP model is driven to this solution by the strong prior on S_dg, the error in the estimate of Chl a from the OC4 algorithm, and the shallower slope for β_nw estimated from the Lee et al. (2002) prescription. As is obvious from Fig. 10, the best-fit a_ph(440 nm) values vary by a factor of over 400 %, much larger than the estimated uncertainties for each. These large variations occur even though the models have statistically acceptable fits. The low $χ_{ν}^{2}$ values of the k= 5 and GIOP models further emphasize that the data have limited statistical power to retrieve any additional constraints on phytoplankton beyond what is described in the Bricaud et al. (1998) prescription. While different retrieval models yield substantially different estimates of a_ph(440 nm), the spectral information in R_rs(λ) alone is insufficient to independently resolve phytoplankton absorption without relying on empirical parameterizations. This points to the need for additional observational constraints, such as hyperspectral measurements, to break the degeneracy and improve the accuracy of phytoplankton absorption retrievals.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f11

Figure 11Panels (a) and (c) show the retrievals of a_ph(440 nm) for simulated PACE spectra versus the true values for the k= 5 (a, b) and GIOP models (c, d) using the BING package. When the error in a_ph(440 nm) exceeds 3 times the best value, we plot the measurement in light blue. The dashed curve is the 1:1 line. The right panels show the retrievals for the particulate backscattering at 440 nm. The bias, median absolute error (MAE), and root mean square (RMS) are indicated in each panel.

Download

We have performed BING fits with the k= 5 and GIOP models to simulated PACE spectra for the full L23 dataset to estimate a_ph(440 nm) and b_b,p(440). As above, we limit the range to 400–700 nm, adopt the PACE uncertainties (Fig. 3), and have injected random noise into each spectrum. Figure 11 compares the a_ph(440 nm) and b_b,p(440) retrievals with BING versus the true values. For a_ph(440 nm), on the positive side, the retrievals track the known values over nearly 3 orders of magnitude. At low a_ph(440 nm), however, the k= 5 model systematically underpredicts a_ph(440 nm) for many of the retrievals, leading to a negative bias, and the majority of these are formally upper limits (plotted in a lighter shade of blue). Furthermore, the scatter is higher than the PACE Level 2 requirements (35 %; Cetinić et al., 2018). If we limit to cases with true $a_{ph} (440 nm) > 0.01 m^{- 1}$ , the bias is reduced (15 %), but the MAE remains large (40 %). One of the primary reasons the k= 5 model underestimates low a_ph(440 nm) values is the spectral degeneracy between a_ph(λ) and a_dg(λ). When a_ph(440 nm) is weak, the total absorption is largely dominated by CDOM, making it difficult to separate the phytoplankton contribution from the background absorption (Houskeeper and Hooker, 2025). The k= 5 model allows for flexibility in the spectral shape of a_dg(λ), which can lead to overestimation of CDOM/detritus absorption and a corresponding underestimation of phytoplankton absorption.

The GIOP model, in contrast, constrains CDOM absorption using a fixed spectral slope (S_dg) that is typically prescribed from empirical studies such as Lee et al. (2002). This constraint prevents the retrieval from assigning excess absorption to a_dg(λ) at short wavelengths, reducing the risk of underestimating a_ph(440 nm). While this approach limits retrieval flexibility, it also helps stabilize the separation between phytoplankton and CDOM absorption, ensuring that even at low a_ph(λ) values, the retrieval does not shift excessive absorption to CDOM. The right panels of Fig. 11 show the retrievals for b_b,p(440). The k= 5 model provides better overall agreement with the true values, although some scatter persists, particularly at low b_b,p(440) values. The bias and MAE are smaller compared to the GIOP results, suggesting that the additional flexibility in the k= 5 model helps capture variations in backscattering more accurately. In contrast, the GIOP retrievals exhibit a consistent bias in b_b,p(440). This can be attributed to the model's prescribed functional form for backscattering, which lacks flexibility when representing natural variations in b_b,p(440) across different water types. The constrained power law exponent used in GIOP may not accurately reflect regional or case-specific spectral slopes, leading to systematic errors. As a result, while GIOP provides a reasonable fit to R_rs(λ), its retrieved b_b,p(440) values tend to be biased relative to the true values, particularly in optically complex waters.

To improve the Bayesian approach and reduce the underestimation of a_ph(440 nm) at low values, one may consider refining prior constraints on CDOM absorption, incorporating additional spectral information, and enhancing regularization techniques. Strengthening Bayesian priors on the CDOM spectral slope (S_dg) based on climatologies or independent datasets can help prevent over-attribution of absorption to CDOM. Incorporating near-UV bands (350–400 nm), where CDOM absorption dominates, provides an additional constraint to improve separation from phytoplankton absorption. Enhancing Bayesian regularization with priors that favor realistic a_ph(λ) spectral shapes and implementing adaptive noise weighting can help mitigate retrieval biases in low-absorption regimes. Finally, performing ensemble retrievals, where multiple runs with varied priors are averaged, can further stabilize the retrieval versus noise. These refinements will improve retrieval accuracy and reduce systematic underestimation of a_ph(440 nm) in low-chlorophyll waters.

4 Discussion and future prospects

In this paper, we have introduced BING, a Bayesian inference algorithm for IOP retrievals utilizing the Gordon coefficients for radiative transfer. We have reemphasized a known but underappreciated physical degeneracy in the radiative transfer – r_rs(λ) as a function of $b_{b} / a$ – which strictly limits one's ability to retrieve a(λ) and b_b(λ) without strong priors. Two of the priors are natural: water both absorbs and scatters light with precisely known coefficients, at least for wavelengths λ>400 nm. We demonstrated, however, that even these constraints are insufficient; indeed, water frequently dominates the model, limiting the extraction of additional information. Consequently, we found that multi-spectral observations with published uncertainties (Zhang et al., 2022) cannot reliably retrieve phytoplankton absorption and that even hyperspectral observations (e.g., OCI/PACE) will be challenged (Fig. 6).

Previously, Cael et al. (2023) reached a similar inference as one of our primary conclusions – the limited information content of remote-sensing observations. Specifically, they analyzed the degrees of freedom (DoFs) of R_rs(λ) data through a standard principal component analysis, finding that in situ R_rs(λ) data with MODIS sampling has only 3 DoFs and inferred only DoF=2 for remotely sensed R_rs(λ). Our analysis, which includes several constraints such as water absorption and scattering, yields at least one additional parameter, but the overarching implication is similar: retrievals from R_rs(λ) observations have limited information content.

On statistical grounds, retrieving a_ph(λ) from R_rs(λ) is fundamentally challenging because CDOM and detrital absorption, which are always present, exhibit strong spectral overlap with phytoplankton absorption in the blue region. In fact, this component (a_dg) tends to exceed a_ph, even in the open ocean (Siegel et al., 2013; Hooker et al., 2020; Houskeeper and Hooker, 2025). When a_dg(λ) is parameterized as an exponential function, small variations in its spectral slope can lead to compensatory shifts in a_ph(λ), making it difficult to separate their contributions. Since R_rs(λ) depends on the combined effects of absorption and backscattering rather than direct measurement of individual IOPs, this spectral degeneracy prevents a unique retrieval of a_ph(λ) without additional constraints or priors. Previous work that published estimates of a_ph(λ) required very strict priors on the shape of S_dg (Appendix B), leading to significant bias in estimates of a_ph(λ). In the cases where S_dg was allowed to vary (Boss & Roesler, Chapter 5 Lee, 2006), the errors in a_ph(λ) were severe and limited retrievals to only upper limits, consistent with this work.

Do our results therefore invalidate the past several decades of research and data products using satellite-based ocean color observations? At the very least, all previous retrievals from R_rs(λ) must be further scrutinized. We assert that uncertainties and biases were frequently (possibly always) underestimated, and substantial correlations between retrieved parameters will be present. Unfortunately, even relative analyses of a_ph(λ) may be subject to large error. There are, however, key products that are primarily (and very nearly exclusively) empirical, i.e., generated without any radiative transfer modeling (Stramski et al., 2022). These empirically based algorithms may circumvent the radiative transfer issues raised here, but, as emphasized by Cael et al. (2023), one cannot retrieve an arbitrary number of such quantities from visible-domain R_rs(λ) observations. Therefore, the suite of products generated by the community to date are highly coupled and correlated. The limited information content of R_rs(λ) measurements subject to realistic uncertainties is inherent to the problem.

While the results from Fig. 6b indicate that hyperspectral observations offer a substantial improvement over multi-spectra data, even detecting phytoplankton remains challenging. We reemphasize that the results presented here have assumed a perfect forward model (i.e., no error in the radiative transfer calculation), uncorrelated uncertainties, a perfect model for water absorption and backscattering, and homogeneous seawater (no vertical or horizontal spatial variations). Furthermore, even if we surmount these issues, we may only be able to extract a single parameter describing phytoplankton, e.g., the a_ph amplitude at λ≈440 nm. And, strictly speaking, this may be attributed to any absorption component (not solely a_ph) that does not follow the exponential description of CDOM and detritus absorption.

Note that our findings refer to the number of statistically independent parameters that can be retrieved from ocean color reflectance data under realistic noise conditions. This does not imply that the true biogeochemical variables in nature are independent. On the contrary, many of the optical components, such as phytoplankton absorption and backscattering, are physically coupled. For example, phytoplankton both absorb and scatter light, and their associated IOPs are often correlated through cell size, composition, and pigment content. Such natural correlations are reflected in datasets like those shown in Fig. B3.

However, in an inverse problem, retrieving both components independently from reflectance requires that the reflectance spectrum contain sufficient information to statistically distinguish them. The BIC and MCMC approaches quantify how many distinct degrees of freedom are supported by the measurements and not how many physically or biologically separate variables exist, which in turn determines the number of parameters that can be reliably estimated in models of a(λ) and b_b(λ). In situations where optical properties are highly correlated, as is often the case for a_ph(λ) and b_b,p(λ), the effective dimensionality of the solution space is reduced. This reinforces our conclusion that even when additional parameters are included in the forward model, they are not necessarily identifiable unless the reflectance data provide enough information to constrain them independently. Incorporating prior knowledge about natural covariation between variables is one promising strategy to improve retrievals. Future work should explore how physically informed priors can be integrated into the inversion framework without artificially inflating confidence in individual parameter estimates.

5 Conclusions

The ocean color remote-sensing community faces a fundamental challenge in retrieving IOPs from remote-sensing reflectance due to a physical degeneracy in the radiative transfer equation. Our analysis demonstrates that this degeneracy severely limits the number of parameters that can be reliably extracted from R_rs(λ) observations. For multi-spectral satellite data with realistic noise levels (e.g., MODIS, SeaWiFS), we find that only three parameters can be reliably constrained, which is insufficient to independently retrieve phytoplankton absorption without strong, potentially biasing priors on the spectral shape of CDOM/detritus absorption. Even with hyperspectral observations like those from OCI/PACE, retrievals remain limited to four or five parameters at most, and the detection of phytoplankton absorption is still challenging in many oceanic conditions. In essence, this sets a limit on the information content available in ocean color observations (Cael et al., 2023).

These findings suggest that previous IOP retrieval algorithms likely underestimated uncertainties and may have introduced systematic biases into their estimates of phytoplankton absorption. The widespread practice of fixing the spectral slope of CDOM/detritus absorption (S_dg) to a steep value in models like GSM and GIOP allows for phytoplankton detection but at the cost of potentially significant biases in a_ph(λ) estimates. Our Bayesian approach explicitly incorporates priors and their uncertainties, providing a more transparent and rigorous assessment of the retrieval problem. While hyperspectral observations offer improvement over multi-spectral data, they still cannot fully overcome the fundamental limitations imposed by the physical degeneracy in the radiative transfer equation.

How might we proceed? It is abundantly clear that we must identify the optimal way to parameterize the problem to make the most effective use of the four or five parameters that describe a_nw(λ) and b_b,p(λ). For example, if we know β_nw (i.e., fix its value as a prior), we would not “waste” a free parameter to estimate its value. In short, we must harness our knowledge of the ocean from previous in situ measurements (or current, if one can afford them) to set priors on the model. These priors should be geographically and temporally variable to reflect different oceanic conditions. Because strict and biased priors have been shown to lead to inaccurate and uncertain retrievals, one must proceed cautiously. Empirically derived priors can help mitigate these issues by providing a more accurate representation of the ocean's optical properties.

One obvious community-wide effort would be to develop and agree upon strong priors for S_dg. This is current practice in many existing algorithms (e.g., GIOP or GSM, which set S_dg to a single value), but we describe the negative consequences of this extreme approach in Appendix B. Instead, we encourage the community to generate S_dg priors as probability distribution functions that vary with geographic location and time and then revisit these in our changing climate. Additionally, we must include more observations from both space and in situ measurements. From space, we must leverage the Chl-a fluorescence signal at ≈685 nm (Wolanin et al., 2015), whose production and radiative transfer are distinct from that of IOP retrievals. From the ocean, in situ observations provide invaluable validation data and may establish priors like those for S_dg. Non-visible in situ optical observations have been shown to improve retrievals of CDOM absorption and could be retained to better partition signals related to CDOM and phytoplankton biomass. Constraints on β_nw, as a function of location and season, and physical priors on the coupling of a(λ) to b_b(λ) for individual components from the physics of absorption and scattering could be impactful.

Developing community-wide Bayesian retrieval algorithms is also recommended. The ocean color remote-sensing community should be encouraged to adopt a Bayesian framework for IOP retrievals such as BING, explicitly including all priors and their uncertainties. A Bayesian approach allows for a more transparent and rigorous incorporation of prior knowledge and uncertainties, leading to more reliable retrievals. Unlike BING, however, Bayesian algorithms must adopt an accurate forward model and its uncertainties, include correlated and systematic error in the observations, and harness new data sources. We have initiated such a project – Intensities to Hydrolight Optical Properties (IHOP) – and encourage community adoption and development. The scientists focused on atmospheric corrections have already embarked on this journey, following the original insight of Frouin and Pelletier (2015).

The development of machine learning techniques for scientific exploitation is another tempting and potentially viable path to address some of the challenges presented in this paper. Optimistically, one can hope to train models that learn correlations in the R_rs(λ) spectra to predict quantities like Chl a or even signatures of phytoplankton communities (e.g., Woo Kim et al., 2022; Kramer et al., 2022). While we appreciate the potential power of such data-driven approaches, one must be mindful of the physical degeneracy of Eq. (8), which leads to very similar or even identical R_rs(λ) spectra from distinct IOPs (e.g., Fig. 10). A machine learning model cannot learn how to distinguish between these, especially in the presence of noise, nor will most industry-developed algorithms properly assess the uncertainties in such degeneracies. In practice, they would behave only according to the datasets they were trained upon; this is an implicit prior (Gray et al., 2024). Lastly, such models will not, on their own, discover new signatures of absorption in the ocean.

Increasing the spectral resolution of satellite observations can provide more detailed information about the absorption and backscattering properties of phytoplankton, thereby reducing the impact of degeneracies. Thus, the development and deployment of hyperspectral satellites with high spectral resolution across the visible and near-infrared spectrum are recommended. The recently launched PACE satellite will lead the way. Additionally, exploring alternative remote-sensing techniques, such as lidar and fluorescence-based methods, and incorporating polarization information to complement traditional ocean color observations should be considered. It is possible that inelastic processes, such as Raman scattering and chlorophyll fluorescence, will at least partially mitigate the degeneracies emphasized here by introducing spectrally distinct signals that, when accurately modeled, will provide independent constraints on phytoplankton absorption and scattering, thereby improving retrieval accuracy. These advanced techniques may provide independent measurements that help resolve ambiguities and improve the overall accuracy of phytoplankton estimates, although information gained using these new techniques should be clearly demonstrated and defined first in the field.

Several of the strategies we proposed can be practically implemented with modest adaptation of existing workflows. For instance, incorporating near-UV bands (e.g., 350–400 nm) into standard atmospheric correction and retrieval chains could significantly improve the ability to constrain CDOM absorption. Many modern sensors (such as PACE/OCI) already acquire data in this spectral region, although it is typically excluded from ocean color retrievals due to atmospheric correction uncertainties. Operational implementation would require further validation of atmospheric correction performance in the UV bands, but the gains in constraining a_dg(λ) could justify the investments in refining these procedures. UV bands could also be incorporated into empirical and semi-analytical models by extending existing parameterizations (e.g., for S_dg) and evaluating retrieval sensitivity in this region.

Refining priors on CDOM spectral slope (S_dg) can also be made operational by leveraging existing climatology and in situ datasets to generate geographically and seasonally resolved prior distributions. These priors could be implemented in a Bayesian retrieval framework as either lookup tables or probability distributions that are dynamically assigned by location and time. NASA and other agencies already maintain in situ repositories (e.g., SeaBASS), and these could support the development of such prior climatology. Incorporating them would not necessarily require a full Bayesian inversion at the operational level but could instead inform constrained retrievals with flexible priors, much like how empirical coefficients are regionally tuned in current algorithms.

Adaptive regularization techniques, which vary the strength of constraints based on the apparent information content of the input R_rs(λ) spectrum, can be integrated into operational pipelines through data-driven diagnostic metrics. For example, spectra with low S $/$ N ratios or low total absorption could trigger stronger regularization (e.g., narrower priors or more constrained parameterizations), while higher-quality spectra would allow for more flexible models. This adaptive scheme could be encoded through decision trees or lookup-based thresholding embedded in the processing architecture. Given the modular nature of existing algorithm frameworks like GIOP, such adaptive behavior could be implemented with minimal disruption.

Promoting interdisciplinary collaboration is also essential. Fostering collaboration between oceanographers, remote-sensing experts, and radiative transfer modelers to address the complex challenges of IOP retrievals can bring together diverse expertise and perspectives, leading to more innovative and effective solutions. These should include individuals with mastery of statistics, who can rigorously assess uncertainty and help develop robust and transparent algorithms.

By implementing these recommendations, the remote-sensing community can significantly enhance the accuracy and reliability of phytoplankton IOP retrievals, leading to better-informed biogeochemical models and ecological assessments. This comprehensive approach will help ensure that remote-sensing data accurately reflect the true state of the ocean's biological and chemical processes, thereby supporting more effective environmental monitoring and management efforts.

Appendix A: Scrutinizing the Taylor series expansion approximation to the radiative transfer

To examine the use of Eq. (8) as an approximation for the radiative transfer, consider Fig. A1, which plots the Hydrolight-derived R_rs(λ) of L23 converted to r_rs(λ) with Eq. (8) versus the evaluated u(λ) values using Eq. (9) at four distinct wavelengths. These evaluations follow an approximately quadratic pattern with zero y intercept. Overplotted with a dashed line is the Gordon approximation using the standard G₁, G₂ coefficients and Eq. (8). Qualitatively, the Hydrolight outputs follow the relation yet lie systematically above the curve. At its extreme, the Taylor series approximation is offset by ≈10 % at λ=370 nm and u(λ)=0.35.

To further illustrate the difference, we have fitted the G₁, G₂ coefficients to the data at select wavelengths and recover similar G₁ values but G₂ values that vary significantly with wavelength (G₂≈0.07 at λ=370 nm, $G_{2} \approx - 1.2$ at λ=600 nm). We also find that there is significant scatter around each of the fits, with a relative RMS of ≈5 % at shorter wavelengths and of 20 % at the reddest wavelengths. We expect that this scatter is inherent to Eq. (8) and would be unavoidable if one uses this approximation, even with wavelength-dependent coefficients. An accurate retrieval algorithm would need to account for these variations or otherwise suffer from this systematic error. This is the focus of a separate algorithm that we are developing, and we also refer the readers to recent advances in approximations of the radiative transfer equation (Twardowski et al., 2018).

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f12

Figure A1Sub-surface reflectances r_rs(λ) generated with the Hydrolight radiative transfer code by L23 (converted from R_rs(λ) using Eq. 10) versus u(λ) as defined by Eq. (9). The dashed black line shows the second-order Taylor expansion of r_rs(λ) in u(λ) (Eq. 8) with the Gordon coefficients that are most widely adopted by the community. In general, this curve underpredicts the r_rs(λ) calculated with Hydrolight, with a maximum offset of ≈10 % at u(λ)=0.35 and λ=370 nm. We also show a series of individual fits of Eq. (8) to the data, with the legend indicating the derived G₁,G₂ coefficients. Note that G₁ is largely independent of wavelength but that G₂ is strongly wavelength dependent (and anti-correlated).

Download

Appendix B: Revisiting previous models

The simple IOP models examined in Sect. 3.2 resemble prescriptions adopted previously in the literature and/or implemented operationally by NASA. In light of our results, we are motivated to further examine two such models: GSM and GIOP. As described in Sect. 2.5, both GSM and GIOP adopt strict (fixed) priors on S_dg and β_nw such that these are effectively three-parameter models.

We find that by adopting these priors, these three-parameter GSM and GIOP models are statistically favored (ΔBIC>0) relative to the k= 3 model without phytoplankton for ≈50 % (GSM on SeaWiFS) or more (GIOP on MODIS) of the simulated spectra (Figs. B1 and B2). At the same time, if we relax the prior on either S_dg or B_nw, over 99 % of the spectra have ΔBIC<0 and favor the k= 3 model.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f13

Figure B1These panels describe the BIC analysis assuming MODIS-like observations for two models designed to match standard GIOP configurations. Panel (a) compares our k= 3 model versus the standard k= 3 parameter configuration of GIOP (see text for full details). We find that this GIOP model is preferred, which we speculate is due to the extra freedom to fit the R_rs(λ) at blue wavelengths where the S $/$ N ratio in MODIS is highest. (b) Results for the GIOP+ model (k= 4 parameters), which lets β_nw be an additional free parameter. This model is not favored for the entire dataset, further evidence that one cannot recover four parameters from MODIS observations.

Download

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f14

Figure B2Similar to Fig. B1 but for GSM models and using simulated SeaWiFS spectra. As with the GIOP models, we find that the GSM model is favored over our k= 3 model but that a k= 4 parameter version – GSM+, which lets β_nw be free – is highly disfavored.

Download

Furthermore, the fixed S_dg value adopted in each of these models is relatively steep, which imposes a significant bias on the a_ph(λ) retrievals. Let us scrutinize these priors, as they affect the potential to retrieve phytoplankton and any other constituents. Figure B3 shows the S_dg values derived with the k= 4 model (no phytoplankton) versus the fraction of non-water absorption associated with phytoplankton at 440 nm in the L23 spectra, $a_{ph} / a_{nw}$ . The two quantities are anti-correlated because the increased presence of phytoplankton relative to CDOM and detritus tends to give a shallower non-water absorption spectrum. We find, as anticipated, that the majority of retrieved S_dg values lie within the loci of shape parameters assumed by L23 for CDOM and detritus based on Lee (2006). There is, however, a non-negligible set of retrieved S_dg values that are lower than the lowest value assumed by L23; these are partially due to strong phytoplankton absorption. Overplotted on the figure are the fixed values of S_dg for the GSM and GIOP models, where we see that their priors lie at the upper end of S_dg values measured in the ocean (GIOP) or even beyond (GSM). This was intentional for GSM (Maritorena and Siegel, 2005), as its designers derived fixed values for S_dg and β_nw by achieving the best retrievals compared to in situ data. When one adopts a relatively steep S_dg value, the absorption at λ>450 nm cannot be correctly described by CDOM/detritus, and the model will favor a higher phytoplankton contribution. If, however, the S_dg values are too steep, then one may anticipate biased a_ph(440 nm) retrievals.

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f15

Figure B3The dots plot the best-fit shape parameter S_dg for the L23 spectra using the k= 4 model (no phytoplankton component) versus the amplitude of a_ph to a_nw at 440 nm. The two are correlated, albeit with large scatter. The blue and yellow shaded regions indicate the ranges of exponential shapes for CDOM and detritus (S_g,S_d), respectively, adopted by L23. Not surprisingly, the majority of retrieved S_dg lie within these loci. The ones with shallower slope, however, may be attributed to the presence of phytoplankton, which effectively flattens a_nw(λ) at λ≈450 nm. The dotted/dashed black lines demarcate the fixed values of S_dg assumed by the GIOP/GSM algorithms for a_dg(λ). These are steeper values than the typical S_g (and all S_d) values adopted by L23. In addition, we show the S_dg derived from an extreme CDOM-subtracted Tara absorption spectrum collected off the coast of Africa (see Prochaska and Gray, 2024), which demonstrates at least one instance of a very low S_dg in the ocean.

Download

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f16

Figure B4Retrievals of a_ph(440 nm) and b_b,p at 600 nm for the GIOP model, with simulated MODIS spectra and perturbations of the R_rs(λ) values by the typical noise. We find that the values are biased and scatter by 1 order of magnitude or more.

Download

https://bg.copernicus.org/articles/22/4705/2025/bg-22-4705-2025-f17

Figure B5Same as Fig. B4 but for the GSM model and the simulated SeaWiFS spectra.

Download

We then performed a new set of inferences on the entire L23 dataset, assuming MODIS- and SeaWiFS-simulated spectra for the GIOP and GSM models, respectively. In both cases, we calculated R_rs(λ) from Eqs. (8) and (10) and performed the inversion with the same model after perturbing the R_rs(λ) values due to the presence of noise. The retrievals are presented in Figs. B4 and B5. Clearly, the retrievals of a_ph and b_b,p are biased and highly uncertain at all values, with nearly 2 orders of magnitude of scatter. Therefore, the detection of a_ph(λ), if it were possible with multi-spectral observations, would be highly uncertain.

The constraints inherent within inversion algorithms like GSM and GIOP do affect the confidence when interpreting changes in maps of retrieved variables such as chlorophyll concentration, absorption coefficients, and backscattering coefficients. The spectral ambiguity in R_rs(λ) data can lead to changes influenced by variations in other optical properties not fully addressed by the models, making it difficult to attribute changes solely to biological factors. Moreover, the interdependence of retrieved parameters, such as chlorophyll, a_CDOM, and b_b,p, means that errors in one can propagate to others, complicating the interpretation of these maps. For example, inaccuracies in backscatter coefficient estimates can affect chlorophyll retrievals. Additionally, variability in environmental conditions can impact the accuracy of the retrieved variables. Algorithm performance may vary across different water types and regions, necessitating further caution when interpreting these changes.

Results from tests of IOP retrieval models have been presented in multiple publications over the years since the beginning of their development (e.g., Mouw et al., 2017; Werdell et al., 2018; Seegers et al., 2018), and a discussion of all of these lies beyond the scope of this paper. Nevertheless, we wish to highlight one in-depth effort summarized by the International Ocean Colour Coordinating Group (IOCCG) Report 5 (Lee, 2006). Similar to our work, the participants applied their IOP retrieval algorithms to a simulated (i.e., known) dataset to assess performance. The majority of these algorithms assumed an exponential term for CDOM/detritus absorption with fixed S_dg and a steep value ( $S_{dg} > 0.015 {nm}^{- 1}$ ). Similar to the results we found for GIOP and GSM (Figs. B5, B4), these consistently over-estimated a_ph(λ) at 440 nm.

Only one team (Boss and Roesler) allowed S_dg to vary (from 0.008 to 0.023 nm⁻¹) in their algorithm, which followed from the Roesler et al. (1989) publication. Referring to their Fig. 8.1, one notes less biased a_ph(λ) values than from the other algorithms and sees that they were the only group to include an error estimation. Given that the axes are a log scaling, one might miss that the uncertainties in a_ph(λ) are large enough to be consistent with zero. This implies that the retrieved values are not statistically distinguishable from zero at the given confidence level. In other words, the results from the only algorithm that allowed S_dg to vary freely indicate that a_ph(λ) could not be reliably constrained by the simulated data.

Code and data availability

All of the code and results presented here are available on GitHub in the following two repositories: https://github.com/ocean-colour/bing (https://doi.org/10.5281/zenodo.13292700, Prochaska, 2025), https://github.com/ocean-colour/ocpy (https://doi.org/10.5281/zenodo.17088615, Prochaska and Gray, 2025). The data on which this article is based are available in Loisel et al. (2023).

Author contributions

JXP conceived of the paper, generated all of the code for BING, and made all of the figures of the paper. He led the writing. RF provided scientific guidance, text, and extensive editing.

Competing interests

The contact author has declared that neither of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The authors wish to thank Brice Menard, Emmanuel Boss, Anna Windle, Henry Housekeeper, and Curtis Mobley for discussions and/or their input on an earlier draft. Last, we acknowledge deriving new insight into the problem from conversations with M. Kehrli.

Financial support

Jason Xavier Prochaska acknowledges support from a Simons Pivot Fellowship. Robert J. Frouin was supported by the National Aeronautics and Space Administration under various grants.

Review statement

This paper was edited by Koji Suzuki and reviewed by two anonymous referees.

References

Behrenfeld, M. J., Boss, E., Siegel, D. A., and Shea, D. M.: Carbon-based ocean productivity and phytoplankton physiology from space, Global Biogeochem. Cy., 19, GB1006, https://doi.org/10.1029/2004GB002299, 2005. a

Behrenfeld, M. J., O'Malley, R. T., Boss, E. S., Westberry, T. K., Graff, J. R., Halsey, K. H., Milligan, A. J., Siegel, D. A., and Brown, M. B.: Revaluating ocean warming impacts on global phytoplankton, Nat. Clim. Change, 6, 323–330, https://doi.org/10.1038/nclimate2838, 2016. a

Bentler, P. M. and Bonett, D. G.: Significance tests and goodness of fit in the analysis of covariance structures, Psychol. Bull., 88, 588–606, https://doi.org/10.1037/0033-2909.88.3.588, 1980. a

Boss, E., Waite, A. M., Karstensen, J., Trull, T., Muller-Karger, F., Sosik, H. M., Uitz, J., Acin as, S. G., Fennel, K., Berman-Frank, I., Thomalla, S., Yamazaki, H., Batten, S., Gregori, G., Richardson, A. J., and Wanninkhof, R.: Recommendations for Plankton Measurements on OceanSITES Moorings With Re levance to Other Observing Sites, Front. Mar. Sci., 9, 929436, https://doi.org/10.3389/fmars.2022.929436, 2022. a

Bricaud, A., Babin, M., Morel, A., and Claustre, H.: Variability in the chlorophyll-specific absorption coefficients of natural phytoplankton: Analysis and parameterization, J. Geophys. Res., 100, 13321–13332, https://doi.org/10.1029/95JC00463, 1995. a

Bricaud, A., Morel, A., Babin, M., Allali, K., and Claustre, H.: Variations of light absorption by suspended particles with chlorophyll a concentration in oceanic (case 1) waters: Analysis and implications for bio-optical models, J. Geophys. Res., 103, 31033–31044, https://doi.org/10.1029/98JC02712, 1998. a, b

Cael, B. B., Bisson, K., Boss, E., and Erickson, Z. r. K.: How many independent quantities can be extracted from ocean color?, Limnol. Oceanogr. Lett., 8, 603–610, https://doi.org/10.1002/lol2.10319, 2023. a, b, c, d

Cetinić, I., McClain, C. R., and Werdell, P. J.: PACE Technical Report Series, Volume 6: Data Product Requirements and Error Budgets, NASA Technical Memorandum NASA/TM-2018 – 2018-219027/Vol. 6, NASA Goddard Space Flight Center, Greenbelt, MD, https://ntrs.nasa.gov/citations/20190001726 (last access: 15 January 2025), 2018. a

Cetinić, I., Rousseaux, C. S., Carroll, I. T., Chase, A. P., Kramer, S. J., Werdell, P. J., Siegel, D. A., Dierssen, H. M., Catlett, D., Neeley, A., Soto Ramos, I. M., nnifer L. Wolny, J., Sadoff, N., Urquhart, E., Westberry, T. K., Stramski, D., Pahlevan, N., Seegers, B. N., Sirk, E., Lange, P. K., Vandermeulen, R. A., Graff, J. R., Allen, J. G. ., Gaube, P., McKinna, L. I., McKibben, S. M., aren E. Binding, C., Calzado, V. S., and Sayers, M.: Phytoplankton composition from sPACE: Requirements, opportunities, and challenges, Remote Sens. Environ., 302, 113964, https://doi.org/10.1016/j.rse.2023.113964, 2024. a

Cloern, J. E. and Jassby, A. D.: Patterns and Scales of Phytoplankton Variability in Estuarine–Coastal Ecosystems, Estuar. Coast., 33, 230–241, https://doi.org/10.1007/s12237-009-9195-3, 2010. a

Defoin-Platel, M. and Chami, M.: How ambiguous is the inverse problem of ocean color in coastal waters?, J. Geophys. Res.-Oceans, 112, C03004, https://doi.org/10.1029/2006JC003847, 2007. a

Erickson, Z. K., Werdell, P. J., and Cetinić, I.: Bayesian retrieval of optically relevant properties from hyperspectral water-leaving reflectances, Appl. Optics, 59, 6902, https://doi.org/10.1364/AO.398043, 2020. a, b

Erickson, Z. K., McKinna, L., Werdell, P. J., and Cetinić, I.: Bayesian approach to a generalized inherent optical property model, Optics Express, 31, 22790, https://doi.org/10.1364/OE.486581, 2023. a, b

Flombaum, P., Wang, W.-L., Primeau, F. W., and Martiny, A. C.: Global picophytoplankton niche partitioning predicts overall positive response to ocean warming, Nat. Geosci., 13, 116–120, https://doi.org/10.1038/s41561-019-0524-2, 2020. a

Foreman-Mackey, D., Hogg, D. W., Lang, D., and Goodman, J.: emcee: The MCMC Hammer, Publ. Astron. Soc. Pac., 125, 306–312, https://doi.org/10.1086/670067, 2013. a

Fox, J., Kramer, S. J., Graff, J. R., Behrenfeld, M. c. J., Boss, E., Tilstone, G., and Halsey, K. H.: An absorption-based approach to improved estimates of phytoplankton bio mass and net primary production, Limnol. Oceanogr. Lett., 7, 419–426, https://doi.org/10.1002/lol2.10275, 2022. a

Frouin, R. and Pelletier, B.: Bayesian methodology for inverting satellite ocean-color data, Remote Sens. Environ., 159, 332–360, https://doi.org/10.1016/j.rse.2014.12.001, 2015. a

Frouin, R., Tan, J., Compiègne, M., Ramon, D. r., Sutton, M., Murakami, H., Antoine, D., Send, U. w., Sevadjian, J., and Vellucci, V.: The NASA EPIC/DSCOVR Ocean PAR Product, Frontiers in Remote Sensing, 3, 833340, https://doi.org/10.3389/frsen.2022.833340, 2022. a

Garver, S. A., Siegel, D. A., and B. Greg, M.: Variability in near-surface particulate absorption spectra: What can a satellite ocean color imager see?, Limnol. Oceanogr., 39, 1349–1367, https://doi.org/10.4319/lo.1994.39.6.1349, 1994. a

Gordon, H. R.: Simple Calculation of the Diffuse Reflectance of the Ocean, Appl. Optics, 12, 2803, https://doi.org/10.1364/AO.12.002803, 1973. a

Gordon, H. R.: Distribution Of Irradiance On The Sea Surface Resulting From A Point Source Imbedded In The Ocean, in: Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, vol. 637 of Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, 66–71, https://doi.org/10.1117/12.964216, 1986. a

Gordon, H. R. and Morel, A. Y.: Remote Assessment of Ocean Color for Interpretation of Satellite Visible Imagery, vol. 4 of Lecture Notes on Coastal and Estuarine Studies, Springer-Verlag, ISBN 9780387909233, 9781118663707, https://doi.org/10.1029/LN004, 1983. a

Gray, P. C., Boss, E., Prochaska, J. X., Kerner, H., Begouen Demeaux, C., and Lehahn, Y.: The Promise and Pitfalls of Machine Learning in Ocean Remote Sensing, Oceanography, 37, 52–63, https://doi.org/10.5670/oceanog.2024.511, 2024. a

Hansen, J. E.: Multiple Scattering of Polarized Light in Planetary Atmospheres Part II. Sunlight Reflected by Terrestrial Water Clouds., J. Atmos. Sci., 28, 1400–1426, https://doi.org/10.1175/1520-0469(1971)028<1400:MSOPLI>2.0.CO;2, 1971. a

Hooker, S. B., Matsuoka, A., Kudela, R. M., Yamashita, Y., Suzuki, K., and Houskeeper, H. F.: A global end-member approach to derive a_CDOM(440) from near-surface optical measurements, Biogeosciences, 17, 475–497, https://doi.org/10.5194/bg-17-475-2020, 2020. a

Houskeeper, H. F. and Hooker, S. B.: The primacy of dissolved organic matter to aquatic light variability, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2024-4163, 2025. a, b

Hovis, W. A., Clark, D. K., Anderson, F., Austin, R. W., Wilson, W. H., Baker, E. T., Ball, D., Gordon, H. R., Mueller, J. L., El-Sayed, S. Z., Sturm, B., Wrigley, R. C., and Yentsch, C. S.: Nimbus-7 Coastal Zone Color Scanner: System Description and Initial Imagery, Science, 210, 60–63, https://doi.org/10.1126/science.210.4465.60, 1980. a

IOCCG: Remote Sensing of Ocean Colour in Coastal, and Other Optically-Complex Waters, no. 3 in Reports of the International Ocean-Colour Coordinating Group, IOCCG, Dartmouth, Canada, ISBN 978-1-896246-54-3, 2000. a

Kramer, S. J., Siegel, D. A., Maritorena, S., and Catlett, D.: Modeling surface ocean phytoplankton pigments from hyperspectral remote sensing reflectance on global scales, Remote Sens. Environ., 270, 112879, https://doi.org/10.1016/j.rse.2021.112879, 2022. a

Kudela, R. M., Hooker, S. B., Houskeeper, H. F., and McPherson, M.: The Influence of Signal to Noise Ratio of Legacy Airborne and Satellite Sensors for Simulating Next-Generation Coastal and Inland Water Products, Remote Sensing, 11, 2071, https://doi.org/10.3390/rs11182071, 2019. a

Lee, Z.: Remote Sensing of Inherent Optical Properties: Fundamentals, Tests of Algorithms, and Applications, IOCCG Report 5, International Ocean-Colour Coordinating Group (IOCCG), Dartmouth, Canada, an Affiliated Program of the Scientific Committee on Oceanic Research (SCOR) and An Associate Member of the Committee on Earth Observation Satellites (CEOS), ISBN 978-1-896246-56-7, 2006. a, b, c, d

Lee, Z., Carder, K. L., and Arnone, R. A.: Deriving inherent optical properties from water color: a multiband qua si-analytical algorithm for optically deep waters, Appl. Optics, 41, 5755–5772, https://doi.org/10.1364/AO.41.005755, 2002. a, b, c

Lee, Z., Carder, K. L., and Arnone, R. A.: Deriving inherent optical properties from water color: a multiband quasi-analytical algorithm for optically deep waters, Appl. Optics, 41, 5755–5772, https://doi.org/10.1364/AO.41.005755, 2002. a, b

Loisel, H., Stramski, D., Dessailly, D., Jamet, C., Li, L., and Reynolds, R. A.: An Inverse Model for Estimating the Optical Absorption and Backscattering Coefficients of Seawater From Remote-Sensing Reflectance Over a Broad Range of Oceanic and Coastal Marine Environments, J. Geophys. Res.-Oceans, 123, 2141–2171, https://doi.org/10.1002/2017JC013632, 2018. a

Loisel, H., Jorge, D. S. F., Reynolds, R. A., and Stramski, D.: A synthetic optical database generated by radiative transfer simulations in support of studies in ocean optics and optical remote sensing of the global ocean, Earth Syst. Sci. Data, 15, 3711–3731, https://doi.org/10.5194/essd-15-3711-2023, 2023. a, b, c

Maritorena, S. and Siegel, D. A.: Consistent merging of satellite ocean color data sets using a bio-optical model, Remote Sens. Environ., 94, 429–440, https://doi.org/10.1016/j.rse.2004.08.014, 2005. a

Maritorena, S., Siegel, D. A., and Peterson, A. R.: Optimization of a semianalytical ocean color model for global-scale applications, Appl. Optics, 41, 2705–2714, https://doi.org/10.1364/AO.41.002705, 2002. a

Mason, J. D., Cone, M. T., and Fry, E. S.: Ultraviolet (250–550 nm) absorption spectrum of pu re water, Appl. Optics, 55, 7163–7172, https://doi.org/10.1364/AO.55.007163, 2016. a

Mobley, C. D. E.: The Oceanic Optics Book, International Ocean Colour Coordinating Group (IOCCG), Dartmouth, NS, Canada, https://www.oceanopticsbook.info/ (last access: 1 July 2025), 2022. a

Mouw, C. B., Hardman-Mountford, N. J., Alvain, S., Bracher, A., Brewin, R. J. W., Bricaud, A., Ciotti, A. M., Devred, E., Fujiwara, A., Hirata, Takafumi an d Hirawake, T., Kostadinov, T. S., Roy, S., and Uitz, J. I.: A Consumer's Guide to Satellite Remote Sensing of Multiple Phytoplankton Groups in the Global Ocean, Front. Mar. Sci., 4, 41, https://doi.org/10.3389/fmars.2017.00041, 2017. a, b

NASA Goddard Space Flight Center: SeaBASS Validation Search, https://seabass.gsfc.nasa.gov/search/?search_type=Perform Validation Search&val_sata=1&val_products=11&val_source=0 (last access: June 2024), 2024. a

O'Reilly, J. E., Maritorena, S., Mitchell, B. G., Siegel, D. A., Carder, K. L., Garver, S. A., Kahru, M., and McClain, C.: Ocean color chlorophyll algorithms for SeaWiFS, J. Geophys. Res.-Oceans, 103, 24937–24953, https://doi.org/10.1029/98JC02160, 1998. a

Prochaska, J. X.: BING: Bayesian INferences with Gordon coefficients, Zenodo [code], https://doi.org/10.5281/zenodo.13292700, 2024. a

Prochaska, J. X.: ocean-colour/bing: Final version for EGU Sphere (v0.3), Zenodo [code], https://doi.org/10.5281/zenodo.15857015, 2025. a

Prochaska, J. X. and Gray, P.: On the Fundamental Additive Modes of Ocean Color Absorption, Limnol. Oceanogr, 70, 2267–2283, https://doi.org/10.22541/essoar.171828481.15444713/v1, 2024. a

Prochaska, J. X. and Gray, P.: ocean-colour/ocpy: First DOI; coincides with publication of the BING paper (v1.0), Zenodo [code], https://doi.org/10.5281/zenodo.17088615, 2025. a

Roesler, C. S., Perry, M. J., and Carder, K. L.: Modeling in situ phytoplankton absorption from total absorption spectra in productive inland marine waters, Limnol. Oceanogr., 34, 1510–1523, https://doi.org/10.4319/lo.1989.34.8.1510, 1989. a, b, c

Seegers, B. N., Stumpf, R. P., Schaeffer, B. A., Loftin, K. A., and Werdell, P. J.: Performance metrics for the assessment of satellite data products: an ocean color case study, Opt. Express, 26, 7404, https://doi.org/10.1364/OE.26.007404, 2018. a, b

Siegel, D. A., Maritorena, S., Nelson, N. B., Hansell, D. A., and Lorenzi-Kayser, M.: Global distribution and dynamics of colored dissolved and detrital organic materials, J. Geophys. Res.-Oceans, 107, 21-1–21-14, https://doi.org/10.1029/2001JC000965, 2002. a

Siegel, D. A., Behrenfeld, M. J., Maritorena, S., McClain, C. R., Antoine, D., Bailey, S. W., Bontempi, P. S., Boss, E. S., Dierssen, H. M., Doney, S. C., Eplee, R. E., J., Evans, R. H., Feldman, G. C., Fields, E., Franz, B. A., Kuring, N. A., Mengelt, C., Nelson, N. B., Patt, F. S., Robinson, W. D., Sarmiento, J. L., Swan, C. M., Werdell, P. J., Westberry, T. K., Wilding, J. G., and Yoder, J. A.: Regional to global assessments of phytoplankton dynamics from the SeaWiFS mission, Remote Sens. Environ., 135, 77–91, https://doi.org/10.1016/j.rse.2013.03.025, 2013. a

Siegel, D. A., Buesseler, K. O., Doney, S. C., Sailley, S. F., Behrenfeld, M. J., and Boyd, P. W.: Global assessment of ocean carbon export by combining satellite observations and food-web models, Global Biogeochem. Cy., 28, 181–196, https://doi.org/10.1002/2013GB004743, 2014. a

Stramski, D., Bricaud, A., and Morel, A.: Modeling the Inherent Optical Properties of the Ocean Based on the Detailed Composition of the Planktonic Community, Appl. Optics, 40, 2929–2945, https://doi.org/10.1364/AO.40.002929, 2001. a

Stramski, D., Joshi, I., and Reynolds, R. A.: Ocean color algorithms to estimate the concentration of particulate organic carbon in surface waters of the global ocean in support of a long-term data record from multiple satellite missions, Remote Sens. Environ., 269, 112776, https://doi.org/10.1016/j.rse.2021.112776, 2022. a

Sydor, M., Gould, R. W., Arnone, R. A., Haltrin, V. I., and Goode, W.: Uniqueness in Remote Sensing of the Inherent Optical Properties of Ocean Water, Appl. Optics, 43, 2156–2162, https://doi.org/10.1364/AO.43.002156, 2004. a, b

Twardowski, M. S., Jamet, C., and Loisel, H.: Analytical model to derive suspended particulate matter concentration in natural waters by inversion of optical attenuation and backscattering, in: Ocean Sensing and Monitoring X, edited by Hou, W. W. and Arnone, R. A., vol. 10631 of Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, p. 106310H, https://doi.org/10.1117/12.2309995, 2018. a

Werdell, P. J. and Bailey, S. W.: The SeaWiFS Bio-optical Archive and Storage System (SeaBASS): Current architecture and implementation, NASA Tech. Memo. 2002-211617, NASA Goddard Space Flight Center, Greenbelt, Maryland, https://ntrs.nasa.gov/citations/20020091607 (last access: 15 January 2025), 2002. a, b, c

Werdell, P. J., Franz, B. A., Bailey, S. W., Feldman, G. C., Boss, E., Brando, V. E., Dowell, M., Hirata, T., Lavender, S. J., Lee, Z., Loisel, H., Maritorena, S., Mélin, F., Moore, T. S., Smyth, T. J., Antoine, D., Devred, E., d'Andon, O. H. F., and Mangin, A.: Generalized ocean color inversion model for retrieving marine inherent optical properties, Appl. Optics, 52, 2019, https://doi.org/10.1364/AO.52.002019, 2013. a

Werdell, P. J., McKinna, L. I. W., Boss, E., Ackleson, S. G., Craig, S. E., Gregg, W. W., Lee, Z., Maritorena, S., Roesler, C. S., Rousseaux, C. S., Stramski, D., Sullivan, J. M., Twardowski, M. S., Tzortziou, M., and Zhang, X.: An overview of approaches and challenges for retrieving marine inherent optical properties from ocean color remote sensing, Prog. Oceanogr., 160, 186–212, https://doi.org/10.1016/j.pocean.2018.01.001, 2018. a, b, c

Wolanin, A., Rozanov, V. V., Dinter, T., Noël, S., Vountas, M., Burrows, J. P., and Bracher, A.: Global retrieval of marine and terrestrial chlorophyll fluorescence at its red peak using hyperspectral top of atmosphere radiance measurements: Feasibility study and first results, Remote Sens. Environ., 166, 243–261, https://doi.org/10.1016/j.rse.2015.05.018, 2015. a

Woo Kim, Y., Kim, T., Shin, J., Lee, D.-S., Park, Y.-S., Kim, Y., and Cha, Y.: Validity evaluation of a machine-learning model for chlorophyll a retrieval using Sentinel-2 from inland and coastal waters, Ecol. Indic., 137, 108737, https://doi.org/10.1016/j.ecolind.2022.108737, 2022. a

Zege, E. P., Ivanov, A. P., and Katsev, I. L.: Image Transfer Through a Scattering Medium, Springer-Verlag, ISBN 0-387-51978-5, 1991. a

Zhang, M., Ibrahim, A., Franz, B. A., and a nd Andrew M. Sayer, Z. A.: Estimating pixel-level uncertainty in ocean color retrievals from MODI S, Opt. Express, 30, 31415–31438, https://doi.org/10.1364/OE.460735, 2022. a, b, c, d

In the following, we adopt Gaussian uncertainties but will incorporate correlated noise when it is provided by satellite missions.

Articles

Short summary

Satellites monitor ocean health globally, but we discovered a fundamental physics limitation when measuring phytoplankton – tiny plants essential to marine ecosystems. Our analysis shows that even advanced satellites cannot reliably distinguish phytoplankton from other ocean components. This challenges decades of research and suggests that existing measurements have greater uncertainties than realized. Combining satellite data with direct ocean sampling is needed for better monitoring of these vital organisms.

On the challenges of retrieving phytoplankton properties from remote-sensing observations

2.1 Bayesian formalism

2.2 Radiative transfer with a physical degeneracy

2.3 A hyperspectral IOP dataset

2.4 Simulating satellite observations

2.5 IOP models

3.1 Failed attempts at arbitrary IOP retrievals

3.2 Parameterized IOP retrievals with BING

3.3 Retrieving aph(λ) with NASA/PACE

3.3 Retrieving a_ph(λ) with NASA/PACE