Forecasting seasonal global sea surface chlorophyll <i>a</i> with a lightweight data-driven approach

Martinez Balbontin, Gabriela; Jouanno, Julien; Benshila, Rachid; Lamouroux, Julien; Perruche, Coralie; Ciavatta, Stefano

doi:10.5194/bg-23-2601-2026

Articles | Volume 23, issue 7

https://doi.org/10.5194/bg-23-2601-2026

Articles | Volume 23, issue 7

Research article

17 Apr 2026

Research article |

| 17 Apr 2026

Forecasting seasonal global sea surface chlorophyll a with a lightweight data-driven approach

Gabriela Martinez Balbontin, Julien Jouanno, Rachid Benshila, Julien Lamouroux, Coralie Perruche, and Stefano Ciavatta

Abstract

Marine chlorophyll a concentration is a key indicator of ecosystem health, and accurate seasonal forecasting has important applications for fisheries management, harmful algal bloom detection, and climate studies. Traditional approaches rely on computationally intensive numerical biogeochemical models that require extensive domain expertise and parameterization.

We propose a data-driven, resource-efficient alternative: a neural architecture based on the U-Net that reconstructs surface, near-global chlorophyll a based on four physical predictors. The model learns to emulate satellite-like chlorophyll a from mixed layer depth, sea surface height, salinity, and temperature, all of which are known to influence phytoplankton distribution and nutrient availability. By leveraging publicly available seasonal forecasts of these variables, we can generate six-month of chlorophyll a predictions in a matter of minutes with a single GPU.

We trained the model using the GLORYS12 reanalysis as input physics and GlobColour merged chlorophyll a observations as reference. When applied to seasonal forecasting by using SEAS5 ensemble forecasts as input as opposed to the reanalysis, the model maintains high skill globally and remains stable across the six-month forecast horizon. Regional analysis shows that the model accurately captures seasonal dynamics and bloom timings across diverse regimes, with performance comparable to or exceeding that of a state-of-the-art numerical biogeochemical model, while requiring orders of magnitude less computational resources.

This approach demonstrates that high-performing, seasonal chlorophyll a forecasting can be achieved through a resource-efficient, observation-driven framework, offering a practical alternative for operational applications where computational constraints limit the use of full biogeochemical models.

Download & links

How to cite.

Received: 17 Mar 2025 – Discussion started: 09 Apr 2025 – Revised: 17 Mar 2026 – Accepted: 31 Mar 2026 – Published: 17 Apr 2026

1 Introduction

Like many other natural systems, marine biogeochemical cycles are vulnerable to the effects of climate change (Bopp et al., 2013; Achterberg, 2014). Stressors such as rising temperatures (Polovina et al., 2011), ocean acidification (Orr et al., 2005), and changes in nutrient availability (Achterberg, 2014; Schneider et al., 2008) are already affecting the oceans, and these impacts are expected to intensify in the coming years.

These changes can have cascading effects on marine ecosystems, causing alterations in species distribution and biodiversity, impacting primary production, and posing risks to food security. An important indicator of changes in the health of aquatic ecosystems is chlorophyll a (chl a) concentration. This photosynthetic pigment can be used to estimate phytoplankton biomass and productivity, and it has the additional advantage of being observable from space (Groom et al., 2019), which allows for its large-scale monitoring.

Accurate seasonal chl a forecasting, even if limited to the surface level, can have a variety of important applications. It can help estimate food availability for higher trophic levels, with implications for fisheries management and biodiversity conservation (Park et al., 2019 b). Additionally, it enables the early detection of harmful algal blooms, allowing for timely warnings to mitigate their impacts on human health and coastal economies (Lin et al., 2021; Wang et al., 2018). Chl a forecasting also serves as a valuable indicator of upwelling events, which influence nutrient availability and primary production. These processes, in turn, have effects on the carbon cycle and the broader climate system (Bopp et al., 2013; Achterberg, 2014; Rousseaux et al., 2021; Park et al., 2019 b).

Traditionally, biogeochemical forecasting has been done using numerical models, which are based on differential equations that model nutrient and carbon cycles (Aumont et al., 2015; Fennel et al., 2022; Berardi, 2020; Gehlen et al., 2015). While some of these models have successfully been used for seasonal chl a forecasting (Rousseaux et al., 2021; Park et al., 2019 b), they can be very computationally prohibitive, and they require extensive knowledge of the systems they represent. Incorrect parametrization can lead to systematic biases and uncertainties, and these errors can propagate throughout the model, limiting its capabilities (Fennel et al., 2022; Berardi, 2020; Gehlen et al., 2015).

Machine learning offers the possibility of making use of the growing number of geospatial data for this purpose. While its use in marine biogeochemical modeling is not as widespread as in meteorology, its adoption is steadily growing (Sadaiappan et al., 2023). In the case of chl a, it has been used to enhance remote sensing datasets by filling observational gaps and correcting biases caused by sensor limitations or cloud cover (Keiner, 1999; Hu et al., 2021; Kolluru and Tiwari, 2022; Park et al., 2019 a; Cao et al., 2020). Martinez et al. (2020) reconstructed historical chl a patterns from surface oceanic and atmospheric predictors, demonstrating that this approach is able to capture decadal and longer trends of past periods, while Roussillon et al. (2023) further expanded on this work, highlighting the different regional modes of variability. Machine learning has also been applied to short-term chl a forecasting, primarily at regional scales. Park et al. (2015) proposed a support vector machine for forecasts within freshwater and estuarine reservoirs, while Wenxiang et al. (2022) used a neural network to generate forecasts in the Xiamen Bay. Additional examples include Chen et al. (2024), who generated forecasts for freshwater lakes, and Ly et al. (2021), who focused on the Han River. Zhu et al. (2023) and Ying et al. (2023) applied data-driven methods for forecasting in coastal waters. Nonetheless, to our knowledge, most existing efforts focus on regional-scale forecasts spanning only a few days.

Many of these studies demonstrate the feasibility of reconstructing chl a using physical data (Martinez et al., 2020; Roussillon et al., 2023; Chen et al., 2024; Sauzède et al., 2017). Based on this, we propose a machine learning-driven methodology to predict six-months of global, surface-level chl a by using forecasts for the same time period of four oceanic variables: mixed layer depth (MLD), sea surface temperature (SST), sea surface salinity (SSS), and sea surface height (SSH). These variables are selected due to their influence on nutrient and oxygen availability, as well as phytoplankton distribution, making them effective predictors of chl a (Browning and Moore, 2023; Palacios et al., 2013; Fernandez-Gonzalez et al., 2022; Xu et al., 2022; Barone et al., 2019; Uz et al., 2001; Chenillat et al., 2021). They also offer practical advantages: they are readily available from ocean reanalyses, and it is possible to obtain their seasonal forecasts.

The goal of this work is to demonstrate that we can not only estimate chl a from these four variables, but that by using publicly available forecasts of these as input, we are able to generate an ensemble of accurate chl a predictions for six months into the future. Our approach operates at a spatial resolution of approximately 25 km ( $1 / 4$ °) and two different temporal resolutions: 5 d (as a proof-of-concept of performance at higher temporal resolution) and monthly (to match the available physics forecasts used as input).

2 Proposed Methodology

The predictive task consists of reconstructing the chl a distribution over a six-month horizon using forecasts of four physical predictors. Given the intended practical utility of these predictions in operational settings, our primary objective was to develop a solution that is computationally efficient, operationally robust, and readily deployable under limited computational resources.

Given that the seasonal cycle of chl a is pronounced, we chose to decompose the forecast problem by calendar month rather than addressing it with a single, more complex model. We therefore implemented a month-dependent strategy based on twelve lightweight neural networks, one for each initialization month of the forecast. The training period spans January 1998 to May 2017. Each network was trained on time series of six consecutive months beginning at its initialization: for example, the January network used January–June data from 1998–2016, while the December network used sequences from December to May (1998–2017). Figure 1 provides an overview of the methodology.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f01

Figure 1Overview of the methodology: Twelve month-specific neural networks are trained on six-month time series spanning 1998–2017, each initialized on the starting month of the forecast.

Download

This strategy limits the framework's ability to generalize across initialization months and may reduce capacity to learn universal temporal relationships. Nonetheless, it offers some practical advantages: each network is substantially smaller than a unified model, conditioning on the initialization month reduces representational complexity, and the design mirrors the operational workflow, in which users generate chl a forecasts based on physics predictions available from a specific month of initialization. This approach also allows each model to focus on a narrower distribution of patterns, which can be learned effectively with fewer parameters; thus, we considered this an acceptable trade-off.

Two versions of this workflow were created: one using monthly resolution (to match the available physics forecasts that would be used as input operationally), and another using 5 d averages (i.e. 6 data points per month, to test performance at higher temporal resolution). Both architectures use a spatial resolution of $1 / 4$ °, or approximately 25 km.

2.1 Training data

The physics data were provided by the Global Ocean Physics Reanalysis (GLORYS12) (Lellouche et al., 2021; Copernicus Marine Service, Marine Data Store, 2024), and the Global Ocean Color (GlobColour) dataset (Garnesson et al., 2019) was used as the output target. GLORYS12 is a high-resolution reanalysis that provides daily and monthly data on the four predictors (MLD, SST, SSS, SSH), among other variables. It is based on the NEMO (Nucleus for European Modelling of the Ocean) platform and driven at the surface by ECMWF's ERA-Interim and ERA5 meteorological reanalyses. Satellite altimetry, SST, sea ice concentration, and in situ temperature and salinity vertical profiles are assimilated (Lellouche et al., 2021).

The GlobColour dataset provides a combination of gap-filled satellite observations of chl a from multiple remote sensors. Chl a concentrations are derived from reflectance using sensor-specific algorithm coefficients and then merged (Garnesson et al., 2019; Maritorena et al., 2010). Because the availability and quality of remotely sensed chl a degrade at high latitudes due to low solar elevation, persistent cloud cover, and sea-ice, we restrict the domain to a −60 to 80° latitude grid in order to ensure robust training targets.

Within the development dataset (January 1998–May 2017), the years 2001, 2005, 2010 and 2015 were used for validation, while the rest was used for training. The training set was used to fit the model, while the validation set served for hyperparameter optimization and selecting the most effective model architecture. After selecting the final parameters using the training and validation sets, the model is evaluated on an independent test set (2018–2023 for the 5 d model and 2020–2023 for the monthly model).

The physical ocean data were normalized using min-max scaling and the chl a data were log-transformed. Both datasets are available at a daily resolution, and the data were averaged to monthly and 5 d resolutions for training the respective versions of the model. The $1 / 12$ ° GLORYS12 grid was coarsened using bicubic interpolation to match the observational grid resolution, which was obtained at $1 / 4$ °.

2.2 Training objective

The proposed methodology reconstructs a six-month chl a grid ( $\hat{y}$ ) given a corresponding six-month input grid of four physical variables ( $X = {MLD, SST, SSS, SSH}$ ). As mentioned above, the network operates on log-transformed chl a values, where $p_{i} = \log ({\hat{y}}_{i})$ denotes the predicted values for the ith grid point, and r_i=log (y_i) denotes the corresponding log-transformed reference point. The training objective is defined as the mean squared error (MSE) between the network output and the reference:

\begin{matrix} (1) & MSE = \frac{1}{N} \sum_{i = 1}^{N} (r_{i} - p_{i})^{2} \end{matrix}

where N is the total number predicted points in the training batch.

In spite of the log-transformation, initial assessments showed that the model had a tendency to underestimate chl a concentrations, so the standard MSE loss function was modified by adding a small penalty for underestimation. This adjustment was based on empirical testing and was implemented by applying a small weight w whenever the underestimation occurred:

\begin{matrix} (2) & L = \frac{1}{N} \sum_{i = 1}^{N} [(r_{i} - p_{i})^{2} + w \cdot [max (0, r_{i} - p_{i})]^{2}] \end{matrix}

where the second term applies an additional penalty $w \cdot (r_{i} - p_{i})^{2}$ when p_i<r_i. While this approach was effective in reducing underestimation, more principled methods could provide a more robust solution.

2.3 Model architecture

The neural networks are based on the U-Net architecture (Ronneberger et al., 2015), a convolutional neural network with an encoder-decoder structure that has been widely applied to Earth science forecasting tasks due to the grid-based nature of such data (Agrawal et al., 2019; Sønderby et al., 2020; Weyn et al., 2019, 2021). The U-Net's skip connections enable it to combine low-level spatial details with high-level semantic information, allowing reconstruction of both large-scale patterns and fine-scale features simultaneously. Despite advances and increased complexity in more recent architectures, the U-Net's simplicity and efficiency make it an effective and resource-efficient choice for this task. To process both spatial and temporal dimensions simultaneously, we use 3D convolutions (Tran et al., 2015). To balance the additional computational cost of the 3D convolutions, the proposed architecture uses fewer layers and filters than the original U-Net.

The encoder consists of two blocks of convolutional and max-pooling layers, which downsample the input. Two additional convolutional layers refine feature extraction before transitioning to the decoder, which mirrors the encoder and upsamples the data back to the original resolution. Skip connections link matching layers of corresponding size in the encoder and decoder (see Table 1). A rectified linear unit (ReLU) activation function was used throughout the network, except in the output layer, where a Softplus activation function (Dugas et al., 2000) was empirically found to work best. The model architecture is summarized in Table 1.

Table 1Summary of proposed neural architecture.

Download Print Version | Download XLSX

All twelve models share a common architecture and hyperparameters (i.e. number and order of layers, number of convolutional filters and kernel size, learning rate, etc.), which were optimized empirically and using random search. The models were trained using the Adam optimizer (Kingma and Ba, 2017) and a learning rate of 0.001 for a maximum of 300 epochs. We used early stopping on the validation loss to prevent overfitting, but no explicit regularization methods were used. We also used transfer learning, where the training process of a model can be initialized with the pre-trained weights of another, to further optimize computational cost. The models were trained on a single GPU (NVIDIA A100, 40 GB).

2.4 Model evaluation and prediction generation

We evaluate our approach by using two distinct setups, each serving a different purpose. First, we perform a preliminary validation at a 5 d resolution using the GLORYS12 reanalysis as input. We generate a set of 5 d predictions covering the 2018–2023 period. This step isolates the model's ability to map physics to chl a when given accurate, observation-constrained physics, and serves to assess performance at a higher temporal resolution. We then validate seasonal forecast generation using 6 month forecasts from the ECMWF SEAS5 system (Johnson et al., 2019) as input. This is performed for the 2020–2023 period at a monthly temporal resolution to align with the available input data. This setup mimics the intended application of the model, where the chl a predictions would rely on predicted physics as opposed to a reanalysis. Both approaches use a spatial resultion of $1 / 4$ °. We describe the SEAS5 system in more detail below.

The forecast consists of an ensemble of 51 members, and it uses the IFS (Integrated Forecasting System) cycle 43r1 for the atmosphere, NEMO for the ocean, and LIM2 (Louvain-la-Neuve Sea Ice Model version 2) for sea ice. The system assimilates temperature, salinity, satellite altimetry, and snow cover (Johnson et al., 2019). The forecasts are available free of charge on the Copernicus Climate Data Store at a monthly, one-degree resolution (Copernicus Climate Change Service, Climate Data Store, 2018 a, b). A version of higher spatial resolution ( $1 / 4$ °) of the data was kindly provided by ECMWF, but it is possible to obtain similar results by downscaling with a simple interpolation method.

The SEAS5 forecast exhibits some biases relative to the reanalysis (see Fig. 2), which is expected given their different sources and purposes. To account for this, we computed the difference between the monthly climatologies of both datasets over the training period (1998–2017), then subtracted this mean bias from the SEAS5 data before using it as model input. This offset is computed once, and can be applied to all forecasts, including future ones.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f02

Figure 2Mean biases between SEAS5 (monthly forecast, one-month lead time) and GLORYS12 for the input variables over the 2020–2023 period. While correction is applied using historical climatological differences (see Methods), this figure is indicative of the systematic differences between the forecast and reanalysis.

We used the first 10 (out of 51) ensemble members from SEAS5 as input for the neural network to generate six-month predictions at a monthly resolution (i.e. matching the available forecast resolution), using forecasts initialized from January 2020 to December2023. To evaluate performance across different lead times (i.e., the number of months after forecast initialization), we constructed six continuous time series by concatenating the corresponding month from each six-month forecast. Specifically, we extracted the first month from each forecast to create a one-month lead time series (January 2020–December 2023), the second month from each forecast for a two-month lead time series (February 2020–January 2024), and so on. This is illustrated in Fig. 3 for clarity. We compute SEAS5-based skill metrics over January 2020–December 2023 (i.e., truncating any lead time series months that extend into 2024).

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f03

Figure 3A summary of the prediction generation workflow: We concatenated the nth months from each prediction to generate continuous time series with lead times 1–6 (months).

Download

We analyzed performance by zooming into some open-ocean regions of interest: the North Pacific (170° E–130° W, 25–45° N), Tropical Pacific (170° E–100° W, 10° S–10° N), South Pacific (170–100° W, 35–15° S), North Atlantic (70–20° W, 25–40° N), and Indian Ocean (55–95° E, 25–0° S), which are based on Park et al. (2019 b). We also examined model skill in the Equatorial Atlantic (35° W–5° E, 5° S–5° N) and in selected large coastal ecosystems, specifically the California Current (25–45° N, 130–113° W), Humboldt Current (50–5° S, 85–69° W), South China Sea (1–25° N, 105–119° E), Gulf of Mexico (18–30° N, 100–80° W), Benguela Current (35–12° S, 5–20° E), Bay of Bengal (5–25° N, 80–98° E), Celtic-Biscay Shelf (43–55° N, 15° W–0° E), and Northeast Australian Shelf (25–10° S, 140–155° E). These regions are illustrated in Fig. 4.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f04

Figure 4Regions where performance is evaluated. Open-ocean (dashed lines): North (1), Tropical (2) and South Pacific (3), North (4) and Equatorial Atlantic (5), and Indian Ocean (6). Coastal large ecosystems (solid lines): California Current (A), Gulf of Mexico (B), Humboldt Current (C), Celtic-Biscay Shelf (D), Benguela Current (E), Bay of Bengal (F), South China Sea (G), and Northeast Australian Shelf (H).

We evaluated our results against BIO4, a state-of-the-art global biogeochemical product from Mercator Océan International (Copernicus Marine Service, Marine Data Store, 2023). BIO4 is based on the PISCES model (Pelagic Interactions Scheme for Carbon and Ecosystem Studies), the biogeochemical component of the NEMO platform (Lamouroux et al., 2023; Aumont et al., 2015). It simulates global cycles of carbon, oxygen, and key nutrients, includes two functional groups each of phytoplankton and zooplankton, and produces a 3D representation of nearly 30 biogeochemical variables. BIO4 provides both analyses and 10 d forecasts down to 5700 m depth. The model is driven by GLO12, a near-real-time physical reanalysis based on GLORYS12, the training input physics in our study, and it assimilates GlobColour observations on a weekly basis (Lamouroux et al., 2024).

Our evaluation focused on the surface layer of the BIO4 analysis (rather than the forecast) due to differing forecast horizons. We note that BIO4 simulates a wide range of biogeochemical variables and processes beyond surface chl a, while our neural network is specifically optimized for chl a prediction and trained directly on observational data. We therefore use BIO4 as a contextual benchmark rather than a direct point of comparison.

We evaluate model skill primarily using RMSE and Pearson correlation computed on the raw (non-log-transformed) chl a fields. All figures in this manuscript use non-log-transformed chl a. Because different diagnostics emphasize different aspects of performance, we report two complementary RMSE definitions. For the regional time series shown in Figs. 11–14, we first spatially average each field over the region to obtain a region-mean time series, then compute RMSE between the ensemble-mean forecast and the GlobColour reference over 2020–2023 (with RMSE of the BIO4 analysis computed on the same dates). For the regional RMSE summaries (Figs. 10, 16, and 17), we compute RMSE within each region using all valid grid-point samples over 2020–2023 either for a given calendar month (Figs. 16 and 17) or averaged across months (Fig. 10). For anomaly visualizations (Figs. 12 and 14), anomalies are computed relative to a monthly historical climatology based on GlobColour (1998–2017). This provides a stable observation-based baseline while retaining systematic forecast biases.

3 Results

3.1 Model validation with reanalysis physics as input

To validate the neural network architecture and isolate model performance from forecast uncertainties, we first evaluated the model using GLORYS12 reanalysis as input physics. We generated predictions for 2018–2023 at a 5 d temporal resolution. Figure 5 shows that the globally averaged (spatial mean at each time step) time series of the neural network predictions exhibits good visual agreement with the observations. We also included outputs from the BIO4 analysis (dashed lines) for comparison. The neural network captures both the amplitude and phasing of the dominant signal. Although it slightly overshoots seasonal peaks, it represents seasonal lows more accurately than BIO4. This validates the model's ability to learn the relationship between the four physical variables and biological productivity.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f05

Figure 5Globally averaged chl a concentration time series from 2018 to 2023. The neural network predictions using GLORYS12 reanalysis as input physics (green line) are compared against GlobColour satellite observations (black line) and outputs from the BIO4 analysis (dashed lines).

Download

To assess spatial skill, Fig. 6 presents a global map of Pearson correlation coefficients (r) computed at each grid point between the predictions and observations. Correlation values were computed for monthly-averaged data over the 2020–2023 period to facilitate comparison with the forecasting results, which have lower temporal resolution and only cover that temporal period. The correlation map shows strong spatial coherence, with positive correlation values across most of the global ocean. Regions of particularly high correlation are found in the Northern oceans, while the Tropical and South Pacific show less skill.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f06

Figure 6Spatial distribution of Pearson correlation coefficients between the neural network predictions (with GLORYS12 as input physics) and the observations. Correlation values are computed at each grid point for the monthly-averaged data over the 2020–2023 period for better comparison with forecasting results.

3.2 Seasonal forecast performance

We next evaluated the model in a forecasting configuration by using SEAS5 seasonal forecasts as input physics. All results presented below use the monthly model and the SEAS5 ensemble (10 of 51 members) as input. We note here that the chl a reconstruction model is deterministic, and the ensemble spread arises solely from the SEAS5 physical ensemble.

We generated 6 month forecasts at monthly resolution for 2020–2023. Figure 7 displays the globally averaged time series for forecasts initialized in March, June, September and December, with individual ensemble member trajectories shown as thin lines and initializations indicated by colored markers. The ensemble forecasts are visually consistent with the observations, with individual member trajectories continuing to follow the observed chl a concentrations throughout the 6 month forecast horizon, indicating little skill degradation with increasing lead time.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f07

Figure 7Globally averaged chl a concentration time series from 2020 to 2023 reconstructed from the seasonal forecasts. The forecasts were initialized in March (cyan), June (magenta), September (yellow), and December (green), and the individual member reconstructions are shown as thin lines. Initialization (lead time 1) months are indicated by large colored markers. GlobColour observations are shown in black and BIO4 outputs are shown in dashed lines.

Download

Figure 8 illustrates spatial skill using the Pearson correlation coefficient between predictions and observations over the 2020–2023 period, shown for one and six-month lead times. Correlations remain spatially coherent but a small reduction in skill is observed relative to the GLORYS12 driven predictions (Fig. 6). This slight decrease is expected from the transition to forecast inputs, since they can introduce a distributional shift relative to the training data (Fig. 2) even after the bias correction applied to SEAS5 (see Methods). The discrepancies occur mostly in the tropical and southern Pacific, the Indian Ocean, and plume-influenced regions near Brazil and to the south of the Gulf of Guinea. We also note that the southern and tropical Pacific exhibit relatively modest skill even in the GLORYS12 validation, suggesting that our particular modeling framework might be suboptimal for these regions, irrespective of the physics being used as input. Nonetheless, skill remains high in the subtropical gyre regions, where correlations are strong and spatially consistent. Additionally, the large-scale correlation patterns show little degradation from lead time one to six-months, indicating that the spatial skill of the predictions is relatively stable over the forecast horizon.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f08

Figure 8Spatial distribution of Pearson correlation coefficients for seasonal forecasts (SEAS5 used as input physics) at one and six-month lead times over 2020–2023. The ensemble-mean forecast is compared against GlobColour observations.

Figure 9 summarizes skill within each region from Fig. 4 using Pearson's r computed against the GlobColour reference. For each region (and lead time), r is computed using all valid grid-point pairs in the ensemble-mean forecast within the region for each calendar month (pooling 2020–2023), and then averaged across months. In the open-ocean regions (Fig. 9, top), correlations are generally higher than 0.7. The neural-network forecasts outperform BIO4 in the northern basins, consistent with the strong performance visible in Fig. 8. They also outperform BIO4 in the Equatorial Atlantic, although skill there remains weak for the neural model too, as shown in Fig. 8. Performance is lower than BIO4 in the tropics and in the South Pacific and Indian Ocean, again consistent with the poor performance visible in Fig. 8.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f09

Figure 9Comparative overview of Pearson correlation coefficients for seasonal forecasts at lead times 1–6 (months) over 2020–2023 for different regions. Note that the y axis is cut off at r=0.5.

Download

In the coastal regions (Fig. 9, bottom), correlations are often higher (typically r≳0.8) and exceed BIO4 in all analysed regions, which is notable given the lack of explicit biogeochemical inputs and the challenges of coastal dynamics, although Fig. 8 shows that this performance is far from uniform. Across regions, correlations show limited sensitivity to lead time, with little degradation from lead times one to six-months.

Figure 10 summarizes regional RMSE for the chl a fields. For each region and lead time, similarly to r, the RMSE is computed over all valid grid cells of the ensemble-mean for each calendar month, and then averaged to obtain a single regional RMSE per lead time. The neural model exhibits lower RMSE than BIO4 across all regions considered, including regions where its correlation is lower (see Fig. 9). This indicates that, in those regions, the neural model is closer to the reference in absolute value even if it might be missing specific patterns (such as spatial shifts in mesoscale features), which can penalize correlation more strongly than RMSE. This is consistent with Fig. 15, where the neural predictions generally match the mean level and broad seasonal evolution of GlobColour but can show small timing/position offsets in bloom development. Conversely, although correlations are often higher in coastal regions, RMSE is also higher on average (note the larger y axis scale), consistent with the larger variability and dynamic range of coastal chl a.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f10

Figure 10Comparative overview of RMSE for seasonal forecasts at lead times 1–6 (months) over 2020–2023 for different regions, compared to the BIO4 analysis.

Download

3.3 Further examining regional performance

Having assessed global-average skill, we now examine performance in the specific regions highlighted in Fig. 4 more in detail.

Figures 11 and 13 show regional time series for forecasts, this time initialized in January, April, July, and October. For each region, we compare the neural network predictions, the GlobColour reference observations, and the BIO4 analysis. Figures 12 and 14 show the corresponding anomalies, computed by subtracting a monthly GlobColour climatology (computed over 1998–2017) from each region to assess the ability to predict deviations from the seasonal cycle. We report the RMSE for each regional (and corresponding anomaly) time series, computed between the ensemble-mean forecast and the regional GlobColour time series over 2020–2023 (with BIO4 RMSE computed on the same dates). These time-series RMSE values quantify errors in the integrated regional signal after spatial averaging and thus complement the gridpoint-based RMSE reported in Fig. 10.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f11

Figure 11Regional, open-ocean forecasts. The solid black line represents the average GlobColour reference value for each region, while the dashed lines indicate the corresponding BIO4 output. The solid colored lines represent the ensemble average for the neural network-predicted values initialized on January, April, July, and October, while the shading represents the ensemble standard deviation. We report RMSE (computed after spatial averaging) for BIO4 and each group of predictions.

Download

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f12

Figure 12Regional, open-ocean forecasted anomalies. Anomalies are computed by subtracting the monthly GlobColour climatology (seasonal cycle, computed over 1998–2017) from each region-mean time series. The solid black line shows the GlobColour anomaly, dashed lines show the corresponding BIO4 anomaly, and solid colored lines show the ensemble-mean neural-network anomaly forecasts initialized in January, April, July, and October; shading indicates the ensemble standard deviation.

Download

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f13

Figure 13Regional, coastal forecasts. The figure shows GlobColour (black), BIO4 (dashed lines), and the ensemble average and standard deviation of the neural network predictions initialized on January, April, July, and October.

Download

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f14

Figure 14Regional, coastal forecasted anomalies. The figure shows the GlobColour anomaly (black), corresponding BIO4 anomaly (dashed lines), and the ensemble average and standard deviation of forecasted anomalies initialized in January, April, July, and October.

Download

Across regions, the seasonal predictions generally reproduce the observed seasonal cycle and often track GlobColour more closely than BIO4 in terms of magnitude and amplitude. Nonetheless, skill is not uniform and performance varies by region and initialization month, with larger discrepancies often occurring during the peak of the bloom. Note that the RMSE displayed in Figs. 12 and 14 is identical to that of the raw-value figures (Figs. 11 and 13) because the anomalies are computed by subtracting the same monthly GlobColour climatology from both the forecasts and the reference.

The anomaly plots reveal which deviations from the seasonal cycle the model successfully reproduces. The time series demonstrate that forecasts capture departures from the seasonal cycle with varying skill across initialization months and regions, and some cases exhibit consistent offsets or systematic biases. For example, in the South Pacific, predicted anomalies align more closely with BIO4 than with observations, suggesting that shared errors may tied to limitations in the physical input set, or may be predominantly driven by processes that we are either not modeling or struggling to represent.

Notably, in coastal regions such as the Celtic-Biscay Shelf, the South China Sea, the California Current, and the Bay of Bengal, the forecasts capture the overall trend and temporal evolution of the observed anomalies, despite not always matching absolute magnitudes.

Figure 15 shows Hovmöller diagrams, which visualize the evolution of the field as a function of time and one spatial dimension (latitude in this case). The plots summarize the spatiotemporal evolution of surface chl a, making it easier to compare bloom timing, propagation, and amplitude across models. The SEAS5- and GLORYS12-driven neural-network predictions are broadly similar across regions, but the Hovmöller format makes it possible to identify subtle shifts in the timing, latitude/extent, and amplitude of peak concentrations that are likely to contribute to the regional skill degradation highlighted in Fig. 8. BIO4's tendency to overestimate chl a magnitude, is apparent as systematically elevated values relative to GlobColour. These diagrams also highlight interannual variability: despite the dominant seasonal cycle, year-to-year differences in the timing and intensity of blooms are visible. The neural-network predictions exhibit variation across years rather than repeating a purely climatological seasonal pattern, even though these do not always align with the observed deviations from the seasonal cycle.

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f15

Figure 15Hovmöller diagrams showing time vs. latitude for each open-ocean region. Columns show (1) the GlobColour reference, (2) the ensemble-mean lead-1 prediction forced by SEAS5 physics, (3) the corresponding prediction forced by GLORYS12 physics, and (4) the BIO4 output.

Download

To investigate how error evolves through the seasonal cycle, Figs. 16 and 17 show RMSE as a function of calendar month for each region. Here, RMSE is computed separately for each calendar month using all valid grid points within the region over 2020–2023 (i.e. not averaged across months like we do in Fig. 10).

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f16

Figure 16Seasonal evolution of the RMSE for each open-ocean region, comparing BIO4 (dashed lines) with the neural-network seasonal forecasts at lead times 1–6 (L1–L6).

Download

https://bg.copernicus.org/articles/23/2601/2026/bg-23-2601-2026-f17

Figure 17Seasonal evolution of the RMSE for each coastal region comparing BIO4 and seasonal forecasts at L1–L6.

Download

Both the neural network and BIO4 exhibit increased errors during bloom peaks. Northern-hemisphere regions show larger RMSE in the first half of the year, while southern-hemisphere regions peak later, consistent with hemispheric seasonality. This behavior is expected because bloom peaks concentrate the largest amplitudes and strongest spatiotemporal gradients, in RMSE these timing offsets or amplitude damping near the peak translate into large squared errors. The fact that both the neural and process-based models show similar seasonality in error suggests that peak-bloom conditions may be intrinsically harder to predict, potentially reflecting sensitivity to rapidly evolving states and/or drivers that are not explicitly represented, rather than model class alone. However, this seasonal structure is not uniform across regions or between the two models. For example, in the Tropical Pacific the neural-network RMSE shows little seasonal variation compared to BIO4, while in the Humboldt Current BIO4 error has a secondary late-year peak that is not mirrored by the neural model, and the Benguela Current shows a different seasonal pattern. These differences suggest different sources of uncertainty and region-specific dynamics, and that accuracy might be constrained by different representations (or omission) of key drivers.

4 Discussion

We introduced a lightweight, data-driven approach for reconstructing near-global surface chl a from only four physical predictors (MLD, SSH, SSS and SST) and for extending this capability to seasonal forecasting by using forecasted physical ocean states as input. Despite its simplicity, the method achieves high correlation and low error across many regions, while remaining computationally efficient: the network can be trained in a few hours on a single GPU and can produce $1 / 4$ °, six-month forecasts within minutes. This makes the framework attractive for rapid generation of large-scale predictions, and in contexts where computational cost is a primary constraint.

Our approach aligns with previous studies that have demonstrated the potential of using physical ocean variables and machine learning for chl a reconstruction (Sauzède et al., 2017; Martinez et al., 2020; Roussillon et al., 2023; Chen et al., 2024). By applying it to forecasted physical ocean states, we further expand the utility of this approach to forecasting, offering a simple yet useful framework for seasonal, near-global surface chl a predictions. Our model focuses solely on surface chl a, but this variable can be used as a proxy for phytoplankton biomass and variability in primary productivity, and it has the advantage of being more readily verifiable from observations. Skillful surface chl a reconstructions and forecasts can help support the study of ecosystem responses to climate variability, and resource-management use cases that depend on these large-scale indicators. At the same time, we acknowledge that important subsurface processes and vertical structure are not represented in our approach, and so this framework would fall short whenever information about the vertical distribution of biomass is required, such as in the presence of a Deep Chlorophyll Maximum (DCM).

Across the 2020–2023 evaluation period, the seasonal forecasts maintain spatially coherent skill even with a six-month lead time (Fig. 8). Regionally, Pearson correlations generally remain high and show limited degradation with lead time (Fig. 9), while RMSE (Fig. 10) is consistently lower than BIO4, a state-of-the-art process based model. The relatively weak degradation that we see with increasing forecast horizon likely reflects that much of the skill is carried by large-scale, slowly varying components, and broad physical states that retain useful information across the 1–6 month range. The time-series and Hovmöller diagnostics (Figs. 11–13 and 15) indicate that the model reproduces the dominant seasonal cycle while exhibiting year-to-year variability, and that it captures the timing and magnitude of major bloom events in many regions. Errors nevertheless peak during bloom maxima (Figs. 16 and 17), potentially due to heightened sensitivity to rapidly evolving physical conditions, or to drivers that are not explicitly represented in our input set.

Our approach retains useful skill in the large-scale evolution of surface chl a, even when driven by forecasted physics that differ from the reanalysis used for training. The similarity between SEAS5- and GLORYS12-driven predictions (Fig. 15) suggests that the method is not tightly tied to a single source of physical data, in spite of the modest skill reduction relative to reanalysis-driven predictions in certain regions (Fig. 8). While this behavior is expected due to forecast errors (Fig. 2) potentially propagating into the chl a reconstruction, a more detailed examination of these differences would be valuable in future work. Such an analysis could identify where loss of skill is dominated by limits in input predictability versus model sensitivity, providing concrete targets for future model development. These could include region or regime-specific training strategies, domain adaptation approaches, or other input corrections aimed specifically at improving robustness.

Limitations

Our framework is intentionally minimalist, and this design choice implies several limitations. First, stricter diagnostics that emphasize interannual variability and small departures from the climatological seasonal cycle are highly phase-sensitive and can penalize even modest changes in timing or small spatial shifts in bloom features. This is expected because removing the seasonal cycle isolates a weaker, noisier signal, and because the monthly evaluation record, with only 48 time steps, is relatively short. In addition, limited predictability in the forecast physical anomalies can further constrain skill. We therefore emphasize metrics (RMSE and Pearson correlation on the raw fields) that better reflect our main objective, which was reproducing realistic magnitudes and large-scale spatiotemporal evolution of surface chl a in a resource-efficient manner. Performance in more complex scenarios where accurate reproduction of anomalies and outliers is crucial would likely require enhancements beyond the minimalist setup presented here. A more systematic treatment of interannual skill would also require a more methodological assessment of SEAS5 skill with respect to anomalies, in order to evaluate how forecast errors might propagate into the chl a reconstruction. A similar decomposition of skill for the process-based benchmark would enable a more principled comparison of performance between approaches

Second, we deliberately restrict predictors to four variables (MLD, SSH, SSS, and SST) that are available in both reanalyses and seasonal forecasts, and in order to keep the framework lightweight and readily deployable, but by doing so, we neglect drivers that can be crucial. These include irradiance (PAR), a key driver in light-limited regimes, winds and mixing variability not fully summarized by monthly MLD, and biogeochemical state variables (e.g. nutrients). As a result, forecast errors will increase in regimes where chl a variability is strongly controlled by factors not captured in the input set, such as light limitation and nutrient supply. Several studies suggest that interannual predictability in chl a is closely linked to biogeochemical drivers, which can exhibit longer predictability than physical variables alone (Séférian et al., 2014; Rousseaux et al., 2021; Park et al., 2019 b; Ham et al., 2021; Frölicher et al., 2020). As with any modeling approach, trade-offs are inevitable. While biogeochemical variables can carry higher uncertainties, they may also capture important feedbacks not fully represented by physical drivers. Future versions of this approach could therefore benefit from a more principled investigation of predictor uncertainty and sensitivity analyses to systematically incorporate any additional relevant physical and biogeochemical inputs.

Third, the neural network design is intentionally lightweight, but this comes at the cost of potentially failing to capture the full intricacy of the underlying processes. We used a simplified U-Net architecture and a month-conditioned strategy implemented as twelve separately trained networks (one per initialization month) rather than a single unified model. While this choice simplifies training across the seasonal cycle, it increases system complexity and can introduce month-to-month inconsistencies because the mapping from physics to chl a is learned independently for each network. A natural extension would be to adopt a more robust architecture that explicitly conditions on month and lead time through a shared-backbone design.

Furthermore, while MSE is a standard choice for regressive neural networks such as this one, its focus on minimizing error can lead to overly smooth predictions that tend toward the mean. This can result in the loss of fine-scale detail in regions with high variability, and an overall impact on the realism of the model's predictions. Future work could mitigate this by adopting more robust learning objectives and/or by increasing architectural capacity. In addition, the forecast “ensemble” presented here is derived from the propagation of uncertainty from the SEAS5 physical ensemble through a deterministic chl a reconstruction. It does not explicitly represent chl a uncertainties, and it inherits any shared biases and limits in the physical forecast. A more complete approach would require the inclusion of more comprehensive sources of uncertainty and the perturbation of these within the predictive framework.

Finally, data-driven methods like ours, which learn directly from observations, can better reproduce observed chl a magnitudes and avoid some of the systematic biases that arise from uncertain parameterizations in process-based models (manifested as systematic overestimations relative to GlobColour in the case of BIO4). Nonetheless, this same minimalistic design, which emphasizes good skill in the raw fields in its current form, may also limit performance. Mechanistic biogeochemical models, by enforcing dynamical and biogeochemical consistency across variables, may be more robust for extrapolation beyond the observational regime, whereas purely data-driven approaches may degrade under distribution shift or in regimes that are poorly represented in the training data.

5 Conclusions

Our resource-efficient, data-driven methodology demonstrates competitive results for the estimation of seasonal surface chl a at a near-global scale. This is done by using MLD, SSH, SSS and SST data as input for a lightweight, U-Net-like neural network that is trainable in a matter of hours on a single GPU. Our method can be used to reconstruct historical chl a patterns (i.e. by using a reanalysis as input), or to generate predictions based on easily accessible forecast data. While future work is needed to better quantify and improve performance on interannual variability, these predictions show low error and high correlation to observations across different regions and lead times. The method is computationally efficient, and can be easily expanded to improve accuracy and extend its capabilities to other satellite-derived ocean-colour variables.

Code availability

The software code used in this paper are available upon request by contacting the corresponding author.

Data availability

All observational and reanalysis data used in this study are publicly available through the Copernicus Climate Change Service (C3S) and the Copernicus Marine Service (CMEMS). The following datasets were used: Copernicus Climate Change Service: Seasonal monthly ocean data (Copernicus Climate Change Service, 2023), CMEMS: Ocean Colour (GlobColour) L4 (OCEANCOLOUR_GLO_BGC_L4_MY_009_104) (Copernicus Marine Environment Monitoring Service, 2023), CMEMS: Global Ocean Physics Reanalysis (GLOBAL_MULTIYEAR_PHY_001_030) (Copernicus Marine Environment Monitoring Service, 2024 a), CMEMS: Global Ocean Biogeochemistry Analysis and Forecast (GLOBAL_ANALYSISFORECAST_BGC_001_028) (Copernicus Marine Environment Monitoring Service, 2024 b).

Author contributions

JJ, RB and GMB conceived the study. GMB collected the data, developed the model, performed the data analysis and created the figures and original draft of the manuscript. All authors discussed the results and contributed to the final manuscript.

Competing interests

At least one of the (co-)authors is a member of the editorial board of Biogeosciences. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Climate, Infrastructure and Environment Executive Agency (CINEA). Neither the European Union nor the granting authority can be held responsible for them.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

The authors would like to thank the anonymous reviewers for their constructive comments and suggestions, which significantly helped improve the quality of this manuscript.

Financial support

This research has been supported by the European Union's Horizon Europe Programme under Grant Agreement no. 101112823 (DTO-BioFlow), by the NECCTON project (“New Copernicus Capability for Trophic Ocean Networks”), funded under Horizon Europe RIA grant no. 101081273, and by the French National Centre for Space Studies (CNES) under project TOSCA SARGAT (grant no. ANR-19-SARG-0007).

Review statement

This paper was edited by Tina Treude and reviewed by two anonymous referees.

References

Achterberg, E. P.: Grand challenges in marine biogeochemistry, Front. Marine Sci., 1, https://doi.org/10.3389/fmars.2014.00007, 2014. a, b, c

Agrawal, S., Barrington, L., Bromberg, C., Burge, J., Gazen, C., and Hickey, J.: Machine Learning for Precipitation Nowcasting from Radar Images, arXiv [preprint], https://doi.org/10.48550/arXiv.1912.12132, 2019. a

Aumont, O., Ethé, C., Tagliabue, A., Bopp, L., and Gehlen, M.: PISCES-v2: an ocean biogeochemical model for carbon and ecosystem studies, Geosci. Model Dev., 8, 2465–2513, https://doi.org/10.5194/gmd-8-2465-2015, 2015. a, b

Barone, B., Coenen, A., Beckett, S., McGillicuddy, D., Weitz, J., and Karl, D.: The ecological and biogeochemical state of the North Pacifi c Subtropical Gyre is linked to sea surface height, J. Mar. Res., 77, 215–245, 2019. a

Berardi, D.: 21st-century biogeochemical modeling: Challenges for Century-based models and where do we go from here?, GCB Bioenergy, 12, 774–788, https://doi.org/10.1111/gcbb.12730, 2020. a, b

Bopp, L., Resplandy, L., Orr, J. C., Doney, S. C., Dunne, J. P., Gehlen, M., Halloran, P., Heinze, C., Ilyina, T., Séférian, R., Tjiputra, J., and Vichi, M.: Multiple stressors of ocean ecosystems in the 21st century: projections with CMIP5 models, Biogeosciences, 10, 6225–6245, https://doi.org/10.5194/bg-10-6225-2013, 2013. a, b

Browning, T. J. and Moore, C. M.: Global analysis of ocean phytoplankton nutrient limitation reveals high prevalence of co-limitation, Nat. Commun., 14, 5014, https://doi.org/10.1038/s41467-023-40774-0, 2023. a

Cao, Z., Ma, R., Duan, H., Pahlevan, N., Melack, J., Shen, M., and Xue, K.: A machine learning approach to estimate chlorophyll a from Landsat-8 measurements in inland lakes, Remote Sens. Environ., 248, 111974, https://doi.org/10.1016/j.rse.2020.111974, 2020. a

Chen, C., Chen, Q., Yao, S., He, M., Zhang, J., Li, G., and Lin, Y.: Combining physical-based model and machine learning to forecast chlorophyll a concentration in freshwater lakes, Sci. Total Environ., 907, 168097, https://doi.org/10.1016/j.scitotenv.2023.168097, 2024. a, b, c

Chenillat, F., Illig, S., Jouanno, J., Awo, F. M., Alory, G., and Brehmer, P.: How do Climate Modes Shape the Chlorophyll a Interannual Variability in the Tropical Atlantic?, Geophys. Res. Lett., 48, e2021GL093769, https://doi.org/10.1029/2021GL093769, 2021. a

Copernicus Climate Change Service, Climate Data Store: Seasonal forecast monthly averages of ocean variables, https://doi.org/10.24381/cds.2f9be611, 2018a. a

Copernicus Climate Change Service, Climate Data Store: Seasonal forecast monthly statistics on single levels, https://doi.org/10.24381/cds.68dd14c3, 2018b. a

Copernicus Climate Change Service: Seasonal monthly ocean data, Copernicus Climate Data Store [data set], https://doi.org/10.24381/cds.2f9be611, 2023. a

Copernicus Marine Environment Monitoring Service: Ocean Colour (GlobColour) L4 (OCEANCOLOUR_GLO_BGC_L4_MY_009_104), Copernicus Marine Service [data set], https://doi.org/10.48670/moi-00281, 2023. a

Copernicus Marine Environment Monitoring Service: Global Ocean Physics Reanalysis (GLOBAL_MULTIYEAR_PHY_001_030), Copernicus Marine Service [data set], https://doi.org/10.48670/moi-00021, 2024a. a

Copernicus Marine Environment Monitoring Service: Global Ocean Biogeochemistry Analysis and Forecast (GLOBAL_ANALYSISFORECAST_BGC_001_028), Copernicus Marine Service [data set], https://doi.org/10.48670/moi-00015, 2024b. a

Copernicus Marine Service, Marine Data Store: Global Ocean Biogeochemistry Analysis and Forecast, https://doi.org/10.48670/moi-00015, 2023. a

Copernicus Marine Service, Marine Data Store: Global Ocean Physics Reanalysis (GLORYS12), https://doi.org/10.48670/moi-00021, 2024. a

Dugas, C., Bengio, Y., Bélisle, F., Nadeau, C., and Garcia, R.: Incorporating Second-Order Functional Knowledge for Better Option Pricing, https://dl.acm.org/doi/10.5555/3008751.3008817 (last access: 17 March 2025), 2000. a

Fennel, K., Mattern, J. P., Doney, S. C., Bopp, L., Moore, A. M., Wang, B., and Yu, L.: Ocean biogeochemical modelling, Nature Reviews Methods Primers, 2, 1–21, https://doi.org/10.1038/s43586-022-00154-2, 2022. a, b

Fernandez-Gonzalez, C., Tarran, G. A., Schuback, N., Woodward, E. M. S., Aristegui, J., and Maranon, E.: Phytoplankton responses to changing temperature and nutrient availability are consistent across the tropical and subtropical Atlantic, Communications Biology, 5, 1035, https://doi.org/10.1038/s42003-022-03971-z, 2022. a

Frölicher, T. L., Ramseyer, L., Raible, C. C., Rodgers, K. B., and Dunne, J.: Potential predictability of marine ecosystem drivers, Biogeosciences, 17, 2061–2083, https://doi.org/10.5194/bg-17-2061-2020, 2020. a

Garnesson, P., Mangin, A., Fanton d'Andon, O., Demaria, J., and Bretagnon, M.: The CMEMS GlobColour chlorophyll a product based on satellite observation: multi-sensor merging and flagging strategies, Ocean Sci., 15, 819–830, https://doi.org/10.5194/os-15-819-2019, 2019. a, b

Gehlen, M., Barciela, R., Bertino, L., Brasseur, P., Butenschön, M., Chai, F., Crise, A., Drillet, Y., Ford, D., Lavoie, D., Lehodey, P., Perruche, C., Samuelsen, A., and Simon, E.: Building the capacity for forecasting marine biogeochemistry and ecosystems: recent advances and future developments, J. Oper. Oceanogr., 8, s168–s187, https://doi.org/10.1080/1755876X.2015.1022350, 2015. a, b

Groom, S., Sathyendranath, S., Ban, Y., Bernard, S., Brewin, R., Brotas, V., Brockmann, C., Chauhan, P., Choi, J.-k., Chuprin, A., Ciavatta, S., Cipollini, P., Donlon, C., Franz, B., He, X., Hirata, T., Jackson, T., Kampel, M., Krasemann, H., Lavender, S., Pardo-Martinez, S., Mélin, F., Platt, T., Santoleri, R., Skakala, J., Schaeffer, B., Smith, M., Steinmetz, F., Valente, A., and Wang, M.: Satellite Ocean Colour: Current Status and Future Perspective, Front. Marine Sci., 6, https://doi.org/10.3389/fmars.2019.00485, 2019. a

Ham, Y.-G., Joo, Y.-S., and Park, J.-Y.: Mechanism of skillful seasonal surface chlorophyll prediction over the southern Pacific using a global earth system model, Clim. Dynam., 56, 45–64, https://doi.org/10.1007/s00382-020-05403-2, 2021. a

Hu, C., Feng, L., and Guan, Q.: A Machine Learning Approach to Estimate Surface Chlorophyll a Concentrations in Global Oceans From Satellite Measurements, IEEE T. Geosci. Remote, 59, 4590–4607, https://doi.org/10.1109/TGRS.2020.3016473, 2021. a

Johnson, S. J., Stockdale, T. N., Ferranti, L., Balmaseda, M. A., Molteni, F., Magnusson, L., Tietsche, S., Decremer, D., Weisheimer, A., Balsamo, G., Keeley, S. P. E., Mogensen, K., Zuo, H., and Monge-Sanz, B. M.: SEAS5: the new ECMWF seasonal forecast system, Geosci. Model Dev., 12, 1087–1117, https://doi.org/10.5194/gmd-12-1087-2019, 2019. a, b

Keiner, L. E.: Estimating oceanic chlorophyll concentrations with neural networks, Int. J. Remote Sens., 20, 189–194, https://doi.org/10.1080/014311699213695, 1999. a

Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, arXiv [preprint], https://doi.org/10.48550/arXiv.1412.6980, 2017. a

Kolluru, S. and Tiwari, S. P.: Modeling ocean surface chlorophyll a concentration from ocean color remote sensing reflectance in global waters using machine learning, Sci. Total Environ., 844, 157191, https://doi.org/10.1016/j.scitotenv.2022.157191, 2022. a

Lamouroux, J., Perruche, C., Mignot, A., Paul, J., Szczypta, C., and Gutknecht, E.: Global Ocean Biogeochemistry Analysis and Forecast, Quality Information Document, https://doi.org/10.48670/moi-00015, 2023. a

Lamouroux, J., Perruche, C., Mignot, A., Paul, J., and Szczypta, C.: Global Production Center GLOBAL_ANALYSISFORECAST_BGC_001_028 Quality Information Document, Copernicus Marine Service, https://doi.org/10.48670/moi-00015, 2024. a

Lellouche, J.-M., Greiner, E., Bourdallé-Badie, R., Garric, G., Melet, A., Drévillon, M., Bricaud, C., Hamon, M., Le Galloudec, O., Regnier, C., Candela, T., Testut, C.-E., Gasparin, F., Ruggiero, G., Benkiran, M., Drillet, Y., and Le Traon, P.-Y.: The Copernicus Global $1 / 12$ ° Oceanic and Sea Ice GLORYS12 Reanalysis, Front. Earth Sci., 9, https://doi.org/10.3389/feart.2021.698876, 2021. a, b

Lin, J., Miller, P. I., Jönsson, B. F., and Bedington, M.: Early Warning of Harmful Algal Bloom Risk Using Satellite Ocean Color and Lagrangian Particle Trajectories, Front. Marine Sci., 8, https://doi.org/10.3389/fmars.2021.736262, 2021. a

Ly, Q. V., Nguyen, X. C., Lê, N. C., Truong, T.-D., Hoang, T.-H. T., Park, T. J., Maqbool, T., Pyo, J., Cho, K. H., Lee, K.-S., and Hur, J.: Application of Machine Learning for eutrophication analysis and algal bloom prediction in an urban river: A 10 year study of the Han River, South Korea, Sci. Total Environ., 797, 149040, https://doi.org/10.1016/j.scitotenv.2021.149040, 2021. a

Maritorena, S., d'Andon, O. H. F., Mangin, A., and Siegel, D. A.: Merged satellite ocean color data products using a bio-optical model: Characteristics, benefits and issues, Remote Sens. Environ., 114, 1791–1804, https://doi.org/10.1016/j.rse.2010.04.002, 2010. a

Martinez, E., Gorgues, T., Lengaigne, M., Fontana, C., Sauzède, R., Menkes, C., Uitz, J., Di Lorenzo, E., and Fablet, R.: Reconstructing Global Chlorophyll a Variations Using a Non-linear Statistical Approach, Front. Marine Sci., 7, https://doi.org/10.3389/fmars.2020.00464, 2020. a, b, c

Orr, J. C., Fabry, V. J., Aumont, O., Bopp, L., Doney, S. C., Feely, R. A., Gnanadesikan, A., Gruber, N., Ishida, A., Joos, F., Key, R. M., Lindsay, K., Maier-Reimer, E., Matear, R., Monfray, P., Mouchet, A., Najjar, R. G., Plattner, G.-K., Rodgers, K. B., Sabine, C. L., Sarmiento, J. L., Schlitzer, R., Slater, R. D., Totterdell, I. J., Weirig, M.-F., Yamanaka, Y., and Yool, A.: Anthropogenic ocean acidification over the twenty-first century and its impact on calcifying organisms, Nature, 437, 681–686, https://doi.org/10.1038/nature04095, 2005. a

Palacios, D. M., Hazen, E. L., Schroeder, I. D., and Bograd, S. J.: Modeling the temperature-nitrate relationship in the coastal upwelling domain of the California Current, J. Geophys. Res.-Oceans, 118, 3223–3239, https://doi.org/10.1002/jgrc.20216, 2013. a

Park, J., Kim, J.-H., Kim, H.-c., Kim, B.-K., Bae, D., Jo, Y.-H., Jo, N., and Lee, S. H.: Reconstruction of Ocean Color Data Using Machine Learning Techniques in Polar Regions: Focusing on Off Cape Hallett, Ross Sea, Remote Sens.-Basel, 11, 1366, https://doi.org/10.3390/rs11111366, 2019a. a

Park, J.-Y., Stock, C. A., Dunne, J. P., Yang, X., and Rosati, A.: Seasonal to multiannual marine ecosystem prediction with a global Earth system model, Science, 365, 284–288, https://doi.org/10.1126/science.aav6634, 2019b. a, b, c, d, e

Park, Y., Cho, K. H., Park, J., Cha, S. M., and Kim, J. H.: Development of early-warning protocol for predicting chlorophyll a concentration using machine learning models in freshwater and estuarine reservoirs, Korea, Sci. Total Environ., 502, 31–41, https://doi.org/10.1016/j.scitotenv.2014.09.005, 2015. a

Polovina, J. J., Dunne, J. P., Woodworth, P. A., and Howell, E. A.: Projected expansion of the subtropical biome and contraction of the temperate and equatorial upwelling biomes in the North Pacific under global warming, ICES J. Mar. Sci., 68, 986–995, https://doi.org/10.1093/icesjms/fsq198, 2011. a

Ronneberger, O., Fischer, P., and Brox, T.: U-Net: Convolutional Networks for Biomedical Image Segmentation, arXiv [preprint], https://doi.org/10.48550/arXiv.1505.04597, 2015. a

Rousseaux, C. S., Gregg, W. W., and Ott, L.: Assessing the Skills of a Seasonal Forecast of Chlorophyll in the Global Pelagic Oceans, Remote Sens.-Basel, 13, 1051, https://doi.org/10.3390/rs13061051, 2021. a, b, c

Roussillon, J., Fablet, R., Gorgues, T., Drumetz, L., Littaye, J., and Martinez, E.: A Multi-Mode Convolutional Neural Network to reconstruct satellite-derived chlorophyll a time series in the global ocean from physical drivers, Front. Marine Sci., 10, https://doi.org/10.3389/fmars.2023.1077623, 2023. a, b, c

Sadaiappan, B., Balakrishnan, P., C. R., V., Vijayan, N. T., Subramanian, M., and Gauns, M. U.: Applications of Machine Learning in Chemical and Biological Oceanography, ACS Omega, 8, 15831–15853, https://doi.org/10.1021/acsomega.2c06441, 2023. a

Sauzède, R., Bittig, H. C., Claustre, H., Pasqueron de Fommervault, O., Gattuso, J.-P., Legendre, L., and Johnson, K. S.: Estimates of Water-Column Nutrient Concentrations and Carbonate System Parameters in the Global Ocean: A Novel Approach Based on Neural Networks, Front. Marine Sci., 4, https://doi.org/10.3389/fmars.2017.00128, 2017. a, b

Schneider, B., Bopp, L., Gehlen, M., Segschneider, J., Frölicher, T. L., Cadule, P., Friedlingstein, P., Doney, S. C., Behrenfeld, M. J., and Joos, F.: Climate-induced interannual variability of marine primary and export production in three global coupled climate carbon cycle models, Biogeosciences, 5, 597–614, https://doi.org/10.5194/bg-5-597-2008, 2008. a

Séférian, R., Bopp, L., Gehlen, M., Swingedouw, D., Mignot, J., Guilyardi, E., and Servonnat, J.: Multiyear predictability of tropical marine productivity, P. Natl. Acad. Sci. USA, 111, 11646–11651, https://doi.org/10.1073/pnas.1315855111, 2014. a

Sønderby, C. K., Espeholt, L., Heek, J., Dehghani, M., Oliver, A., Salimans, T., Agrawal, S., Hickey, J., and Kalchbrenner, N.: MetNet: A Neural Weather Model for Precipitation Forecasting, arXiv [preprint], https://doi.org/10.48550/arXiv.2003.12140, 2020. a

Tran, D., Bourdev, L., Fergus, R., Torresani, L., and Paluri, M.: Learning Spatiotemporal Features with 3D Convolutional Networks, in: 2015 IEEE International Conference on Computer Vision (ICCV), IEEE, Santiago, Chile, 4489–4497, https://doi.org/10.1109/ICCV.2015.510, 2015. a

Uz, B. M., Yoder, J. A., and Osychny, V.: Pumping of nutrients to ocean surface waters by the action of propagating planetary waves, Nature, 409, 597–600, https://doi.org/10.1038/35054527, 2001. a

Wang, H., Zhu, R., Zhang, J., Ni, L., Shen, H., and Xie, P.: A Novel and Convenient Method for Early Warning of Algal Cell Density by Chlorophyll Fluorescence Parameters and Its Application in a Highland Lake, Front. Plant Sci., 9, https://doi.org/10.3389/fpls.2018.00869, 2018. a

Wenxiang, D., Caiyun, Z., Shaoping, S., and Xueding, L.: Optimization of deep learning model for coastal chlorophyll a dynamic forecast, Ecol. Model., 467, 109913, https://doi.org/10.1016/j.ecolmodel.2022.109913, 2022. a

Weyn, J. A., Durran, D. R., and Caruana, R.: Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500-hPa Geopotential Height From Historical Weather Data, J. Adv. Model. Earth Sy., 11, 2680–2693, https://doi.org/10.1029/2019MS001705, 2019. a

Weyn, J. A., Durran, D. R., Caruana, R., and Cresswell-Clay, N.: Sub-Seasonal Forecasting With a Large Ensemble of Deep-Learning Weather Prediction Models, J. Adv. Model. Earth Sy., 13, e2021MS002502, https://doi.org/10.1029/2021MS002502, 2021. a

Xu, D., Wang, T., Xing, X., and Bian, C.: The Relationship Between Nitrate and Potential Density in the Ocean South of 30° S, J. Geophys. Res.-Oceans, 127, e2022JC018948, https://doi.org/10.1029/2022JC018948, 2022. a

Ying, C., Xiao, L., Xueliang, Z., Wenyang, S., and Chongxuan, X.: Marine chlorophyll a prediction based on deep auto-encoded temporal convolutional network model, Ocean Model., 186, 102263, https://doi.org/10.1016/j.ocemod.2023.102263, 2023. a

Zhu, X., Guo, H., Huang, J. J., Tian, S., and Zhang, Z.: A hybrid decomposition and Machine learning model for forecasting Chlorophyll a and total nitrogen concentration in coastal waters, J. Hydrol., 619, 129207, https://doi.org/10.1016/j.jhydrol.2023.129207, 2023. a

Articles

Download

Article (14560 KB)
Full-text XML

Short summary

This study uses machine learning to predict global sea surface chlorophyll a, which is important for monitoring marine ecosystems and the carbon cycle. Using forecasts of sea surface temperature, salinity, height, and mixed layer depth, we generate global predictions up to six months ahead in just minutes. Our approach matches state-of-the-art numerical methods while being faster and more resource-efficient.