Estimating the variability of deep-ocean particle flux collected by sediment traps using satellite data and machine learning

Picard, Théo; Baker, Chelsey A.; Gula, Jonathan; Fablet, Ronan; Mémery, Laurent; Lampitt, Richard

doi:https://doi.org/10.5194/bg-22-4309-2025

Articles | Volume 22, issue 17

https://doi.org/10.5194/bg-22-4309-2025

Articles | Volume 22, issue 17

Research article

01 Sep 2025

Research article |

| 01 Sep 2025

Estimating the variability of deep-ocean particle flux collected by sediment traps using satellite data and machine learning

Théo Picard, Chelsey A. Baker, Jonathan Gula, Ronan Fablet, Laurent Mémery, and Richard Lampitt

Abstract

The gravitational pump plays a key role in the ocean carbon cycle by exporting sinking organic carbon from the surface to the deep ocean. Deep sediment trap time series provide unique measurements of this sequestered carbon flux. Sinking particles are influenced by physical short-term spatio-temporal variability, which inhibits the establishment of a direct link to their surface origin. In this study, we present a novel machine learning tool, designated as U-Net_SST−SSH, which is capable of predicting the catchment area of particles captured by sediment traps moored at a depth of 3000 m above the Porcupine Abyssal Plain (PAP) based solely on surface data. The machine learning tool was trained and evaluated using Lagrangian experiments in a realistic CROCO numerical simulation. The conventional approach of assuming a static 100–200 km box over the sediment trap location only yields an average prediction for ∼25 % of the source region, whilst U-Net_SST−SSH predicts ∼50 %. U-Net_SST−SSH was then applied to satellite observations to create a 20-year catchment area dataset, which demonstrates a stronger correlation between the PAP site deep particle fluxes and surface chlorophyll-a concentration compared with the conventional approach. However, predictions remain highly sensitive to the local deep dynamics which are not observed in surface ocean dynamics. The improved identification of the particle source region for deep-ocean sediment traps can facilitate a more comprehensive understanding of the mechanisms driving the export of particles from the surface to the deep ocean, a key component of the biological carbon pump.

Download & links

Article (PDF, 9708 KB)

Download & links

How to cite.

Received: 22 Oct 2024 – Discussion started: 05 Dec 2024 – Revised: 16 May 2025 – Accepted: 10 Jun 2025 – Published: 01 Sep 2025

1 Introduction

The biological carbon pump (BCP) is one mechanism that sequesters carbon from the atmosphere into the deep ocean. The BCP plays a key role in the climate system as, without it, the atmospheric CO₂ concentrations would be about twice those observed today (Parekh et al., 2006; Kwon et al., 2009). Furthermore, the BCP is a crucial source of food resources in the deep ocean (Grabowski et al., 2019). However, despite the considerable importance of the BCP, its driving mechanisms are poorly understood (Le Moigne, 2019). Given that climate-change-driven perturbations may have widescale implications for the BCP, it is of utmost importance to improve our understanding of this topic (Kwon et al., 2009; Passow and Carlson, 2012; Palevsky and Nicholson, 2018; Henson et al., 2022; Wilson et al., 2022).

One of the main processes contributing to the export of the BCP is the export of organic particles from the surface to the deep ocean, which sink due to their excess density (Siegel et al., 2016; Durkin et al., 2016; Le Moigne, 2019). This is known as the gravitational pump (Boyd et al., 2019; Siegel et al., 2023). This is a complex process modulated, on the one hand, by phytoplankton net primary production (NPP), which uses carbon dioxide, solar energy, and available nutrients for photosynthesis in the lighted upper layer of the ocean, also known as the euphotic zone (∼0–200 m), and, on the other hand, by zooplankton faecal pellets (Lampitt et al., 1990). To assess the magnitude and composition of particle sinking via the gravitational pump, long-term observations of the downward particle flux have been made using moored sediment traps (STs). These have been widely used to measure deep particle fluxes below 2000 m (Honjo et al., 2008; McDonnell et al., 2015). At this depth, the carbon can be sequestered for decades or centuries (Guidi et al., 2021; Burd et al., 2016; Siegel et al., 2021; Baker et al., 2022). However, while the time series data from the STs are crucial for estimating the amount of long-term carbon sequestration and for understanding the evolution of the global carbon cycle, fluxes from STs are often generalised over a wide spatial area despite being located in only a single data location. This spatial limitation hinders the ability of these instruments to capture the inherent variability of deep-ocean particle fluxes. Indeed, medium and small local dynamics affect the sinking-particle pathways and can have a significant impact on the ST measurements, especially over short time periods (Siegel et al., 1990; Deuser et al., 1990; Burd et al., 2010; Liu et al., 2018; Dever et al., 2021; Wang et al., 2022 a). This means that particles originate over a large area of the surface ocean, called the catchment area (Deuser et al., 1988; Waniek et al., 2000), highly dependent on the local currents throughout the water column. It therefore remains a challenge to establish a clear link between observed NPP at the surface and deep carbon fluxes (Lampitt et al., 2010, 2023). This is particularly true for 10–30 d time periods, during which time the drivers of carbon “pulses” observed in the STs remain unexplained (Smith et al., 2018).

This study focuses on the contribution of the local physics to the gravitational sinking flux. Traditionally, the sinking-particle catchment area is typically represented as a 100 or 200 km box around the ST (Armstrong et al., 2001; Lampitt et al., 2010, 2023). This simplified catchment area is based on several studies that have used Lagrangian particle backtracking with physical model fields over several years (Waniek et al., 2000; Siegel et al., 2008; Wekerle et al., 2018; Wang et al., 2022 a) to define a so-called “statistical funnel”. The statistical funnel may allow for the annual surface area that influences sediment trap measurements to be captured, but it does not capture the mesoscale spatial variability on timescales of weeks to months. So far, the only method capable of capturing this variability is that of Lagrangian backtracking experiments in reanalyses, i.e. the release of Lagrangian particles in a numerical simulation forced with observations that are supposed to represent the full 3D dynamics of the ocean (Frigstad et al., 2015; Liu et al., 2018; Ruhl et al., 2020; Ma et al., 2021). However, the practice of Lagrangian backtracking in reanalyses has a number of caveats:

Reconstruction of mesoscale and submesoscale sea surface dynamics in numerical models, especially below 150 km resolution, remains a challenge for operational systems with data assimilation schemes (Lellouche et al., 2021; Cutolo et al., 2022; Febvre et al., 2023), which can lead to significant biases in the Lagrangian transport, usually unquantified.
The deep dynamics (below 1000 m) are typically not validated due to a lack of observational data and/or understanding and are almost completely absent in some data assimilation models (Lellouche et al., 2021). Our understanding of the influence of this phenomenon and how well it is represented in models is very limited.
The process of reanalysis is typically complicated and computationally demanding, especially when used in conjunction with backtracking Lagrangian studies. This inherent complexity leads to certain constraints, such as the use of only a single particle's sinking velocity or a limited time frame for the experiments.

To address the aforementioned problems, we have developed a new tool based on machine learning to predict the catchment area of particles reaching deep-ocean STs directly from the model output surface data (Picard et al., 2024). This approach was motivated by two main advances from the literature. Firstly, Wang et al. (2022 a) showed that the monthly catchment area is closely related to the surface mesoscale dynamics and, in particular, to local eddies observed with satellite altimetry (Chelton et al., 2011). In addition, recent studies have demonstrated the benefits of machine learning in predicting ocean interior currents from surface observations (Chapman and Charantonis, 2017; Bolton and Zanna, 2019; Manucharyan et al., 2021), as well as its high performance in reconstructing Lagrangian particle trajectories (Jenkins et al., 2022). Picard et al. (2024) trained a neural network with a numerical simulation dataset at the Porcupine Abyssal Plain sustained observatory (PAP-SO) station, situated in the northeastern Atlantic Ocean (49° N, 16.5° W). The PAP-SO site has collected more than 30 years of deep-ocean particulate organic carbon flux time series (Hartman et al., 2021; Lampitt et al., 2023). Picard et al. (2024) demonstrated the ability to predict the catchment area for particles with a sinking rate of w=50 m d⁻¹, collected in a PAP-SO ST at 1000 m, using only surface numerical simulation outputs. Furthermore, a framework was presented to evaluate the prediction efficiency depending on the local physical conditions, with the best predictions being associated with low kinetic energy and the presence of mesoscale eddies above the ST.

Therefore, this study has two main objectives. The first one is to improve the methodology presented in Picard et al. (2024) by proposing an enhanced version of the machine learning model that is capable of predicting the catchment area of particles collected at 3000 m by the PAP-SO station ST, taking into account a wider range of particle sinking velocities (Sect. 2). Indeed, as previously stated by Wekerle et al. (2018), the provenance of particles can vary considerably depending on their sinking velocity. Consequently, it is imperative to consider the entire particle velocity spectrum in order to accurately represent all of the possible source areas. We also chose to focus on a 3000 m ST because the PAP-SO deep particle flux dataset is the most complete. Indeed, STs at 1000 m at the PAP-SO station do not give reliable results, likely due to hydrodynamic biases for conical traps in the upper ocean (Buesseler et al., 2007), whilst fluxes collected at 3000 m are much more reliable. Similarly to Picard et al. (2024), we will evaluate the network performance and identify the physical factors that influence the accuracy of the catchment area prediction (Sect. 3). Considering the fact that the dynamics below 1000 m at PAP-SO are weak compared to in the upper layer (Wang et al., 2022 a), we expect similar results to Picard et al. (2024), with the particle sinking velocity being the primary factor influencing the prediction score. The second objective is to investigate whether the connection between satellite-derived surface chlorophyll-a concentration, as a proxy for phytoplankton biomass, and the deep-ocean ST fluxes can be improved with the application of the trained machine learning tool (Sect. 4).

2 Methods

In this study, we follow the methodology presented in Picard et al. (2024), where we use a series of Lagrangian experiments in a numerical simulation at the PAP-SO station to train convolutional neural networks (CNNs) to predict the origin of particles collected in a deep-ocean ST. We have adapted the learning strategy to train a model that can be applied to satellite data. This section presents the experiments carried out and the characteristics of the CNNs used.

2.1 Numerical simulation and Lagrangian experiments

The North Atlantic Subpolar Gyre simulation (POLGYR), designed and validated by Le Corre et al. (2020), is used in this study. This simulation is run using the Coastal and Regional Ocean COmmunity (CROCO) model, based on the Regional Ocean Modeling System (ROMS) (Shchepetkin and McWilliams, 2005). The grid has a horizontal resolution of 2 km and 80 vertical levels, allowing the simulation to fully resolve the mesoscale processes and partly resolve those of the submesoscale. The focus of this study is the PAP-SO, represented by the black 1020 km square centred at the PAP-SO station (49° N, 16.5° W) (see Fig. 1a).

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f01

Figure 1(a) Surface snapshot of relative vorticity in the numerical simulation. The black star represents the location of the PAP-SO station. The dashed square outlines the domain considered in this study. (b) A closer examination of the solid black square, with a focus on the vertical dimension. Relative vorticity at 200, 1000, 2000, and 3000 m depth. The location of the sediment trap is indicated by the black star. A group of particles from a single Lagrangian experiment is shown. The colours of the particles represent the time in days after the release at the ST trap. When the particles reach a depth of 200 m, their position is saved (black dots) to compute the two-dimensional probability density function (PDF). The green diamond indicates the northeast of the sub-domain, with the PAP station location as the reference point.

A series of Lagrangian backtracking experiments were performed to represent the sinking-particle pathways from the surface ocean to the PAP-SO sediment trap at 3000 m. In order to account for the wide range of particle sinking velocities observed in the region, as reported in Villa-Alfageme et al. (2016), the experiments were performed with five different sinking velocities w, namely 80, 100, 150, 200, and 300 m d⁻¹. Although slower-sinking particles (w<80 m d⁻¹) have been observed at PAP-SO (Baker et al., 2017; Villa-Alfageme et al., 2016), they are not considered in this study due to computational constraints. Slower-sinking particles present a significant challenge in terms of time taken to sink to 3000 m and dispersion in the spatial dimension, which, in turn, increases the size of our model domain and output considerably.

The Lagrangian experiment is carried out according to the general methodology presented in Picard et al. (2024) considering a deeper ST depth and several particle sinking velocities. Over a period of 10 d, representing the ST collection period, 720 particles (36 particles every 12 h) are released at the PAP-SO sediment trap, which is moored at a depth of 3000 m. During the experiment, all particles have a constant sinking velocity w. Once the particles have ascended to a depth of 200 m, which defines the depth of effective particle export (Wang et al., 2022 a), their position is recorded (Fig. 1b), and the probability density function (PDF) associated with this position is computed. The PDF represents the catchment area of the particles captured by the sediment trap during the 10 d collection period. This is also the variable predicted by the convolutional neural networks (CNNs). For each w considered in this study, a total of 10 260 independent Lagrangian experiments were performed, each providing a PDF associated with a different dynamical condition. Further details on the methodology used can be found in Picard et al. (2024).

2.2 Convolutional neural network architecture and training scheme

We have trained different CNNs to predict the catchment area, depending on the sinking velocities (w) considered here. The training methodology follows a state-of-the-art scheme with independent training, validation, and test datasets (Lecun et al., 2015). We used U-Net schemes as described in Ronneberger et al. (2015). These schemes are among the state-of-the-art neural architectures for mapping problems with n-dimensional tensors, with numerous applications in imaging science (Falk et al., 2019), as well as recent applications in ocean science (Lguensat et al., 2018; Beauchamp et al., 2023; Jenkins et al., 2022). For each training run, we use 8604 Lagrangian experiments for training, 1224 for validation, and 6800 for testing. Further details of the methodology can be found in Picard et al. (2024). To evaluate our predictions, we consider the Bhattacharyya coefficient (Bhattacharyya, 1943) to assess the similarity between the true PDF and the predicted one:

\begin{matrix} (1) & {BC}_{z} = Σ^{i \in D} \sqrt{P_{i, z} Q_{i, z}}, \end{matrix}

where D represents the PAP domain, P_i is the predicted PDF value, and Q_i is the true PDF computed from the Lagrangian experiment at point i and at depth z. The Bhattacharyya coefficient is used to evaluate the similarity between two PDFs and serves as the loss function. In the following section, we refer to this loss function as the Bhattacharyya training loss (BL).

\begin{matrix} (2) & {BL}_{200 m} = 1 - {BC}_{200 m} = 1 - Σ \sqrt{P_{i, 200 m} Q_{i, 200 m}} \end{matrix}

BL_200 m ranges from 1 to 0, with 0 representing a perfect prediction. We implement our machine learning scheme using PyTorch (Paszke et al., 2019). The training phase relies on the Adam optimiser (Kingma and Ba, 2015) with the following hyperparameters: $β = (0.5, 0.999)$ , no weight decay, and a learning rate of 0.001. The training process is performed using mini-batches of size 32. After 50 training epochs, the best model is selected based on its performance based on the validation dataset. We further improve the performance and robustness of the model by using a bootstrapping method with 10 replicates (Breiman, 1996). The final prediction is a set of PDFs computed as the median of the predictions from the 10 models, followed by a re-normalisation step.

The inputs of the U-Nets are geophysical fields for a 800 km wide square box around the sediment trap, with a 50 d time window and a 10 d time step. Three different U-Net models were used to evaluate the impact of the input type and resolution:

U-Net $_{5 V - 4 L}^{w}$ . This configuration uses five variables as inputs, namely temperature, sea surface height (SSH), horizontal velocities U and V, and vorticity at a horizontal resolution of 8 km and at four vertical levels (except for SSH) (0, 750, 1500, 2250 m).
U-Net $_{5 V - 1 L}^{w}$ . This configuration uses sea surface only fields as inputs, namely sea surface temperature (SST), SSH, and sea surface velocities at a horizontal resolution of 8 km.
U-Net $_{SST - SSH}^{w}$ . This configuration uses only SST and SSH as inputs. Its training involves spatially averaged fields to account for the effective resolution of satellite-derived products in the region (80 km for SSH (Chelton et al., 2011) and 28 km for SST level-4 product).

Of these three models, we expect ${U-Net}_{SST - SSH}^{w}$ to be more applicable to reanalysis and satellite-derived products as it has been trained under conditions consistent with observational data inputs. The other two models will allow us to explore the key drivers of Lagrangian particle trajectories from the surface to the deep ocean. In addition to these U-Net models, prediction baselines are considered in the form of 100 and 200 km boxes centred at the PAP-SO station, denoted as box_100 km and box_200 km (i.e. a PDF with uniform values within the box). These baselines represent the conventional approach that has traditionally been used in previous studies to represent the particles' surface origins (Frigstad et al., 2015; Lampitt et al., 2023) and are used here as a reference point to assess the added value of the CNNs.

2.3 Test dataset and evaluation metrics

The considered test dataset consists of 6800 independent Lagrangian experiments that are used for testing the CNNs. Based on the BL_200 m score introduced in Picard et al. (2024), we define a binary classification score as an evaluation metric:

If BL_200 m<0.3, the prediction is valid.
If BL_200 m≥0.3, the prediction is invalid.

As shown in Picard et al. (2024), the BL_200 m score is directly linked to the overlap between the two distributions defined as follows:

\begin{matrix} (3) & F_{200 m} = Σ^{i \in D} min (P_{i, 200 m}, Q_{i, 200 m}) . \end{matrix}

The criterion of BL_200 m<0.3 is arbitrarily chosen to represent a valuable prediction, such that the prediction accounts for F_200 m=45 % or more of the particles. The prediction made by ${U-Net}_{SST - SSH}^{w}$ for the simulation test dataset will be referred to as $D_{simu}^{w}$ in the following. Figure 2 shows three samples from this dataset. The predictions are compared with the PDF of the true particle origins from the Lagrangian experiments (see Sect. 2.1). In this example, predictions (a) and (b) are considered to be valid, while prediction (c) is considered invalid.

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f02

Figure 2Examples of predictions of the probability density function (PDF) of particle origins from the $D_{simu}^{w}$ simulation-based dataset. The PDFs are represented by two contours: the solid contour represents 25 % of the integrated PDF, while the dashed contour represents 75 %. The black PDFs are the true PDFs derived from the Lagrangian experiment, and the red PDFs are the predictions using ${U-Net}_{SST - SSH}^{100}$ . We report the corresponding Bhattacharyya scores. The background represents the relative vorticity 20 d after the initial particle release, which coincides with the particles reaching the euphotic layer (z=200 m) with a sinking velocity of 100 m d⁻¹. Be advised that, in (c), the true PDF is split into two patches. This is likely to be due to divergent dynamics at the source point located at the junction of several eddies, which makes the prediction more challenging.

3 Sensitivity analysis on simulation datasets

In this section, we evaluate the performance of the different U-Net schemes. We test the robustness of the predictions while varying the horizontal resolution of the inputs, the particle sinking velocity, and the type of inputs. Our aim is to gain a deeper understanding of the key influences on sinking-particle trajectories.

3.1 Impact of input drivers and associated spatial resolutions

We first focus on a sinking velocity of w=100 m d⁻¹, which has been assumed to be the mean velocity of particles sinking to the deep ocean as observed at the PAP-SO station (Lampitt et al., 2001; Villa-Alfageme et al., 2014, 2016). To evaluate the robustness of the predictions with respect to the horizontal resolution of the input variables, we examine the evolution of the prediction score given by ${U-Net}_{SST - SSH}^{100}$ (Fig. 3) by progressively degrading the effective resolution of the inputs of SST (dashed black line) and SSH (dashed red line) fields from 8 km (effective resolution of the numerical simulation) to 200 km. The downscaling is conducted using an under-sampling method. To isolate the impact for each dataset, the SST resolution is fixed at 24 km when the SSH resolution is downscaled and vice versa, whereby, when the SST resolution is degraded, the SSH resolution is fixed at 80 km. The evaluation is performed by computing the percentage of valid predictions from the entire test dataset. The score does not change significantly with SST resolution, whereas the score decreases significantly with a coarser SSH resolution. We conclude that the information from SSH, which includes geostrophic-current information, is the main driver for particle trajectory predictions. Conversely, the information derived from SST, which provides smaller-scale features such as fronts, seems to play a secondary role. Regarding the resolution of SSH, the prediction score is not significantly affected at a resolution of 80 km compared to at a finer resolution (a loss of about 3 % of valid predictions). This result supports the potential application of the trained models with real satellite-derived products. However, it is important to note that, for SSH resolutions greater than 100 km, the network prediction score can be seriously degraded.

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f03

Figure 3Evaluation of ${U-Net}_{SST - SSH}^{100}$ score as a percentage of the valid predictions computed with the numerical simulation test dataset. The axis represents the horizontal resolution of the inputs downscaled using an under-sampling method. When the SSH resolution is downscaled, the SST resolution is fixed at 24 km. Conversely, when the SST resolution is downscaled, the SSH resolution is fixed at 80 km.

Download

3.2 Impact of sinking velocities and type of inputs

In Fig. 4, we compare the prediction metrics in terms of BL_200 m, F_200 m and the percentage of valid predictions provided by the three CNNs: (i) ${U-Net}_{5V - 4L}^{w}$ , (ii) ${U-Net}_{5V - 1L}^{w}$ , and (iii) ${U-Net}_{SST - SSH}^{w}$ . Additionally, the scores obtained with the standard catchment areas, i.e. box_200 km and box_100 km, were computed. Overall, the scores improved with larger sinking velocities w. This is probably because particles with high w are less sensitive to subsurface dynamics and are likely to be much closer to the sediment trap location, making it easier to predict the location. Conversely, with a lower sinking velocity, the particle path is typically more complex, with a longer transit resulting in a catchment area that is typically further from the sediment trap location and spread over a larger area, as shown by Wang et al. (2022 a).

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f04

Figure 4Evaluation of three types of U-Net depending on the sinking speed w. U-Net_SST−SSH is in light blue, U-Net_1L-5var is in beige, and U-Net_4L-5var is in red. Additionally, the scores of box_100 m (dark blue) and box_200 m (blue) have been computed. We evaluate the score using (a) BL_200 m, (b) F_200 m, and (c) the percentage of valid predictions.

Download

A comparison of ${U-Net}_{SST - SSH}^{w}$ predictions with traditional 100–200 km area baselines (Fig. 4) reveals a clear added value of the neural network scheme. The ${box}_{200 km / 100 km}$ gives, on average, between 1 %–20 % of valid predictions, with the percentage of predicted surface particles averaging about 20 %. In contrast, the ${U-Net}_{SST - SSH}^{w}$ outperforms this score, with a percentage of valid predictions ranging from 50 % (w=80 m d⁻¹) to 80 % (w=300 m d⁻¹). The average percentage of predicted particles F_200 m increases to 50 % with ${U-Net}_{SST - SSH}^{w}$ (+30 % compared to the boxes).

To gain a deeper understanding of the limitations of the ${U-Net}_{SST - SSH}^{w}$ score, we have increased the dynamical information in the region provided by the inputs using ${U-Net}_{5V - 1L}^{w}$ and ${U-Net}_{5V - 4L}^{w}$ . Unlike ${U-Net}_{SST - SSH}^{w}$ , ${U-Net}_{5V - 1L}^{w}$ includes explicit surface velocity and vorticity information at a fine resolution of 8 km. This additional information has led to a ∼5 %–10 % increase in valid predictions. As explained in Fig. 3, part of this improvement is due to the finer resolution. Thus, the addition of velocity and vorticity does not seem to significantly improve the score prediction at this resolution. We assume that ${U-Net}_{SST - SSH}^{w}$ can correctly extract the relevant features of the geostrophic velocities directly from the SSH.

The main limitation of the predictions seems to be the lack of information at deep levels. Indeed, ${U-Net}_{5V - 4L}^{w}$ outperforms all other U-Net models with a range of 78 %–99 % accuracy in predicting particle path dynamics (F_200 m=60 %–80 %), suggesting that deep dynamics are a crucial factor to consider in reconstructing the particle path. In the following section, the role of deep dynamics in particles' pathways is elucidated, showing a direct correlation between the prediction score and the intensity of deep currents.

3.3 Impact of deep dynamics

The aim of this investigation is to examine the role of the deep dynamics on particle pathways and their potential impact on the $D_{simu}^{w}$ predictions. Based on the model average kinetic energy profile ( $KE = \frac{1}{2} (u^{2} + v^{2})$ ) in the region (Fig. B1), it seems that the dynamics below 1000 m could be considered to be negligible compared to those in the mesopelagic zone (z<1000 m). Despite the low intensity of the deep currents, the particle pathways are still significantly influenced by deep structures such as deep jets or mesoscale eddies, which can originate from the surface or at depth through local bathymetric interactions (Smilenova et al., 2020). The deep currents induced by the continental shelf clearly affect the movement of the particles as soon as they are released, as shown in Fig. 5a–d by the PDF of particles (w=100 m d⁻¹) when they reach the mesopelagic layer (1000 m depth). In this example, the particles are already ∼100 km away from the source before entering the area driven by surface conditions. Moreover, based on a comparison between the two KE maps (b and d), the local currents around the PAP-SO station (see inside the black box) in the upper layer (0–1000 m) are typically not well correlated with the dynamics at depth (1000–3000 m). This leads to an incorrect prediction area (red contours vs. black contours). However, some surface eddies can have very deep coherence. If they are close to the PAP-SO station, they can lead to a coherent connection as they ensure a better correlation between surface and deep dynamics (Fig. 5f and h). They also tend to trap the particles together. These effects seem to reinforce the predictive power, as exemplified in Fig. 5e.

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f05

Figure 5Example of predictions from the numerical simulation between 29 February–30 March 2004 (a–d) and 6 September–6 October 2004 (e–h). (a, b, e, f) Relative vorticity ( $ζ / f$ ), currents, and kinetic energy (KE) vertically averaged between 1000–3000 m and temporally averaged during the particle crossing. The “true” particle catchment area at 1000 m is indicated by the black contours, which contain 25 % and 75 % of the PDF, respectively. (c, d, g, h) Relative vorticity, currents, and kinetic energy vertically averaged between 0–1000 m and temporally averaged during the particle crossing. The “true” particle catchment area at 200 m is shown by the black contours, and the associated ${U-Net}_{SST - SSH}^{100}$ prediction is shown by the red contours.

To corroborate these observations, we analyse the link between the score of the ${U-Net}_{SST - SSH}^{100}$ model and (i) the shape of the “true” particle catchment area (i.e. the catchment area from Lagrangian experiments) when reaching the base of the mesopelagic zone (z=1000 m) (Fig. 6a) and (ii) the local deep dynamics (KE and ζ below 1000 m) (Fig. 6b).

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f06

Figure 6Averaged bin statistics of the ${U-Net}_{SST - SSH}^{100}$ prediction score BL_200 m in the (a) particle PDF_1000 m mass centre and entropy and (b) kinetic energy (KE) and relative vorticity ( $ζ / f$ ) averaged between 1000–3000 m and in an 80 km box centred on the sediment trap location and temporally during the particle crossing.

Download

For (i), we computed the averaged bin statistics of the prediction score BL_200 m conditioned on the mass centre and the entropy of the true PDFs at 1000 m. The mass centre is defined as the average distance of the particles from the ST location. The entropy, defined as −Σp_ilog (p_i), where p_i is the PDF value at point i, describes the spread of the PDF. A high entropy is associated with a large particle spread over the domain Picard et al. (2024). This demonstrates that the final prediction score BL_200 m is directly related to the PDF state at deep depths. It can be observed that valid scores (BL_200 m<0.3) are associated with a low value of mass centre and a high value of entropy. This suggests that particles whose centre of mass remains close to the sediment trap location, even when dispersed over a large area at depth, are competently predicted. Conversely, the particles significantly affected by the deep currents reaching the mesopelagic zone too far away from the PAP-SO ST (mass centre >75–100 km) are unlikely to be competently predicted.

For (ii), the averaged bin statistics of the prediction score BL_200 m were conditioned with the local KE and relative vorticity averaged vertically between 1000–3000 m, horizontally in an 80 km box (black box in Fig. 5), and temporally during the crossing of the particle in the layer. A clear indication of a favourable prediction score (BL_200 m<0.3) can be observed when either the horizontal velocity is weak (i.e. low KE) or the absolute value of the vorticity is high (i.e. presence of a mesoscale eddy). These results corroborate the finding that the deep currents are the primary driver of the final prediction score.

4 Connection between surface- and deep-flux observations at the PAP-SO station

This section presents the application of U-Net_SST−SSH with real satellite-derived observations around the PAP-SO station and examines whether the predicted catchment areas improve the correlation between deep sediment trap fluxes and the surface chlorophyll-a concentration.

4.1 Predictions with satellite data

We focus on a 20-year period from 1 January 2000 to 1 June 2019. The data used in this study were obtained from the Global Ocean Gridded L4 Sea Surface Heights And Derived Variables Reprocessed from the Copernicus Climate Service, with a resolution of 0.25°×0.25° (https://doi.org/10.48670/moi-00148, CMEMS, 2024 a), and the Global Ocean OSTIA Sea Surface Temperature and Sea Ice Reprocessed with a resolution of 0.05°×0.05° (https://doi.org/10.48670/moi-00168, CMEMS, 2024 b). For SST and SSH, we sampled a daily dataset once every 10 d and interpolated the maps over the original CROCO grid in an 800 km box centred at the PAP-SO station using a bicubic interpolation method. To ensure coherence between the satellite dataset and the simulation dataset used to train ${U-Net}_{SST - SSH}^{w}$ , we compared the SSH and SST distributions between the two datasets, and no significant differences were observed (Fig. A1 in the Appendix). The satellite-derived SSH and SST datasets are used as inputs to generate predictions with ${U-Net}_{SST - SSH}^{w}$ . Over the 20-year period (2000–2019), a total of 815 predictions were generated for each sinking velocity w, with one PDF prediction generated every 10 d. We denote the resulting dataset of predicted PDFs as $D_{sat}^{w}$ . To ensure that the predictions produced with satellites are consistent with the predictions observed with the simulation data in Sect. 3, we compare the respective shape characteristics in $D_{sat}^{w}$ and $D_{simu}^{w}$ (mass centre and entropy distribution, Fig. A2). No significant differences were found, providing further confidence in the predictions made with real satellite-derived data.

Figure 7 shows examples of catchment area predictions from $D_{sat}^{100}$ between June–October 2016. The PDFs are associated with the corresponding chlorophyll-a images as background and the geostrophic sea surface velocities (averaged over the period). The surface chlorophyll-a images are derived from Global Ocean Colour Plankton and Reflectances MY L3 daily observations at 4 km resolution (https://doi.org/10.48670/moi-00282, CMEMS, 2024 c). The PDFs from $D_{sat}^{w}$ are associated with a date representing the mean time of particle arrival at the surface. The images illustrate a discernible coherent time continuity between the $D_{sat}^{100}$ catchment area locations. The $D_{sat}^{100}$ PDFs are often outside the box_200 m and show narrower locations that can change rapidly, usually in less than a month.

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f07

Figure 7Visual results of the $D_{sat}^{w}$ predictions (predictions with ${U-Net}_{SST - SSH}^{100}$ based on real satellite data) represented by the red contours (25 % and 75 % of the PDF). The plots show the evolution of the PDF from June to October 2016. The 200 km box is with dashed black lines. Also shown are the corresponding chlorophyll-a images from Atlantic Ocean Colour Global Ocean Colour Plankton and Reflectances MY L3 daily observations at 4 km resolution (OCEANCOLOUR GLO BGC L3 MY 009 107) (averaged over a 10 d window). The black arrows represent the geostrophic current from Global Ocean Gridded L4 Sea Surface Heights And Derived Variables Reprocessed from the Copernicus Climate Service. White areas in the chlorophyll-a data are due to cloud cover.

4.2 Particle flux data at the PAP-SO station

All particle flux data used in this study are from PAP-SO STs (Lampitt and Pebody, 2023) deployed between 3000–3200 m, which is approximately 1800 m above the seabed (see Lampitt et al., 2010, for a detailed methodology). The collection period varies between 7–42 d, depending on the time of the year and the expected fluxes. Fluxes are integrated over the collection period and are expressed in $mg m^{- 2} d^{- 1}$ . They are further separated into different variables: dry weight, particulate organic carbon (POC), and particulate inorganic carbon (PIC). Dry weight is the dry mass of the material collected in the sediment trap, POC is the organic carbon retained on a 0.7 µm GF/F filter after acidification, and PIC content was calculated as the difference between total carbon and POC content (Lampitt et al., 2023).

Figure 8 shows the 20-year time series of fluxes measured at PAP-SO ST, with chlorophyll-a concentration being time-averaged for each 10 d period and spatially averaged over a 200 km box centred at the PAP. Irrespective of the flux type, a clear signature is observed during the spring bloom for almost every year. This is characterised by a peak in chlorophyll-a concentration, which is followed later by a peak in deep-ocean carbon fluxes. The time lag between the chlorophyll a and the carbon fluxes depends mainly on the time it takes for particles to travel from the euphotic layer to the ST (Stange et al., 2017). Due to the large range of sinking velocities w, this time lag δ_t can vary significantly from days to months.

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f08

Figure 8Time series of carbon fluxes (dry weight, particulate organic carbon (POC), particulate inorganic carbon (PIC)) measured at the PAP-SO 3000 m sediment trap between 2000–2019. The green background is the chlorophyll a from Atlantic Ocean Colour Global Ocean Colour Plankton and Reflectances MY L3 daily observations (OCEANCOLOUR GLO BGC L3 MY 009 107). The time series are temporally averaged over a 10 d period and spatially averaged in a 200 km box around the PAP. The white area represents the data used for the cross-correlation calculation.

Download

4.3 Assessing connections between the surface ocean properties and deep-ocean particle fluxes

To assess the link between the surface ocean and ST carbon fluxes, we apply a methodology similar to that introduced by Frigstad et al. (2015). This strategy is based on the cross-correlation (CC) between the particle fluxes measured at PAP-SO and the surface net primary production (NPP) averaged over the catchment area. The CC score obtained with the predicted catchment area $D_{sat}^{w}$ can be compared with the CC references (box_100 km and box_200 km) to confirm – or not – an improved relationship between sea surface tracers and deep measurements. In this study, we have chosen to use chlorophyll-a concentration derived from ocean colour images instead of NPP to work directly with satellite-derived observations due to the large variability in derived NPP products, which depends on the choice of the algorithm used (Saba et al., 2011).

The detailed methodology used to compute the CC is described in Appendix B. In summary, the CC is calculated by determining the correlation coefficient between PAP-SO fluxes (dry weight, POC, and PIC) and the surface chlorophyll-a concentration within the catchment areas. We associate each particle flux measurement of PAP-SO ST taken at a given time t with the averaged chlorophyll-a concentration in the catchment area depending on the time lag δ_t. We compute the CC with three types of catchment area, which are box_200 km and box_100 km (baseline reference) and predictions from $D_{sat}^{w}$ . The sinking velocity considered for the prediction $D_{sat}^{w}$ depends on δ_t as defined in Table 1 to account for the variability of the duration of the particle pathways with respect to the sinking velocity. For example, for a time lag of less than 12 d, i.e. δ_t<12 d, we consider the predictions with the largest sinking velocity w=300 m d⁻¹ and use the catchment area predictions provided by the $D_{sat}^{300}$ dataset.

Table 1w(δ_t) predictions as a function of time lag in days. The time lag represents the time for a particle at w velocity to travel from the euphotic layer to the ST.

Download Print Version | Download XLSX

To test the robustness of the results, the CC was also computed using random catchment area predictions from the $D_{sat}^{w}$ dataset. This random process was repeated 100 times to compute the 10th and 90th score percentiles for each δ_t, representing the range of uncertainty.

As a considerable number of ST data points were missing prior to 2009 (Fig. 8), we only compute the CC between 2009–2019 (white area in Fig. 8), which is the period where the time series is continuous and considered to be valid by Lampitt et al. (2023). A second period between 2009–2019 but excluding the years 2011 and 2013 is also examined. The years 2011 and 2013 are associated with the deep fluxes that occur before the chlorophyll-a bloom. This pronounced anomaly has been observed before, and a possible explanation is that rapid re-stratification and/or intense events such as storms isolate pre-bloom particles at depth, leading to an intense carbon export that is not associated with surface data (Giering et al., 2016). Therefore, these years should be filtered out as they are not consistent with the hypothesis of biological processes at the sea surface as the main drivers of deep-ocean particle fluxes.

4.4 Results

The cross-correlation (CC) was computed at 3 d intervals between δ_t=0 d and δ_t=110 d, which is the range in which a non-zero correlation signal can be observed (Fig. 9). For both periods, the signal generally peaks at δ_t∼30–20 d (w=100–150 m d⁻¹) for dry weight and PIC, whereas POC shows a maximum at δ_t∼70 d (w=45 m d⁻¹). The correlations are generally weak for the three particle flux variables, and the reasons for this are discussed in the next section. Overall, the CC with the three catchment areas considered has a higher score than the random catchment area zone (blue area), confirming its relevance. However, the score appears to be generally higher for ${U-Net}_{SST - SSH}^{w}$ predictions, particularly for time lag values of $15 < δ_{t} < 50$ , which is associated with the particle velocities considered in this study. Note that, for δ_t>50, we choose to continue using ${U-Net}_{sat}^{80}$ . However, the associated particle sinking velocity should be slower than the values considered here (w≤80 m d⁻¹). Consequently, the ${U-Net}_{sat}^{80}$ may not be optimally suited to this context, which may partly explain why the correlation improvement is less pronounced here compared to box_100 km and box_200 km. The period without 2011 and 2013 leads to a higher global correlation score for all variables. Interestingly, a significantly higher score with ${U-Net}_{SST - SSH}^{w}$ is observed for PIC at about δ_t=30 d (w=100 m d⁻¹), corresponding to the average particle sinking velocity observed in the region. This seems to confirm that the model improves our ability to link surface data with deep carbon fluxes, especially for PIC fluxes.

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f09

Figure 9Cross-correlations (CCs) computed with dry weight, particulate organic carbon (POC), and particulate inorganic carbon (PIC) considering the period 2009–2019 (a) and the period 2009–2019 excluding the years 2011 and 2013 (b). CC is computed considering catchment areas from box_200 km (green), box_100 km (black), and $D_{sat}^{w}$ (red). The $D_{sat}^{w}$ catchment areas depend on the time lag δ_t as delimited by the dashed lines. The blue area represents the zone between the 90–10th-percentile CC score computed with 100 random catchment areas from $D_{sat}^{w}$ .

Download

5 Discussion

5.1 Comparison with previous studies

There are still significant gaps in the understanding of the PAP-SO ST fluxes, and no discernible link with the surface ocean data (i.e. NPP) has been established (Lampitt et al., 2023). A crucial missing piece of information that may limit such a link is the source location of the particles, which can vary rapidly and be located hundreds of kilometres from the trap, depending on the local surface mesoscale dynamics (Wang et al., 2022 a). Conventionally, this dynamic effect has been addressed by considering a fixed zone of influence, typically represented by a 100 or 200 km box surrounding the sediment trap (Lampitt et al., 2023). However, this approach is limited in its ability to handle the spatial and temporal variability of the source area, which can vary on a weekly basis due to local mesoscale dynamics. The objective of this study is to examine the potential of using machine learning and surface ocean mesoscale dynamics data to establish a more robust relationship between the deep-ocean carbon fluxes from the PAP-SO ST and surface ocean dynamics. This approach has the capacity to make effective predictions, thereby improving the source area location compared to a simple box. The strategy is based on a machine learning framework described in Picard et al. (2024), where convolutional neural networks were trained with a series of Lagrangian experiments in a numerical simulation to predict the catchment area of a PAP-SO ST. This approach suggested effective predictions using only surface data. Consequently, we developed an extended version of the network, called ${U-Net}_{SST - SSH}^{w}$ , to identify catchment areas at PAP-SO with remote sensing observations. The cross-correlation methodology, based on Frigstad et al. (2015), was used to determine the relationship between surface and deep fluxes. Despite notable differences in methodology compared to the Frigstad et al. (2015) study, which obtained catchment areas by using particle backtracking in a reanalysis model with a constant sinking rate of w=100 m d⁻¹ and computed the cross-correlation score with the NPP during the period 2006–2016, we found some coherence with our results. Indeed, the observation of a correlation peak for dry weight and PIC at $δ_{t} \sim - 20 / 30$ d is in line with Frigstad et al. (2015), who also identified a maximum correlation at PAP-SO for dry weight at δ_t=1 month. Furthermore, the POC correlation peak occurred at a greater time lag (δ_t=70 d), which is also supported by Frigstad et al. (2015), who observed a δ_t=2–3 months. Nevertheless, while the results of this study demonstrate the advantages of ${U-Net}_{SST - SSH}^{w}$ , the correlation signal remains weak (R²<0.3). This can be partly explained by the fact that not all biological surface drivers have been fully captured by the methodology employed. Additionally, the ${U-Net}_{SST - SSH}^{w}$ prediction presents limitations due to the absence of information at depth.

5.2 Other biological surface drivers of deep-ocean particle fluxes

A notable constraint of the study lies in the simplified representation of the organic particles within the numerical simulation. As mentioned in Picard et al. (2024), the size of the particles and their sinking rate vary during their descent through the water column due to aggregation and/or disaggregation, grazing, and remineralisation by bacterial activity (Alldredge and Gotschalk, 1988; Berelson, 2001; Fischer and Karakaå”u, 2009; Villa-Alfageme et al., 2016). These processes have not been taken into account in the presented Lagrangian experiments, and it is clear that they must be considered in future experiments. One potential approach to achieve this would be to use a Lagrangian framework that incorporates the parameters of particle biological interactions, as proposed by Jokulsdottir and Archer (2016).

Another major limitation of this study, particularly with respect to the cross-correlation method, is the simplified assumption that the sinking particles captured by the STs are directly derived from the chlorophyll a observed at the surface. First, sinking particles do not systematically originate from chlorophyll-a concentration footprints, a proxy for phytoplankton biomass. According to Nowicki (2022) and Siegel et al. (2023), zooplankton contributes a significant fraction of the sinking export in the region (>50 %). In particular Lampitt et al. (2009, 2023) also propose that deep carbon sequestration at the PAP-SO site could be controlled by Rhizaria, which includes two main classes, namely Radiolaria (mixotrophs) and Foraminifera (heterotrophs). Following the occurrence of phytoplankton blooms, zooplankton converts phytoplankton biomass and detritus into faecal pellets, which facilitate the export and rapid sinking of POC (Steinberg and Landry, 2017). However, zooplankton dynamics were not explicitly addressed in this study. The CC methodology focuses on the linkage between surface chlorophyll-a concentration and deep-ocean particle fluxes and does not account for the contribution of the zooplankton-mediated particle transformation of deep-ocean particle flux (Briggs et al., 2020). Zooplankton dynamics likely introduce an additional time lag into carbon export, which may account for the delayed correlation peak observed for POC. Conversely, PIC may be more directly driven by the sinking of phytoplankton-derived calcite incorporated into aggregates or zooplankton-derived calcium carbonate shells which can sink rapidly (up to 700 m d⁻¹) from the surface and may explain why the $D_{sat}^{w}$ CC score is the most optimal with this flux (Schmidt et al., 2014). To more accurately explain the drivers of POC pulses to the deep ocean, it would be beneficial in the future to have a more comprehensive representation of zooplankton dynamics in the upper ocean. While being challenging, this topic has been addressed by recent studies in the California Current System, where the zooplankton growth evolution and their surface 2D advection have been accurately depicted (Messié and Chavez, 2017; Messié et al., 2022).

A further limitation of focusing only on chlorophyll-a concentration is that it represents the production of phytoplankton organic matter without any species information. However, numerous studies have indicated that the carbon export efficiency is linked to particle characteristics such as the size, density, and sinking velocity, which are primarily determined by phytoplankton communities (Henson et al., 2012, 2015). In the future, it would be necessary to refine our analysis by taking into account the local plankton communities observed in the catchment areas to include further information such as the sinking rate and the export ratio. For instance, the use of OC-CCI micro-, nano-, and/or pico-phytoplankton data (Copernicus Marine Service, daily, 4 km resolution) could facilitate a more comprehensive assessment of the impact of community composition on fluxes, thereby improving our interpretation of the ST data. Moreover, pigment signatures (anomalies in the sea colour signal) are now beginning to be used to map the distributions of dominant phytoplankton groups (Alvain et al., 2005, 2006; Cetinić et al., 2024). More recently, machine learning products used a data-driven approach to extrapolate surface plankton communities (El Hourany et al., 2019) and biological properties (Sauzède et al., 2017) in the water column from surface conditions and in situ profiles (BioGeoChemical-Argo; Claustre et al., 2020). Products now available globally and at a high resolution (e.g. 8 d product and 4 km for El Hourany et al., 2019, products) may support the expansion of the variables considered in future work.

A final limitation lies in the limits of the satellite product itself, which only provides an estimate of phytoplankton chlorophyll-a concentration to a maximum depth of 10 m (Wang et al., 2022 b). However, the deep chlorophyll-a maximum (DCM) can differ significantly from the surface state, particularly in oligotrophic conditions with a shallow mixed layer, where the DCM is typically observed down to a maximum depth of 200 m (Mignot et al., 2014). However, this can also be a challenge during the pre-bloom phase. The rapid deepening of the mixed-layer depth, followed by a rapid re-stratification (typically during a storm event), can result in the isolation of a significant amount of carbon from the surface (Giering et al., 2016), a phenomenon known as the mixed-layer pump (Dall'Olmo et al., 2016). This phenomenon may explain the anomalies observed in 2011 and 2013, where the peak of ST fluxes occurred before the onset of the chlorophyll-a bloom, resulting in a disruption in the global CC score. Further research is therefore required to gain a deeper understanding of the impact of these mechanisms. Emerging technologies, especially BGC-Argo in situ observations and machine-learning-based products that can be used to estimate the carbon vertical distribution of organic carbon from satellites (Sauzède et al., 2016), are likely to be of key interest. Some of these products (i.e. 3D fields of particulate organic carbon, particulate backscattering coefficient and chlorophyll-a concentration) are already available (https://doi.org/10.48670/moi-00046, CMEMS, 2024 d). They could also provide a more comprehensive assessment of the missing NPP obtained from a surface-only perspective.

In the future, it would be beneficial to extend the catchment area reconstruction to other long-term ST observation sites which cover different regions and systems in the global ocean, e.g. BATS (Bates and Johnson, 2023), Station M (Smith et al., 2018), DYFAMED (Miquel et al., 2011), and ALOHA (Howe et al., 2011). Hence, the integration of the aforementioned processes into the proposed machine learning methodology seems to be a relevant research avenue to generalise beyond station-specific characteristics and to provide a more comprehensive record of deep-ocean carbon fluxes.

5.3 The importance of representing deep-ocean dynamics

It is important to consider this analysis in the context of the previous study by Picard et al. (2024), which considered PAP-SO ST at a depth of 1000 m, with a particle sinking velocity of w=50 m d⁻¹. Despite the relatively low particle velocities, the scores of U-Net_5V−1L obtained in the aforementioned study were considerably higher (85 % of valid predictions, i.e. BL_200 m>0.3) than those observed in the present study. It was originally hypothesised that the weak deep-ocean dynamics at the PAP would result in the particle sinking velocity being the primary factor influencing the prediction score. However, we can hypothesise that the use of a comparable sinking speed of 50 m d⁻¹ in this study would result in less than 50 % of valid predictions (considering the fact that the score decreases with lower w and that, at the slowest sinking rate of 80 m d⁻¹, we only reached ∼50 % of valid predictions with U-Net_5V−1L). Hence, this study seems to indicate that the vertical distance from the upper ocean and the resulting influence of local deep dynamics may be more important than initially hypothesised. Indeed, our results show that the prediction score is significantly driven by the local deep dynamics below 1000 m (Fig. 6). As noted by Bolton and Zanna (2019), machine learning faces challenges in reconstructing currents below a certain depth, even in the absence of topography, largely due to the influence of bottom drag. These difficulties are exacerbated when topography is present as geostrophic currents interacting with the seafloor generate strong bottom-intensified currents that can extend thousands of metres into the water column without leaving a detectable surface signature (e.g. Carli et al., 2024). In addition, submesoscale coherent vortices generated on nearby seamounts, ridges, and continental slopes (Smilenova et al., 2020) can generate anomalous mid-water column currents, contributing to the complexity of current structures that cannot be captured without local measurements. As a result, the primary limitations of ${U-Net}_{SST - SSH}^{w}$ can be attributed to the lack of comprehensive data on deep currents: the comparison between U-Net_5V−1L and U-Net_5V−4L outlines a potential F_200 m score increase of $\sim + 20$ % with the addition of information at depth (Fig. 4).

Hence, the predictive capabilities of ${U-Net}_{SST - SSH}^{w}$ could be improved by incorporating in situ observational data into the inputs. To achieve this, data on deep currents will need to be provided, for example, by incorporating data from current meters deployed at the PAP-SO ST mooring. An alternative approach would be to focus on the specific sampling period of the recent PAP observation campaigns, during which in situ drifting-sediment traps were released into the mesopelagic (i.e. during the APERO campaign, where 10 drifters were released between the surface and 1000 m for 5 d; the data are presented in Baker et al., 2020). However, the limited spatial resolution of the data in the region may prove to be insufficient to achieve the desired improvement in prediction score. It would be beneficial to conduct a prior study to evaluate the sensitivity score with deep data to determine if such data could improve ${U-Net}_{SST - SSH}^{w}$ . If this approach proves to be ineffective, an alternative idea would be to consider deploying sediment traps at shallower water depths but deeper than 1000 m. Previously, STs have been deployed at the PAP-SO site at 1000 m, but the measurements are more susceptible to under-collection due to hydrodynamic biases associated with conical STs, as highlighted in previous studies (Buesseler et al., 2007; Baker et al., 2020). It is also possible to consider the deployment of sediment traps in a region where the deep dynamics are even weaker and unaffected by the nearby topography, which is typically the source of deep eddies and instabilities (Smilenova et al., 2020). A numerical simulation such as the one used here can be used to identify the weakest dynamical regions, where particle pathways below the mesopelagic layer are unlikely to be affected. Nevertheless, 3D numerical simulation remains one of the most effective methods for studying deep-ocean dynamics, and further efforts are required to validate the accuracy of simulations of deep-ocean currents. In addition, given that SSH is the main driver of the network score (see Fig. 3), we hypothesise that the network relies predominantly on geostrophic currents to perform its prediction. Consequently, it would be worthwhile to compare the efficiency of the model in regions with varying degrees of geostrophic-current dominance.

Finally, questions remain about the uncertainties associated with the Lagrangian method. It is clear that the uncertainty associated with the Lagrangian method has a direct impact on the predictions since the network is trained directly with the backtracked particles. Although sensitivity tests have been carried out (changing the number of particles and the size of the released patch) to ensure that the particle sources are not affected, some diffusion processes are not represented in the numerical simulation and, consequently, in the propagation of the particles. To evaluate potential biases, it would be necessary in the future to adopt a stochastic approach (Mínguez et al., 2012), where random noise is introduced into the particle trajectories to account for subgrid-scale diffusion processes. These processes have the potential to influence the results of the Lagrangian analysis (see Appendix D for an example). Consequently, the diffusion parameterisation should be carefully defined, taking into account local dynamics. This approach would facilitate the establishment of a confidence interval for the source areas.

6 Conclusions

This study presents a novel machine learning tool, named U-Net_SST−SSH, which is capable of predicting the catchment area of particles trapped at the PAP-SO station ST moored at 3000 m depth, based solely on remote sensing data, namely SST and SSH. The study considers five sinking velocities, ranging from 80 to 300 m d⁻¹. The results of our method are compared with the direct use of a 100–200 km box around the trap location, representing the conventional approach of using catchment area. The results show that the prediction score increases with w, and, while the 100–200 km boxes predict only 20 %–30 % of the particle catchment area (w=80–300 m d⁻¹), the U-Net_SST−SSH predictions enhance this score to 40 %–60 %. We applied U-Net_SST−SSH to real satellite observations at PAP-SO, resulting in the generation of a 20-year catchment area dataset available at a 10 d resolution. The dataset demonstrated a stronger correlation – and, therefore, connection – between the deep-ocean particle fluxes measured at PAP-SO and surface chlorophyll-a concentration compared to the traditional catchment area method. The presence of deep-ocean energetic dynamics that are uncorrelated with the surface appears to be the main reason for the invalid predictions. Future improvements to the U-Net_SST−SSH method would entail a more comprehensive consideration of these deep currents. Ultimately, the improved identification of the surface catchment area of particles collected in deep-ocean sediment traps would facilitate the identification of the surface drivers of deep-ocean carbon sequestration, thereby improving our understanding of the biological carbon pump.

Appendix A: Statistical comparison between predictions with satellite and simulation database

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f10

Figure A1Comparison of U-Net_SST−SSH inputs (SST and SSH distribution) from the satellite data (in black) and from the training dataset from the CROCO numerical simulation (in red).

Download

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f11

Figure A2Statistical comparison of the catchment area PDF's mass centre and entropy between predictions with numerical simulation inputs $D_{simu}^{w}$ (red) and predictions with satellite inputs $D_{sat}^{w}$ (blue) for different vertical sinking velocities w. The boxplot represents the first and the third quartiles.

Download

Appendix B: Dynamics profile at the PAP-SO station

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f12

Figure B1Profiles of the average kinetic energy $\overline{KE}$ , absolute vertical velocities $\overline{| w |}$ , and vorticity standard deviation (SD) (ζ) between the surface and 3000 m depth. The profiles are spatially averaged in an 80 km box around the PAP-SO station and temporally averaged during the 8 years in the POLGYR numerical simulation.

Download

Appendix C: Methodology for the cross-correlation calculation

We associate each measurement taken at the PAP-SO ST at a middle time t (in days) and between t_start and t_end with a corresponding surface chlorophyll-a product depending on (i) the collection period $cp = t_{end} - t_{start}$ and (ii) a time lag δ_t which represents the time of the particle's travelling from the euphotic zone to the ST depth. Since this travelling time depends on particle sinking speed, we decide to assign a time lag range corresponding to each of the velocities considered in this study (Table 1). As a result, the final PDF prediction area for the time $tl = t - δ_{t}$ , called $\overline{D_{sat}^{w (tl)} (tl)}$ , will also depend on t_start and t_end:

\begin{matrix} (C1) & \overline{D_{sat}^{w (δ_{t})} (tl)} = {〈D_{sat}^{w (δ_{t})}〉}_{{tl}_{end}}^{{tl}_{start}}, \end{matrix}

where $〈 . 〉_{{tl}_{end}}^{{tl}_{start}}$ is the average of all of the predicted catchment areas between the time ${tl}_{start} = t_{start} - δ_{t}$ and the time ${tl}_{end} = t_{end} - δ_{t}$ .

Similarly, we compute a chlorophyll-a background based on a level-3 daily product from Atlantic Ocean Colour Global Ocean Colour Plankton and Reflectances MY L3 daily observations (OCEANCOLOUR GLO BGC L3 MY 009 107). The ocean colour images have been also interpolated over the CROCO grid using linear interpolation. The associated weighted averaged surface chlorophyll-a background $\overline{Chl (tl)}$ is computed such as follows:

\begin{matrix} (C2) & \overline{Chl (tl)} = 〈 Chl 〉_{{tl}_{end} + 5}^{{tl}_{start} - 5} . \end{matrix}

To avoid important cloud coverage, particularly during short collection times, we consider 10 additional days during the averaging process (5 d before tl_start and 5 d after tl_end). Finally, we compute the average chlorophyll a inside the catchment area PDF predicted for time tl:

\begin{matrix} (C3) & \overline{{Chl}_{D} (tl)} = \overline{Chl (tl)} \times \overline{D_{sat}^{w (δ_{t})} (tl)} . \end{matrix}

The comparison is made by computing the average chlorophyll a inside the reference catchment area boxes box_100 km and box_200 km as follows:

\begin{array}{l} (C4) & \overline{{Chl}_{box200} (tl)} = \overline{Chl (tl)} \cdot {box}_{200 km}, \\ (C5) & \overline{{Chl}_{box100} (tl)} = \overline{Chl (tl)} \cdot {box}_{100 km} . \end{array}

Appendix D: The impact of the diffusion process on Lagrangian experiments

To illustrate the effects of subgrid-scale diffusion processes on particle trajectories, we have implemented a simplified Markov model (Berloff and McWilliams, 2003) of order 0. The computation of the particle trajectory x_n can thus be described as follows:

\begin{matrix} (D1) & x_{n + 1} = x_{n} + Δ t \cdot u (x_{n}, t_{n}) + R \sqrt{(2 \cdot K_{diff} \cdot Δ t)} . \end{matrix}

Here, the u function is employed to compute the advection of the particles. The final term is related to the stochastic implementation, where $R = N (0, 1)$ denotes a random number selected according to a normal distribution, K_diff represents the diffusivity coefficient, and Δt is the online step time set to 120 s. The following illustrative examples demonstrate how the catchment area can be affected by adding a constant diffusivity term. The examples present a period of unfavourable conditions in winter, characterised by chaotic flows. We focus on the catchment area PDF observed for two distinct values of K_diff=0.1 m² s⁻¹ (Fig. D1) and K_diff=1 m² s⁻¹ (Fig. D2), which correspond roughly to horizontal diffusivities associated with internal waves and submesoscale processes at scales of 0.1–10 km (Garrett, 1983; Ledwell et al., 1998; Nencioli et al., 2013). For each value of K_diff, 10 Lagrangian experiments have been conducted, and an averaged PDF of these experiments has been computed and compared with the Gaussian-filtered PDF that has been used in this study.

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f13

Figure D1First row: four examples of PDF with K_diff=0.1 m² s⁻¹. Second row, from left to right: PDF for the same period with K_diff=0 m² s⁻¹, PDF with K_diff=0 m² s⁻¹ after Gaussian filter, average of 10 PDFs with K_diff=0.1 m² s⁻¹, and absolute error between the two previous PDFs.

Download

https://bg.copernicus.org/articles/22/4309/2025/bg-22-4309-2025-f14

Figure D2First row: four examples of PDF with K_diff=1 m² s⁻¹. Second row, from left to right: PDF for the same period with K_diff=0 m² s⁻¹, PDF with K_diff=0 m² s⁻¹ after Gaussian filter, average of 10 PDFs with K_diff = 1 m² s⁻¹, and absolute error between the two previous PDFs.

Download

As demonstrated by the example, when K_diff is set to 0.1, the high-density areas of the averaged PDF appear to be included in the Gaussian-filtered PDF (Fig. D1). However, when increasing the K_diff to 1, the averaged PDF is distributed over a larger domain, and new potential source areas outside the Gaussian-filtered PDF can be revealed (see the new particle patch in the top right of Fig. D2).

Code availability

The codes used in this study are available online at https://github.com/TheoPcrd/SPARO (last access: 7 October 2024; https://doi.org/10.5281/zenodo.13899396, Picard, 2024 b).

Data availability

The dataset of the predicted catchment area at the PAP-SO station ( $D_{sat}^{w}$ ) is available online at https://doi.org/10.17882/102535 (Picard, 2024 a).

Video supplement

A video abstract is available at https://doi.org/10.5281/zenodo.10261827 (Picard, 2023).

Author contributions

TP conducted the analysis and prepared the paper, with contributions from all of the co-authors.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We would like to thank Corinne Pebody for her support with the access to and understanding of the PAP-SO sediment trap data, Mathieu Le Corre for providing the CROCO simulation outputs, and Monique Messié for the valuable discussions.

Financial support

Théo Picard received a PhD grant from École Normale Superieure Paris‐Saclay. This paper contributes to the APERO project funded by the National Research Agency (grant no. ANR-21-CE01-0027). Théo Picard was supported by a CLASS ECR fellowship, and Chelsey A. Baker and Richard Lampitt were funded by the CLASS project (NERC grant no. NE/R015953/1). The authors received support from the French National Agency for Research (ANR) through the project DEEPER (grant no. ANR‐19‐CE01‐0002‐01) and AI chair OceaniX (grant no. ANR-19-CHIA-0016). Simulations were performed using HPC resources from GENCI-TGCC (grant no. 2022-A0090112051) and from HPC facilities DATARMOR of “Pôle de Calcul Intensif pour la Mer” at Ifremer in Brest, France. Théo Picard was supported by a CLASS ECR fellowship, and Chelsey A. Baker and Richard Lampitt were funded by the CLASS project (NERC grant no. NE/R015953/1).

Review statement

This paper was edited by Peter Landschützer and reviewed by two anonymous referees.

References

Alldredge, A. L. and Gotschalk, C.: In situ settling behavior of marine snow, Limnol. Oceanogr., 33, 339–351 https://doi.org/10.4319/lo.1988.33.3.0339, 1988. a

Alvain, S., Moulin, C., Dandonneau, Y., and Bréon, F. M.: Remote sensing of phytoplankton groups in case 1 waters from global SeaWiFS imagery, Deep-Sea Res. Pt. I, 52, 1989–2004, https://doi.org/10.1016/j.dsr.2005.06.015, 2005. a

Alvain, S., Moulin, C., Dandonneau, Y., Loisel, H., and Bréon, F. M.: A species-dependent bio-optical model of case I waters for global ocean color processing, Deep-Sea Res. Pt. I, 53, 917–925, https://doi.org/10.1016/j.dsr.2006.01.011, 2006. a

Armstrong, R. A., Lee, C., Hedges, J. I., Honjo, S., and Wakeham, S. G.: A new, mechanistic model for organic carbon fluxes in the ocean based on the quantitative association of POC with ballast minerals, Deep-Sea Res. Pt. II, 49, 219–236, https://doi.org/10.1016/S0967-0645(01)00101-1, 2001. a

Baker, C. A., Henson, S. A., Cavan, E. L., Giering, S. L., Yool, A., Gehlen, M., Belcher, A., Riley, J. S., Smith, H. E., and Sanders, R.: Slow-sinking particulate organic carbon in the Atlantic Ocean: magnitude, flux, and potential controls, Global Biogeochem. Cy., 31, 1051–1065, https://doi.org/10.1002/2017GB005638, 2017. a

Baker, C. A., Estapa, M. L., Iversen, M., Lampitt, R., and Buesseler, K.: Are all sediment traps created equal? An intercomparison study of carbon export methodologies at the PAP-SO site, Prog. Oceanogr., 184, 102317, https://doi.org/10.1016/j.pocean.2020.102317, 2020. a, b

Baker, C. A., Martin, A. P., Yool, A., and Popova, E.: Biological carbon pump sequestration efficiency in the North Atlantic: a leaky or a long term sink?, Global Biogeochem. Cy., 36, e2021GB007286, https://doi.org/10.1029/2021GB007286, 2022. a

Bates, N. R. and Johnson, R. J.: Forty years of ocean acidification observations (1983–2023) in the Sargasso Sea at the Bermuda Atlantic Time-series Study site, Frontiers in Marine Science, 10, 1289931, https://doi.org/10.3389/fmars.2023.1289931, 2023. a

Beauchamp, M., Amar, M. M., Febvre, Q., and Fablet, R.: End-to-end learning of variational interpolation schemes for satellite-derived SSH data, in: 2021 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Brussels, Belgium, 7418–7421, https://doi.org/10.1109/IGARSS47720.2021.9554800, 2021.

Beauchamp, M., Febvre, Q., Georgenthum, H., and Fablet, R.: 4DVarNet-SSH: end-to-end learning of variational interpolation schemes for nadir and wide-swath satellite altimetry, Geosci. Model Dev., 16, 2119–2147, https://doi.org/10.5194/gmd-16-2119-2023, 2023. a

Berelson, W. M.: Particle settling rates increase with depth in the ocean, Deep-Sea Res. Pt. II, 49, 237–251, https://doi.org/10.1016/S0967-0645(01)00102-3, 2001. a

Berloff, P. S. and McWilliams, J. C.: Material transport in oceanic gyres. Part III: Randomized stochastic models, J. Phys. Oceanogr., 33, 1416–1445, https://doi.org/10.1175/1520-0485(2003)033<1416:MTIOGP>2.0.CO;2, 2003. a

Bhattacharyya, A.: On a measure of divergence between two statistical populations defined by their probability distribution, Bull. Calcutta Math. S., 35, 99–110, 1943. a

Bolton, T. and Zanna, L.: Applications of deep learning to ocean data inference and subgrid parameterization, J. Adv. Model. Earth Sy., 11, 376–399, https://doi.org/10.1029/2018MS001472, 2019. a, b

Boyd, P. W., Claustre, H., Levy, M., Siegel, D. A., and Weber, T.: Multi-faceted particle pumps drive carbon sequestration in the ocean, Nature, 568, 327–335, https://doi.org/10.1038/s41586-019-1098-2, 2019. a

Breiman, L.: Bagging predictors, Mach. Learn., 24, 123–140, https://doi.org/10.1007/bf00058655, 1996. a

Briggs, N., Dall'Olmo, G., and Claustre, H.: Major role of particle fragmentation in regulating biological sequestration of CO₂ by the oceans, Science, 367, 791–793, https://doi.org/10.1126/science.aay1790, 2020. a

Buesseler, K. O., Lamborg, C. H., Boyd, P. W., Lam, P. J., Trull, T. W., Bidigare, R. R., Bishop, J. K., Casciotti, K. L., Dehairs, F., Elskens, M., Honda, M., Karl, D. M., Siegel, D. A., Silver, M. W., Steinberg, D. K., Valdes, J., Van Mooy, B., and Wilson, S.: Revisiting carbon flux through the ocean's twilight zone, Science, 316, 567–570, https://doi.org/10.1126/science.1137959, 2007. a, b

Burd, A., Buchan, A., Church, M. J., Landry, M. R., McDonnell, A. M. P., Passow, U., Steinberg, D. K., and Benway, H. M.: Towards a transformative understanding of the oceans biological pump: Priorities for future research – Report on the NSF Biology of the Biological Pump Workshop, NSF Biology of the Biological Pump Workshop, Hyatt Place New Orleans, New Orleans, LA, 19–20 February 2016, https://doi.org/10.1575/1912/8263, 2016. a

Burd, A. B., Hansell, D. A., Steinberg, D. K., Anderson, T. R., Arístegui, J., Baltar, F., Beaupré, S. R., Buesseler, K. O., DeHairs, F., Jackson, G. A., Kadko, D. C., Koppelmann, R., Lampitt, R. S., Nagata, T., Reinthaler, T., Robinson, C., Robison, B. H., Tamburini, C., and Tanaka, T.: Assessing the apparent imbalance between geochemical and biochemical indicators of meso- and bathypelagic biological activity: What the @$#! is wrong with present calculations of carbon budgets?, Deep-Sea Res. Pt. II, 57, 1557–1571, https://doi.org/10.1016/j.dsr2.2010.02.022, 2010. a

Carli, E., Siegelman, L., Morrow, R., and Vergara, O.: Surface quasi geostrophic reconstruction of vertical velocities and vertical heat fluxes in the Southern Ocean: perspectives for SWOT, J. Geophys. Res.-Oceans, 129, e2024JC021216, https://doi.org/10.1029/2024JC021216, 2024. a

Cetinić, I., Rousseaux, C. S., Carroll, I. T., Chase, A. P., Kramer, S. J., Werdell, P. J., Siegel, D. A., Dierssen, H. M., Catlett, D., Neeley, A., Soto Ramos, I. M., Wolny, J. L., Sadoff, N., Urquhart, E., Westberry, T. K., Stramski, D., Pahlevan, N., Seegers, B. N., Sirk, E., Lange, P. K., Vandermeulen, R. A., Graff, J. R., Allen, J. G., Gaube, P., McKinna, L. I., McKibben, S. M., Binding, C. E., Calzado, V. S., and Sayers, M.: Phytoplankton composition from sPACE: requirements, opportunities, and challenges, Remote Sens. Environ., 302, 113964, https://doi.org/10.1016/j.rse.2023.113964, 2024. a

Chapman, C. and Charantonis, A. A.: Reconstruction of subsurface velocities from satellite observations using iterative self-organizing maps, IEEE Geosci. Remote S., 14, 617–620, https://doi.org/10.1109/LGRS.2017.2665603, 2017. a

Chelton, D. B., Schlax, M. G., and Samelson, R. M.: Global observations of nonlinear mesoscale eddies, Prog. Oceanogr., 91, 167–216, https://doi.org/10.1016/j.pocean.2011.01.002, 2011. a, b

Claustre, H., Johnson, K. S., and Takeshita, Y.: Observing the global ocean with Biogeochemical-Argo, Annu. Rev. Mar. Sci., 12, 23–48, https://doi.org/10.1146/annurev-marine-010419-010956, 2020. a

Cutolo, E., Pascual, A., Ruiz, S., Zarokanellos, N., and Fablet, R.: CLOINet: Ocean state reconstructions through remote-sensing, in-situ sparse observations and Deep Learning, arXiv [preprint], https://doi.org/10.48550/arXiv.2210.10767 2022. a

Dall'Olmo, G., Dingle, J., Polimene, L., Brewin, R. J., and Claustre, H.: Substantial energy input to the mesopelagic ecosystem from the seasonal mixed-layer pump, Nat. Geosci., 9, 820–823, https://doi.org/10.1038/ngeo2818, 2016. a

Deuser, W. G., Muller-Karger, F. E., and Hemleben, C.: Temporal variations of particle fluxes in the deep subtropical and tropical North Atlantic: Eulerian versus Lagrangian effects, J. Geophys. Res., 93, 6857–6862, https://doi.org/10.1029/JC093iC06p06857, 1988. a

Deuser, W. G., Muller-Karger, F. E., Evans, R. H., Brown, O. B., Esaias, W. E., and Feldman, G. C.: Surface-ocean color and deep-ocean carbon flux: how close a connection?, Deep-Sea Res., 37, 1331–1343, https://doi.org/10.1016/0198-0149(90)90046-X, 1990. a

Dever, M., Nicholson, D., Omand, M. M., and Mahadevan, A.: Size differentiated export flux in different dynamical regimes in the ocean, Global Biogeochem. Cy., 35, e2020GB006764, https://doi.org/10.1029/2020GB006764, 2021. a

Durkin, C. A., Mooy, B. A. S. V., Dyhrman, S. T., and Buesseler, K. O.: Sinking phytoplankton associated with carbon flux in the Atlantic Ocean, Limnol. Oceanogr., 61, 1172–1187, https://doi.org/10.1002/lno.10253, 2016. a

El Hourany, R., Abboud-Abi Saab, M., Faour, G., Aumont, O., Crépon, M., and Thiria, S.: Estimation of secondary phytoplankton pigments from satellite observations using Self-Organizing Maps (SOMs), J. Geophys. Res.-Oceans, 124, 1357–1378, https://doi.org/10.1029/2018JC014450, 2019. a, b

E.U. Copernicus Marine Service Information (CMEMS): Global Ocean Gridded L4 Sea Surface Heights And Derived Variables Reprocessed 1993 Ongoing, Marine Data Store (MDS) [data set], https://doi.org/10.48670/moi-00148, 2024a. a

E.U. Copernicus Marine Service Information (CMEMS): Global Ocean OSTIA Sea Surface Temperature and Sea Ice Reprocessed, Marine Data Store (MDS) [data set], https://doi.org/10.48670/moi-00168, 2024b. a

E.U. Copernicus Marine Service Information (CMEMS): Global Ocean Colour Plankton and Reflectances MY L3 daily observations, Marine Data Store (MDS) [data set], https://doi.org/10.48670/moi-00282, 2024c. a

E.U. Copernicus Marine Service Information (CMEMS): Global Ocean 3D Chlorophyll-a concentration, Particulate Backscattering coefficient and Particulate Organic Carbon, Marine Data Store (MDS) [data set], https://doi.org/10.48670/moi-00046, 2024d. a

Falk, T., Mai, D., Bensch, R., Çiçek, Ö., Abdulkadir, A., Marrakchi, Y., Böhm, A., Deubner, J., Jäckel, Z., Seiwald, K., Dovzhenko, A., Tietz, O., Dal Bosco, C., Walsh, S., Saltukoglu, D., Tay, T. L., Prinz, M., Palme, K., Simons, M., Diester, I., Brox, T., and Ronneberger, O.: U-Net: deep learning for cell counting, detection, and morphometry, Nat. Methods, 16, 67–70, https://doi.org/10.1038/s41592-018-0261-2, 2019. a

Febvre, Q., Sommer, J. L., Ubelmann, C., and Fablet, R.: Training neural mapping schemes for satellite altimetry with simulation data, arXiv [preprint], https://doi.org/10.48550/arXiv.2309.14350, 2023. a

Fischer, G. and Karakaş, G.: Sinking rates and ballast composition of particles in the Atlantic Ocean: implications for the organic carbon fluxes to the deep ocean, Biogeosciences, 6, 85–102, https://doi.org/10.5194/bg-6-85-2009, 2009. a

Frigstad, H., Henson, S. A., Hartman, S. E., Omar, A. M., Jeansson, E., Cole, H., Pebody, C., and Lampitt, R. S.: Links between surface productivity and deep ocean particle flux at the Porcupine Abyssal Plain sustained observatory, Biogeosciences, 12, 5885–5897, https://doi.org/10.5194/bg-12-5885-2015, 2015. a, b, c, d, e, f, g

Garrett, C.: On the initial streakness of a dispersing tracer in two- and three-dimensional turbulence, Dynam. Atmos. Oceans, 7, 265–277, https://doi.org/10.1016/0377-0265(83)90008-8, 1983. a

Giering, S. L., Sanders, R., Martin, A. P., Lindemann, C., Möller, K. O., Daniels, C. J., Mayor, D. J., and St. John, M. A.: High export via small particles before the onset of the North Atlantic spring bloom, J. Geophys. Res.-Oceans, 121, 6929–6945, https://doi.org/10.1002/2016JC012048, 2016. a, b

Grabowski, E., Letelier, R. M., Laws, E. A., and Karl, D. M.: Coupling carbon and energy fluxes in the North Pacific Subtropical Gyre, Nat. Commun., 10, 1895, https://doi.org/10.1038/s41467-019-09772-z, 2019. a

Guidi, L., Legendre, L., Reygondeau, G., Uitz, J., Stemmann, L., and Henson, S. A.: A new look at ocean carbon remineralization for estimating deepwater sequestration, Global Biogeochem. Cy., 29, 1044–1059, https://doi.org/10.1002/2014GB005063, 2015. a

Hartman, S. E., Bett, B. J., Durden, J. M., Henson, S. A., Iversen, M., Jeffreys, R. M., Horton, T., Lampitt, R., and Gates, A. R.: Enduring science: three decades of observing the Northeast Atlantic from the Porcupine Abyssal Plain Sustained Observatory (PAP-SO), Prog. Oceanogr., 191, 102508, https://doi.org/10.1016/j.pocean.2020.102508, 2021. a

Henson, S. A., Sanders, R., and Madsen, E.: Global patterns in efficiency of particulate organic carbon export and transfer to the deep ocean, Global Biogeochem. Cy., 26, 1–14, https://doi.org/10.1029/2011GB004099, 2012. a

Henson, S. A., Yool, A., and Sanders, R.: Global Biogeochemical Cycles carbon export: A model study, Global Biogeochem. Cy., 29, 33–45, https://doi.org/10.1002/2014GB004965. a

Henson, S. A., Laufkötter, C., Leung, S., Giering, S. L., Palevsky, H. I., and Cavan, E. L.: Uncertain response of ocean biological carbon export in a changing world, Nat. Geosci., 15, 248–254, https://doi.org/10.1038/s41561-022-00927-0, 2022. a

Honjo, S., Manganini, S. J., Krishfield, R. A., and Francois, R.: Particulate organic carbon fluxes to the ocean interior and factors controlling the biological pump: a synthesis of global sediment trap programs since 1983, Prog. Oceanogr., 76, 217–285, https://doi.org/10.1016/j.pocean.2007.11.003, 2008. a

Howe, B. M., Lukas, R., Duennebier, F., and Karl, D.: ALOHA cabled observatory installation, in: OCEANS'11 MTS/IEEE KONA, Waikoloa, HI, USA, 19–22 September 2011, https://doi.org/10.23919/OCEANS.2011.6107301, 1–11, 2011. a

Jenkins, J., Paiement, A., Ourmières, Y., Le Sommer, J., Verron, J., Ubelmann, C., and Glotin, H.: A DNN Framework for Learning Lagrangian Drift With Uncertainty, Appl. Intell., 53, 23729–23739, https://doi.org/10.1007/s10489-023-04625-1, 2023. a, b

Jenkins, J., Paiement, A., Ourmières, Y., Sommer, J. L., Verron, J., Ubelmann, C., and Glotin, H.: A DNN Framework for Learning Lagrangian Drift With Uncertainty, Appl Intell., 53, 23729–23739, https://doi.org/10.1007/s10489-023-04625-1, 2023.

Jokulsdottir, T. and Archer, D.: A stochastic, Lagrangian model of sinking biogenic aggregates in the ocean (SLAMS 1.0): model formulation, validation and sensitivity, Geosci. Model Dev., 9, 1455–1476, https://doi.org/10.5194/gmd-9-1455-2016, 2016. a

Kingma, D. P. and Ba, J. L.: Adam: A method for stochastic optimization, in: 3rd International Conference on Learning Representations, ICLR 2015 – Conference Track Proceedings, San Diego, 30 January 2017, arXiv, https://doi.org/10.48550/arXiv.1412.6980, 2015. a

Kwon, E. Y., Primeau, F., and Sarmiento, J. L.: The impact of remineralization depth on the air-sea carbon balance, Nat. Geosci., 2, 630–635, https://doi.org/10.1038/ngeo612, 2009. a, b

Lampitt, R. and Pebody, C.: Sediment Trap data from the Porcupine Abyssal Plain Sustained Observatory (PAP-SO) site on PAP3 mooring at 3000 metres April 1989–June 2019 Version 2,NERC EDS British Oceanographic Data Centre NOC [data set], https://doi.org/10.5285/06bd25d5-fcd3-0f63-e063-6c86abc0481e, 2023. a

Lampitt, R. S., Noji, T., and von Bodungen, B.: What happens to zooplankton faecal pellets? Implications for material flux, Mar. Biol., 104, 15–23, https://doi.org/10.1007/BF01313152, 1990. a

Lampitt, R. S., Bett, B. J., Kiriakoulakis, K., Popova, E. E., Ragueneau, O., Vangriesheim, A., and Wolff, G. A.: Material supply to the abyssal seafloor in the northeast Atlantic, Prog. Oceanogr., 50, 27–63, https://doi.org/10.1016/S0079-6611(01)00047-7, 2001. a

Lampitt, R. S., Salter, I., and Johns, D.: Radiolaria: major exporters of organic carbon to the deep ocean, Global Biogeochem. Cy., 23, 1–9, https://doi.org/10.1029/2008GB003221, 2009. a

Lampitt, R., Salter, I., de Cuevas, B., Hartman, S., Larkin, K., and Pebody, C.: Long-term variability of downward particle flux in the deep northeast Atlantic: causes and trends, Deep-Sea Res. Pt. II, 57, 1346–1361, https://doi.org/10.1016/j.dsr2.2010.01.011, 2010. a, b, c

Lampitt, R. S., Briggs, N., Cael, B. B., Espinola, B., Hélaouët, P., Henson, S. A., Norrbin, F., Pebody, C. A., and Smeed, D.: Deep ocean particle flux in the Northeast Atlantic over the past 30 years: carbon sequestration is controlled by ecosystem structure in the upper ocean, Front. Earth Sci., 11, 1–19, https://doi.org/10.3389/feart.2023.1176196, 2023. a, b, c, d, e, f, g, h, i

Le Corre, M., Gula, J., and Tréguier, A.-M.: Barotropic vorticity balance of the North Atlantic subpolar gyre in an eddy-resolving model, Ocean Sci., 16, 451–468, https://doi.org/10.5194/os-16-451-2020, 2020. a

Le Moigne, F. A. C.: Pathways of organic carbon downward transport by the oceanic biological carbon pump, Frontiers in Marine Science, 6, 482488, https://doi.org/10.3389/fmars.2019.00634, 2019. a, b

Lecun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521, 436–444, https://doi.org/10.1038/nature14539, 2015. a

Ledwell, J. R., Watson, A. J., and Law, C. S.: Mixing of a tracer in the pycnocline, J. Geophys. Res.-Oceans, 103, 21499–21529, https://doi.org/10.1029/98JC01738, 1998. a

Lellouche, J.-M., Greiner, E., Bourdallé Badie, R., Gilles, G., Angélique, M., Marie, D., Clément, B., Mathieu, H., Olivier, L. G., Charly, R., Tony, C., Charles-Emmanuel, T., Florent, G., Giovanni, R., Mounir, B., Yann, D., and Pierre-Yves, L. T.: The Copernicus global $1 / 12$ ° oceanic and sea ice GLORYS12 reanalysis, Front. Earth Sci., 9, 1–27, https://doi.org/10.3389/feart.2021.698876, 2021. a, b

Lguensat, R., Sun, M., Fablet, R., Tandeo, P., Mason, E., and Chen, G.: EddyNet: A Deep Neural Network For Pixel-Wise Classification of Oceanic Eddies, in: IGARSS 2018–2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 22–27 July 2018, https://doi.org/10.1109/IGARSS.2018.8518411, 1764–1767, 2018. a

Liu, G., Bracco, A., and Passow, U.: The influence of mesoscale and submesoscale circulation on sinking particles in the northern Gulf of Mexico, Elementa, 6, 36, https://doi.org/10.1525/elementa.292, 2018. a, b

Ma, W., Xiu, P., Chai, F., Ran, L., Wiesner, M. G., Xi, J., Yan, Y., and Fredj, E.: Impact of mesoscale eddies on the source funnel of sediment trap measurements in the South China Sea, Prog. Oceanogr., 194, 102566, https://doi.org/10.1016/j.pocean.2021.102566, 2021. a

Manucharyan, G. E., Siegelman, L., and Klein, P.: A deep learning approach to spatiotemporal sea surface height interpolation and estimation of deep currents in geostrophic ocean turbulence, J. Adv. Model. Earth Sy., 13, 1–17, https://doi.org/10.1029/2019MS001965, 2021. a

McDonnell, A. M., Lam, P. J., Lamborg, C. H., Buesseler, K. O., Sanders, R., Riley, J. S., Marsay, C., Smith, H. E., Sargent, E. C., Lampitt, R. S., and Bishop, J. K.: The oceanographic toolbox for the collection of sinking and suspended marine particles, Prog. Oceanogr., 133, 17–31, https://doi.org/10.1016/j.pocean.2015.01.007, 2015. a

Messié, M. and Chavez, F. P.: Nutrient supply, surface currents, and plankton dynamics predict zooplankton hotspots in coastal upwelling systems, Geophys. Res. Lett., 44, 8979–8986, https://doi.org/10.1002/2017GL074322, 2017. a

Messié, M., Sancho-Gallegos, D. A., Fiechter, J., Santora, J. A., and Chavez, F. P.: Satellite-based Lagrangian model reveals how upwelling and oceanic circulation shape krill hotspots in the California current system, Frontiers in Marine Science, 9, 1–19, https://doi.org/10.3389/fmars.2022.835813, 2022. a

Mignot, A., Claustre, H., Uitz, J., Poteau, A., D'Ortenzio, F., and Xing, X.: Understanding the seasonal dynamics of phytoplankton biomass and the deep chlorophyll maximum in oligotrophic environments: a Bio-Argo float investigation, Global Biogeochem. Cy., 28, 856–879, https://doi.org/10.1002/2013GB004781, 2014. a

Mínguez, R., Abascal, A. J., Castanedo, S., and Medina, R.: Stochastic Lagrangian trajectory model for drifting objects in the ocean, Stoch. Env. Res. Risk. A., 26, 1081–1093, https://doi.org/10.1007/s00477-011-0548-7, 2012. a

Miquel, J.-C., Martín, J., Gasser, B., Rodriguez-y Baena, A., Toubal, T., and Fowler, S. W.: Dynamics of particle flux and carbon export in the northwestern Mediterranean Sea: a two decade time-series study at the DYFAMED site, Prog. Oceanogr., 91, 461–481, https://doi.org/10.1016/j.pocean.2011.07.018, 2011. a

Nencioli, F., d'Ovidio, F., Doglioli, A. M., and Petrenko, A. A.: In situ estimates of submesoscale horizontal eddy diffusivity across an ocean front, J. Geophys. Res.-Oceans, 118, 7066–7080, https://doi.org/10.1002/2013JC009252, 2013. a

Nowicki, M.: Quantifying the carbon export and sequestration pathways of the ocean's biological carbon pump, Global Biogeochem. Cy., 3, 1–22, https://doi.org/10.1029/2021GB007083, 2022. a

Palevsky, H. I. and Nicholson, D. P.: The North Atlantic biological pump: insights from the ocean observatories initiative irminger sea array, Oceanography, 31, 42–49, https://doi.org/10.5670/oceanog.2018.108, 2018. a

Parekh, P., Dutkiewicz, S., Follows, M. J., and Ito, T.: Atmospheric carbon dioxide in a less duty world, Geophys. Res. Lett., 33, 2–5, https://doi.org/10.1029/2005GL025098, 2006. a

Passow, U. and Carlson, C. A.: The biological pump in a high CO₂ world, Mar. Ecol. Prog. Ser., 470, 249–271, https://doi.org/10.3354/meps09985, 2012. a

Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury Google, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Xamla, A. K., Yang, E., Devito, Z., Raison Nabla, M., Tejani, A., Chilamkurthy, S., Ai, Q., Steiner, B., Facebook, L. F., Facebook, J. B., and Chintala, S.: PyTorch: An imperative style, high-performance deep learning library, in: Advances in Neural Information Processing Systems, edited by: Wallach, H., Larochelle, H., Beygelzimer, A., d'Alché-Buc, F., Fox, E., and Garnett, R., NeurIPS, 8026–8037, arXiv, https://doi.org/10.48550/arXiv.1912.01703, 2019. a

Picard, T.: Video for learning-based prediction of the particles catchment area of PAP sediment traps, Zenodo [video], https://doi.org/10.5281/zenodo.10261827, 2023. a

Picard, T.: Catchment areas of PAP sediment traps at 3000m depth from 2000 to 2022, Seanoe [data set], https://doi.org/10.17882/102535, 2024a. a

Picard, T.: SPARO: v2.0.0, Zenodo [code], https://doi.org/10.5281/zenodo.13899396, 2024b. a

Picard, T., Gula, J., Fablet, R., Collin, J., and Mémery, L.: Predicting particle catchment areas of deep-ocean sediment traps using machine learning, Ocean Sci., 20, 1149–1165, https://doi.org/10.5194/os-20-1149-2024, 2024. a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p

Ronneberger, O., Fischer, P., and Brox, T.: U-Net: Convolutional Networks for Biomedical Image Segmentation, arXiv [preprint], https://doi.org/10.48550/arXiv.1505.04597 2015. a

Ruhl, H. A., Bahr, F. L., Henson, S. A., Hosking, W. B., Espinola, B., Kahru, M., Daniel, P., Drake, P., and Edwards, C. A.: Understanding the remote influences of ocean weather on the episodic pulses of particulate organic carbon flux, Deep-Sea Res. Pt. II, 173, 104741, https://doi.org/10.1016/j.dsr2.2020.104741, 2020. a

Saba, V. S., Friedrichs, M. A. M., Antoine, D., Armstrong, R. A., Asanuma, I., Behrenfeld, M. J., Ciotti, A. M., Dowell, M., Hoepffner, N., Hyde, K. J. W., Ishizaka, J., Kameda, T., Marra, J., Mélin, F., Morel, A., O'Reilly, J., Scardi, M., Smith Jr., W. O., Smyth, T. J., Tang, S., Uitz, J., Waters, K., and Westberry, T. K.: An evaluation of ocean color model estimates of marine primary productivity in coastal and pelagic regions across the globe, Biogeosciences, 8, 489–503, https://doi.org/10.5194/bg-8-489-2011, 2011. a

Sauzède, R., Claustre, H., Uitz, J., Jamet, C., Dall'Olmo, G., D'Ortenzio, F., Gentili, B., Poteau, A., and Schmechtig, C.: A neural network-based method for merging ocean color and Argo data to extend surface bio-optical properties to depth: retrieval of the particulate backscattering coefficient, J. Geophys. Res.-Oceans, 121, 2552–2571, https://doi.org/10.1002/2015JC011408, 2016. a

Sauzède, R., Bittig, H. C., Claustre, H., de Fommervault, O. P., Gattuso, J. P., Legendre, L., and Johnson, K. S.: Estimates of water-column nutrient concentrations and carbonate system parameters in the global ocean: a novel approach based on neural networks, Frontiers in Marine Science, 4, 1–17, https://doi.org/10.3389/fmars.2017.00128, 2017. a

Schmidt, K., De La Rocha, C. L., Gallinari, M., and Cortese, G.: Not all calcite ballast is created equal: differing effects of foraminiferan and coccolith calcite on the formation and sinking of aggregates, Biogeosciences, 11, 135–145, https://doi.org/10.5194/bg-11-135-2014, 2014. a

Shchepetkin, A. F. and McWilliams, J. C.: The regional oceanic modeling system (ROMS): a split-explicit, free-surface, topography-following-coordinate oceanic model, Ocean Model., 9, 347–404, https://doi.org/10.1016/j.ocemod.2004.08.002, 2005. a

Siegel, D. A., Granata, T. C., Michaels, A. F., and Dickey, T. D.: Mesoscale eddy diffusion, particle sinking, and the interpretation of sediment trap data, J. Geophys. Res., 95, 5305–5311, https://doi.org/10.1029/JC095iC04p05305, 1990. a

Siegel, D. A., Fields, E., and Buesseler, K. O.: A bottom-up view of the biological pump: modeling source funnels above ocean sediment traps, Deep-Sea Res. Pt. I, 55,108–127, https://doi.org/10.1016/j.dsr.2007.10.006, 2008. a

Siegel, D. A., Buesseler, K. O., Behrenfeld, M. J., Benitez-Nelson, C. R., Boss, E., Brzezinski, M. A., Burd, A., Carlson, C. A., D'Asaro, E. A., Doney, S. C., Perry, M. J., Stanley, R. H., and Steinberg, D. K.: Prediction of the export and fate of global ocean net primary production: the exports science plan, Frontiers in Marine Science, 3, 1–10, https://doi.org/10.3389/fmars.2016.00022, 2016. a

Siegel, D. A., Devries, T., Doney, S. C., and Bell, T.: Assessing the sequestration time scales of some ocean-based carbon dioxide reduction strategies, Environ. Res. Lett., 16, 10, https://doi.org/10.1088/1748-9326/ac0be0, 2021. a

Siegel, D. A., Devries, T., Cetinić, I., and Bisson, K. M.: Quantifying the ocean's biological pump and its carbon cycle impacts on global scales, Annu. Rev. Mar. Sci., 15, 329–356, https://doi.org/10.1146/annurev-marine-040722-115226, 2023. a, b

Smilenova, A., Gula, J., Le Corre, M., Houpert, L., and Reecht, Y.: A persistent deep anticyclonic vortex in the rockall trough sustained by anticyclonic vortices shed from the slope current and wintertime convection, J. Geophys. Res.-Oceans, 125, e2019JC015905, https://doi.org/10.1029/2019JC015905, 2020. a, b, c

Smith, K. L., Ruhl, H. A., Huffard, C. L., Messié, M., and Kahru, M.: Episodic organic carbon fluxes from surface ocean to abyssal depths during long-term monitoring in NE Pacific, P. Natl. Acad. Sci. USA, 115, 12235–12240, https://doi.org/10.1073/pnas.1814559115, 2018. a, b

Stange, P., Bach, L. T., Le Moigne, F. A., Taucher, J., Boxhammer, T., and Riebesell, U.: Quantifying the time lag between organic matter production and export in the surface ocean: implications for estimates of export efficiency, Geophys. Res. Lett., 44, 268–276, https://doi.org/10.1002/2016GL070875, 2017. a

Steinberg, D. K. and Landry, M. R.: Zooplankton and the ocean carbon cycle, Annu. Rev. Mar. Sci., 9, 413–444, https://doi.org/10.1146/annurev-marine-010814-015924, 2017. a

Villa-Alfageme, M., de Soto, F., Le Moigne, F. A. C., Giering, S. L. C., Sanders, R., and García-Tenorio, R.: Observations and modeling of slow-sinking particles in the twilight zone, Global Biogeochem. Cy., 28, 1327–1342, https://doi.org/10.1002/2014GB004981, 2014. a

Villa-Alfageme, M., de Soto, F. C., Ceballos, E., Giering, S. L. C., Le Moigne, F. A. C., Henson, S., Mas, J. L., and Sanders, R. J.: Geographical, seasonal, and depth variation in sinking particle speeds in the North Atlantic, Geophys. Res. Lett., 43, 8609–8616, https://doi.org/10.1002/2016GL069233, 2016. a, b, c, d

Wang, L., Gula, J., Collin, J., and Mémery, L.: Effects of mesoscale dynamics on the path of fast-sinking particles to the deep ocean: a modeling study, J. Geophys. Res.-Oceans, 127, 1–30, https://doi.org/10.1029/2022JC018799, 2022a. a, b, c, d, e, f, g

Wang, Y., He, X., Bai, Y., Li, T., Wang, D., Zhu, Q., and Gong, F.: Satellite-derived bottom depth for optically shallow waters based on hydrolight simulations, Remote Sens.-Basel, 14, 4590, https://doi.org/10.3390/rs14184590, 2022b. a

Waniek, J., Koeve, W., and Prien, R. D.: Trajectories sinking particles and the catchment areas above sediment traps in the northeast atlantic, J. Mar. Res., 58, 983–1006, https://doi.org/10.1016/S0967-0637(97)00028-9, 2000. a, b

Wekerle, C., Krumpen, T., Dinter, T., von Appen, W. J., Iversen, M. H., and Salter, I.: Properties of sediment trap catchment areas in fram strait: results from Lagrangian modeling and remote sensing, Frontiers in Marine Science, 5, 407, https://doi.org/10.3389/fmars.2018.00407, 2018. a, b

Wilson, J. D., Andrews, O., Katavouta, A., de Melo Viríssimo, F., Death, R. M., Adloff, M., Baker, C. A., Blackledge, B., Goldsworth, F. W., Kennedy-Asser, A. T., Liu, Q., Sieradzan, K. R., Vosper, E., and Ying, R.: The biological carbon pump in CMIP6 models: 21st century trends and uncertainties, P. Natl. Acad. Sci. USA, 119, 11–13, https://doi.org/10.1073/pnas.2204369119, 2022. a

Worsfold, M., Good, S., Atkinson, C., and Embury, O.: Presenting a long-term, reprocessed dataset of global sea surface temperature produced using the OSTIA system, Remote Sens., 16, 3358, https://doi.org/10.3390/rs16183358, 2024.