Alpha and beta diversity patterns of polychaete assemblages across the nodule province of the eastern Clarion-Clipperton Fracture Zone (equatorial Paciﬁc)

. In the abyssal equatorial Paciﬁc Ocean, most of the seaﬂoor of the Clarion-Clipperton Fracture Zone (CCFZ), a 6 million km 2 polymetallic nodule province, has been preempted for future mining. In light of the large environmental footprint that mining would leave and given the diversity and the vulnerability of the abyssal fauna, the International Seabed Authority has implemented a regional management plan that includes the creation of nine Areas of Particular Environmental Interest (APEIs) located at the periphery of the CCFZ. The scientiﬁc principles for the design of the APEIs were based on the best – albeit very limited – knowledge of the area. The fauna and habitats in the APEIs are un-known, as are species’ ranges and the extent of biodiversity across the CCFZ.

Abstract. In the abyssal equatorial Pacific Ocean, most of the seafloor of the Clarion-Clipperton Fracture Zone (CCFZ), a 6 million km 2 polymetallic nodule province, has been preempted for future mining. In light of the large environmental footprint that mining would leave and given the diversity and the vulnerability of the abyssal fauna, the International Seabed Authority has implemented a regional management plan that includes the creation of nine Areas of Particular Environmental Interest (APEIs) located at the periphery of the CCFZ. The scientific principles for the design of the APEIs were based on the best -albeit very limited -knowledge of the area. The fauna and habitats in the APEIs are unknown, as are species' ranges and the extent of biodiversity across the CCFZ.
As part of the Joint Programming Initiative Healthy and Productive Seas and Oceans (JPI Oceans) pilot action "Ecological aspects of deep-sea mining", the SO239 cruise provided data to improve species inventories, determine species ranges, identify the drivers of beta diversity patterns and assess the representativeness of an APEI. Four exploration contract areas and an APEI (APEI no. 3) were sampled along a gradient of sea surface primary productivity that spanned a distance of 1440 km in the eastern CCFZ. Between three and eight quantitative box cores (0.25 m 2 ; 0-10 cm) were sampled in each study area, resulting in a large collection of polychaetes that were morphologically and molecularly (cytochrome c oxidase subunit I and 16S genes) analyzed.
A total of 275 polychaete morphospecies were identified. Only one morphospecies was shared among all five study ar-eas and 49 % were singletons. The patterns in community structure and composition were mainly attributed to variations in organic carbon fluxes to the seafloor at the regional scale and nodule density at the local scale, thus supporting the main assumptions underlying the design of the APEIs. However, the APEI no. 3, which is located in an oligotrophic province and separated from the CCFZ by the Clarion Fracture Zone, showed the lowest densities, lowest diversity, and a very low and distant independent similarity in community composition compared to the contract areas, thus questioning the representativeness and the appropriateness of APEI no. 3 to meet its purpose of diversity preservation. Among the four exploration contracts, which belong to a mesotrophic province, the distance decay of similarity provided a species turnover of 0.04 species km −1 , an average species range of 25 km and an extrapolated richness of up to 240 000 polychaete species in the CCFZ. By contrast, nonparametric estimators of diversity predict a regional richness of up to 498 species. Both estimates are biased by the high frequency of singletons in the dataset, which likely result from undersampling and merely reflect our level of uncertainty. The assessment of potential risks and scales of biodiversity loss due to nodule mining thus requires an appropriate inventory of species richness in the CCFZ.

Introduction
The abyssal depths are vast, covering 54 % of the Earth's surface and 75 % of the ocean floor, typically located between 3000 and 6000 m depth; it generally features lowtemperature, low-current and well-oxygenated oligotrophic waters (Gage and Tyler, 1991;Smith and Demopoulos, 2003;Ramirez-Llodra et al., 2010). Only about 1 % of abyssal depths have been investigated to date: much remains to be discovered. Polymetallic nodule fields are one of the unique habitats in the abyss (Ramirez-Llodra et al., 2010;Vanreusel et al., 2016). Nodules are potato-shaped, variably sized aggregations of minerals, mainly manganese and iron but also copper, nickel, and cobalt, that are patchily distributed (Hein and Petersen, 2013;Morgan, 2000). Polymetallic nodules were discovered during the Challenger expedition in the 1870s at depths below 4000 m in the Pacific, Atlantic and Indian oceans (Murray and Renard, 1891). In the equatorial Pacific Ocean, the Clarion-Clipperton Fracture Zone (CCFZ) harbors the largest polymetallic nodule field, with nodule densities as high as 75 kg m −2 (average 15 kg m −2 ) and possibly containing 34 billion metric tons of manganese nodules (Hein and Petersen, 2013;Morgan, 2000), which may represent a minimum sale value of USD 25 trillion . The presence of abundant metal resources has attracted the interest of industries. Established by the United Nations Convention on the Law of the Sea (UNC-LOS), the International Seabed Authority (ISA) manages the deep-sea mineral resources in international waters and is in charge of protecting fauna against any harm (Articles 145, 156, UNCLOS, 1982;Lodge et al., 2014). Currently, the ISA has granted 16 nodule exploration contracts and approved nine Areas of Particular Environmental Interest (APEIs) for preservation (Lodge et al., 2014) in the CCFZ.
Among the current seabed mining technologies, the hydraulic collector seems the most effective for commercial utilization . The mining pre-prototype vehicle (4.7 × 12 m) presented by GSR is a pickup system based on four nodule collector heads (1 m wide each) using a jet water pump and suction to collect nodules down to 15 cm depth (Global Sea Mineral Resources NV, 2018). A mining operation is anticipated to directly affect over 100 km 2 yr −1 of the seabed  and create sediment plumes that can indirectly increase the footprint of mining by a factor of 2 to 5 (Oebius et al., 2001;Glover and Smith, 2003). Nodule mining will clearly have detrimental effects on the benthic ecosystem, but the severity of the impacts is difficult to predict. Long-term surveys of small-scale disturbances or mining tests have shown that the direct impacts of seafloor disturbances may last for over 30 years in the CCFZ Jones et al., 2017). However, such small experiments hardly mimic the cumulative impacts of any single nodule mining operation that could last for 20 years (Glover and Smith, 2003;Jones et al., 2017). Beyond the unpredictable effects of the full-scale mining, the extent of biodiversity and species' ranges in the CCFZ are two major unknowns that prevent the assessment of potential biodiversity loss due to nodule mining. The few biodiversity studies undertaken so far in the CCFZ have revealed a high diversity of communities of megafauna (over 130 morphospecies; Amon et al., 2016;Simon-Lledó et al., 2019), polychaetes (over 180 morphospecies; Paterson et al., 1998;Glover et al., 2002;Wilson, 2017), isopods (over 160 morphospecies; Wilson, 2017), tanaids (over 100 morphospecies; Wilson, 2017;Błażewicz, et al., 2019) and nematodes (over 300 morphospecies; Miljutina et al., 2010). Overall, over 870 morphospecies are already known in the CCFZ, but almost none have been named and 90 % are likely new to science Miljutina et al., 2010). Therefore, new inventories of CCFZ biodiversity cannot be compared with these previous ones. To overcome this bias, DNA taxonomy is increasingly used . In the CCFZ, in two exploration contract areas separated by 1300 km, the first assessment of macrofaunal diversity based on DNA taxonomy already increased the number of known polychaetes to 233 Molecular Operational Taxonomic Units (MOTUs; Janssen et al., 2015). This study further highlighted three characteristics of abyssal biodiversity: (i) high rates of species turnover (i.e., species replacement), with only 12 % of polychaete MOTUs and 1 % of isopod MOTUs shared between the two areas; (ii) high frequencies of singletons (MOTUs known from a single unique DNA sequence) ranging from 60 % to 70 % for polychaetes and isopods, respectively; and (iii) cryptic diversity within polychaete and isopod morphospecies, suggesting that previous surveys have underestimated alpha and beta diversity of these two taxa.
Considering the large environmental footprint of nodule mining disturbances on the seafloor, as well as the diversity and vulnerability of the abyssal fauna, the need for marine spatial planning to preserve species, habitats and functions in the CCFZ has emerged, concomitant to a renewed interest for deep-sea mineral resources (Wedding et al., 2013). Due to the paucity of biological data in the CCFZ, the recommendations issued by Wedding et al. (2013) for the design of a network of protected areas were mainly based on nitrogen flux at 100 m depth (a proxy for trophic inputs to the seafloor), modeled nodule densities, the distribution of large seamounts and the dispersal distances of shallow water taxa. One of the main assumptions underlying the management plan is that longitudinal and latitudinal productivity-driven gradients shape the community structure and species distribution of abyssal communities. As a result, Wedding et al. (2013) divided the spatial domain of the CCFZ into 3 × 3 subregions and suggested creating one large no-mining area in each subregion. The size of the no-mining areas was defined with the aim of maintaining viable population sizes for species potentially restricted to a subregion, taking into account the inferred dispersal distances of species and of the plumes created by nodule mining (Wedding et al., 2013). Those principles were implemented in the regional management plan for the CCFZ, which re-P. Bonifácio et al.: Alpha and beta diversity patterns of polychaete assemblages 867 sulted in the designation of nine APEIs (Lodge et al., 2014). However, most of the CCFZ had already been preempted by current exploration contracts and areas reserved for future exploration. Consequently, the APEIs were located at the periphery of the CCFZ, thus deviating from an optimal design.
The European project Managing Impacts of Deep-seA re-Source exploitation (MIDAS) and the Joint Programming Initiative Healthy and Productive Seas and Oceans (JPI Oceans) pilot action "Ecological aspects of deep-sea mining" was aimed at improving the scientific basis on which to assess and manage the potential impacts arising from nodule mining. In this context, four exploration contract areas and one APEI (separated by 240 to 1440 km) were sampled along a sea surface primary productivity gradient from southeast to northwest across the eastern portion of the CCFZ nodule province (Martínez Arbizu and Haeckel, 2015). The four exploration contract areas were located within the eastern central subregion defined by Wedding et al. (2013), where an APEI did not fit. One of the nearest APEIs was thus sampled instead.
The aims of our study were (i) to test the hypotheses that support spatial conservation planning in the CCFZ, particularly the environmental drivers of alpha and beta diversity such as organic carbon fluxes to the seafloor and nodule density; (ii) to assess the representativeness of an APEI (i.e., APEI no. 3); and (iii) to improve the assessment of potential risks of biodiversity loss due to nodule mining. To tackle these issues, we focused on polychaete assemblages. Polychaetes are the dominant and most diverse group of the macrofauna; they can be quantitatively sampled and identified down to species level using a combination of morphological and molecular methods (Hessler and Jumars, 1974;Janssen et al., 2015;Wilson, 2017). Polychaetes also show a wide range of biological traits, from trophic behaviors to life history strategies, and play a major role in the functioning of benthic communities (Hutchings, 1998;Jumars et al., 2015).

Clarion-Clipperton Fracture Zone
The CCFZ is located in the equatorial Pacific Ocean between the Clarion Fracture to the north and the Clipperton Fracture to the south and between Kiribati to the west and Mexico to the east (Fig. 1). This area covers about 6 million km 2 and is composed of a high variety of habitats such as abyssal hills or seamounts, as well as polymetallic nodule fields . As part of the JPI Oceans project "Ecological aspects of deep-sea mining", the EcoResponse cruise SO239 took place from 11 March to 30 April 2015 aboard the RV Sonne (Martínez Arbizu and Haeckel, 2015). Sampling during the cruise focused on four exploration contract areas in addition to APEI number 3 (APEI no. 3; Fig. 1). All five study areas had water depths between 4000 and 5000 m (Fig. 1). The four exploration areas were licensed by the ISA to the Federal Institute for Geosciences and Natural Resources of Germany (BGR); the InterOcean-Metal Joint Organization (IOM); the G-TEC Sea Mineral Resources NV (GSR); and the Institut Français de Recherche pour l'Exploitation de la Mer (Ifremer). Furthermore, the ISA administrates APEI no. 3 as part of the regional environmental plan for the CCFZ. The distances separating the areas ranged from 243 km (BGR to IOM) to 1440 km (BGR to Ifremer or APEI no. 3).

Sampling strategy
The sampling strategy resulted from a combination of objectives that were unique to each area, together with the overarching aim of describing alpha and beta diversity patterns across a productivity gradient that included both contract areas for nodule exploration and an APEI (Martínez Arbizu and Haeckel, 2015). In the BGR area, two sub-areas were sampled: a Prospective Area (PA) that could be mined in the future and a Reference Area (RA) that could serve as a preservation area. In the IOM area, three sub-areas were sampled: one that had been directly disturbed by a Benthic Impact Experiment (BIE; Radziejewska, 2002), one that had been impacted by the plume and one undisturbed control area. These levels of sampling stratification are, however, out of the scope of the present study, which focuses on variations between contract areas. After checking that there was no statistically significant difference in the abundance and richness of polychaetes between sub-areas, all samples within an area were deemed representative of that area and considered replicate samples. The level of replication within areas accordingly varied as a function of sampling stratification. The aim was to collect a minimum of five replicate samples per strata, but due to sampling failures and time constraints it could not be systematically achieved (Table 1).
Within each area, macrofaunal samples were collected using a United States Naval Electronics Laboratory (USNEL) spade box corer of 0.25 m 2 (Hessler and Jumars, 1974). A total of 34 box cores were sampled, of which 30 samples were deemed quantitative ( Table 1). The overlying water was siphoned and sieved using a sieve of 300 µm mesh size. The box core sample surface was photographed, and all nodules were picked up from the sediment surface, washed with cold seawater over a 300 µm mesh sieve and individually weighed. Sessile and crevice-inhabiting polychaetes, if present, remained with the nodules and were not considered in this study. According to Thiel et al. (1993), who washed and broke 26 nodules, the fraction of crevice inhabitant polychaetes has low significance and representativity (i.e., only 29 specimens belonging to six species) when compared with those living in sediments surrounding the nodules (i.e., 864 polychaetes). The upper 10 cm of each core was sliced into three layers (0-3, 3-5 and 5-10 cm) to facilitate sieving and sorting; each layer was transferred into cold  seawater (4 • C) and sieved using the same mesh size. The 0-3 cm layer was immediately sieved in the cold room with cold seawater (4 • C). The sieve residues from the overlying water and nodule washing were added to the 0-3 cm layer and live sorted. All polychaete specimens were photographed, individualized and preserved in cold (−20 • C) 80 % ethanol and then kept at −20 • C (DNA-friendly). The 0-3 cm residue and 3-5 and 5-10 cm layers were fixed in formalin for 48 to 96 h, preserved in 96 % ethanol, and later sorted in the laboratory (not DNA-friendly). All layers were combined for the community analysis. In the laboratory, from each DNA-friendly polychaete specimen and from very few fragments, a small piece of tissue was dissected, fixed in cold 96 % ethanol and frozen at −20 • C for molecular studies. DNA sequences from fragments without a head were archived in BOLD and Gen-Bank (Bonifácio et al., 2019) but were not further used for the purpose of this paper.

DNA extraction, amplification, sequencing and alignment
The DNA of the subsampled tissues was extracted using a NucleoSpin Tissue kit (Macherey-Nagel), following the manufacturer's protocol. Approximately 450 base pairs (bp) of 16S, 700 bp of COI (cytochrome c oxidase subunit I) and 1600 bp of 18S genes were amplified using the following primers: Ann16SF and 16SbrH for 16S (Palumbi, 1996;Sjölin et al., 2005); polyLCO, polyHCO, LCO1490, and HCO2198 for COI (Folmer et al., 1994;Carr et al., 2011); and 18SA, 18SB, 620F, and 1324R for 18S (Medlin et al., 1988;Cohen et al., 1998;Nygren and Sundberg, 2003) for 18S. The polymerase chain reaction (PCR) mixtures of 25 µL contained 5 µL of Green GoTaq ® Flexi Buffer (final concentration of 1x), 2.5 µL of MgCl 2 solution (final concentration of 2.5 mM), 0.5 µL of PCR nucleotide mix (final concentration of 0.2 mM of each dNTP), 9.875 µL of nuclease-free water, 2.5 µL of each primer (final concentration of 1 µM), 2 µL template DNA and 0.125 U of GoTaq ® G2 Flexi DNA Polymerase (Promega). The temperature profile for PCR amplification consisted of the following steps: initial denaturation at 95 • C for 240 s, 35 cycles of denaturation at 94 • C for 30 s, annealing at 52 • C for 60 s, extension at 72 • C for 75 s, and a final extension at 72 • C for 480 s. Particularly for COI, 40 cycles were run, and for 18S extension during cycles lasted 180 s. PCR products, visualized after electrophoresis on 1 % agarose gel, were sent to the MacroGen Europe Laboratory in Amsterdam (the Netherlands) to obtain sequences, using the same set of primers as used for the PCR.
Overlapping sequence (forward and reverse) fragments were aligned into consensus sequences using Geneious Pro 8.1.7 (2005-2015; Biomatters Ltd). For COI, the sequences were translated into amino-acid alignments and checked for stop codons to avoid pseudogenes. The minimum length coverage was 207 bp for 16S, 327 bp for COI and 1615 bp for 18S.
The sequences were blasted in GenBank to check for the presence of contamination. Each set of genes was aligned separately using the following plugins: MAAFT (Katoh et al., 2002) for 16S and 18S and MUSCLE (Edgar, 2004) for COI. All sequences obtained in this study have been deposited in BOLD (http://www.boldsystems.org, last access: 12 February 2020; Ratnasingham and Hebert, 2007) or GenBank (http://www.ncbi.nlm.nih.gov/genbank/, last access: 12 February 2020).

Taxonomic identification and feeding guilds classification
Preserved specimens were examined under a Leica M125 stereomicroscope and a Nikon Eclipse E400 microscope, counted (anterior ends only) and morphologically identified using the deep-sea polychaete fauna bibliography (Fauchald, 1972(Fauchald, , 1977Böggemann, 2009) at the lowest taxonomic level possible (morphospecies). We separated closely related species (specimens that could not be discriminated morphologically) using the principle of phylogenetic species, whereby the genetic divergence among specimens belonging to the same species (intraspecific) is smaller than the divergence among specimens from different species (interspecific) (Hebert et al., 2003a). In the distribution of pairwise divergences among all sequences of a typical bar code dataset, a gap can be observed between intraspecific and interspecific variations. Molecular operational taxonomic units (MOTUs) were generally recognized using a threshold of 97 % or 99 % similarity between COI and 16S sequences, respectively (Hebert et al., 2003a, b;Brasier et al., 2016). The similarity of sequences within species was considered when identifying morphologically similar species. As genetic data were only used to separate closely related species, the delimited taxa entities in the present study are referenced as morphospecies. Trophic guilds were determined at family level following Jumars et al. (2015).

Environmental data
Environmental data were compiled from Hauquier et al. (2019) and Volz et al. (2018). Sediment samples were collected with a multi-corer or a gravity corer during the same cruise and in the same areas (see Martínez Arbizu and Haeckel, 2015 for details). The sediment characteristics studied by Hauquier et al. (2019) included a clay fraction (< 4 µm), a silt fraction (4-63 µm), total nitrogen (TN in weight %), total organic carbon (TOC in weight %) and chloroplastic pigment equivalents (CPEs in µg mL −1 ). Nodules were weighed onboard for each box-core sample to calculate nodule density (kg m −2 ; Table 1). Particulate organic carbon flux (POC, mg C m −2 d −1 ) at the seafloor for our studied areas (eastern CCFZ) were provided by Volz et al. (2018). At the northeastern (NE)-Pacific-Basin scale, POC flux (mg C m −2 d −1 ) at the seafloor was approximated using net surface primary production provided by the ocean productivity site (Westberry et al., 2008) averaged over the years 2002 to 2018 and applying the Suess algorithm (POC at the seafloor as a function of the net primary production scaled by depth; Suess, 1980; Table 2). POC flux at seafloor was considered a proxy for food supply to benthic communities.

NE-Pacific-scale polychaete community data
To put the results of our study in the larger context of the NE Pacific Basin, we compiled data from previous surveys of polychaete assemblages in the NE Pacific, including CLI-MAX II sampled in 1969, DOMES A, B and C in 1977and 1978, ECHO I in 1983, PRA in 1989, EqPac in 1992, Kaplan East in 2003, Kaplan West and Central in 2004, KR5 in 2012 in 2015 (Paterson et al., 1998;Glover et al., 2002;Wilson, 2017;Smith et al., 2008b;De Smet et al., 2017). From these studies, we compiled (when available) the mean abundance (ind. 0.25 m −2 ), total number of species, ES163 and bootstrap (Table 2).

Univariate analyses
Abundance and number of species per box core (Table 1) and averaged by area (Table 2) were used as descriptors of alpha diversity. A few cryptic or damaged specimens that could not be classified to a lower taxonomical level were included in total abundance calculations but excluded from subsequent diversity analyses. To compare diversity among the studied areas and for all samples (eastern CCFZ), rarefaction curves were computed based on the total number of individuals and the total number of box core samples (Hurlbert, 1971;Gotelli and Colwell, 2001). Based on these data the expected number of species was calculated for 12 individuals (ES12) and 163 individuals (ES163), as  well as for three samples (S3). Nonparametric estimators of species richness were used to estimate the total number of species at local and regional scales. Abundance-based estimators included Chao1 and abundance-based coverage estimator (ACE; O'Hara, 2005;Chiu et al., 2014). Incidencebased estimators included Chao2 (Chao, 1984), first-and second-order Jackknife (Burnham and Overton, 1979), and bootstrap (Smith and van Belle, 1984). A Venn diagram was used to show the distribution of rare, wide and common species across the CCFZ. Univariate analyses relied on nonparametric tests. The Kruskal-Wallis rank sum test was used to test differences among areas (Hollander and Wolfe, 1973); and the Conover multiple pairs rank comparisons (adjusted p value by Holm) was used to identify the pairs showing differences (Conover and Iman, 1979;Holm, 1979). Spearman correlations were sought between biotic and abiotic variables, using data from the SO239 cruise in the CCFZ and data compiled from the literature. The latter analysis aimed to test correlations between biotic variables and POC fluxes at the regional scale.

Multivariate analyses
Three indices of faunal similarity were used in multivariate analyses, the Chord-Normalized Expected Species Shared (CNESS), the New Normalized Expected Species Shared (NNESS; Trueblood et al., 1994;Gallagher, 1996) and the Jaccard family indices (Baselga, 2010;Legendre, 2014). The CNESS and NNESS were computed from probabilities of species occurrence in random draws of m individuals, with low values of m giving a high weight to dominant species and high values of m giving a high weight to rare species. The best trade-off value of m is the one providing the highest Kendall correlation between the similarity matrix for m = 1 and the similarity matrix for m = m max. The value of m max was given by the total abundance of the least abundant sample considered. CNESS was the distance metric used to perform a redundancy analysis (RDA; Legendre and Legendre, 2012). The RDA is a constrained multivariate analysis that tested the influence of multiple environmental covariates on multi-specific assemblages. Species contributing significantly to the ordination were plotted out of the equilibrium circle in RDA (scaling 1). The best set of environmental variables was selected using a forward selection procedure (Borcard et al., 2011) among the environmental variables available (see Sect. 2.5): clay fraction, silt fraction, TN, TOC, CPE, nodule density and POC flux at the seafloor. Furthermore, when selected variables had more than 80 % co-correlation, they were excluded, and the selection procedure was started over again. Also, the variance inflation factor (VIF) was used to verify the possible linear dependency among variables in the RDA model.
The NNESS index was used to perform a distance decay analysis in the same way as in Wilson (2017). Distance decay screens for a negative correlation between faunal similarities and geographic distances among pairs of areas. Wilson (2017) used the slope of linear regression between NNESS and distance to compute the rate of change (species km −1 ) and the species range (kilometers per species). The rate of change is the slope of linear regression between NNESS and distance multiplied by the mean total estimated species from all areas. The species range is the inverse of the rate of change.
The Jaccard family indices were used to partition betadiversity into its three components: similarity; turnover, which is dissimilarity due to species replacement, and nestedness, which is dissimilarity due to differences in the number of species (Baselga, 2010).

Abundance and alpha diversity
During the SO239 cruise, 1233 polychaete specimens were sampled in the five study areas. Interestingly, only a large specimen identified as Bathyasychis sp. 150 was found deeper than 50 cm (bottom of box core) and thus not included in the analyses. The dataset has been archived in the information system PANGAEA and is available in open access .
Of the 1233 polychaetes, 1118 specimens belonging to 62 genera within 40 families were identified down to morphospecies. The 115 remaining specimens were too damaged, cryptic or doubtful to be assigned to a morphospecies and were thus not included in diversity and composition analyses. The DNA-friendly samples totaled 430 specimens, 265 of which were successfully barcoded with either the COI and 16S genes (or both). The success rates were 17 % for COI and 60 % for 16S. The COI gene was successfully sequenced for 71 specimens totaling 45 MOTUs; for the 16S gene, 259 specimens were successfully sequenced covering 104 MO-TUs; only 65 specimens were successfully sequenced using both genes and yielded 40 MOTUs. The 18S gene was sequenced for phylogenetic purposes on a restricted number of specimens. The 21 sequences of the 18S gene that have been obtained are mentioned here because they were archived concomitantly with COI and 16S sequences in GenBank and BOLD public datasets, but they are not further considered in this study.
Based on both morphological and molecular identification, a total of 275 morphospecies were recognized. The mean number of species per area tended to decrease from southeast to northwest with high variability between neighboring areas (Fig. 2b). Mean richness varied from 37 ± 10 taxa 0.25 m −2 in BGR to 3 ± 2 taxa 0.25 m −2 in APEI no. 3. The number of species per box core (Table 1) differed significantly among areas (Kruskal-Wallis test, p < 0.001). The pairwise comparison test (Conover-Holm) showed that the number of species per box core was (i) significantly lower at APEI no. 3 than all other areas (p ≤ 0.01) except Ifremer, (ii) significantly lower at Ifremer (19±5 taxa 0.25 m −2 ) than at BGR and GSR (35 ± 7 taxa 0.25 m −2 ) (p < 0.001), and (iii) significantly lower at IOM (25 ± 6 taxa 0.25 m −2 ) (p < 0.05) than at BGR and GSR. A total of 156 species (observed species richness, Sobs) were sampled at BGR from eight box core samples, 107 species at IOM from eight box cores, 104 species at GSR from five box cores, 73 species at Ifremer from six box cores and 9 species at APEI no. 3 from three box cores (Table 3). Species rarefaction curves, based on individuals or samples, did not reach an asymptote at the local scale (Fig. 6a, b). Individual-based rarefaction curves did not show any clear diversity patterns among study areas (Fig. 6a). Sample-based rarefaction curves followed a pattern similar to abundance (Fig. 6b). From a random draw of three box cores, BGR and GSR, with 82 and 77 species, respectively, had higher expected numbers of species than IOM and Ifremer did, with 58 and 45 species, respectively. APEI no. 3, with only 9 species, had the lowest expected number of species (Fig. 6b, Table 3). The nonparametric estimators of local diversity followed the same patterns with the highest values for BGR and the lowest for APEI no. 3 (Table 3).

Beta and gamma diversity
In the RDA, the forward selection procedure kept CPE, clay fraction and nodule density as the best explanatory variables. The model explained 13 % (R 2 adj ) of the total variance in the composition of polychaete assemblages (Fig. 7a). The first axis of the RDA discriminated the eastern areas (BGR, IOM and GSR) from the western areas (Ifremer and APEI no. 3). The second axis of the RDA discriminated Ifremer from APEI no. 3 but also captured local-scale variation because replicate samples within areas were distributed along this second axis. The CPE concentrations mostly explained variance along the first axis. CPE was also positively and highly correlated with POC flux at seafloor and TOC (Fig. 3b).
The first axis of the RDA thus illustrates the influence of food inputs on species composition. The clay fraction contributed to the first and the second axis of the RDA. Grain size distribution differentiated APEI no. 3 from all other areas in the CCFZ (see Hauquier et al., 2019, for details). In the RDA, the clay fraction accounted for the large dissimilarity in species composition of the APEI no. 3. Nodule density was the main contributor to the second axis of the RDA. Variation in nodule density likely accounted for some of the local variation in species composition. The ordination of species (Fig. 7b) showed that Lumbrinerides sp. 2107 was the species most characteristic of the eastern areas; a cirratulid (Aphelochaeta sp. 2062) and a maldanid (Maldanidae sp. 121) were characteristic of APEI no. 3; and two spionids (Aurospio sp. 249 and Laonice sp. 349), a paraonid (Levinsenia sp. 498), and an opheliid (Ammotrypanella sp. 2045) were characteristic of the Ifremer area.
The distance decay of similarity showed two different patterns (Fig. 8a, b). APEI no. 3 had very low values of NNESS compared with all other areas, irrespective of distance (Fig. 8a). There was no statistically significant correlation between NNESS and distance (R 2 adj = 18 %, p = 0.12). However, without APEI no. 3, the NNESS values among pairs of exploration contract areas (Fig. 8b) within the CCFZ per se were negatively correlated with distance (R 2 adj = 0.85, p = 0.006). The slope of the linear regression (−0.0003) multiplied by the mean of species richness estimators for each area (Table 3) provided a rate of species change that ranged from 0.04 species km −1 with the bootstrap estimator (mean species richness of 135 species) to 0.07 species km −1 for the ACE estimator (mean species richness of 234 species). The inverse of these rates of species change predicted geographic ranges of 14 to 25 km.
Beta diversity was thus high across the CCFZ, particularly between the exploration contract areas, south of the Clarion Fracture Zone, and APEI no. 3, north of the Clar-   ion Fracture Zone. In addition, the decomposition of beta diversity showed that dissimilarity was mainly due to species turnover (91 %) and not nestedness (9 %). However, species turnover was driven by singletons. The Venn diagram ( Fig. 9) showed that, in each area, at least 30 % and up to 67 % of species were unique to one area, so that overall 169 out of 275 species were unique to a given area. Of these, 134 species were singletons (i.e., morphospecies known from a single specimen). Only a single species, Aurospio sp. 249, was sampled in all five areas, 16 species (6 %) were sampled in four areas, 33 species (12 %) were shared among three areas and 56 species (20 %) were shared between two areas. When all individuals and samples were pooled together, rarefaction curves did not level off (Fig. 10a, b) and the number of singletons steadily increased with increasing sample size (Fig. 10b). At this regional scale, nonparametric estimators of species richness ranged from 334 to 498 species (Table 3).

Major forces driving local-and regional-scale patterns in community structure and composition
Food supply, sediment grain size and the density of nodules are the three main environmental factors that seem to drive the structure and composition of polychaete assemblages in the CCFZ. The abundance of polychaetes per box core was positively correlated with nodule density, which is consistent with previous studies showing that nodules enhance macrofaunal densities and polychaete diversity (De Smet et al., 2017;Yu et al., 2018). Nodules may have antagonistic influences on different size groups of benthic communities. Meiofaunal assemblages are less abundant in nodule-rich sediments than in nodule-free sediments, which may be due to the lower volume of sediment available in nodule areas (Miljutina et al., 2010;Hauquier et al., 2019). In our study, the volume and surface occupied by nodules were not quantified but the positive relationship between nodule density and polychaete abundance shows that space is not a limiting  factor for polychaetes. Nodules also increase habitat heterogeneity, providing hard substrate for sessile organisms and generally enhancing the standing stocks of both sessile and vagile megafauna Vanreusel et al., 2016;Simon-Lledó et al., 2019). Nodules increase seafloor rough-ness, thereby increasing friction (Sternberg, 1970;Boudreau and Scott, 1978) and potentially sediment deposition rates. The large sessile suspension feeders may similarly enhance biodeposition (Graf and Rosenberg, 1997). Both processes may decelerate water current, stabilizing sediments and thus  increasing organic carbon supply in the same ways that polychaete tube lawns do, for example (Friedrichs et al., 2000). An increase in food supply may explain the higher densities of polychaetes in nodule-rich areas.
At regional to global scales, food input is among the main forcing factors of the structure and function of the abyssal ecosystem, which mainly rely upon 0.5 %-2 % of the organic carbon derived from sea surface primary production (Rowe et al., 1991;Smith et al., 1997;Smith et al., 2008a). Variations in sea surface primary productivity divide the NE Pacific abyss into three main areas (Sokolova, 1997;Hannides and Smith, 2003;Smith and Demopoulos, 2003): the eutrophic abyss in the equatorial upwelling zone (−5 • S-5 • N), with POC flux of about 1-2 g C m −2 yr −1 ; the mesotrophic abyss in the equatorial northern Pacific (5-15 • N), with a POC flux of about 0.5-1.5 g C m −2 yr −1 ; and the oligotrophic abyss underlying the North Pacific Subtropical Gyre (15-35 • N), with a POC flux lower than 0.5 g C m −2 yr −1 . Our metadata analysis confirmed that polychaete abundance was significantly and positively correlated with POC flux at seafloor, distinguishing areas in the oligotrophic abyss (APEI no. 3, CLIMAX II, DOMES A, EqPac 9 and Kaplan West) with low abundance (4-21 ind. 0.25 m −2 ) from areas in the mesotrophic abyss (Kaplan Central, Ifremer, PRA, ECHO 1, GSRNOD15A, GSR, IOM, Kaplan East and BGR) with average to high abundance (14-85 ind. 0.25 m −2 ) and areas in the eutrophic abyss (EqPac 0, 2 and 5) with abundance in the highest range (60-84 ind. 0.25 m −2 ; see Table 2). The exploration areas sampled in our study all lie within the mesotrophic zone, but APEI no. 3 lies within the oligotrophic zone. An analysis of biogeochemical processes confirmed the very low POC fluxes to the seafloor at APEI no. 3 (1 mg C m −2 d −1 ) and found respiration rates that were 2fold lower than in the exploration areas of the mesotrophic zone (Volz et al., 2018). APEI no. 3 was also characterized by higher clay content, which may be caused by lower sedimentation rate and a different sedimentation regime (Hauquier et al., 2019;Volz et al., 2018). Polychaete assemblages in APEI no. 3 consistently showed lower abundance, lower species richness and lower alpha diversity. Species turnover was also very high, with APEI no. 3 showing the highest rate of species unique to an area and the lowest NNESS for all pairs of comparisons. The redundancy analysis also suggested that, in addition to food supply, the higher relative proportion of clay contributed to variation in species composition at APEI no. 3. The polychaete assemblage was dominated by cirratulids, with one species significantly contributing to ordination (Aphelochaeta sp. 2062). Some cirratulids are recognized as surface deposit feeders (Jumars et al., 2015) and may prefer the smaller particles predominantly present at APEI no. 3 (D 4−3 = 15.71 µm). At least two cirratulid species can effectively select particle sizes in the clay size range using their tentacles (Magalhães and Bailey-Brock, 2017). The strong shift in community struc-ture and composition of polychaete assemblages between the APEI no. 3 and the exploration areas echoes that of megafaunal , nematode (Hauquier et al., 2019) and tanaid assemblages (Błażewicz et al., 2019). The biogeochemical settings and the biological patterns of the three size groups of the benthic fauna thus converge to conclude that the structure and functioning of the benthic ecosystem in APEI no. 3 is not representative of any of the four exploration contract areas included in this study.
Within the mesotrophic zone, the species composition of polychaete assemblages in the Ifremer exploration area differed from the other exploration areas. Differences were driven by species belonging to common deep-sea deposit feeders such as spionids, paraonids and opheliids (Jumars et al., 2015), whereas a lumbrinerid species characterized the eastern exploration areas (BGR, IOM and GSR). Furthermore, other carnivorous families were relatively more abundant in the eastern areas as well, such as paralacydoniids and sigalionids. These results agree with Smith et al. (2008b) who observed higher abundances of lumbrinerids and amphinomids, two families of carnivorous polychaetes (Jumars et al., 2015), in the eastern CCFZ (Kaplan East). The upper trophic levels indeed tended to be less represented in the Ifremer and APEI no. 3 areas than in the eastern areas. This pattern matches model predictions that food chain length is positively correlated with resource availability in very low productivity systems (< 1-10 g C m −2 yr −1 ; Moore and de Ruiter, 2000;Post, 2002). McClain and Schlacher (2015) formulated this food chain length-productivity relationship as the "one-more-trophic-level" hypothesis to account for a positive productivity-diversity relationship. Species richness and productivity were significantly correlated at eastern CCFZ scale, but no significant correlation was found between alpha diversity and productivity in the meta-analysis at the scale of the NE Pacific. The reason diversity and productivity were not correlated in the meta-analysis, which included data from the literature, could be mainly methodological. In particular, the use of integrative taxonomy in this study versus morphological taxonomy in previous works might hinder comparisons of diversity metrics.
To conclude, our study supports the assumptions behind the creation of nine large APEIs, namely that gradients of sea surface primary productivity determine large-scale patterns and that nodule densities determine local-scale patterns in community structure, species composition and functioning (Wedding et al., 2013). However, among exploration contract areas, there is a shift in community composition and trophic structure between BGR, IOM, and GSR on the one hand and Ifremer on the other hand, suggesting that these two groups do not belong to the same subregion, as hypothesized by Wedding et al. (2013). Environmental conditions at the APEI no. 3 also seem to be beyond the range of those found in exploration contract areas, which may explain why the community structure and species composition of benthic assemblages are so different.

Species turnover and geographic ranges
Species turnover was best illustrated by the distance decay of NNESS similarity, which showed two different patterns. Firstly, APEI no. 3 showed very low similarity with all other areas, irrespective of distance. Secondly, similarity decayed linearly with distance among the exploration contract areas located within the CCFZ. Beyond variation in food inputs, as discussed above, the large dissimilarity of polychaete assemblages in APEI no. 3 may suggest a major physiographic barrier between the north and south of the Clarion Fracture. The Clarion Fracture Zone is a long and narrow submarine mountain range characterized by a peak and trough exceeding 1800 m difference in elevation (Hall and Gurnis, 2005), which may be a barrier to dispersal for abyssal fauna. In the Atlantic, the Vema-TRANSIT expedition tested the influence of the Mid-Atlantic Ridge (MAR) and the Vema Fracture Zone (VFZ) on distribution and connectivity patterns of abyssal fauna with contrasting results (Riehl et al., 2018a). The MAR is not a barrier to dispersal for nematode species of the genus Acantholaimus , a pattern already found for 61 copepod species of the genus Mesocletodes (Menzel et al., 2011). However, the MAR is differently permeable to dispersal for three families of isopods, depending on their habits and swimming abilities (Bober et al., 2018). In particular, connectivity was very low for Macrostylidae species, a family of burrowing isopods with limited dispersal abilities (Riehl et al., 2018b). The species composition of the two polychaete families Spionidae and Polynoidae also differed on both sides of the VFZ, which may be due to limited dispersal and different habitat characteristics (Guggolz et al., 2018). This was, however, not the case for species of Laonice, which tended to show large ranges of up to 4000 km across the eastern and western At-lantic (Guggolz et al., 2019), or species of Aurospio and Prionospio, which could show pan-oceanic distribution (i.e., Pacific and Atlantic oceans; Guggolz et al., 2020). Our observations about Aurospio sp. 249, which was the only species sampled in all five areas, confirm the potential to disperse across large geographic distances of some spionids (Guggolz et al., 2020). In the CCFZ,  described 17 new species of polynoids based on morphology and DNA, of which 4 species are shared between APEI no. 3 and the exploration areas. In the abyssal Pacific, the CCFZ and the Peru Basin share 9 species of scavenging amphipods, which are highly motile and thus potentially cross the Clipperton and Galapagos fracture zones (Patel et al., 2018). However, species identification was based on morphology only, although cryptic species are common among scavenging amphipods, even in abyssal lineages (Brandt et al., 2012;Havermans et al., 2013). The influence of the fracture zones on the dispersal of the abyssal fauna remains to be better understood as the Clarion and Clipperton fractures may act as a barrier for species with low dispersal abilities, such as infaunal brooders. If so, the representativeness of seven out of the nine APEIs, which lie partly beyond the fractures, may be questionable.
Moreover, the slope of the linear decay of NNESS similarity within the CCFZ suggests an average range of 14 to 25 km per species. This average range masks large variance between a small pool of widespread species, known from two or more areas, and a large pool of rare species, only known from one study area and, in most cases, only known from a single individual. This high frequency of singletons may also significantly bias the estimation of species ranges (see below for a discussion on singletons). However, based on the best knowledge we have, our study suggests that, on average, the spatial range of polychaete species in the CCFZ is on the order of 20 km. This figure can be compared with the scale of a mining operation : rounding the production rate to 1.5 Mt yr −1 and using a nodule density of 15 kg m −2 , an area of a 100 km 2 would be mined each year. In other words, every year nodule mining would affect an area that is equivalent to the average geographic range of polychaete species.

4.3
The under-sampling bias: how many polychaete species live in the CCFZ?
Considering that the economic feasibility of nodule mining requires, for any single operation, mining a minimum of ca. 100 km 2 of abyssal seafloor per year for a couple of decades , there is no doubt that the benthic ecosystem will be subjected to adverse environmental impacts and that recovery, if any, will take centuries (Miljutin et al., 2011;Vanreusel et al., 2016;Gollner et al., 2017;Jones et al., 2017).
The main issue that has to be addressed is how significant these adverse impacts will be: will they cause "serious harm" (Levin et al., 2016) and, in particular, what will be the magnitude of biodiversity loss (Van Dover et al., 2017)? To assess the significance of adverse impacts due to nodule mining, one of the key unknowns is whether the deep sea, including abyssal fauna, is hyper-diverse (Hessler and Jumars, 1974;Grassle and Maciolek, 1992;Paterson et al., 1998) or not (May, 1992;Rex et al., 2005). Locally, alpha diversity of polychaete assemblages is high in the CCFZ (Paterson et al., 1998;Glover et al., 2002;Wilson, 2017), particularly for the equitability component of diversity, as exemplified by the slopes of individual-based rarefaction curves and a ratio of individuals to species of two to three at a local scale. Rarefaction curves level off at none of the sampling areas, highlighting that species richness has been systematically under-sampled, even at DOMES A, where 41 box-cores have been sampled (Wilson, 2017). At a regional scale, Glover et al. (2002) reported a total of 177 polychaete species in 2.94 m 2 along a 3260 km latitudinal gradient of productivity in the NE Pacific and a total of 183 species in 21 m 2 along a 2800 km longitudinal transect crossing the CCFZ. Janssen et al. (2015) found 233 MOTUs of polychaetes from epibenthic sledge samples of the BGR and Ifremer areas separated by 1400 km. Along this same transect, using an integrative taxonomy approach, here we report a total of 275 species from 30 quantitative box cores, covering an area of 7.5 m 2 . The two latter studies, relying partly or totally on DNA bar-coding, yield higher numbers of species than the two former regional assessments based on morphology only. Our personal observations during the identification process effectively allowed the identification of cryptic species sometimes sympatrically distributed. This presence of cryptic species has been already observed by Janssen et al. (2015) and , with the former suggesting that the specific environmental condi-tions have already selected for the best morphological characters, resulting in convergent speciation in other aspects as well, such as behavior or physiology. Integrative taxonomy thus not only provides more accurate estimates of species diversity but also facilitates comparisons across datasets. Over 90 % of the species in the abyssal Pacific are new to science  and there are few attempts to try to name them , although DNA sequences can easily be matched. Therefore, 26 MOTUs are shared between Janssen et al. (2015) and our study. The overlap is low, but it should be noted that we had only 71 COI sequences belonging to 45 MOTUs to compare with the 556 COI sequences belonging to 233 MOTUs from Janssen et al. (2015). This highlights a shortcoming of COIbased bar-coding because success rates for COI sequencing are generally low. A combination of several genetic markers associated to formal morphological descriptions are thus essential to accurately assess species diversity. In addition, Janssen et al. (2015) used an epibenthic sledge and we used a box corer. These two devices sample different components of benthic communities. During the SO239 cruise, epibenthic sledge samples provided a collection of 278 specimens and 80 MOTUs of polynoids, a family of larger epifaunal polychaetes , but in our box core samples, we only found one polynoid.
Overall, the combination of high local diversity, unsaturated rarefaction curves, high levels of cryptic diversity and high rates of species turnover suggest that polychaete diversity in the CCFZ is large and vastly under-sampled. Within the eastern CCFZ, the linear decay of NNESS similarity suggests a species turnover of 0.04 to 0.07 species km −1 , and decomposition of the beta diversity shows that 90 % of dissimilarity is due to spatial turnover. This rate of species change is 1 order of magnitude higher than the rate found by Wilson (2017) for polychaetes (0.0056 species km −1 ) and even higher than the rate for isopods (0.012 species km −1 ). These discrepancies may again reflect a high level of cryptic diversity. Wilson (2017) acknowledged that the rates of change he found may be underestimated, particularly for polychaetes, due to the fact that identifications were based on morphology only. The rate of species turnover that we report here for a 1440 km transect across the eastern CCFZ is, however, 20 times lower than the rates of 1 species km −1 reported by Grassle and Maciolek (1992) from a 180 km transect at 2100 m in the northwestern Atlantic. This difference is roughly consistent with Grassle and Maciolek (1992) hypothesis that in the deepest and most oligotrophic parts of the ocean, species richness may be lower by 1 order of magnitude. Still, an extrapolation of our rate of species turnover to the 6 million km 2 of the CCFZ, as Grassle and Maciolek (1992) did for the whole deep sea, yields predictions of at least 240 000 polychaete species, i.e., a number of species equivalent to the number of accepted marine species globally (WoRMS, 2019). This prediction is in sharp contrast with the outcome of nonparametric estimators of species richness, such as Chao or Jackknife, which provide a maximum estimate of 498 species. However, such estimators implicitly assume that the number of singletons decreases with increasing sample size (Melo, 2004), but the number of singletons steadily increased with sample size in this study. In such circumstances, the nonparametric species estimators underestimates species richness (Melo, 2004;Coddington et al., 2009). In an intensive survey of spiders in 1 ha of tropical forest, Coddington et al. (2009) found 29 % of singletons and tested the null hypothesis of under-sampling against ecologically driven hypotheses to explain this "anomalously" high frequency of singletons. They concluded that under-sampling was the most parsimonious explanation for the high frequency of singletons and that it causes a systematic negative bias of species richness estimators. In the deep sea, an anomalously high rate of singletons (about one-third of the sampled species) is usually the rule of macrofaunal surveys (Gage, 2004), and the most parsimonious hypothesis that still needs to be tested is thus that the deep-sea macrofauna has been systematically under-sampled.
Although under-sampling causes an underestimation of species richness, it may also lead to an overestimation of the distance decay of similarity because singletons, considered endemic to an area in the analysis, may have much wider distributions. In conclusion, our level of certainty on the number of polychaete species inhabiting the CCFZ and potentially threatened by nodule mining ranges from 498 to 240 000 species. The former estimate assumes that we have already sampled about half of the regional diversity and further suggests that most species have a large geographical range. The latter estimate assumes that we have sampled 0.1 % of the polychaete species in the CCFZ and that these species have narrow geographical ranges about the size of the area that will presumably be mined in 1 year by a single mining operation.

Conclusions
Food inputs and nodule density influence the structure and composition of polychaete assemblages in the CCFZ. This is a confirmation of hypotheses underpinning the design of the APEIs. Increasingly oligotrophic conditions cause two shifts in the trophic structure and species composition of polychaete assemblages. The first shift suggests that within the eastern central subregion defined by Wedding et al. (2013) and sampled in the present study, the eastern contract areas (BGR, IOM and GSR) and the western contract (Ifremer) belong in fact to different subregions, and if so they should be represented by two different APEIs. The most significant shift in community structure and composition was, however, found between the APEI no. 3 and the nodule exploration areas. APEI no. 3 is found in oligotrophic condi-tions north of the Clarion Fracture Zone, whereas exploration areas experience mesotrophic conditions south of the Clarion Fracture Zone. The scantiness of food supply and a barrier to dispersal may thus compromise the representativeness of APEI no. 3 and question its ability to meet its purpose of preserving the biodiversity from any of the contract areas considered in this study. However, the sampling effort in both the contract areas and the APEI remains quite limited. Furthermore, there are vast gaps in knowledge regarding life cycle and population dynamics that would need to be better constrained to fully assess the risks and provide guidance in mining management. In order to ascertain that the APEIs collectively meet their goal of preserving the biodiversity of the CCFZ an ambitious research agenda is needed, the funding of which could rely on the willingness of contractors and sponsoring states but could also become a priority of the future Environmental Compensation Fund to be created by the regulations on exploitation of mineral resources in the Area (ISBA/25/C/WP.1, 2019). The Area is defined by the United Nations Convention on the Law of the Sea (UNCLOS, 1982) as the seabed and subsoil beyond the limits of national jurisdiction.
The efficiency of the regional environmental management plan for the CCFZ is crucial in face of the yet uncertain but potentially adverse impacts of nodule mining. Within the CCFZ per se, the diversity of polychaete assemblages is even higher than previously thought due to a high level of cryptic diversity. Species turnover is high with a minimum estimated rate of species change of 0.04 species km −1 , suggesting an average geographical range of 25 km and a number of polychaete species in the CCFZ that may equal the number of all currently known marine species. If true, the risk of species extinction is very high because the environmental footprint of nodule mining would largely exceed the range of many species. On the contrary, nonparametric estimators of species richness suggest that total species richness across the five study areas does not exceed 498 species, which likely implies a species range much larger than 25 km. Both methods of estimating species richness can, however, be severely biased by singletons. Singletons represent 49 % of the 275 species of polychaetes that were sampled. The most parsimonious hypothesis to explain such a high rate of singletons is under-sampling. Under the auspice of the ISA, the synthesis of ongoing studies from independent science and contractors in the CCFZ will certainly contribute in filling some knowledge gaps on species richness and turn over but differences in objectives, strategies and methodologies among studies are also likely to put some limits on the usefulness of the exercise. The JPI Oceans pilot action "Ecological aspects of deep-sea mining" demonstrated how powerful such a joined and coordinated initiative can be. In the framework of a similarly ambitious and collective effort to inventory species richness in the CCFZ, a stratified random sampling at nested scales from regional down to seascapes, would provide the scales of species turnover, while intensive sampling of se-lected habitats up to the point where the number of singletons decreases with sample size would provide accurate estimates of species diversity. Both strategies should consider different taxonomic and functional groups of the abyssal fauna, which are likely to show different responses to nodule mining. Such an approach, based on standardized sampling methods and statistical sampling strategies, is needed to assess the potential risks and scales of biodiversity loss due to nodule mining in the CCFZ.
Data availability. DNA sequences are available in BOLD (https://doi.org/10.5883/DS-GKG001; Bonifácio, 2019) or Gen-Bank databases. The abundance data analyzed in the present study together with BOLD IDs (sample ID and process ID) and GenBank accession numbers are available in the PANGAEA database (https://doi.org/10.1594/PANGAEA.902860; Bonifácio et al., 2019).
Author contributions. LM and PMA conceived the project and designed the sampling. LM and PB performed the sampling and processed the samples. PB identified (morphology and DNA) the polychaetes. LM and PB analyzed and interpreted the data. All authors prepared and contributed to the manuscript.
Competing interests. The authors declare that they have no conflict of interest.
Special issue statement. This article is part of the special issue "Assessing environmental impacts of deep-sea mining -revisiting decade-old benthic disturbances in Pacific nodule areas". It is not associated with a conference.
Acknowledgements. The research leading to these results has received funding from the Ifremer program Ressources Minérales Marines (REMIMA), the JPI Oceans pilot action "Ecological aspects of deep-sea mining" and the European Union Seventh Framework Program (FP7/2007(FP7/ -2013 under the MIDAS project, grant agreement no. 603418. We are grateful to the crew of the RV Sonne and to all of the people involved in the field sampling and sample processing during the SO239 cruise. We would like to thank Stefanie Kaiser, Sarah Schnurr, and Ana Hilário for their expertise in washing and sieving samples and Lenka Neal for live-sorting the worms. Additionally, we thank Baptiste François for sample sorting in the laboratory. We are also grateful to all of the people that were involved in that molecular analysis: Aliou Dia, Guillaume Lannuzel, Emmanuelle Omnes, Alana Jute, Mohamed Dosoky and Gavin Campbell. Special thanks to Thomas Dahlgren for looking after and sharing the data of some families of polychaetes and to Ann Vanreusel, Freija Hauquier, and Felix Janssen for providing the abiotic data of CCFZ. Finally, we would like to thank Tina Treude and the referees (Paul Dando and three anonymous referees) for their critical reviews and useful comments, which significantly improved the quality of the final paper.
Financial support. This research has been supported by the Seventh Framework Programme (MIDAS, grant no. 603418). The study also received funding from REMIMA and JPI Oceans.
Review statement. This paper was edited by Tina Treude and reviewed by Paul Dando and three anonymous referees.