Structural elucidation and environmental distributions of butanetriol and pentanetriol dialkyl glycerol tetraethers (BDGTs and PDGTs)

Butanetriol and pentanetriol dialkyl glycerol tetraethers (BDGTs and PDGTs) are membrane lipids recently discovered in sedimentary environments and in the methanogenic archaeon Methanomassiliicoccus luminyensis. They possess an unusual structure, which challenges fundamental assumptions in lipid biochemistry. Indeed, they bear a butanetriol or a pentanetriol backbone instead of a glycerol at one end of their core structure. In this study, we unambiguously located the additional methyl group of the BDGT compound on the C3 carbon of the lipid backbone via high-field NMR experiments. We 15 further systematically explored the abundance, distribution and isotopic composition of BDGTs and PDGTs as both intact polar and core lipid forms in marine sediments collected in contrasting environments of the Mediterranean Sea and Black Sea. High proportions of intact polar BDGTs and PDGTs in the deeper methane-laden sedimentary layers and relatively Cdepleted BDGTs, especially in the Rhone delta and in the Black Sea, are in agreement with a probable methanogenic source for these lipids. However, contributions from heterotrophic archaea to BDGTs (and PDGTs) cannot be excluded, particularly 20 in the eastern Mediterranean Sea, and contrasting BDGT and PDGT headgroup distribution patterns were observed between the different studied sites. This points to additional, non-methanogenic, archaeal sources for these lipids.


Introduction
Unique membrane lipids were a key argument to postulate the existence of Archaea as a third and independent domain of life, as distantly related to Bacteria as to Eukarya, when Woese et al. (1990) proposed their revised tree of life. Membrane lipids 25 form an envelope that separates cells from their environment and protects their interior components. Specific chemical properties define the fluidity and permeability of the membrane barrier, regulating what can enter the internal cell compartment.
On the one hand, membrane lipids from members of all domains of life share some common characteristics, such as their amphiphilic nature. That is, they all possess apolar alkyl chains and polar headgroups held together by a glycerol moiety (Lombard et al., 2012). On the other hand, membrane lipids of Archaea fundamentally differ from those of Bacteria and 30 Eukarya in that they contain (bi)phytanyl chains constituted from the condensation of several isoprenoid units and ether linkages to the sn2 and sn3 carbons of a glycerol backbone (De Rosa and Gambacorta, 1988;Koga and Morii, 2005). On the contrary, Bacteria and Eukarya generally produce fatty acyl chains linked to the sn1 and sn2 carbons of the glycerol backbone (Kates, 1977). Intensive exploratory analyses of lipid extracts from pure cultures and environmental samples over the last decades (e.g. Elling et al., 2017;Koga et al., 1993;Koga and Morii, 2005;Liu et al., 2012;Meador et al., 2014;Paściak et al., 35 2003;Schouten et al., 2000Schouten et al., , 2013Sturt et al., 2004;Weijers et al., 2006), revealed a large diversity of membrane lipids and a more complex picture than first considered by Woese et al. (1990). Several non-isoprenoid ether glycerol lipids were identified as of bacterial origin, such as the branched glycerol dialkyl glycerol tetraethers (brGDGTs, Weijers et al., 2006) or the alkyl glycerol ether lipids (AGEs; e.g. Hinrichs et al., 2000;Pancost et al., 2001;Rütters et al., 2001). To date, AGEs have been observed in a wide range of pure cultures, covering different bacterial phyla with contrasting physiologies (Vinçon-Laugier et 40 al., 2016). In addition, lipids containing both a (bi)phytanyl and a non-isoprenoidal alkyl chain were previously observed in natural environments (Liu et al., 2012;Schouten et al., 2000). Nevertheless, all the above-mentioned membrane lipids possess glycerol backbones, which appear to be a common feature shared by members of all domains of life.
The recent identification of butanetriol and pentanetriol dialkyl glycerol tetraethers (BDGTs and PDGTs, Knappy et al., 2014;Zhu et al., 2014), in which one glycerol is substituted by a butanetriol or pentanetriol, challenges this assumption. Tandem 45 mass spectrometry complemented with gas chromatography (GC) detection of butanetriol after hydrolysis (Zhu et al., 2014) demonstrated the presence of a 1,2,3 butanetriol backbone in BDGTs but did not specify its configuration within the lipid molecule, notably its linkages with the biphytanyl side chains. Subsequently, BDGTs and PDGTs were observed in diverse samples, both as intact polar lipids (IPLs, lipids with polar headgroups) and core lipids (CLs, lipids without polar headgroups), from recent organic-rich estuarine sediments (Meador et al., 2015) to old Jurassic marine shales (Knappy et al., 2014) and deep 50 subsurface sediments (Becker et al., 2016;Zhu et al., 2014). Furthermore, Becker et al. (2016) identified BDGTs as prominent membrane lipids in Methanomassiliicoccus luminyensis, currently the only cultured representative of the seventh order of methanogens; lower quantities of PDGTs were also observed in this archaeon. BDGTs accounted for up to 82% of the detected core lipids, while being absent from 25 different pure cultures spanning the main archaeal phyla and encompassing several representatives of the different orders of methanogens, thus leading to the conclusion that the capability to synthesize BDGTs 55 and PDGTs among the methanogens could potentially be restricted to the order Methanomassiliicoccales (Becker et al., 2016).
However, this exclusive chemotaxonomic relationship contrasts with the wide diversity of geochemical settings where BDGTs and PDGTs have been detected and cultured representatives for several archaea phyla remain unavailable for screening. For example, Meador et al. (2015) had suggested members of the Miscellaneous Crenarchaeotal Group, now termed Bathyarchaeota, as a source of BDGTs in the estuarine White Oak River Basin due to the similarly high relative abundances 60 and correlation of BDGTs with bathyarchaeotal 16S genes in the sediment profile.
In the present study, the exact structure of the core BDGT molecule was elucidated through high-field two-dimensional NMR analysis of BDGT purified from a pure culture of M. luminyensis. In addition, the relative abundance of BDGTs and PDGTs and stable carbon isotopic composition (δ 13 C) of BDGTs were systematically investigated in a set of 48 marine sediment samples covering a wide range of environmental and geochemical conditions. Our aim was to provide new insights into the 65 diversity and carbon metabolism of BDGT and PDGT producers and on the potential roles of these novel lipids in the cell membrane.

Sample collection
Marine sediment samples were collected with a combination of multi-corer and gravity coring at eight different sites ( Fig. 1) 70 in the Mediterranean and Black Seas during two expeditions: RV Meteor Cruise M84/1 (Zabel et al., 2011) and RV Poseidon Cruise POS450 (Heuer et al., 2014). Description of the environmental characteristics, geochemical and sedimentary conditions of the eight visited sites can be found in Schmidt et al. (2017). From these locations, 48 samples spanning different environments and geochemical conditions were selected for detailed organic geochemical investigation (Suppl. Table 1).

Marine sediment lipid extraction and quantification 75
Lipid extraction of sediment samples was performed according to a modified Bligh & Dyer method (Sturt et al., 2004). Samples (ca. 50-60 g wet weight) were lyophilized and ultrasonically extracted four times with a mixture of dichloromethane (DCM)/methanol (MeOH)/buffer (1:2:0.8; v:v:v), with a phosphate buffer at pH 7.4 for the first two steps and a trichloroacetic acid buffer at pH 2.0 for the remaining two steps. The samples were extracted in two final steps with DCM/MeOH (5:1). Each extraction step was followed by centrifuging the samples for 10 min and the resulting supernatants were combined in a 80 separatory funnel. An equal amount of DCM and Milli-Q H2O was added to the supernatants and the separatory funnel was thoroughly mixed. After phase separation the aqueous phase was washed three times with DCM and the water phase was discarded. The organic phase was washed three times with Milli-Q H2O, collected and evaporated to dryness under a stream of N2. The total lipid extracts (TLE) obtained were stored at -20 °C until further analysis.
Detection and quantification of intact polar lipids (IPLs) were carried out on a maXis plus ultra-high resolution quadrupole-85 time-of-flight mass spectrometer (Q-ToF-MS; Bruker), coupled to an Ultimate 3000RS ultra high pressure liquid chromatography instrument (UHPLC; Dionex). IPLs were chromatographically separated by an ACE3 C18 column (150 x 2.1 mm; particle size 3 µm; ACE) as described by Zhu et al. (2013) and detected in positive ionization mode scanning a mass range of 150 -2000 Da. Lipids were identified by retention time, exact mass (± 0.001 Da), and characteristic fragmentation patterns obtained by data-dependent MS² scans. IPLs were quantified by comparison of the intensity of parent ions with that 90 of a C46-glycerol trialkyl glycerol tetraether (GTGT;Huguet et al., 2006) added as an internal standard. IPL concentrations were corrected for their response factors determined from purified and commercially available standards following the procedure described by Becker et al. (2016).

Stable carbon isotope analysis
Six of the 48 marine sediment TLE from the Rhone delta, the eastern Mediterranean Sea and the Black Sea (Samples # 5,8,95 20,22,34,38 of the dataset; detailed information available in Suppl. Table 1) were selected to investigate the natural stable isotopic composition of the BDGT-derived biphytanes. Only IPL-BDGTs were analyzed as they are more likely to derive from living organisms. Before isotopic analysis via gas chromatography coupled to isotope ratio mass spectrometry (GC-IRMS), IPL-BDGTs were purified with two steps of preparative HPLC and then cleaved into biphytanes (bp), as detailed below.

Preparative HPLC 100
TLE samples were first separated into IPL and CL fractions by preparative HPLC (Agilent 1200 series) with a modified version of the protocol reported by Meador et al. (2015). TLE separation was performed on an LiChrospher Diol column (250 x 10 mm; 5 µm, Grace) with n-hexane/isopropanol (90:10; v/v) as eluent A and 100% isopropanol as eluent B. Chromatographic conditions were as follows: a gradient from 100% A at 3 ml min -1 to 24% B in 15 min and then to 100% B in 5 min at a flow rate of 2 ml min -1 . 100% B at 2 ml min -1 was maintained for 10 min before switching back to the initial conditions for 15 min. 105 During the run, the eluent flow was split (split ratio 150:1; Agilent active splitter G1968D) between the fraction collector and an online mass spectrometer (Agilent 6130 single quadrupole) allowing continuous monitoring of retention time stability.
The collected fractions were subsequently dried under a stream of N2. IPLs were then converted into CLs by acid hydrolysis with 1 M HCl in MeOH for 3 h at 70 °C (Pitcher et al., 2009;Elling et al., 2014). 110 CL-BDGTs were further separated from CL-GDGTs according to Zhu et al. (2014). The hydrolyzed IPL fraction was injected into an Agilent 1200 normal phase HPLC-system equipped with a PerfectSil CN-3 column (250 x 10 mm, 5 µm, MZ Analysentechnik). Separation was achieved at a flow rate of 2.5 ml min -1 with an elution gradient from 100% A (99:1 nhexane:isopropanol) held for 5 min, ramping to 10% B (90:10 n-hexane:isopropanol) at 12 min and then to 100% B at 30 min finally holding 100% B for 12 more min. The solvent system was then returned to initial conditions for 10 min. Retention time 115 stability was monitored via simultaneous MS detection as above. BDGTs and GDGTs were collected between 13 and 16 min and between 16 and 21 min, respectively.

Ether cleavage and compound specific stable carbon isotope analysis
Ether cleavage with BBr3 was performed on the purified CL-BDGT and CL-GDGT fractions to convert them into biphytanes (bp) following the protocol by Kellermann et al. (2012). Briefly, aliquots of the dried fractions were amended with 200 μl of 120 1 M BBr3 dissolved in DCM and incubated in sealed tubes at 60 °C for 2 h followed by reduction with 200 μl of 1 M superhydride in tetrahydrofuran. Liquid-liquid extraction with H2O and n-hexane was performed three times and the aqueous phase was discarded. The apolar phase was purified on a silica column with n-hexane as eluent. Carbon isotopic composition of biphytanes was measured on a Trace GC Ultra coupled to a GC-IsoLink ConFlow IV interface and a Delta V Plus IRMS (Thermo) equipped with a Rxi-5ms column (30 m x 250 μm x 0.25 μm, Restek). The injection temperature was set at 300 °C; 125 the initial oven temperature was held at 60 °C for 1 min, followed by an increase to 150 °C at 10 °C min -1 and then to 320 °C at 4 °C min -1 with a flow rate of 1.2 ml min -1 . Every sample was measured in duplicate and the associated error was lower or equal to 1‰.

Stable carbon isotope composition of potential carbon sources
In order to gain information on the C sources of BDGTs, the stable carbon isotope analysis of the total organic carbon (TOC), 130 the dissolved inorganic carbon (DIC) and methane (CH4) was determined. TOC content and stable carbon isotopic analysis (δ 13 CTOC) were previously described in Schmidt et al. (2017). Stable carbon isotopic analysis of DIC (δ 13 CDIC) and CH4 (δ 13 CCH4) were measured on shore within the year following each cruise. Prior to analysis, pore water aliquots of 2 ml were stored at -20 °C in vials without headspace for δ 13 CDIC analysis and ca. 3 ml of sediment were stored at 4 °C in 22 ml gas tight vials amended with 5 ml NaOH for δ 13 CCH4 analysis. δ 13 CDIC was measured using a gas bench coupled to a Finnigan MAT 252 135 mass spectrometer. Samples were prepared as follows: 100 µl of phosphoric acid were transferred to glass tubes, which were subsequently sealed with butyl septa and plastic caps and purged five times with helium. Fluid samples (0.2-1.0 ml) were injected into the purged tubes by syringe and allowed to degas CO2 from the acidified aqueous matrix for at least five hours.
Carbon isotopic compositions of CO2 were then analyzed in subsamples of the gas phase. The precision of the analysis was 0.1‰ (1σ). CH4 stable carbon isotope values were determined on a Trace GC Ultra coupled to a GC combustion III interface 140 and a Delta Plus XP IRMS (Thermo) equipped with a Carboxen 1006 Plot column (Supelco, Sigma Aldrich). Injection temperature was set at 200 °C, the oven temperature was held at 40 °C for 6 min and the flow rate at 3 ml min -1 .

M. luminyensis strain was purchased at DSMZ (Leibniz Institute DSMZ -German Collection of Microorganisms and Cell
Cultures) and grown in an anaerobic medium optimized by the DSMZ (protocol 1637). A total volume of 2 l of culture was 145 grown in two 2 l Schott bottles, inoculated with 10% (v/v) of a previous culture grown under the same conditions, i.e. at 37 °C under an atmosphere containing 80% H2 and 20% CO2. After 16 days, cells were harvested by centrifugation (25 min; 4500 rpm) and were subsequently lyophilized.
Direct acid hydrolysis of the freeze-dried biomass pellet was performed according to Becker et al. (2016) using 1 M HCl in MeOH for 16 h at 70 °C. CLs were extracted by ultra-sonication (two times 20 min) with a 5:1 DCM:MeOH solvent mix and 150 the extracts were collected in a separatory funnel. Lipids were partitioned into the organic phase following addition of Milli-Q H2O. The water phase was then extracted three times with an equal amount of DCM. The organic phases were pooled in an Erlenmeyer flask before transfer into the separation funnel and further washing (three times) with an equal amount of Milli-Q H2O. The water phase was discarded and the organic solvent was evaporated under a gentle flow of N2. CL-BDGT-0 was purified by preparative HPLC following the same preparative HPLC protocol as described in Section 2.3.1. BDGT-0 was 155 collected between 12.3 and 13 min. BDGT-0 was estimated to be pure at 99.4% via UHPLC-QToF-MS following the protocol by Becker et al. (2015).

NMR analysis
BDGT-0 (860 µg) was dissolved in CDCl3 (170 µL) and transferred to a 3 mm NMR tube. 1 H, 13 C spectra and two-dimensional 160 COSY, TOCSY, 1 H-13 C HSQC and 1 H-13 C HMBC were acquired at 600 MHz on a Varian VNMRS spectrometer equipped with triple resonance 1 H observe cryogenic probe. Spectra were processed and analyzed using VnmrJ 4.2 software provided with the spectrometer.

Statistical analysis
Principal component analysis (PCA) was performed with the R software using factominR and vegan packages. PCA requires 165 all variables to follow a normal distribution, thus all data were reduced and centered before analysis.

NMR analysis of BDGT-0 extracted from M. luminyensis
Analysis of high-resolution 1 H and 13 C one-dimensional spectra revealed a number of downfield signals (3.15-3.70 ppm) that and B1'). Similarly, A1' and B1' could be correlated to A2' and B2' respectively as both were partially resolved from A2 and B2. A3/3' and B3/3' appeared to give a single methine resonance in the 1 H-13 C HSQC spectrum but were partially resolved in the 1 H-13 C HMBC, showing that A3 and A3' could be distinguished by their 13 C chemical shifts (29.71 and 29.54 ppm 190 respectively) and were resolved from B3/3' (29.60 ppm). Aside from this, the remainder of the (highly overlapped) branched alkyl chain chemical shifts (from A4'/B4' onwards, Supplementary Figure 2A) were superimposable on those of GDGT-0 suggesting an identical arrangement of branched methyl groups (Table 1). Together these analyses confirmed the presence of a butanetriol group at the opposing end of the molecule (Fig. 2) and four resolvable ether linkages.

BDGT and PDGT abundance and diversity
Geochemical parameters of the 48 analyzed samples were previously described in Schmidt et al. (2017). Notably, TOC values ranged between 0.08 and 4.37% (Suppl. Table 1). The highest TOC content was measured in the sapropel layers of the eastern Mediterranean Basin (GeoB15103) while the basin sites, i.e. eastern Mediterranean Basin (excluding the sapropel layers), Cap de Creus Canyon and Ligurian-Provençal Basin (GeoB15103, GeoB17302, GeoB17304), exhibited the lowest TOC contents. 200 From the 48 samples, ranging in depth from surface to 635 cm and ages from modern to ~173,000 years, BDGTs and PDGTs were detected as CLs and IPLs in 45 and 37 samples, respectively. PDGTs were not detected in samples from the Black Sea (GeoB15105). Concentrations ranged from 0.04 to 9.2 µg g -1 Corg for BDGTs and from 0.04 to 3.1 µg g -1 Corg for PDGTs, which correspond, in average, to 8 and 3% of the GDGT-0 concentration in these samples, respectively (Suppl. Table 2).
The relative abundance of these IPLs within the total pool comprised of both the corresponding CLs and IPLs (on average, 74% for IPL-BDGTs and 93% for IPL-PDGTs) was much higher than the relative abundance of IPL-GDGTs (18% on average; Suppl. Table 2 and Fig. 3).

Principal Component Analysis (PCA) on BDGT and PDGT distribution 210
In order to evaluate the variability in BDGT and PDGT distribution within the dataset, a PCA was performed with the major environmental variables and indices of BDGT and PDGT relative abundances. In addition to the fractional abundance for each

BDGT and PDGT pool [f(CL-BDGTs), f(1G-BDGTs), f(2G-BDGTs), f(CL-PDGTs), f(1G-PDGTs), f(2G-PDGTs)], the relative proportion of IPLs to the total lipid content [f(IPL-BDGTs), f(IPL-PDGTs)] and the relative proportion of BDGTs and
PDGTs to the total GDGT-0 content [(sum-BDGTs)/(sum-GDGT-0), (sum-PDGTs)/(sum-GDGT-0)] were computed (Suppl. 215 Table 2). This PCA separated three groups of samples that differed from their geochemical properties and their BDGT and PDGT content (Fig. 4). The first group contained all samples from the Rhone delta (sites GeoB17306, 7, 8) and is characterized by high concentrations of dissolved Fe and CH4, shallow water depths, and relatively low values of δ 13 CTOC (Fig. 4). This sample group shows a high proportion of BDGTs and PDGTs relative to GDGT-0, and a high proportion of IPL-BDGTs and IPL-PDGTs, especially as diglycosidic lipids (2Gs; Fig. 4). The second group of samples includes all samples collected in the 220 Black Sea (GeoB15105), with high concentrations of DIC, DOC, and HS -, and a low concentration of SO4 2-. The last group comprises all samples from basin sites, which are characterized by relatively lower TOC and lower terrestrial input, i.e. from the eastern Mediterranean basin (GeoB15103), the Marmara Sea (GeoB15104) and Cap de Creus and Ligurian-Provençal Basin (GeoB17302 and 17304). These samples all exhibit high water depth, less negative δ 13 CTOC values and higher concentrations in SO4 2-. The Black Sea and basin site groups are characterized by higher contributions of CL-and 1G-BDGTs 225 than the first group of river delta samples. Moreover, the Black Sea group differs from the other groups by its high proportion of CL-BDGTs and its lack of PDGTs (both CL and IPL) while the basin sites are relatively enriched in 1G-PDGTs (Fig. 4).

Potential source organisms of BDGTs and PDGTs
δ 13 C analysis of membrane lipids in environmental samples provides insight into the carbon metabolism that promoted their biosynthesis and can thus help to assign their source organisms (Hayes, 2001;Pearson, 2010). In this study, six marine sediment samples exhibiting relatively high concentrations of BDGTs and representing contrasting geochemical settings were selected 240 for δ 13 C analysis of lipid biomarker and carbon substrate pools to investigate the potential drivers of BDGT distributions in the environment. PDGT concentrations were too low to allow determination of their δ 13 C values. BDGT-derived biphytanes (δ 13 CBDGTs) exhibited systematically more negative δ 13 C values than GDGT-derived biphytanes (δ 13 CGDGTs) at all sites, pointing to a distinct origin of BDGTs (Fig. 5). Moreover, the range of δ 13 CBDGTs (between -56 and -28‰) is suggestive of a predominant benthic rather than planktonic source for these compounds (Hoefs et al., 1997;Pearson et al., 2001). Accordingly, IPL-BDGTs 245 and IPL-PDGTs, supposedly representative of extant organisms, constituted more than 50% of the whole BDGT and PDGT pool in more than 90% of the sediment samples where they were detected ( Fig. 3 and Suppl. Table 2), further supporting a benthic origin of BDGTs and PDGTs.
In the Black Sea and Rhone delta (GeoB15105 and GeoB17306), BDGT-derived biphytanes had similarly low δ 13 C values (i.e., < -40‰), suggesting that BDGTs were derived from functionally related archaeal phyla at these two sites (Fig. 5). These 250 values are also consistent with those observed by Meador et al. (2015) for IPL-BDGTs in the White Oak River Estuary, further suggesting a common source for BDGTs in these settings influenced by high inputs of terrestrial organic matter. By contrast, δ 13 CBDGTs values in the sapropel layers of the Eastern Mediterranean Basin (GeoB15103) were up to 20‰ higher than δ 13 CBDGTs values in the sediments of the Rhone delta and Black Sea (Fig. 5).
A predominant methanotrophic origin for BDGTs is unlikely, as δ 13 CBDGTs values were generally higher than δ 13 CCH4 (Fig. 5), 255 which contrasts with the relationship found in lipids from anaerobic methane-oxidizing archaea at seeps (Hinrichs et al., 1999;2000). Pearson et al. (2010) reported fractionation effects (ε)between 7‰ and 36‰ between CO2 and the produced biomass in the case of autotrophic pathways. In this study, δ 13 C between CO2 and BDGTs was calculated considering a fractionation of -10.7‰ between dissolved CO2 and DIC in marine sediments (Mook et al., 1974;Fig. 5). In the Rhone delta and shallow Black Sea, δ 13 C ranged between 21‰ and 45‰ in agreement with an autotrophic metabolism consistent with the initial 260 suggestions by Meador et al. (2015). Alternatively, it cannot be excluded that the low δ 13 CBDGTs may be due to consumption of organic substrates that are more depleted in 13 C than bulk TOC. In addition, some methanogenic processes, based on CO2 fixation (hydrogenotrophic methanogenesis) or incorporation of methylated compounds (methylotrophic methanogenesis), induce large carbon isotopic fractionation during lipid biosynthesis (Londry et al., 2008;Summons et al., 1998) and could also explain the relatively low δ 13 CBDGTs values (i.e. < -20‰). On the contrary, in the sapropels and in the deeper horizon of the 265 Black Sea, δ 13 CCO2-BDGT ranged between 9‰ and 12‰ (Fig. 5). This suggests an alternative carbon metabolism and/or phylogenetic group of BDGT producers. In particular, the heterotrophic incorporation of organic compounds usually induces a small isotopic fractionation between the substrates and the membrane lipids (Pearson, 2010). Some methanogens can use acetate as a carbon source and a small isotopic fractionation has been observed in this case (Londry et al., 2008;Summons et al., 1998). This could hold true for the deep sample from the Black Sea. However, the relatively high concentrations of BDGTs 270 and PDGTs in the Mediterranean sapropel samples (Suppl. Table 2) combined with low rates of methanogenesis, as indicated by the low CH4 concentrations (Suppl. Table 1), strongly suggest that methanogens cannot account for the BDGT pool in this setting. It is therefore likely that other archaea with non-methanogenic metabolisms are the sources of BDGTs in the Mediterranean sapropels.
patterns. The eastern Mediterranean Basin samples clustered with the other basin site samples of the dataset (Marmara Sea, Cap de Creus and Ligurian-Provençal Basin; Fig. 4). The Black Sea samples formed a second cluster and the Rhone delta samples formed the last cluster in the PCA. The first two clusters were characterized by a predominance of 1G-BDGTs (and 1G-PDGTs; Suppl. Table 2) but the Black Sea differed from the other basin sites due to higher proportions of CL-BDGTs than in the rest of the sample set as well as the absence of PDGTs. Additional contribution of BDGTs produced within the anoxic 280 Black Sea bottom waters (Schröder, 2015) could explain the higher proportion of CL-BDGTs at this site. On the contrary, the last cluster (formed by Rhone delta samples) showed a predominance of 2G-BDGTs and 2G-PDGTs (Fig. 4). Meador et al.
(2015) similarly observed a higher relative abundance of 2G-BDGTs and 2G-PDGTs in the White Oak River Estuary, another river mouth highly influenced by terrestrial input. By contrast, only 1G bearing BDGTs were observed in the membrane of M.
Overall, our data infer that BDGT (and PDGT) producers may comprise an autotrophic, potentially methanogenic community as well as a heterotrophic, likely not methanogenic community. A methanogenic origin of BDGTs (and PDGTs) is in agreement with their prominence in M. luminyensis (Becker et al., 2016), the only cultured representative of Methanomassiliicoccales, a novel order of methanogens (Dridi et al., 2012). However, heterotrophic (acetate) C incorporation was so far suggested for 290 Methanomassiliicoccales (Borrel et al., 2014;Lang et al., 2015;Söllinger et al., 2016), which cannot explain the observed carbon isotopic composition of BDGTs in the Rhone delta and the Black Sea if we consider the δ 13 C of acetate to be similar to δ 13 CTOC. More studies on Methanomassiliicoccales carbon metabolism are needed to accurately relate the presence of BDGTs in the environment to this archaeal group. Furthermore, the similar distributions of BDGTs and 16S rRNA gene clones from Bathyarchaeota in estuarine sediments led Meador et al. (2015) to suggest that this group is a putative source of BDGTs. 295 Bathyarchaeota are widespread in marine sediments, notably in the deep subsurface horizons (e.g. Lloyd et al., 2013). They are phylogenetically highly diverse (e.g. Lazar et al., 2015) and contain diverse metabolic groups (e.g. Lazar et al., 2016;Yu et al., 2018;Zhou et al., 2018) involving both autotrophic and heterotrophic lifestyles. Methanogenic capacities have also been identified for some members of this clade (Evans et al., 2015). The contrasted BDGT carbon isotopic composition observed in the different sites of the present study could thus be explained by these versatile metabolic capabilities. It is thus conceivable 300 that the ubiquitous Bathyarchaeota, including methanogens, are an additional source for BDGTs (and PDGTs) in the environment. Zhu et al. (2014) demonstrated the 1,2,3 butanetriol structure of the BDGT backbone via gas chromatography MS, following ether cleavage from its biphytanyl chains. In the present study, 2D-NMR analysis confirmed the presence of a butanetriol 305 backbone and unequivocally determined its configuration in the tetraether molecule (Fig. 2). Additional methylations have been previously observed on the isoprenoid chains or as methoxylation on the glycerol in different lipid classes (e.g. Elling et al., 2017;Knappy et al., 2015). However, BDGTs (and PDGTs) stand out as unique archaeal membrane lipids that contain a non-glycerol moiety. This raises two fundamental questions: (i) how are these lipids biosynthesized? and (ii) why do microorganisms produce them? 310

BDGTs as a putative adaptive trait of subsurface archaea
For every domain of life, it is known that dihydroxyacetone phosphate (DHAP), an intermediate in glycolysis, serves as a precursor of the glycerol moiety in membrane lipid backbones (Koga and Morii, 2007). At the early stage of membrane lipid biosynthesis, DHAP is converted by stereospecific glycerol dehydrogenase enzymes into either a glycerol-3-phosphate (G-3-P) or a glycerol-1-phosphate (G-1-P) in Bacteria and Archaea, respectively. The existence of BDGTs (and PDGTs) imply that different precursors must be involved at the very first steps of lipid biosynthesis. Knappy et al. (2014) suggested, for example, one gene for 3-O-geranylgeranyl-sn-glyceryl-1-phosphate (GGGP) synthase was identified (Becker et al., 2016), but no second homologue that might encode a hypothetical enzyme catalyzing the formation of butanetriol-or pentanetriol-based intermediates. Alternatively, BDGTs and PDGTs could be regular GDGTs, which underwent additional methylation at the final stages of their biosynthesis. Welander et al. (2010) showed that an S-adenosylmethionine (SAM) enzyme catalyzing a 320 radical reaction was responsible for the methylation of certain bacterial hopanoids at the C2 position. We found 13 genes annotated as belonging to the radical SAM superfamily in the permanent draft genome of M. luminyensis B10 (IMG/M website, IMG Submission ID 11458; Chen et al., 2019). Very recently, another SAM enzyme was proposed to be involved in the biosynthesis of calditol based lipids in the archaeal strain Sulfolobus acidocaldarius (Zeng et al., 2018). A similar mechanism could explain the structure of BDGTs and PDGTs. The recent observation of such radical-mediated reactions on un-activated 325 carbon atoms in both bacterial and archaeal strains (Zhou et al., 2016) implies that this may be a common mechanism to adapt their lipid envelope to the surrounding environment.
The ubiquitous presence of BDGTs and PDGTs in the environment and their active biosynthesis by certain organisms, notably M. luminyensis (Becker et al., 2016), signify that they have likely offered an evolutionary advantage to their producers. Two trends emerged from the concentration and distribution of BDGTs and PDGTs in the Mediterranean and the Black Seas. First, 330 the proportion of BDGTs and PDGTs relative to GDGT-0 ((sum-BDGTs)/(sum-GDGT-0), (sum-PDGTs)/(sum-GDGT-0)) remained stable or even increased with depth (Suppl. Table 2). Moreover, unlike GDGT-0, the IPL form of BDGTs and PDGTs dominates their distribution at every depth (Fig. 3). This suggests that IPL-BDGTs and IPL-PDGTs may be specifically produced by sedimentary archaea, as Meador et al. (2015) previously proposed, and/or that they are more resistant against extracellular hydrolytic enzymes. The highest constraint for life in subsurface sediments is the lack of energy, which selects 335 for microorganisms that limit their energy requirement to the most essential functions (e.g. Bradley et al., 2019;Hoehler and Jørgensen, 2013). Hypothetically, the additional methyl or ethyl group could increase the stability of the cell membrane by sterically hindering access of extracellular enzymes responsible for the lysis of the glycosidic bond that links the mono-or disaccharide headgroups, thereby preserving the IPL form and decreasing the maintenance energy demand of these organisms for lipid repair. 340

Conclusion
The unique structure of BDGTs, here unambiguously elucidated by NMR experiments, further increases the diversity of membrane lipids observed in Archaea. BDGTs and PDGTs were detected in a large set of marine sediment samples from diverse geochemical, depth and age conditions, highlighting their widespread presence in marine sediments. Within the dataset, major differences are also observed in the BDGT and PDGT headgroup distribution patterns and 13/12 C content of BDGT-345 derived biphytanes relative to DIC. Thus, it seems likely that BDGT and PDGT biosynthesis may be present across different archaeal phyla relying on different carbon metabolisms. A common trait of the subseafloor sample set is the high contribution of BDGTs, especially in their intact polar form, to the total lipid pool. The specific 1,2,3 butanetriol structure of their backbone could then be interpreted as an adaptive trait of sedimentary archaea to energy limited environments.

Sample availability
Samples are stored at MARUM -Center for Marine Environmental Sciences, University of Bremen, Germany. Sample aliquots may be requested from Prof. Dr. Kai-Uwe Hinrichs.

Competing interests
The authors declare that they have no conflict of interest. 370 Table   Table 1: 13 C-and 1 H-NMR chemical shifts of BDGT-0 as well as key correlations from 1 H-13 C HSQC and HMBC spectra.
Carbon numbers correspond to the carbon atoms in Fig. 2. Overlapped peaks are given as ranges (i.e. 1.17-1.21 ppm).   Table 1, with C4' representing the additional carbon detected in BDGTs. . Lipid samples were measured in duplicate and are presented as averages; deviations between individual measurements were generally smaller than the symbol size. On the right panel, the difference (Δ) between δ 13 CCO2 and δ 13 CBDGT is computed for each sample. δ 13 CCO2 was derived from δ 13 CDIC based on an isotope fractionation of -10.7‰ between dissolved CO2 and DIC, considering in situ temperatures to be below 10 °C, according to Mook et al. (1974).