<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0"><?xmltex \makeatother\@nolinetrue\makeatletter?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">BG</journal-id><journal-title-group>
    <journal-title>Biogeosciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">BG</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Biogeosciences</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1726-4189</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/bg-17-6145-2020</article-id><title-group><article-title>Uncovering chemical signatures of salinity gradients through compositional
analysis of protein sequences</article-title><alt-title>Chemical compositions of proteins in salinity gradients</alt-title>
      </title-group><?xmltex \runningtitle{Chemical compositions of proteins in salinity gradients}?><?xmltex \runningauthor{J.~M.~Dick et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Dick</surname><given-names>Jeffrey M.</given-names></name>
          <email>jeff@chnosz.net</email>
        <ext-link>https://orcid.org/0000-0002-0687-5890</ext-link></contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Yu</surname><given-names>Miao</given-names></name>
          <email>yumiao1987@pku.edu.cn</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Tan</surname><given-names>Jingqiang</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Key Laboratory of Metallogenic Prediction of Nonferrous Metals and
Geological Environment Monitoring,<?xmltex \hack{\break}?> Ministry of Education, School of
Geosciences and Info-Physics, Central South University,<?xmltex \hack{\break}?> Changsha 410083,
China</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>State Key Laboratory of Organic Geochemistry, Guangzhou Institute
of Geochemistry,<?xmltex \hack{\break}?> Chinese Academy of Sciences, Guangzhou 510640, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Jeffrey M. Dick (jeff@chnosz.net) and Miao Yu (yumiao1987@pku.edu.cn)</corresp></author-notes><pub-date><day>8</day><month>December</month><year>2020</year></pub-date>
      
      <volume>17</volume>
      <issue>23</issue>
      <fpage>6145</fpage><lpage>6162</lpage>
      <history>
        <date date-type="received"><day>27</day><month>April</month><year>2020</year></date>
           <date date-type="rev-request"><day>15</day><month>May</month><year>2020</year></date>
           <date date-type="rev-recd"><day>21</day><month>October</month><year>2020</year></date>
           <date date-type="accepted"><day>23</day><month>October</month><year>2020</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2020 Jeffrey M. Dick et al.</copyright-statement>
        <copyright-year>2020</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020.html">This article is available from https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020.html</self-uri><self-uri xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020.pdf">The full text article is available as a PDF file from https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e112">Prediction of the direction of change of a system under specified
environmental conditions is one reason for the widespread utility
of thermodynamic models in geochemistry. However, thermodynamic influences
on the chemical compositions of proteins in nature have remained enigmatic
despite much work that demonstrates the impact of environmental conditions
on amino acid frequencies. Here, we present evidence that the dehydrating
effect of salinity is detectable as chemical differences in protein
sequences inferred from (1) metagenomes and metatranscriptomes in regional
salinity gradients and (2) differential gene and protein expression
in microbial cells under hyperosmotic stress. The stoichiometric hydration
state (<inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>), derived from
the number of water molecules in theoretical reactions to form proteins
from a particular set of basis species (glutamine, glutamic acid,
cysteine, O<inline-formula><mml:math id="M2" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, H<inline-formula><mml:math id="M3" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O), decreases along
salinity gradients, including the Baltic Sea and Amazon River and ocean
plume, and decreases in particle-associated compared to free-living fractions.
However, the proposed metric does not respond as expected for hypersaline
environments. Analysis of data compiled for hyperosmotic stress experiments
under controlled laboratory conditions shows that differentially expressed
proteins are on average shifted toward lower <inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>.
Notably, the dehydration effect is stronger for most organic solutes
compared to NaCl. This new method of compositional analysis can be
used to identify possible thermodynamic effects in the distribution
of proteins along chemical gradients at a range of scales from microbial
mats to oceans.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e176">How microbial populations adapt to environmental gradients is a major
challenge at the intersection of geochemistry, microbiology, and biochemistry.
Patterns of amino acid usage in proteins are important indicators
of microbial adaptation, and amino acid composition at the genome
level is well known to depend on growth temperature <xref ref-type="bibr" rid="bib1.bibx95" id="paren.1"/>.
Furthermore, measures of evolutionary distance and community composition
based on protein sequences predicted from metagenomic sequencing are
strongly associated with environmental temperature and pH <xref ref-type="bibr" rid="bib1.bibx3" id="paren.2"/>.
It is widely acknowledged that the effect of amino acid substitutions
on the structural stability of proteins is a major factor affecting
amino acid usage in thermophiles <xref ref-type="bibr" rid="bib1.bibx84 bib1.bibx95" id="paren.3"/>. Similarly, a
large body of work has demonstrated amino acid signatures associated
with proteins from halophilic organisms <xref ref-type="bibr" rid="bib1.bibx51 bib1.bibx66 bib1.bibx65 bib1.bibx12" id="paren.4"/>.
The most common interpretation of these trends is that particular
amino acid substitutions are selected through evolution to increase
the stability and solubility of the folded conformation and enhance
other structural properties such as flexibility <xref ref-type="bibr" rid="bib1.bibx66" id="paren.5"/>.</p>
      <?pagebreak page6146?><p id="d1e194">An interrelated approach to interpreting patterns of amino acid composition
is based on the energetics of amino acid synthesis. Energetic costs
in terms of ATP (adenosine triphosphate) requirements have been used to model protein expression
levels in bacterial and yeast cells <xref ref-type="bibr" rid="bib1.bibx2 bib1.bibx89" id="paren.6"/>. Although
ATP demands depend on environmental conditions <xref ref-type="bibr" rid="bib1.bibx2" id="paren.7"/>, a limitation
of ATP-based models is that they are derived for specific biosynthetic
pathways, such as whether cells are grown in respiratory or fermentative
(i.e., aerobic or anaerobic) conditions <xref ref-type="bibr" rid="bib1.bibx89" id="paren.8"/>. A different
class of models, based on thermodynamic analysis of the overall Gibbs
energy of reactions to synthesize metabolites from inorganic precursors,
quantifies the energetics of the reactions in terms of temperature,
pressure, and chemical activities of all the species in the reactions,
including those that define pH and oxidation–reduction potential <xref ref-type="bibr" rid="bib1.bibx80" id="paren.9"/>.
Notably, the overall Gibbs energies for amino acid synthesis become
more favorable but to a different extent for each amino acid, between
cold, oxidizing seawater and hot, reducing hydrothermal solution <xref ref-type="bibr" rid="bib1.bibx5" id="paren.10"/>.
A recent systems biology study demonstrates trade-offs between Gibbs
energy of alternative pathways for amino acid synthesis and cofactor
use efficiency (which affects ATP costs) in the model organism <italic>Escherichia coli</italic> and suggests that pathway thermodynamics play a role in thermophilic
adaptation <xref ref-type="bibr" rid="bib1.bibx27" id="paren.11"/>. The oxidation state of proteins as well
as lipids has been shown to be associated with oxidation–reduction
(redox) gradients in a hot spring <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx13" id="paren.12"/>, but so far
energetic models have not been broadly adopted as a tool for relating
metagenomic and geochemical data. This may be because few studies
have asked whether specific changes in the chemical composition of
biomolecules reflect specific environmental conditions.</p>
      <p id="d1e222">To help close this gap, here we use compositional analysis of protein
sequences to identify chemical signatures of two types of environmental
conditions: redox and salinity gradients. In a previous study <xref ref-type="bibr" rid="bib1.bibx26" id="paren.13"/>,
we compared one broad class of geochemical conditions (redox gradients)
with one compositional metric for proteins (carbon oxidation state).
Here, we expand the geobiochemical framework to two dimensions by
considering another set of environments (salinity gradients) and another
compositional metric (stoichiometric hydration state). Thermodynamic
considerations predict that redox gradients supply a driving force
for changes in the oxidation state of biomolecules (similar reasoning
applies to the oxygen content of proteins; <xref ref-type="bibr" rid="bib1.bibx1" id="altparen.14"/>), while
salinity gradients, through the dehydrating potential associated with
osmotic effects, exert a force that selectively alters the hydration
state of biomolecules.</p>
      <p id="d1e231">To test these predictions, we used two compositional metrics: the
carbon oxidation state (<inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) and stoichiometric
hydration state (<inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>). <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
is computed from the chemical formulas of organic molecules and takes
values between the extremes of <inline-formula><mml:math id="M8" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>4 for CH<inline-formula><mml:math id="M9" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M10" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>4
for CO<inline-formula><mml:math id="M11" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, although the range for particular classes
of biomolecules is much smaller <xref ref-type="bibr" rid="bib1.bibx6" id="paren.15"/>. <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
is derived from the number of water molecules in theoretical formation
reactions of proteins from basis species <xref ref-type="bibr" rid="bib1.bibx20 bib1.bibx21" id="paren.16"/>. Through
the compositional analysis of representative metagenomic and metatranscriptomic
datasets, we show that <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
are most closely aligned with environmental redox and salinity gradients,
respectively. These findings apply to freshwater and marine environments,
but trends for hypersaline environments deviate from the thermodynamic
predictions, most likely due to evolutionary optimizations of hydrophobicity
and isoelectric point to stabilize the structures of proteins in halophilic
organisms.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Conceptual background</title>
      <p id="d1e366">In this study we use compositional analysis to uncover environmental
imprints in protein sequences. Analysis of compositional data is used
by geochemists to study processes such as water–rock interaction and
ore deposition and is often one of the first steps in constructing
thermodynamic models, but its application to living systems is relatively
uncommon. Therefore, it is important to describe the conceptual basis
for our methods. To do this, we identified six areas of concern summarized
as (1) intracellular or environmental conditions, (2) amino acids or
atoms, (3) condensation or theoretical formation reactions, (4) chemical
composition or conformational stability, (5) oxidation and hydration
state or temperature and pH, and (6) mathematical or biosynthetic models.</p>
      <?pagebreak page6147?><p id="d1e369">A first concern is that intracellular conditions are maintained
within physiological ranges, so the influence of external conditions
on the composition of microbial biomolecules may be limited. However,
cell membranes are permeable to uncharged species such as hydrogen
<xref ref-type="bibr" rid="bib1.bibx82" id="paren.17"/>, supporting the argument that the oxidation state
of the cytoplasm and therefore the energetics of metabolic reactions
are influenced by the external environment <xref ref-type="bibr" rid="bib1.bibx67 bib1.bibx16" id="paren.18"/>. Likewise,
oxygen diffuses rapidly through lipid membranes, depending on their
composition and structure, and rates of diffusion increase with temperature
<xref ref-type="bibr" rid="bib1.bibx62" id="paren.19"/>. Cell membranes are also permeable to water <xref ref-type="bibr" rid="bib1.bibx70" id="paren.20"/>.
For <italic>E. coli</italic>, which grows most rapidly at about 0.3 Osm L<inline-formula><mml:math id="M15" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> (osmolarity),
increasing the extracellular osmotic strength from 0.1 to 1.0 Osm L<inline-formula><mml:math id="M16" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> (approximately the osmotic concentration of seawater; BioNumbers BNID
100802 <xref ref-type="bibr" rid="bib1.bibx59" id="paren.21"/>) reduces the amount of free cytoplasmic water
by more than half <xref ref-type="bibr" rid="bib1.bibx70" id="paren.22"/>. Halophiles, which thrive at even
higher salinities, accumulate inorganic salts or organic solutes to
maintain osmotic balance with the environment <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx65" id="paren.23"/>.
The result is that, with few exceptions, intracellular conditions
must be isosmotic with the environment, or somewhat higher, to maintain
turgor pressure <xref ref-type="bibr" rid="bib1.bibx37" id="paren.24"/>. Water activity is lower in more concentrated
solutions, and intracellular water activity estimated from freezing
point and cell composition data closely follows that of the growth
medium but is often offset to lower values <xref ref-type="bibr" rid="bib1.bibx17" id="paren.25"/>, perhaps
due to macromolecular crowding effects <xref ref-type="bibr" rid="bib1.bibx34" id="paren.26"/>. To summarize,
high osmotic strength causes a decrease in hydration potential, measured
as water activity, both outside and inside cells.</p>
      <p id="d1e431">This brief review suggests that oxidation and hydration potentials
in cell interiors, at least under experimental conditions, are influenced
by (but not equal to) environmental conditions. Ideally, we would
like to compare the compositions of biomolecules to conditions actually
measured inside cells or in the immediate surroundings of cells, but
these measurements are generally not available for microbial communities
in their natural environments; thus, we make comparisons with large-scale
geochemical gradients, except for different layers of the Guerrero
Negro microbial mat, where metagenomic and chemical data are available
on the scale of millimeters.</p>
      <p id="d1e434">Second, previous authors have emphasized the importance of changes
in elemental stoichiometry – that is, atomic composition – and
not only amino acid composition in the molecular evolution of proteins
<xref ref-type="bibr" rid="bib1.bibx9" id="paren.27"/>. Although stoichiometric predictions are amenable
to experimental tests, such as the long-term evolution of <italic>E. coli</italic> in the laboratory <xref ref-type="bibr" rid="bib1.bibx87" id="paren.28"/>, the omission of a major bioelement,
hydrogen, and the oxidation state of organic matter from most stoichiometric
models <xref ref-type="bibr" rid="bib1.bibx44" id="paren.29"/> means that there are also significant opportunities
for theory development. Because redox reactions are inherent in many
aspects of metabolism, while hydration and dehydration reactions are
essential for the synthesis of biomacromolecules <xref ref-type="bibr" rid="bib1.bibx14" id="paren.30"/>, our
approach is shaped by the assumption that O<inline-formula><mml:math id="M17" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and
H<inline-formula><mml:math id="M18" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O are two primary components that link environmental
conditions to the energetics of biomolecular synthesis.</p>
      <p id="d1e472">The third point follows from the previous one. The polymerization
of amino acids is a condensation reaction that releases one H<inline-formula><mml:math id="M19" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O
per bond formed, independent of the particular amino acids that are
involved. By contrast, our analysis depends crucially on the concept
of a “formation reaction”, which in the thermodynamic literature
represents the composition of a chemical species, either in terms
of elements <xref ref-type="bibr" rid="bib1.bibx92" id="paren.31"/> or in terms of other species <xref ref-type="bibr" rid="bib1.bibx58" id="paren.32"/>.
When these other species are restricted in number to the minimum needed
to represent the composition of all possible species in the system,
they constitute a set of “basis species”, which
can be thought of as the building blocks of the system, similar to
the concept of thermodynamic components <xref ref-type="bibr" rid="bib1.bibx7" id="paren.33"/>. Therefore,
a formation reaction from basis species is a mass-balanced (but nonunique)
stoichiometric representation of the chemical composition of the protein.
This type of reaction in general does not correspond to amino acid
biosynthesis or polymerization, so to avoid confusion, we refer to
these formation reactions as “theoretical formation
reactions”; the number of water molecules in the
theoretical formation reactions, normalized by the protein length,
is the “stoichiometric hydration state”.</p>
      <p id="d1e493">From a mechanistic standpoint, an analysis using any set of basis
species is inadequate, since the number of basis species (five, corresponding
to the elements C, H, N, O, and S) is smaller than the number of biochemical
precursors and inorganic species that are actually involved in amino
acid synthesis <xref ref-type="bibr" rid="bib1.bibx27" id="paren.34"/>. The use of O<inline-formula><mml:math id="M20" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, H<inline-formula><mml:math id="M21" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O,
and other basis species to represent the composition of proteins reflects
the hypothesis that they are conjugate to thermodynamically meaningful
descriptive variables (specifically, chemical potentials) even if
they are not directly involved in the biosynthetic mechanisms for
amino acids. The projection of amino acid composition (20-D) into
the compositional space represented by basis species (5-D) is a type
of dimensionality reduction, but the variables are chosen based on
a physicochemical hypothesis, unlike principal components analysis
(PCA) or other unsupervised methods, where the projection is determined
by the data.</p>
      <p id="d1e517">A fourth concern is that this analysis is based on the hypothesis
that thermodynamic forces affect the chemical compositions of proteins
over evolutionary time, which is different from the more common hypothesis
of optimization of structural stability. Thermodynamic models define
the “cost” of a protein as a function
of not only amino acid composition but also environmental conditions.
Conceptually, this follows from Le Chatelier's principle, in that
increasing the chemical activity of a reactant (on the left-hand side
of a reaction) drives the reaction toward the products. Stated in
more general terms, the overall Gibbs energy of a reaction depends
on the activities of species in the reaction <xref ref-type="bibr" rid="bib1.bibx80 bib1.bibx4" id="paren.35"/>.
Consider two proteins with different amino acid compositions and
therefore also different chemical compositions and theoretical formation
reactions, which should be normalized by the number of residues in
order to compare proteins of different length. The formation of the
protein with more water as a reactant is theoretically favored by
increasing the water activity, whereas the formation of the protein
with more oxygen as a reactant is favored by increasing the oxygen
activity. The water and oxygen activity are thermodynamic measures
of hydration and oxidation potential and can be converted to other
scales, such as oxidation–reduction potential (ORP).</p>
      <?pagebreak page6148?><p id="d1e523">This reasoning provides the theoretical justification for using chemical
composition as an indicator of molecular adaptation to specific environmental
conditions but does not replace interpretations based on structural
considerations. Halophilic organisms exhibit well documented patterns
of amino acid usage, including lower hydrophobicity and higher abundance
of acidic residues that impart greater stability, solubility, and
flexibility of proteins <xref ref-type="bibr" rid="bib1.bibx66" id="paren.36"/>. These adaptations are reflected
in lower values of the GRAVY hydrophobicity scale <xref ref-type="bibr" rid="bib1.bibx66 bib1.bibx12" id="paren.37"/>
and/or isoelectric point of proteins (pI) <xref ref-type="bibr" rid="bib1.bibx65" id="paren.38"/>. In Sect. <xref ref-type="sec" rid="Ch1.S4.SS3"/> and <xref ref-type="sec" rid="Ch1.S4.SS4"/>,
we compare the compositional metrics with GRAVY and pI for the same
datasets.</p>
      <p id="d1e539">Fifth, temperature, pH, and other environmental parameters besides
redox and salinity might influence the oxidation and hydration state
of proteins. For instance, the redox gradients in hydrothermal systems
are also temperature gradients, due to the mixing of seawater and
hydrothermal fluid, and we have not attempted to disentangle the effects
of temperature and redox conditions. However, our previous analysis
of other redox gradients, including stratified hypersaline lakes,
indicates that the carbon oxidation state of biomolecules can vary even
in systems where temperature changes are much smaller <xref ref-type="bibr" rid="bib1.bibx26" id="paren.39"/>.
It is an axiomatic statement that changes in oxidation state can be
associated with one thermodynamic component of a system; our objective
in the present study is to explore the differences between this and
one other component, represented by hydration state. Future work should
also account for the effects of pH and temperature, which is possible
using thermodynamic models for proteins <xref ref-type="bibr" rid="bib1.bibx25" id="paren.40"/>.</p>
      <p id="d1e548">Finally, it should be noted that the basis species used in the stoichiometric
analysis are chosen primarily for mathematical convenience and not because
of evolutionary or biosynthetic requirements. The main criterion we
consider for the choice of basis species is to reduce the covariation
between the metrics for oxidation and hydration state, which arises
as a mathematical consequence of projecting the atomic formulas of
proteins into a particular compositional space, and may not reflect
meaningful differences of chemical composition. Additional considerations
are described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>.</p>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Methods</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Carbon oxidation state</title>
      <p id="d1e568">The most common metric used in geochemistry for the oxidation state
of organic molecules is the average oxidation state of carbon (<inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>),
which also goes by other names such as nominal oxidation state of
carbon (NOSC) <xref ref-type="bibr" rid="bib1.bibx53" id="paren.41"/>. This quantity measures the average degree
of oxidation of carbon atoms in organic molecules. For a protein for
which the primary sequence has the chemical formula C<inline-formula><mml:math id="M23" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula>H<inline-formula><mml:math id="M24" display="inline"><mml:msub><mml:mi/><mml:mi>h</mml:mi></mml:msub></mml:math></inline-formula>N<inline-formula><mml:math id="M25" display="inline"><mml:msub><mml:mi/><mml:mi>n</mml:mi></mml:msub></mml:math></inline-formula>O<inline-formula><mml:math id="M26" display="inline"><mml:msub><mml:mi/><mml:mi>o</mml:mi></mml:msub></mml:math></inline-formula>S<inline-formula><mml:math id="M27" display="inline"><mml:msub><mml:mi/><mml:mi>s</mml:mi></mml:msub></mml:math></inline-formula>,
the value of <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> can be calculated from the following <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx19" id="paren.42"/>:
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M29" display="block"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>-</mml:mo><mml:mi>h</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>o</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>s</mml:mi></mml:mrow><mml:mi>c</mml:mi></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e686">The derivation of Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) is based on the relative electronegativities
of the elements, expressed as oxidation numbers <xref ref-type="bibr" rid="bib1.bibx45 bib1.bibx60" id="paren.43"><named-content content-type="pre">e.g.,</named-content></xref>.
When bonded to carbon, H is assigned an oxidation number of +1, and
N, O, and S have oxidation numbers of <inline-formula><mml:math id="M30" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>3, <inline-formula><mml:math id="M31" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2, and <inline-formula><mml:math id="M32" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2. Equation (<xref ref-type="disp-formula" rid="Ch1.E1"/>)
gives the remaining charge that must be present on each C atom, on
average, to satisfy overall neutrality. Because of the relatively
simple structures of amino acids and the primary structure of proteins,
in which N, O, and S are bonded to only H and C, it is possible to
calculate the average oxidation state of carbon using Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>).
However, this equation is not necessarily valid for other classes
of organic molecules or some types of post-translational modifications
of proteins, including the formation of disulfide bonds. An important
relation inherent in Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) is the redox neutrality of
hydration and dehydration reactions; any pair of hypothetical (or
real) proteins whose formulas differ only by some amount of H<inline-formula><mml:math id="M33" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O
have equal carbon oxidation states.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Choice of basis species: theoretical
considerations</title>
      <p id="d1e741">A major premise of this study is that oxidation state and hydration
state are two primary variables in geobiochemical systems. Accordingly,
when choosing the basis species that can be combined to make the proteins,
O<inline-formula><mml:math id="M34" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and H<inline-formula><mml:math id="M35" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O are the only fixed requirements.
This leaves three basis species that when combined with each other
and with O<inline-formula><mml:math id="M36" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and H<inline-formula><mml:math id="M37" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O must be able
to give any possible formula written as C<inline-formula><mml:math id="M38" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula>H<inline-formula><mml:math id="M39" display="inline"><mml:msub><mml:mi/><mml:mi>h</mml:mi></mml:msub></mml:math></inline-formula>N<inline-formula><mml:math id="M40" display="inline"><mml:msub><mml:mi/><mml:mi>n</mml:mi></mml:msub></mml:math></inline-formula>O<inline-formula><mml:math id="M41" display="inline"><mml:msub><mml:mi/><mml:mi>o</mml:mi></mml:msub></mml:math></inline-formula>S<inline-formula><mml:math id="M42" display="inline"><mml:msub><mml:mi/><mml:mi>s</mml:mi></mml:msub></mml:math></inline-formula>.
We reiterate that this analysis refers to the chemical formulas of
polypeptide sequences, that is, the primary structure of proteins,
not post-translational modifications or H<inline-formula><mml:math id="M43" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O molecules
in the hydration shell of folded proteins.</p>
      <p id="d1e835">Equation (<xref ref-type="disp-formula" rid="Ch1.E1"/>) is derived from electronegativity relations and
therefore allows for the calculation of the carbon oxidation state from
a given chemical formula, independent of any chemical reactions. In
contrast, there is no way to count the number of H<inline-formula><mml:math id="M44" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O
molecules in a chemical formula; H<inline-formula><mml:math id="M45" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O appears only
in chemical reactions. But it is important to note that any particular
reaction that involves only H<inline-formula><mml:math id="M46" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O is redox neutral.
Conversely, the coefficient of O<inline-formula><mml:math id="M47" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> in redox reactions
is closely related to the number of electrons transferred. Let us
consider the 20 protein-forming amino acids as a baseline for compositional
analysis; the numbers of H<inline-formula><mml:math id="M48" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O and O<inline-formula><mml:math id="M49" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
in the formation reactions of the amino acids from a particular set
of basis species are denoted by <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. The choice of basis
species in our study is guided by the dual objectives that (1) <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
of amino acids should have very little correlation with <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
and (2) <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> of amino acids
should be strongly correlated with <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. It
should be emphasized that these are not criteria for “correctness”,
since basis species, like thermodynamic components, only have to be
the minimum number needed to represent the chemical composition of
all the species that can be formed from them <xref ref-type="bibr" rid="bib1.bibx7" id="paren.44"/>. Instead,
basis species selected using these conditions yield a convenient mathematical
projection of elemental composition; that is, nearly horizontal or
vertical trends on <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
scatterplots for proteins specifically
reflect changes in oxidation state or hydration state, respectively.</p>
      <p id="d1e1013">An additional consideration is that a biologically meaningful set
of basis species is likely to comprise metabolites that have high
network connectivity, that is, are involved in<?pagebreak page6149?> reactions with many
other metabolites. Reactions involving glutamine and glutamic acid (or its ionized form glutamate) are major steps of nitrogen metabolism
<xref ref-type="bibr" rid="bib1.bibx61 bib1.bibx18" id="paren.45"/>, and these amino acids have been characterized
as “nodal point” metabolites <xref ref-type="bibr" rid="bib1.bibx90" id="paren.46"/>. Either methionine
or cysteine would provide the sulfur required for the system, but
cysteine is relevant as a constituent of the glutathione molecule,
which has important roles in cellular redox chemistry <xref ref-type="bibr" rid="bib1.bibx90" id="paren.47"/>.
These considerations support the proposal of the amino acids glutamine,
glutamic acid, and cysteine (collectively abbreviated QEC) together
with O<inline-formula><mml:math id="M58" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and H<inline-formula><mml:math id="M59" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O as a biologically
relevant set of basis species for describing the chemical compositions
of proteins <xref ref-type="bibr" rid="bib1.bibx20" id="paren.48"/>. These three amino acids are among the
top eight amino acids ranked by number of reactions in a metabolic
model for <italic>E. coli</italic> <xref ref-type="bibr" rid="bib1.bibx30" id="paren.49"/> (E: 52, S: 25, D: 23, Q:
18, A: 15, G: 15, M: 15, C: 13).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><label>Figure 1</label><caption><p id="d1e1056">Stoichiometric numbers of H<inline-formula><mml:math id="M60" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O
and O<inline-formula><mml:math id="M61" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> for theoretical formation reactions
of amino acids computed with different sets of basis species, plotted
against carbon oxidation state (<inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>),
which is computed from the elemental formula and does not depend on
the choice of basis species. Linear regressions and <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>
values were calculated using the <monospace>lm</monospace> function in R <xref ref-type="bibr" rid="bib1.bibx69" id="paren.50"/>.
<bold>(a–b)</bold> CO<inline-formula><mml:math id="M64" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, NH<inline-formula><mml:math id="M65" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>,
H<inline-formula><mml:math id="M66" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>S, H<inline-formula><mml:math id="M67" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O, O<inline-formula><mml:math id="M68" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
(CHNOS). <bold>(c–d)</bold> Glutamine, glutamic acid, cysteine, H<inline-formula><mml:math id="M69" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O,
O<inline-formula><mml:math id="M70" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> (QEC). <bold>(e)</bold> Scatterplot of <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>
values for <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
fits against <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
fits for all combinations of basis species consisting of H<inline-formula><mml:math id="M77" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O,
O<inline-formula><mml:math id="M78" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, and three amino acids (including the points
labeled QEC and MWY (methionine, tryptophan, tyrosine)) or CO<inline-formula><mml:math id="M79" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>,
NH<inline-formula><mml:math id="M80" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, H<inline-formula><mml:math id="M81" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>S, H<inline-formula><mml:math id="M82" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O,
and O<inline-formula><mml:math id="M83" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> (CHNOS).</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020-f01.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Choice of basis species: stoichiometric
analysis</title>
      <p id="d1e1334">Here we compute the stoichiometric hydration state by analyzing the
compositions of the 20 proteinogenic amino acids in detail. We start
with a “default” set of basis species chosen for their common
occurrence in overall catabolic reactions <xref ref-type="bibr" rid="bib1.bibx4" id="paren.51"/>: CO<inline-formula><mml:math id="M84" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>,
NH<inline-formula><mml:math id="M85" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, H<inline-formula><mml:math id="M86" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>S, H<inline-formula><mml:math id="M87" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O,
and O<inline-formula><mml:math id="M88" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>. Using these basis species (designated CHNOS),
the theoretical formation reaction of alanine (C<inline-formula><mml:math id="M89" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>H<inline-formula><mml:math id="M90" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">7</mml:mn></mml:msub></mml:math></inline-formula>NO<inline-formula><mml:math id="M91" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>)
is

            <disp-formula id="Ch1.R2" content-type="numbered reaction"><label>R1</label><mml:math id="M92" display="block"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow><mml:mo>+</mml:mo><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NH</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow><mml:mo>→</mml:mo><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">C</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">7</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mrow></mml:math></disp-formula>
          and the oxygen and water content of the amino acid (i.e., <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>) are the
opposite of the coefficients on O<inline-formula><mml:math id="M95" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and H<inline-formula><mml:math id="M96" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O
in the reaction. Analogous reactions for the other amino acids were
used to make Fig. <xref ref-type="fig" rid="Ch1.F1"/>a–b. Using glutamine (C<inline-formula><mml:math id="M97" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">5</mml:mn></mml:msub></mml:math></inline-formula>H<inline-formula><mml:math id="M98" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">10</mml:mn></mml:msub></mml:math></inline-formula>N<inline-formula><mml:math id="M99" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O<inline-formula><mml:math id="M100" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>),
glutamic acid (C<inline-formula><mml:math id="M101" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">5</mml:mn></mml:msub></mml:math></inline-formula>H<inline-formula><mml:math id="M102" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">9</mml:mn></mml:msub></mml:math></inline-formula>NO<inline-formula><mml:math id="M103" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:math></inline-formula>),
cysteine (C<inline-formula><mml:math id="M104" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>H<inline-formula><mml:math id="M105" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">7</mml:mn></mml:msub></mml:math></inline-formula>NO<inline-formula><mml:math id="M106" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>S),
H<inline-formula><mml:math id="M107" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O, and O<inline-formula><mml:math id="M108" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> (the QEC basis species),
the theoretical formation reaction of alanine is
            <disp-formula id="Ch1.R3" content-type="numbered reaction"><label>R2</label><mml:math id="M109" display="block"><mml:mtable rowspacing="0.2ex" columnspacing="1em" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mn mathvariant="normal">0.4</mml:mn><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">C</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">10</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">N</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.2</mml:mn><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">C</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">9</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.6</mml:mn><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>→</mml:mo><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">C</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">7</mml:mn></mml:msub><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.3</mml:mn><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
          showing that the oxygen and water content are <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.3</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.6</mml:mn></mml:mrow></mml:math></inline-formula>. Calculations
for all the amino acids using the QEC basis were used to make Fig. <xref ref-type="fig" rid="Ch1.F1"/>c–d.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e1799">Values of stoichiometric hydration state (<inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>)
of amino acids calculated with the QEC basis species (glutamine, glutamic
acid, cysteine, H<inline-formula><mml:math id="M113" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O, O<inline-formula><mml:math id="M114" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>), average oxidation state of carbon (<inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>),
and number of carbon atoms (<inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>). Standard
one-letter abbreviations for the amino acids (denoted AA) are used.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="9">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">AA</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">AA</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9"><inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">A</oasis:entry>
         <oasis:entry colname="col2">0.6</oasis:entry>
         <oasis:entry colname="col3">0</oasis:entry>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">M</oasis:entry>
         <oasis:entry colname="col7">0.4</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">C</oasis:entry>
         <oasis:entry colname="col2">0.0</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">N</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M125" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.2</oasis:entry>
         <oasis:entry colname="col8">1</oasis:entry>
         <oasis:entry colname="col9">4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">D</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M126" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.2</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
         <oasis:entry colname="col4">4</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">P</oasis:entry>
         <oasis:entry colname="col7">0.0</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">E</oasis:entry>
         <oasis:entry colname="col2">0.0</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">5</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">Q</oasis:entry>
         <oasis:entry colname="col7">0.0</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">F</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M130" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2.2</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">9</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">R</oasis:entry>
         <oasis:entry colname="col7">0.2</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">6</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">G</oasis:entry>
         <oasis:entry colname="col2">0.4</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">S</oasis:entry>
         <oasis:entry colname="col7">0.6</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">H</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M134" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1.8</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">6</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">T</oasis:entry>
         <oasis:entry colname="col7">0.8</oasis:entry>
         <oasis:entry colname="col8">0</oasis:entry>
         <oasis:entry colname="col9">4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">I</oasis:entry>
         <oasis:entry colname="col2">1.2</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M136" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1</oasis:entry>
         <oasis:entry colname="col4">6</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">V</oasis:entry>
         <oasis:entry colname="col7">1.0</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">K</oasis:entry>
         <oasis:entry colname="col2">1.2</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">6</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">W</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M139" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>3.8</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">11</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">L</oasis:entry>
         <oasis:entry colname="col2">1.2</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M141" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1</oasis:entry>
         <oasis:entry colname="col4">6</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">Y</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M142" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2.2</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">9</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e2479">As measured by <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> in linear regressions,
the CHNOS basis yields a strong negative correlation between <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> for the amino acids
(Fig. <xref ref-type="fig" rid="Ch1.F1"/>a) but a relatively weak correlation between <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="Ch1.F1"/>b).
The QEC basis provides a stronger association between <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and reduces the correlation
between <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
(Fig. <xref ref-type="fig" rid="Ch1.F1"/>c–d). However, there is still a small negative
correlation for amino acids (Fig. <xref ref-type="fig" rid="Ch1.F1"/>c). A plot with the
<inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for all possible combinations
of H<inline-formula><mml:math id="M154" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O, O<inline-formula><mml:math id="M155" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, and three amino acids indicates
that QEC has relatively low <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
and high <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
(Fig. <xref ref-type="fig" rid="Ch1.F1"/>e). Therefore, it is a suitable candidate to meet
the objectives described above. Although another combination of amino
acids – methionine, tryptophan, and tyrosine (MWY) – has even
lower <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> for the <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
fit (Fig. <xref ref-type="fig" rid="Ch1.F1"/>e), tryptophan and tyrosine are not highly connected
metabolites and therefore are less preferable as basis species.</p>
      <p id="d1e2762">By strengthening the association between <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, which represent alternative
metrics for oxidation state, and by reducing the correlation between
<inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>,
the QEC basis species provides a more convenient projection of elemental
composition than a default choice of inorganic species, such
as CO<inline-formula><mml:math id="M169" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, NH<inline-formula><mml:math id="M170" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, H<inline-formula><mml:math id="M171" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>S,
H<inline-formula><mml:math id="M172" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O, and O<inline-formula><mml:math id="M173" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, which commonly appear
in overall catabolic reactions <xref ref-type="bibr" rid="bib1.bibx4" id="paren.52"/>. The selection of basis
species is an evolving method, and further analysis with other metabolites
may lead to a more convenient set of basis species to project the
elemental composition of proteins into chemical variables.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Compositional metrics for proteins and metagenomes</title>
      <p id="d1e2876">For a given protein, the stoichiometric hydration state was calculated
from
            <disp-formula id="Ch1.E4" content-type="numbered"><label>2</label><mml:math id="M174" display="block"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∑</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mo>,</mml:mo></mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mo>∑</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the frequency of the <inline-formula><mml:math id="M176" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th
amino acid (<inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> to 20) in the protein and <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
is the stoichiometric hydration state of that amino acid (Table <xref ref-type="table" rid="Ch1.T1"/>).
The “<inline-formula><mml:math id="M179" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1” in the numerator accounts for the loss of H<inline-formula><mml:math id="M180" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O
in the polymerization of amino acids, and the “<inline-formula><mml:math id="M181" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>1” after the summation accounts for the N-terminal H and C-terminal OH of the polypeptide.</p>
      <?pagebreak page6150?><p id="d1e3022">Unlike <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M183" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
for proteins must be weighted by the number of carbon atoms in each
amino acid, i.e.,
            <disp-formula id="Ch1.E5" content-type="numbered"><label>3</label><mml:math id="M184" display="block"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∑</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi mathvariant="normal">C</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi mathvariant="normal">C</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>∑</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi mathvariant="normal">C</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mrow class="chem"><mml:mi mathvariant="normal">C</mml:mi></mml:mrow><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M186" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mrow class="chem"><mml:mi mathvariant="normal">C</mml:mi></mml:mrow><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
are the number of carbon atoms and carbon oxidation state of the <inline-formula><mml:math id="M187" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th
amino acid (see Table 1). For example, <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
of the dipeptide Ala-Gly can be calculated as <inline-formula><mml:math id="M189" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, where 3 and 2 are the numbers of carbon
atoms and 0 and 1 are the <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of Ala and Gly,
respectively. The result, 0.4, can be checked by applying Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>)
to the chemical formula of alanylglycine (C<inline-formula><mml:math id="M191" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">5</mml:mn></mml:msub></mml:math></inline-formula>H<inline-formula><mml:math id="M192" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">10</mml:mn></mml:msub></mml:math></inline-formula>N<inline-formula><mml:math id="M193" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O<inline-formula><mml:math id="M194" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>).
The methods for calculating <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from elemental composition and amino
acid composition are shown schematically in Fig. <xref ref-type="fig" rid="Ch1.F2"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><?xmltex \currentcnt{2}?><label>Figure 2</label><caption><p id="d1e3288">Schematic of calculations of <inline-formula><mml:math id="M197" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M198" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for a single protein. The selected
protein is chicken egg white lysozyme (UniProt ID: LYSC_CHICK), which
is historically an extensively characterized protein in the laboratory.
The protein sequence was used to tabulate the amino acid composition
(right column), which in turn was used to generate the elemental composition
(left column). The coefficients on the basis species are determined
from the elemental composition by mass-balance constraints. Dividing
the number of H<inline-formula><mml:math id="M199" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O in the basis species by
the protein length gives the stoichiometric hydration state (<inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>).
Independent of the basis species, the elemental composition yields
the average oxidation state of carbon (<inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>)
according to Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>). To reduce computing steps, in this
study the amino acid compositions of proteins (obtained, for example, from
metagenomic sequences) were used to calculate <inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M203" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> with Eqs. (<xref ref-type="disp-formula" rid="Ch1.E4"/>)
and (<xref ref-type="disp-formula" rid="Ch1.E5"/>) and the values for amino acids in Table 1.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020-f02.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS5">
  <label>3.5</label><?xmltex \opttitle{Amino acid composition of proteomes of \textit{nif}-bearing organisms}?><title>Amino acid composition of proteomes of <italic>nif</italic>-bearing organisms</title>
      <?pagebreak page6151?><p id="d1e3410">In a separate study, <xref ref-type="bibr" rid="bib1.bibx67" id="text.53"/> used carbon oxidation state as
a metric for comparing proteomes of organisms containing the nitrogenase
gene (<italic>nif</italic>). The evolution of these organisms is associated with rising
atmospheric oxygen through geological history. In order to approximately
replicate their results, amino acid compositions of all proteins for
each bacterial, archaeal, and viral taxon in the NCBI Reference Sequence
(RefSeq) database <xref ref-type="bibr" rid="bib1.bibx63" id="paren.54"/> were compiled from RefSeq release
201 (July 2020). Scripts to do this and the resulting data file of
amino acid compositions of 42 787 taxa are available in the JMDplots
R package (see “Code and data availability” section). Names of organisms
containing different nitrogenase (<italic>nif</italic>) homologs were extracted from
Supplement Table S1A of <xref ref-type="bibr" rid="bib1.bibx67" id="text.55"/>. These names were matched
to the closest organism name in RefSeq. Duplicated species (represented
by different strains) were removed, as were matching organisms with
fewer than 1000 RefSeq protein sequences. As a result, the numbers
of organisms included in the present calculations (Nif-A: 155, Nif-B:
68, Nif-C: 14, Nif-D: 7) are less than those identified in <xref ref-type="bibr" rid="bib1.bibx67" id="text.56"/>.
Note that values of <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> calculated here (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a) are lower than those shown in Fig. 5 of <xref ref-type="bibr" rid="bib1.bibx67" id="text.57"/>.
This difference is associated with the weighting by carbon number
(described above), which was not performed by <xref ref-type="bibr" rid="bib1.bibx67" id="text.58"/>.</p>
</sec>
<sec id="Ch1.S3.SS6">
  <label>3.6</label><title>GRAVY and pI</title>
      <p id="d1e3459">The grand average of hydropathicity (GRAVY) was calculated using published
hydropathy values for amino acids <xref ref-type="bibr" rid="bib1.bibx52" id="paren.59"/>. The isoelectric point
(pI) was calculated using published p<inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values for terminal
groups <xref ref-type="bibr" rid="bib1.bibx10" id="paren.60"/> and side chains <xref ref-type="bibr" rid="bib1.bibx11" id="paren.61"/>; however, the
calculation does not implement position-specific adjustments <xref ref-type="bibr" rid="bib1.bibx11" id="paren.62"/>.
The p<inline-formula><mml:math id="M206" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values used for calculating pI <xref ref-type="bibr" rid="bib1.bibx10 bib1.bibx11" id="paren.63"/>
and transfer free energies used in the derivation of the GRAVY scale
<xref ref-type="bibr" rid="bib1.bibx52" id="paren.64"/> correspond to 25 <inline-formula><mml:math id="M207" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C and 1 bar, and no attempt
was made here to account for the temperature effects on these properties.
The charge for each ionizable group was precalculated from pH 0 to
14 at intervals of 0.01, and the isoelectric point was computed as
the pH where the sum of charges of all groups in the protein is closest
to zero. These calculations were implemented as new functions in the
canprot R package <xref ref-type="bibr" rid="bib1.bibx21" id="paren.65"/> (see “Code and data availability” section).
Comparisons for selected proteins (UniProt IDs: LYSC_CHICK, RNAS1_BOVIN,
AMYA_PYRFU) show that the calculated values of GRAVY and pI are equal
to those obtained with the ProtParam tool <xref ref-type="bibr" rid="bib1.bibx35" id="paren.66"/>.</p>
</sec>
<sec id="Ch1.S3.SS7">
  <label>3.7</label><title>Prediction of protein sequences</title>
      <p id="d1e3526">Protein sequences were predicted from metagenomic reads using a previously
described workflow <xref ref-type="bibr" rid="bib1.bibx26" id="paren.67"/>. Briefly, reads were trimmed, filtered,
and dereplicated using scripts adapted from the MG-RAST pipeline <xref ref-type="bibr" rid="bib1.bibx46" id="paren.68"/>.
For metatranscriptomic datasets, ribosomal RNA sequences were removed
using SortMeRNA <xref ref-type="bibr" rid="bib1.bibx50" id="paren.69"/>. Protein-coding sequences were identified
using FragGeneScan <xref ref-type="bibr" rid="bib1.bibx73" id="paren.70"/>, and the amino acid sequences of
the predicted proteins were used in further calculations. For large
datasets, only a portion of the available reads were processed (at
least 500 000 reads; see Supplement Tables S1 and S2). This reduces
the computational requirements without noticeably affecting the calculated
average compositions <xref ref-type="bibr" rid="bib1.bibx26" id="paren.71"/>.</p>
      <p id="d1e3544">Means and standard deviations of <inline-formula><mml:math id="M208" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>,
GRAVY, and pI were calculated for 100 random subsamples of protein
sequences from each metagenomic or metatranscriptomic dataset. The
number of sequences included in each subsample was chosen to give
a total length closest to 50 000 amino acids on average. The subsample
density (or number of sequences included in each sample) depends on
the average length of the metagenomic or metatranscriptomic sequences
and is listed in Tables S1 and S2. This number ranges from 251 for
the dataset with the highest mean protein fragment length (199.1;
metagenome of hot-spring source of Bison Pool) to 1696 for the dataset
with the lowest mean protein fragment length (29.5; metatranscriptome
of site GS684 in the Baltic Sea).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><label>Figure 3</label><caption><p id="d1e3577">Compositional analysis of proteins in redox gradients
and the Baltic Sea salinity gradient. <bold>(a)</bold> Redox gradients.
Abbreviations and data sources: BP (Bison Pool hot spring; <xref ref-type="bibr" rid="bib1.bibx40 bib1.bibx85" id="altparen.72"/>),
DV (diffuse submarine vents; <xref ref-type="bibr" rid="bib1.bibx72 bib1.bibx33" id="altparen.73"/>), GN (Guerrero
Negro microbial mat; <xref ref-type="bibr" rid="bib1.bibx51" id="altparen.74"/>), and NF (nitrogenase-bearing organisms;
<xref ref-type="bibr" rid="bib1.bibx67" id="altparen.75"/>). The NF data are based on reference proteomes (see
Methods section); all others are for protein sequences predicted from metagenomic
data. Outlined symbols indicate samples from relatively oxidizing
conditions. <bold>(b)</bold> Surface and <bold>(c)</bold> deeper samples (Chl <inline-formula><mml:math id="M210" display="inline"><mml:mi>a</mml:mi></mml:math></inline-formula> max: chlorophyll <inline-formula><mml:math id="M211" display="inline"><mml:mi>a</mml:mi></mml:math></inline-formula> maximum, 9–30 m deep) from the Baltic Sea transect.
Metagenomes as described in <xref ref-type="bibr" rid="bib1.bibx28" id="text.76"/> were downloaded from iMicrobe
<xref ref-type="bibr" rid="bib1.bibx94" id="paren.77"/>; the plots show data for the 0.1–0.8 <inline-formula><mml:math id="M212" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m
size fraction collected from stations along the transect at low salinity
(&lt; 6 PSU) and high salinity (&gt; 6 PSU). Background
guidelines have slopes equal to that of the <inline-formula><mml:math id="M213" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
linear regression for amino acids in Fig. <xref ref-type="fig" rid="Ch1.F1"/>c.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020-f03.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results and discussion</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Comparison of redox and salinity gradients</title>
      <p id="d1e3683">To search for the hypothesized dehydration signal in metagenomic data,
we began with redox gradients as a negative control. Submarine hydrothermal
vents are zones of complex interactions between reduced endmember
fluids and relatively oxidized seawater <xref ref-type="bibr" rid="bib1.bibx71 bib1.bibx64" id="paren.78"/>. Terrestrial
hydrothermal systems, such as the hot springs in Yellowstone National
Park, USA, provide a source of reduced fluids that are oxidized by
degassing and mixing with air and surface groundwater as well as biological
activity including sulfide oxidation <xref ref-type="bibr" rid="bib1.bibx57" id="paren.79"/>. Redox gradients
can also develop over smaller length scales. The surface of the Guerrero
Negro microbial mat (Baja California Sur, Mexico) is exposed to ca.
1 m deep hypersaline, oxygenated water (approximately 200 <inline-formula><mml:math id="M215" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>M
O<inline-formula><mml:math id="M216" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>), but in the mat, oxygen rises during the daytime
and is depleted within a few millimeters, giving way to anoxic and then
sulfidic conditions <xref ref-type="bibr" rid="bib1.bibx55" id="paren.80"/>.</p>
      <p id="d1e3712">Using metagenomic data for these redox gradients <xref ref-type="bibr" rid="bib1.bibx51 bib1.bibx40 bib1.bibx85 bib1.bibx72 bib1.bibx33" id="paren.81"/>,
<xref ref-type="bibr" rid="bib1.bibx26" id="text.82"/> showed that the carbon oxidation states of DNA, messenger
RNA, and proteins increase down the outflow channel of Bison Pool
and between fluids from diffuse hydrothermal vents and relatively
oxidizing seawater. Moreover, intact polar lipids extracted from the
microbial communities of Bison Pool and other alkaline hot springs
also exhibit downstream increases in carbon oxidation state <xref ref-type="bibr" rid="bib1.bibx13" id="paren.83"/>,
revealing that parallel compositional trends characterize many major
types of biomacromolecules in these hot springs. The <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
of proteins increases more subtly toward the surface in the upper
few millimeters of the Guerrero Negro microbial mat; it also increases
at greater depths, perhaps due to heterotrophic degradation and/or
horizontal gene transfer <xref ref-type="bibr" rid="bib1.bibx26" id="paren.84"/>. Furthermore, an evolutionary
trajectory associated with the occurrence of different homologs of
nitrogenase (<italic>nif</italic>) in anaerobic and aerobic organisms is characterized
by increasing <inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of the proteomes of these
organisms <xref ref-type="bibr" rid="bib1.bibx67" id="paren.85"/>.</p>
      <?pagebreak page6152?><p id="d1e3756"><?xmltex \hack{\newpage}?>The trends of carbon oxidation state described above are visible in
the scatter plot in Fig. <xref ref-type="fig" rid="Ch1.F3"/>a, with an added dimension: stoichiometric
hydration state. The guidelines in this plot are parallel to the <inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M220" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
trend for amino acids (Fig. <xref ref-type="fig" rid="Ch1.F1"/>c); their slope represents
the background correlation between <inline-formula><mml:math id="M221" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M222" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> that is associated with the choice
of basis species. Sample data for Bison Pool and the submarine vents
are distributed parallel to these guidelines. Therefore, the decrease
of <inline-formula><mml:math id="M223" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> along these redox
gradients can be attributed to the background correlation in the stoichiometric
analysis, and the differences between samples within each dataset
are specifically associated with changes in carbon oxidation state
and not stoichiometric hydration state. This is an expected outcome,
as the redox gradients considered here do not have large changes in
salinity. In particular, concentrations of Cl<inline-formula><mml:math id="M224" display="inline"><mml:msup><mml:mi/><mml:mo>-</mml:mo></mml:msup></mml:math></inline-formula>,
a conservative ion, increase by less than 10 % (6.1 to 6.6 mM) in
the outflow of Bison Pool due to evaporation <xref ref-type="bibr" rid="bib1.bibx85" id="paren.86"/>. The
diffuse vents considered here have concentrations of Cl<inline-formula><mml:math id="M225" display="inline"><mml:msup><mml:mi/><mml:mo>-</mml:mo></mml:msup></mml:math></inline-formula>
between 515 and 624 mM, not greatly different from bottom seawater
at 545 mM (Dataset S1 of <xref ref-type="bibr" rid="bib1.bibx71" id="altparen.87"/>).</p>
      <p id="d1e3862">As a well known example of a regional salinity gradient, the Baltic
Sea exhibits a freshwater to marine transition over 1800 km, but dissolved
oxygen at the surface is at or near saturation with air <xref ref-type="bibr" rid="bib1.bibx28" id="paren.88"/>,
so this transect does not represent a redox gradient. For protein
sequences derived from metagenomes in the 0.1–0.8 <inline-formula><mml:math id="M226" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m
size fraction, there are large changes in stoichiometric hydration
state along the Baltic Sea transect but relatively small differences
in the carbon oxidation state (Fig. <xref ref-type="fig" rid="Ch1.F3"/>b). This pattern holds
for samples from both the surface and chlorophyll <inline-formula><mml:math id="M227" display="inline"><mml:mi>a</mml:mi></mml:math></inline-formula> maximum (9–30 m deep; Fig. <xref ref-type="fig" rid="Ch1.F3"/>c).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><label>Figure 4</label><caption><p id="d1e3890">Stoichiometric hydration state of proteins in metagenomes
<xref ref-type="bibr" rid="bib1.bibx28" id="paren.89"/> and metatranscriptomes <xref ref-type="bibr" rid="bib1.bibx8" id="paren.90"/> of surface
water samples in the Baltic Sea with increasing particle size: <bold>(a)</bold>
0.1–0.8, <bold>(b)</bold> 0.8–3.0, and <bold>(c)</bold> 3.0–200 <inline-formula><mml:math id="M228" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m. From left to right, the samples on the horizontal
axis (some IDs omitted for clarity) are arranged from freshwater to
marine conditions in the Sorcerer II Global Ocean Sampling Expedition
<xref ref-type="bibr" rid="bib1.bibx28" id="paren.91"/>; all sample IDs are GS667, GS665, GS669, GS673, GS675,
GS659, GS679, GS681, GS683, GS685, GS687, and GS694. Width of shading
represents <inline-formula><mml:math id="M229" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 1 standard deviation in subsampled sequences (see
Methods section).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020-f04.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Multifactorial hydration effects</title>
      <p id="d1e3941">The stoichiometric hydration state of proteins can be influenced by
factors other than just salinity. Previous authors have observed large
differences in microbial community composition between free-living
and particle-associated fractions, which may be due in part to anoxic
conditions arising from limited diffusion in particles <xref ref-type="bibr" rid="bib1.bibx81" id="paren.92"/>.
As described below, we found a trend of relatively low <inline-formula><mml:math id="M230" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> in particles compared to free-living fractions in both the Baltic
Sea and Amazon River. This effect is probably associated with phylogenetic
differences among the size fractions, but reduced accessibility to
bulk water may be a contributing factor. Further support for the possible
influence of physical accessibility is the reduced <inline-formula><mml:math id="M231" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
in the interior compared to upper layers of the Guerrero Negro microbial
mat.</p>
      <?pagebreak page6153?><p id="d1e3981">For the Baltic Sea metagenomes and metatranscriptomes, the 0.1–0.8
and 0.8–3.0 <inline-formula><mml:math id="M232" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m size fractions of particles
that do not pass through the filter, which are used for subsequent
DNA extraction and sequencing, represent free-living bacteria, while
the 3.0–200 <inline-formula><mml:math id="M233" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m fraction contains particle-associated
bacteria with average larger genome sizes and greater inferred metabolic
and regulatory capacity <xref ref-type="bibr" rid="bib1.bibx28" id="paren.93"/>. Figure <xref ref-type="fig" rid="Ch1.F4"/>a–c shows
that proteins inferred from metagenomes for larger particles have
lower <inline-formula><mml:math id="M234" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> than those for
the smallest size fraction. The Guerrero Negro microbial mat offers
another opportunity to compare exposed and interior environments.
Unlike <inline-formula><mml:math id="M235" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, which reaches a minimum a few millimeters
into the mat, <inline-formula><mml:math id="M236" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> decreases
throughout the mat, but the changes are most pronounced in the upper
few millimeters (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a).</p>
      <p id="d1e4053">One hypothesis that could explain these findings is that the interiors
of particles and the mat are sequestered to some extent from the surrounding
aqueous environment. If limited accessibility to the aqueous phase
were manifested as lower water activity, perhaps due to surface effects
associated with geological nanomaterials <xref ref-type="bibr" rid="bib1.bibx91" id="paren.94"/> and/or higher
concentrations of solutes, it would provide a thermodynamic drive
that favors lower <inline-formula><mml:math id="M237" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> of
proteins. However, it should be noted that particles are also suitable
habitats for multicellular and eukaryotic populations <xref ref-type="bibr" rid="bib1.bibx81" id="paren.95"/>.
Therefore, the trends in stoichiometric hydration state may require
an explanation in terms of both physical and phylogenetic differences,
which should be explored in future studies.</p>
      <p id="d1e4079">An important evolutionary transition is the emergence of heterotrophic
metabolism, which is a later innovation than autotrophic core metabolism
<xref ref-type="bibr" rid="bib1.bibx61 bib1.bibx14" id="paren.96"/>. It is notable that the deeper layers of the Guerrero
Negro mat show greater evidence for heterotrophic metabolism <xref ref-type="bibr" rid="bib1.bibx51" id="paren.97"/>;
likewise, heterotrophs in the “photosynthetic fringe” in Bison
Pool may outcompete the autotrophs that dominate at higher and lower
temperatures <xref ref-type="bibr" rid="bib1.bibx85" id="paren.98"/>. These putative heterotroph-rich zones
show locally lower values of <inline-formula><mml:math id="M238" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
(Fig. <xref ref-type="fig" rid="Ch1.F3"/>a). If decreasing stoichiometric hydration state
is a common theme across some evolutionary transitions, then the relatively
high <inline-formula><mml:math id="M239" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> in the proteomes
of organisms carrying the ancestral nitrogenase Nif-D (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a)
is not unexpected. A better understanding of these trends would require
more extensive phylogenetically resolved comparisons of the compositional
differences as well as quantitative analyses of water fluxes in different
metabolic pathways.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><label>Figure 5</label><caption><p id="d1e4133">Compositional analysis and hydropathicity and isoelectric
point calculations for proteins from the Amazon River and plume and
other metagenomes. Samples representing freshwater, marine, and hypersaline
environments are indicated by the colored convex hulls. <bold>(a)</bold> Metagenomic and <bold>(b)</bold> metatranscriptomic data for particle-associated
and free-living fractions from the lower Amazon River <xref ref-type="bibr" rid="bib1.bibx77" id="paren.99"/>
and plume in the Atlantic Ocean <xref ref-type="bibr" rid="bib1.bibx76" id="paren.100"/>. <bold>(c)</bold> Freshwater
(lakes in Sweden and USA) and marine metagenomes considered in a previous
comparative study <xref ref-type="bibr" rid="bib1.bibx29" id="paren.101"/> and metagenomes from hypersaline
environments including Kulunda Steppe soda lakes in Siberia, Russia
<xref ref-type="bibr" rid="bib1.bibx88" id="paren.102"/> (KS), Santa Pola salterns in Spain <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx31" id="paren.103"/>
(SA), and salterns in the South Bay of San Francisco, CA, USA <xref ref-type="bibr" rid="bib1.bibx47" id="paren.104"/>
(SB). Plots <bold>(d–f)</bold> show values of average hydropathicity (GRAVY)
and isoelectric point (pI) of proteins for the same datasets. Background
guidelines have slopes equal to that of the <inline-formula><mml:math id="M240" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M241" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
linear regression for amino acids in Fig. <xref ref-type="fig" rid="Ch1.F1"/>c.</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Compositional trends in rivers,
lakes, and hypersaline environments</title>
      <?pagebreak page6154?><p id="d1e4212">The Amazon River and ocean plume provide another example of a freshwater
to marine transition, with salinities that range from below the scale
of practical salinity units (PSU) in the river to 23–36 PSU in the
plume <xref ref-type="bibr" rid="bib1.bibx76 bib1.bibx77" id="paren.105"/>. We used published metagenomic and metatranscriptomic
data for filtered samples classified as free-living (0.2 to 2.0 <inline-formula><mml:math id="M242" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m)
and particle-associated samples (2.0 to 156 <inline-formula><mml:math id="M243" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m) <xref ref-type="bibr" rid="bib1.bibx76 bib1.bibx77" id="paren.106"/>.
River samples form a tight cluster on a plot of stoichiometric hydration
state against carbon oxidation state of proteins, and the plume samples
are scattered over lower <inline-formula><mml:math id="M244" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and low values
of <inline-formula><mml:math id="M245" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, particularly for
the particle-associated fraction (Fig. <xref ref-type="fig" rid="Ch1.F5"/>a). For metatranscriptomes,
there is a noticeable decrease of <inline-formula><mml:math id="M246" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
from the river to the ocean plume but little difference in carbon
oxidation state (Fig. <xref ref-type="fig" rid="Ch1.F5"/>b), and the particle-associated
samples again exhibit a generally lower <inline-formula><mml:math id="M247" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
than the free-living samples. Together with the lower <inline-formula><mml:math id="M248" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
for proteins inferred from metagenomes and metatranscriptomes in the
larger size fractions from Baltic Sea samples, this could reflect
a lower availability of H<inline-formula><mml:math id="M249" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>O to organisms living near
the particle surface due to physical separation from the bulk aqueous
phase and associated diffusion limitation or lower water activity
<xref ref-type="bibr" rid="bib1.bibx91" id="paren.107"/>.</p>
      <p id="d1e4334">We also considered data used in a previous comparative study and data
for hypersaline environments including evaporation ponds (salterns)
and lakes in desert areas. <xref ref-type="bibr" rid="bib1.bibx29" id="text.108"/> characterized microbial
communities using metagenomic data for various freshwater samples
(lakes in the USA and Sweden) and marine locations. For hypersaline
settings, we used metagenomic data from the Santa Pola salterns in
Spain <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx31" id="paren.109"/>, natural soda lakes of the Kulunda Steppe
in Serbia <xref ref-type="bibr" rid="bib1.bibx88" id="paren.110"/>, and South Bay salterns in California, USA
<xref ref-type="bibr" rid="bib1.bibx47" id="paren.111"/>. The compositional analysis reveals a relatively low
<inline-formula><mml:math id="M250" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> of proteins inferred
from the marine metagenomes compared to freshwater samples in the
Eiler et al. (2014) dataset (Fig. <xref ref-type="fig" rid="Ch1.F5"/>c). Surprisingly, hypersaline
metagenomes have ranges of <inline-formula><mml:math id="M251" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
of proteins that are similar to marine environments but considerably
higher <inline-formula><mml:math id="M252" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="Ch1.F5"/>c). To interpret
these results, we considered other factors that are known to influence
the amino acid compositions of proteins in halophiles.</p>
      <p id="d1e4399">“Salt-in” halophilic organisms have proteins with relatively low
isoelectric point that remain soluble at high salt concentrations
<xref ref-type="bibr" rid="bib1.bibx36" id="paren.112"/>. It should be noted that proteins with a lower pI
also tend to have relatively high <inline-formula><mml:math id="M253" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> due to
higher abundances of aspartic acid and glutamic acid, which are relatively
oxidized (see <xref ref-type="bibr" rid="bib1.bibx5" id="altparen.113"/>; <xref ref-type="bibr" rid="bib1.bibx19" id="altparen.114"/>; and Fig. <xref ref-type="fig" rid="Ch1.F1"/>).
Consequently, the lower pI characteristic of salt-in organisms
is also associated with an increase of carbon oxidation state. Because
of the large pI differences (Fig. <xref ref-type="fig" rid="Ch1.F5"/>f), the increase of
<inline-formula><mml:math id="M254" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in hypersaline environments can not be
interpreted as an indicator of an environmental redox gradient. Some
halophilic organisms are also known to have proteins that are less
hydrophobic, with lower values of GRAVY <xref ref-type="bibr" rid="bib1.bibx66 bib1.bibx12" id="paren.115"/>. Because
hydrophobic amino acids have relatively low values of <inline-formula><mml:math id="M255" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
<xref ref-type="bibr" rid="bib1.bibx19" id="paren.116"/>, a negative correlation between GRAVY and <inline-formula><mml:math id="M256" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
is also expected.</p>
      <?pagebreak page6155?><p id="d1e4466">Consistent with these well known features of halophilic adaptation,
marine metagenomes exhibit lower hydrophobicity than most of the freshwater
samples, and hypersaline metagenomes are shifted to both lower GRAVY
and pI (Fig. <xref ref-type="fig" rid="Ch1.F5"/>f). However, there are irregular trends in
the Amazon River data. Compared to the river, the proteins in plume
metagenomes exhibit lower GRAVY and either higher or lower pI (Fig. <xref ref-type="fig" rid="Ch1.F5"/>d). Similarly, other authors have reported that although
lower pI is a signature of many hypersaline environments, it does
not clearly distinguish marine from lower-salinity environments <xref ref-type="bibr" rid="bib1.bibx74" id="paren.117"/>.
In contrast, the plume metatranscriptomes do show decreased pI but
no major difference in GRAVY compared to river samples (Fig. <xref ref-type="fig" rid="Ch1.F5"/>e).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6"><?xmltex \currentcnt{6}?><label>Figure 6</label><caption><p id="d1e4481">Divergent trends of <inline-formula><mml:math id="M257" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M258" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of proteins from metagenomes
for <bold>(a)</bold> the Baltic Sea and <bold>(b)</bold> freshwater and higher-salinity
samples from southern California <xref ref-type="bibr" rid="bib1.bibx75" id="paren.118"/>. The datasets from
<xref ref-type="bibr" rid="bib1.bibx75" id="text.119"/> are classified according to salinity: freshwater (FW;
three samples at different times from the tilapia channel and one sample from the prebead pond), low salinity (LS; three samples at
different times from the low salinity saltern), and hypersaline (MS–HS;
four samples from a medium salinity and two from a high salinity saltern).
Plots <bold>(c)</bold> and <bold>(d)</bold> show GRAVY and pI computed for
the same datasets. Background guidelines have slopes equal to that
of the <inline-formula><mml:math id="M259" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M260" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
linear regression for amino acids in Fig. <xref ref-type="fig" rid="Ch1.F1"/>c.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020-f06.png"/>

        </fig>

      <p id="d1e4567">There is not enough space here to comprehensively examine all the
available metagenomic data for environmental salinity gradients. However,
we have identified one dataset that gives a contradictory result
and therefore offers more perspective on the compositional relationships
of proteins coded by metagenomes in salinity gradients. This dataset
was generated in a time series study of microbial and viral community
dynamics in a freshwater aquaculture facility (“tilapia channel”
and “prebead bond”) and low-, medium-, and high-salinity salterns
in southern California <xref ref-type="bibr" rid="bib1.bibx75" id="paren.120"/>. Here, we have used only the
reported microbial sequences (not the viral dataset) and considered
all time points together. Contrary to our starting hypothesis, the
stoichiometric hydration state of proteins is lowest in the freshwater
samples, which is the reverse of the trend from the Baltic Sea (Fig. <xref ref-type="fig" rid="Ch1.F6"/>a–b). A side-by-side comparison of the Baltic Sea and the datasets by
Rodriguez-Brito et al. (2010)  shows large changes of GRAVY in the
former but pI in the latter (Fig. <xref ref-type="fig" rid="Ch1.F6"/>c–d), which is another
indication that these variables are responsive only in certain ranges
of salinity.</p>
      <p id="d1e4577">This counterexample demonstrates that the sign of differences of <inline-formula><mml:math id="M261" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
is not predictable in all environments; however, the large negative
offset in the freshwater samples may be a signal of some other influence,
perhaps related to the human control of these ponds, which are used
as fish nurseries. Specifically, the microbial communities in the
aquaculture ponds may not be responding as they would in a typical
natural system that is less nutrient rich. As noted above for putative
heterotroph-rich zones in other systems, the lower stoichiometric
hydration state could be associated with the enrichment of heterotrophic
taxa, in this case due to the addition of organic compounds to the
aquaculture ponds.</p>
      <p id="d1e4597">Considering all the datasets shown in Figs. <xref ref-type="fig" rid="Ch1.F5"/> and <xref ref-type="fig" rid="Ch1.F6"/>,
there appears to be no globally consistent metric for environmental
salinity gradients that can be derived from amino acid composition.
If we exclude the <xref ref-type="bibr" rid="bib1.bibx75" id="text.121"/> dataset, then <inline-formula><mml:math id="M262" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
exhibits a consistent decreasing trend in marine compared to freshwater
samples. However, this trend does not continue into hypersaline environments.</p>
</sec>
<sec id="Ch1.S4.SS4">
  <label>4.4</label><title>Compositional analysis of
differentially expressed proteins</title>
      <p id="d1e4632">While biomolecular data for environmental salinity gradients reflect
both ecological and evolutionary differences, laboratory experiments
provide information on the physiological effects of osmotic conditions
on protein expression in particular organisms. It is also important
to recognize that osmotic stress can be imposed by solutes other than
NaCl; the effects of organic solutes differ in relation to their ability
to permeate or depolarize cell membranes and to be sensed by cellular
osmoregulatory systems <xref ref-type="bibr" rid="bib1.bibx43 bib1.bibx79 bib1.bibx93" id="paren.122"/>. Because microbial
acclimation to changes in osmotic conditions is a dynamic process,
it is helpful to look at gene and protein expression data for a range
of times and conditions that can be controlled in the lab.</p>
      <p id="d1e4638">We searched the literature to compile data for differential gene and
protein expression in non-halophilic bacteria in NaCl or other osmotic
stress conditions. As a general rule, we only included datasets with
a minimum of 20 down-regulated and 20 up-regulated genes or proteins;
however,<?pagebreak page6156?> smaller datasets were included if they are part of a study
with larger datasets. This compilation consists of 49 transcriptomics
and 30 proteomics datasets from 36 studies (note that different time
points and treatments are considered separate datasets); descriptions
and references for all datasets are given in Figures S1 and S2. In
addition, four datasets for differential expression of proteins in
halophilic archaea in hyperosmotic stress were located <xref ref-type="bibr" rid="bib1.bibx54 bib1.bibx96 bib1.bibx56 bib1.bibx42" id="paren.123"/>
(see Fig. S3). This is a major update to an earlier compilation
of data for hyperosmotic stress experiments <xref ref-type="bibr" rid="bib1.bibx21" id="paren.124"/>, but we
have limited the present compilation to data for bacteria or archaea;
data for osmotic stress induced by NaCl or glucose in eukaryotic cells
are considered in a separate paper <xref ref-type="bibr" rid="bib1.bibx22" id="paren.125"/>.</p>
      <p id="d1e4650">We assembled the lists of up- and down-regulated proteins in each
dataset or, for gene expression studies, the proteins corresponding
to the up- and down-regulated genes and converted gene names or accession
numbers to UniProt accessions using the UniProt mapping tool <xref ref-type="bibr" rid="bib1.bibx41" id="paren.126"/>.
The compiled data are available as CSV files in R packages (see the “Code
and data availability” section). After removing genes or proteins with unavailable
or duplicated UniProt IDs and those with ambiguous differences (appearing
in both the down- and up-regulated groups), the amino acid compositions
computed for protein sequences downloaded from UniProt <xref ref-type="bibr" rid="bib1.bibx86" id="paren.127"/>
were used for the compositional analysis of carbon oxidation state
and stoichiometric hydration state. Median differences (i.e., <inline-formula><mml:math id="M263" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math id="M264" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) were calculated as the
median value for all up-regulated proteins minus the median value
for all down-regulated proteins in each dataset.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><label>Figure 7</label><caption><p id="d1e4694">Compositional analysis of proteins in hyperosmotic stress
experiments for non-halophilic bacteria and halophilic archaea. <bold>(a)</bold> Time-course experiments for bacteria; black circles represent datasets
for proteins coded by differentially expressed genes (transcriptomics
experiments) and blue squares represent datasets for differentially
expressed proteins (proteomics experiments). Lettered symbols represent
the progression in each experiment: <bold>(a–c)</bold> <xref ref-type="bibr" rid="bib1.bibx48" id="paren.128"><named-content content-type="pre">30, 80, 310 min;</named-content></xref>
(transcriptomes and proteomes), <bold>(d–f)</bold> <xref ref-type="bibr" rid="bib1.bibx83" id="paren.129"><named-content content-type="pre">5, 30, 60 min;</named-content></xref>,
<bold>(g–i)</bold> <xref ref-type="bibr" rid="bib1.bibx32" id="paren.130"><named-content content-type="pre">1, 6, 24 h;</named-content></xref>, <bold>(j–k)</bold> <xref ref-type="bibr" rid="bib1.bibx38" id="paren.131"><named-content content-type="pre">45 min, 14 h;</named-content></xref>, and
<bold>(l–n)</bold> <xref ref-type="bibr" rid="bib1.bibx68" id="paren.132"><named-content content-type="pre">24, 48, 72 h;</named-content></xref> (transcriptomes and proteomes;
no proteomic data available at 72 h). <bold>(b)</bold> Pairs of experiments
for bacteria under hyperosmotic stress imposed by NaCl or organic
solutes. The sources of data are <bold>(A–B)</bold> <xref ref-type="bibr" rid="bib1.bibx43" id="paren.133"><named-content content-type="pre">sorbitol;</named-content></xref>,
<bold>(C–D)</bold> <xref ref-type="bibr" rid="bib1.bibx39" id="paren.134"><named-content content-type="pre">sorbitol;</named-content></xref>, <bold>(E–F)</bold> <xref ref-type="bibr" rid="bib1.bibx49" id="paren.135"><named-content content-type="pre">sucrose;</named-content></xref>
(transcriptomes and proteomes), <bold>(G–H)</bold> <xref ref-type="bibr" rid="bib1.bibx32" id="paren.136"><named-content content-type="pre">glycerol at 1 h;</named-content></xref>,
<bold>(I–J)</bold> <xref ref-type="bibr" rid="bib1.bibx32" id="paren.137"><named-content content-type="pre">glycerol at 6 h;</named-content></xref>, <bold>(K–L)</bold> <xref ref-type="bibr" rid="bib1.bibx79" id="paren.138"><named-content content-type="pre">sucrose;</named-content></xref>,
<bold>(M–N)</bold> <xref ref-type="bibr" rid="bib1.bibx93" id="paren.139"><named-content content-type="pre">urea;</named-content></xref>, and <bold>(O–P)</bold> <xref ref-type="bibr" rid="bib1.bibx78" id="paren.140"><named-content content-type="pre">glucose;</named-content></xref>
(only proteomes). <bold>(c–f)</bold> Plots of median differences of
<inline-formula><mml:math id="M265" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M266" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
or GRAVY and pI for all compiled transcriptomic and proteomic data
for hyperosmotic stress, including datasets shown in <bold>(a)</bold> and <bold>(b)</bold> together with data for other experiments. In each
panel, open symbols represent individual datasets and filled symbols
represent the mean for all datasets. The axis labels include the <inline-formula><mml:math id="M267" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values
for the mean difference for all datasets in each plot; <inline-formula><mml:math id="M268" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values
less than 0.05 are shown in bold. References for all datasets are
in Fig. S1 (transcriptomics for non-halophilic bacteria), Fig. S2 (proteomics
for non-halophilic bacteria), and Fig. S3 (proteomics for halophilic archaea).</p></caption>
          <?xmltex \igopts{width=369.885827pt}?><graphic xlink:href="https://bg.copernicus.org/articles/17/6145/2020/bg-17-6145-2020-f07.png"/>

        </fig>

      <p id="d1e4869">Figure <xref ref-type="fig" rid="Ch1.F7"/>a shows results for time-course experiments for
hyperosmotic stress. Note that all values are differences calculated
relative to the same control (initial time point) in a given study.
In transcriptomic experiments for a commensal species (<italic>Enterococcus faecalis</italic>), a soil bacterium (<italic>Methylocystis</italic> sp. strain SC2),
and two pathogens (<italic>E. coli</italic> O157:H7 and <italic>Salmonella</italic> <italic>enterica</italic>
serovar Typhimurium) <xref ref-type="bibr" rid="bib1.bibx83 bib1.bibx38 bib1.bibx48 bib1.bibx32" id="paren.141"/>, there is
a marked progression toward lower <inline-formula><mml:math id="M269" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
of the associated proteins with time. In a transcriptomic experiment
for salt stress in <italic>Synechocystis</italic> sp. PCC 6803 <xref ref-type="bibr" rid="bib1.bibx68" id="paren.142"/>,
<inline-formula><mml:math id="M270" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is shifted
negatively between 24 and 48 h but rises to a slightly positive value
at 72 h. Proteomic data are available from two of these studies, indicating
that the differentially expressed proteins in <italic>E. coli</italic> <xref ref-type="bibr" rid="bib1.bibx48" id="paren.143"/>
also show decreasing <inline-formula><mml:math id="M271" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
with time, but in the proteomic experiment for <italic>Synechocystis</italic>
sp. PCC 6803 <xref ref-type="bibr" rid="bib1.bibx68" id="paren.144"/>, <inline-formula><mml:math id="M272" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
changes sign from negative to positive between 24 and 48 h (Fig. <xref ref-type="fig" rid="Ch1.F7"/>a).</p>
      <p id="d1e4991">Perhaps the most striking result to emerge from this analysis is the
strong dehydrating signal associated with osmotic stress imposed by
organic solutes. We compared pairs of datasets from the same study
for NaCl and another solute at concentrations that give similar total
osmolalities. Transcriptomic data for sorbitol <xref ref-type="bibr" rid="bib1.bibx43 bib1.bibx39" id="paren.145"/>,
sucrose <xref ref-type="bibr" rid="bib1.bibx49" id="paren.146"/>, and glycerol <xref ref-type="bibr" rid="bib1.bibx32" id="paren.147"/> compared to
controls all show a lower <inline-formula><mml:math id="M273" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
of the associated proteins than for NaCl compared to controls (Fig. <xref ref-type="fig" rid="Ch1.F7"/>b). Data from the study of <xref ref-type="bibr" rid="bib1.bibx32" id="text.148"/> are plotted
at 1 and 6 h in the experiment, indicating a time-dependent decrease
of <inline-formula><mml:math id="M274" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> under
both NaCl and glycerol treatment as well as more negative values for
glycerol than NaCl. Experiments with different strains of <italic>E. coli</italic> show a slightly more positive value for sucrose than NaCl <xref ref-type="bibr" rid="bib1.bibx79" id="paren.149"/>
and a much larger positive difference for urea compared to NaCl <xref ref-type="bibr" rid="bib1.bibx93" id="paren.150"/>.
The available proteomic data also show lower <inline-formula><mml:math id="M275" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
for sucrose <xref ref-type="bibr" rid="bib1.bibx49" id="paren.151"/> and glucose <xref ref-type="bibr" rid="bib1.bibx78" id="paren.152"/> compared to
NaCl (Fig. <xref ref-type="fig" rid="Ch1.F7"/>b). Note that the latter dataset is actually
a comparison between growth on glucose and glucose with NaCl; growth
on glucose alone produces a lower <inline-formula><mml:math id="M276" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
of the differentially expressed proteins.</p>
      <p id="d1e5101">The marked decrease of <inline-formula><mml:math id="M277" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
induced by solutes such as sorbitol, which does not permeate the plasma
membrane, could result from a higher effective osmotic pressure compared
to NaCl <xref ref-type="bibr" rid="bib1.bibx43" id="paren.153"/>. Because it permeates cells, solutions of
urea are not considered hypertonic <xref ref-type="bibr" rid="bib1.bibx15" id="paren.154"/>, which may be one
reason for the higher <inline-formula><mml:math id="M278" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
for urea compared to NaCl. Sucrose, which permeates but unlike NaCl
does not depolarize the plasma membrane <xref ref-type="bibr" rid="bib1.bibx79" id="paren.155"/>, produces
a slightly higher <inline-formula><mml:math id="M279" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
than NaCl in one transcriptomics dataset for <italic>E. coli</italic> <xref ref-type="bibr" rid="bib1.bibx79" id="paren.156"/>
but has a more marked dehydrating effect in both transcriptomics and
proteomics datasets for <italic>Caulobacter crescentus</italic> <xref ref-type="bibr" rid="bib1.bibx49" id="paren.157"/>.
The negative shift of <inline-formula><mml:math id="M280" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
associated with most organic solutes compared to NaCl lends support
to the notion that high organic loading could contribute to the relatively
low <inline-formula><mml:math id="M281" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> of protein sequences
from metagenomes of freshwater aquaculture systems (Fig. <xref ref-type="fig" rid="Ch1.F6"/>b).</p>
      <?pagebreak page6157?><p id="d1e5222">Considering all transcriptomic datasets together (see Fig. S1 for
references), the proteins coded by differentially expressed genes
in non-halophilic bacteria under hyperosmotic stress do not show significant
differences in <inline-formula><mml:math id="M282" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M283" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>,
pI, or GRAVY (Fig. <xref ref-type="fig" rid="Ch1.F7"/>c–d). However, the average difference
of <inline-formula><mml:math id="M284" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> would become more
negative if the early time points in individual time-course experiments
were excluded from the average (see Fig. <xref ref-type="fig" rid="Ch1.F7"/>a). Unlike the
results for transcriptomes, the average value of GRAVY for all proteomics
datasets (see Figs. S2 and S3 for references) increases significantly
(Fig. <xref ref-type="fig" rid="Ch1.F7"/>f; <inline-formula><mml:math id="M285" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.011</mml:mn></mml:mrow></mml:math></inline-formula>). The proteomic data also exhibit
a small decrease of pI (<inline-formula><mml:math id="M286" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.083</mml:mn></mml:mrow></mml:math></inline-formula>), which is expected for halophiles,
but the increase of GRAVY – that is, higher hydrophobicity – is
the opposite of the evolutionary trend for proteomes of halophilic
organisms <xref ref-type="bibr" rid="bib1.bibx66" id="paren.158"/> and the metagenomic comparisons described
above. Overall, the proteomic experiments record a significant decrease
of <inline-formula><mml:math id="M287" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> in hyperosmotic stress
(Fig. <xref ref-type="fig" rid="Ch1.F7"/>e; <inline-formula><mml:math id="M288" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.016</mml:mn></mml:mrow></mml:math></inline-formula>). We therefore conclude that
<inline-formula><mml:math id="M289" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is a metric with consistent
behavior for field and laboratory datasets, since it records decreasing
hydration state of proteins with increasing salinity in the Baltic
Sea and Amazon River and plume and of differentially expressed proteins
in microbial cells grown under hyperosmotic stress.</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d1e5363">This study was focused on describing the chemical compositions of
proteins in a geochemical context. The theoretical novelty of this
study is the derivation of a compositional metric for stoichiometric
hydration state (<inline-formula><mml:math id="M290" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) that
is largely decoupled from changes in oxidation state (<inline-formula><mml:math id="M291" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>)
of proteins. Therefore, based on mass-action effects in thermodynamics,
<inline-formula><mml:math id="M292" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is predicted to decrease
toward higher salinity but be mostly insensitive to redox gradients.
We found that protein sequences inferred from metagenomes in regional
salinity gradients, including the Baltic Sea freshwater-marine transect
and Amazon River and plume, are characterized by changes of <inline-formula><mml:math id="M293" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
in the predicted direction. Although this trend does not continue
into hypersaline environments, the applicability of the compositional
analysis to microbial cells is supported by compilations of transcriptomic
and proteomic data, which indicate decreasing <inline-formula><mml:math id="M294" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
on average for the differentially expressed proteins in hyperosmotic
stress experiments. The dehydration signal becomes larger during many
time-course experiments and is stronger for most organic solutes than
for NaCl.</p>
      <p id="d1e5445">The central message of this study is that geochemical and laboratory
conditions can influence, but naturally do not completely determine,
the chemical compositions of proteins. As a step toward constructing
multidimensional chemical thermodynamic models of microbial communities,
the<?pagebreak page6158?> present results provide evidence that different compositional
metrics, representing the oxidation state and hydration state of molecules,
can be associated specifically with redox and salinity gradients,
respectively. The findings of this study underscore an opportunity
for the integration of hydration state into evolutionary models that
already consider changes in oxidation state or oxygen content of proteins
<xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx67" id="paren.159"/>.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e5455">All metagenomic and metatranscriptomic data analyzed here were obtained
from public databases using the accession numbers listed in Supplement
Table S1 for salinity gradients and Table S2 for redox gradients.
The amino acid compositions of subsampled sequences from the metagenomic
and metatranscriptomic data are available in the JMDplots R package,
version 1.2.4 (<ext-link xlink:href="https://github.com/jedick/JMDplots">https://github.com/jedick/JMDplots</ext-link>),
which is archived on Zenodo <xref ref-type="bibr" rid="bib1.bibx23" id="paren.160"/>. Specifically,
the data are contained in the file <monospace>inst/extdata/gradH2O/MGP.rds</monospace>,
which can be read using the R function readRDS (minimum R version:
2.3.0).
The compilation of differential gene expression data is available
in the JMDplots package as xz-compressed CSV files in the directory
<monospace>inst/extdata/expression/osmotic/</monospace>. The compilation of differential
protein expression data is in the corresponding directory of the canprot
R package, version 1.1.0 (<ext-link xlink:href="https://cran.r-project.org/package=canprot">https://cran.r-project.org/package=canprot</ext-link>),
which is also archived on Zenodo <xref ref-type="bibr" rid="bib1.bibx24" id="paren.161"/>. The results
of the compositional analysis of differential expression data, which
are used for Fig. <xref ref-type="fig" rid="Ch1.F7"/>, are in the <monospace>inst/vignettes/</monospace>
directories of the JMDplots and canprot packages.
The code used to make all of the figures and perform statistical testing
is in the JMDplots package. The <monospace>gradH2O.Rmd</monospace> vignette in the
package demonstrates the functions used to make the figures.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d1e5485">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/bg-17-6145-2020-supplement" xlink:title="zip">https://doi.org/10.5194/bg-17-6145-2020-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e5494">JMD designed and carried out the analysis. JMD, MY, and JT interpreted
the results. JMD wrote the article with editing input from MY and
JT.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e5500">The authors declare that they have no conflict of interest.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e5506">We are grateful to Saroj Poudel for commenting on an earlier version
of the manuscript.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e5511">This research has been supported by the State Key Laboratory of Organic Geochemistry (grant no. SKLOG-201928).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e5517">This paper was edited by Jack Middelburg and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Acquisti et al.(2007)</label><?label AKC07?><mixed-citation>Acquisti, C., Kleffe, J., and Collins, S.: Oxygen content of transmembrane
proteins over macroevolutionary time scales, Nature, 445, 47–52,
<ext-link xlink:href="https://doi.org/10.1038/nature05450" ext-link-type="DOI">10.1038/nature05450</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Akashi and Gojobori(2002)</label><?label AG02?><mixed-citation>Akashi, H. and Gojobori, T.: Metabolic efficiency and amino acid composition
in the proteomes of <italic>Escherichia coli</italic> and <italic>Bacillus subtilis</italic>,
P. Natl. Acad. Sci., 99, 3695–3700,
<ext-link xlink:href="https://doi.org/10.1073/pnas.062526999" ext-link-type="DOI">10.1073/pnas.062526999</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Alsop et al.(2014)</label><?label ABR14?><mixed-citation>Alsop, E. B., Boyd, E. S., and Raymond, J.: Merging metagenomics and
geochemistry reveals environmental controls on biological diversity and
evolution, BMC Ecol., 14, 16, <ext-link xlink:href="https://doi.org/10.1186/1472-6785-14-16" ext-link-type="DOI">10.1186/1472-6785-14-16</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Amend and LaRowe(2019)</label><?label AL19?><mixed-citation>Amend, J. P. and LaRowe, D. E.: Mini-review: Demystifying microbial
reaction energetics, Environ. Microbiol., 21, 3539–3547,
<ext-link xlink:href="https://doi.org/10.1111/1462-2920.14778" ext-link-type="DOI">10.1111/1462-2920.14778</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Amend and Shock(1998)</label><?label AS98?><mixed-citation>Amend, J. P. and Shock, E. L.: Energetics of amino acid synthesis in
hydrothermal ecosystems, Science, 281, 1659–1662,
<ext-link xlink:href="https://doi.org/10.1126/science.281.5383.1659" ext-link-type="DOI">10.1126/science.281.5383.1659</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Amend et al.(2013)</label><?label ALMS13?><mixed-citation>Amend, J. P., LaRowe, D. E., McCollom, T. M., and Shock, E. L.: The
energetics of organic synthesis inside and outside the cell, Philos.
T. R. Soc. B, 368, 20120255,
<ext-link xlink:href="https://doi.org/10.1098/rstb.2012.0255" ext-link-type="DOI">10.1098/rstb.2012.0255</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Anderson(2005)</label><?label And05?><mixed-citation>Anderson, G. M.: Thermodynamics of Natural Systems, Cambridge University
Press, Cambridge, 2nd Edn., <ext-link xlink:href="https://doi.org/10.1017/CBO9780511840258" ext-link-type="DOI">10.1017/CBO9780511840258</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Asplund-Samuelsson et al.(2016)</label><?label ASD+16?><mixed-citation>Asplund-Samuelsson, J., Sundh, J., Dupont, C. L., Allen, A. E., McCrow, J. P.,
Celepli, N. A., Bergman, B., Ininbergs, K., and Ekman, M.: Diversity and
expression of bacterial metacaspases in an aquatic ecosystem, Front.
Microbiol., 7, 1043, <ext-link xlink:href="https://doi.org/10.3389/fmicb.2016.01043" ext-link-type="DOI">10.3389/fmicb.2016.01043</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Baudouin-Cornu et al.(2001)</label><?label BSMT01?><mixed-citation>Baudouin-Cornu, P., Surdin-Kerjan, Y., Marlière, P., and Thomas, D.:
Molecular evolution of protein atomic composition, Science, 293, 297–300,
<ext-link xlink:href="https://doi.org/10.1126/science.1061052" ext-link-type="DOI">10.1126/science.1061052</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Bjellqvist et al.(1993)</label><?label BHP+93?><mixed-citation>Bjellqvist, B., Hughes, G. J., Pasquali, C., Paquet, N., Ravier, F., Sanchez,
J.-C., Frutiger, S., and Hochstrasser, D.: The focusing positions of
polypeptides in immobilized pH gradients can be predicted from their amino
acid sequences, Electrophoresis, 14, 1023–1031,
<ext-link xlink:href="https://doi.org/10.1002/elps.11501401163" ext-link-type="DOI">10.1002/elps.11501401163</ext-link>, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Bjellqvist et al.(1994)</label><?label BBOC94?><mixed-citation>Bjellqvist, B., Basse, B., Olsen, E., and Celis, J. E.: Reference points for
comparisons of two-dimensional maps of proteins from different human cell
types defined in a pH scale where isoelectric points correlate with
polypeptide compositions, Electrophoresis, 15, 529–539,
<ext-link xlink:href="https://doi.org/10.1002/elps.1150150171" ext-link-type="DOI">10.1002/elps.1150150171</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Boyd et al.(2014)</label><?label BHS+14?><mixed-citation>Boyd, E. S., Hamilton, T. L., Swanson, K. D., Howells, A. E., Baxter, B. K.,
Meuser, J. E., Posewitz, M. C., and Peters, J. W.: [FeFe]-hydrogenase
abundance and diversity along a vertical redox gradient in Great Salt
Lake, USA, Int. J. Mol. Sci., 15,
21947–21966, <ext-link xlink:href="https://doi.org/10.3390/ijms151221947" ext-link-type="DOI">10.3390/ijms151221947</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Boyer et al.(2020)</label><?label BSS+20?><mixed-citation>Boyer, G. M., Schubotz, F., Summons, R. E., Woods, J., and Shock, E. L.: Carbon
oxidation state in microbial polar lipids suggests adaptation to hot spring
temperature and redox gradients, Front. Microbiol., 11, 229,
<ext-link xlink:href="https://doi.org/10.3389/fmicb.2020.00229" ext-link-type="DOI">10.3389/fmicb.2020.00229</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Braakman and Smith(2013)</label><?label BS13?><mixed-citation>Braakman, R. and Smith, E.: The compositional and evolutionary logic of
metabolism, Phys. Biol., 10, 011001,
<ext-link xlink:href="https://doi.org/10.1088/1478-3975/10/1/011001" ext-link-type="DOI">10.1088/1478-3975/10/1/011001</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Burg et al.(2007)</label><?label BFD07?><mixed-citation>Burg, M. B., Ferraris, J. D., and Dmitrieva, N. I.: Cellular response to
hyperosmotic stresses, Physiol. Rev., 87, 1441–1474,
<ext-link xlink:href="https://doi.org/10.1152/physrev.00056.2006" ext-link-type="DOI">10.1152/physrev.00056.2006</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Canovas and Shock(2020)</label><?label CS20?><mixed-citation>Canovas III, P. A. and Shock, E. L.: Energetics of the citric acid cycle in
the deep biosphere, in: Carbon in Earth's Interior, edited by: Manning,
C. E., Lin, J.-F., and Mao, W. L., chap. 25,   American
Geophysical Union, 303–327, <ext-link xlink:href="https://doi.org/10.1002/9781119508229.ch25" ext-link-type="DOI">10.1002/9781119508229.ch25</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Chirife et al.(1981)</label><?label CFS81?><mixed-citation>Chirife, J., Fontan, C. F., and Scorza, O. C.: The intracellular water
activity of bacteria in relation to the water activity of the growth medium,
J. Appl. Bacteriol., 50, 475–479,
<ext-link xlink:href="https://doi.org/10.1111/j.1365-2672.1981.tb04250.x" ext-link-type="DOI">10.1111/j.1365-2672.1981.tb04250.x</ext-link>, 1981.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>DeBerardinis and Cheng(2010)</label><?label DC10a?><mixed-citation>DeBerardinis, R. J. and Cheng, T.: Q's next: The diverse functions of
glutamine in metabolism, cell biology and cancer, Oncogene, 29, 313–324,
<ext-link xlink:href="https://doi.org/10.1038/onc.2009.358" ext-link-type="DOI">10.1038/onc.2009.358</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Dick(2014)</label><?label Dic14?><mixed-citation>Dick, J. M.: Average oxidation state of carbon in proteins, J.
Roy. Soc. Int., 11, 20131095, <ext-link xlink:href="https://doi.org/10.1098/rsif.2013.1095" ext-link-type="DOI">10.1098/rsif.2013.1095</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Dick(2016)</label><?label Dic16?><mixed-citation>Dick, J. M.: Proteomic indicators of oxidation and hydration state in
colorectal cancer, PeerJ, 4, e2238, <ext-link xlink:href="https://doi.org/10.7717/peerj.2238" ext-link-type="DOI">10.7717/peerj.2238</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Dick(2017)</label><?label Dic17?><mixed-citation>Dick, J. M.: Chemical composition and the potential for proteomic
transformation in cancer, hypoxia, and hyperosmotic stress, PeerJ, 5, e3421,
<ext-link xlink:href="https://doi.org/10.7717/peerj.3421" ext-link-type="DOI">10.7717/peerj.3421</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Dick(2020a)</label><?label Dic20?><mixed-citation>Dick, J. M.: Water as a reactant in the differential expression of proteins
in cancer, bioRxiv, <ext-link xlink:href="https://doi.org/10.1101/2020.04.09.035022" ext-link-type="DOI">10.1101/2020.04.09.035022</ext-link>, 2020a.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Dick(2020b)</label><?label JMDplots1.2.4?><mixed-citation>Dick, J. M.: JMDplots 1.2.4, Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.4111016" ext-link-type="DOI">10.5281/zenodo.4111016</ext-link>,
2020b.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Dick(2020c)</label><?label canprot1.1.0?><mixed-citation>Dick, J. M.: canprot 1.1.0, Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.4105653" ext-link-type="DOI">10.5281/zenodo.4105653</ext-link>,
2020c.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Dick and Shock(2011)</label><?label DS11?><mixed-citation>Dick, J. M. and Shock, E. L.: Calculation of the relative chemical
stabilities of proteins as a function of temperature and redox chemistry in a
hot spring, PLOS One, 6, e22782, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0022782" ext-link-type="DOI">10.1371/journal.pone.0022782</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Dick et al.(2019)</label><?label DYTL19?><mixed-citation>Dick, J. M., Yu, M., Tan, J., and Lu, A.: Changes in carbon oxidation state
of metagenomes along geochemical redox gradients, Front. Microbiol.,
10, 120, <ext-link xlink:href="https://doi.org/10.3389/fmicb.2019.00120" ext-link-type="DOI">10.3389/fmicb.2019.00120</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Du et al.(2018)</label><?label DZMP18?><mixed-citation>Du, B., Zielinski, D. C., Monk, J. M., and Palsson, B. O.: Thermodynamic
favorability and pathway yield as evolutionary tradeoffs in biosynthetic
pathway choice, P. Natl. Acad. Sci., 115,
11339–11344, <ext-link xlink:href="https://doi.org/10.1073/pnas.1805367115" ext-link-type="DOI">10.1073/pnas.1805367115</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Dupont et al.(2014)</label><?label DLY+14?><mixed-citation>Dupont, C. L., Larsson, J., Yooseph, S., Ininbergs, K., Goll, J.,
Asplund-Samuelsson, J., McCrow, J. P., Celepli, N., Allen, L. Z., Ekman, M.,
Lucas, A. J., Hagström, Å., Thiagarajan, M., Brindefalk, B., Richter,
A. R., Andersson, A. F., Tenney, A., Lundin, D., Tovchigrechko, A., Nylander,
J. A. A., Brami, D., Badger, J. H., Allen, A. E., Rusch, D. B., Hoffman, J.,
Norrby, E., Friedman, R., Pinhassi, J., Venter, J. C., and Bergman, B.:
Functional tradeoffs underpin salinity-driven divergence in microbial
community composition, PLOS One, 9, e89549, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0089549" ext-link-type="DOI">10.1371/journal.pone.0089549</ext-link>,
2014.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Eiler et al.(2014)</label><?label EZM+14?><mixed-citation>Eiler, A., Zaremba-Niedzwiedzka, K., Martínez-García, M., McMahon, K. D.,
Stepanauskas, R., Andersson, S. G. E., and Bertilsson, S.: Productivity and
salinity structuring of the microplankton revealed by comparative freshwater
metagenomics, Environ. Microbiol., 16, 2682–2698,
<ext-link xlink:href="https://doi.org/10.1111/1462-2920.12301" ext-link-type="DOI">10.1111/1462-2920.12301</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Feist et al.(2007)</label><?label FHR+07?><mixed-citation>Feist, A. M., Henry, C. S., Reed, J. L., Krummenacker, M., Joyce, A. R., Karp,
P. D., Broadbelt, L. J., Hatzimanikatis, V., and Palsson, B. Ø.: A
genome-scale metabolic reconstruction for <italic>Escherichia coli</italic> K-12
MG1655 that accounts for 1260 ORFs and thermodynamic information,
Mol. Syst. Biol., 3, 121, <ext-link xlink:href="https://doi.org/10.1038/msb4100155" ext-link-type="DOI">10.1038/msb4100155</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Fernandez et al.(2013)</label><?label FGM+13?><mixed-citation>Fernandez, A. B., Ghai, R., Martin-Cuadrado, A. B., Sanchez-Porro, C.,
Rodriguez-Valera, F., and Ventosa, A.: Metagenome sequencing of prokaryotic
microbiota from two hypersaline ponds of a marine saltern in Santa Pola,
Spain, Genome Announcements, 1, e00933-13, <ext-link xlink:href="https://doi.org/10.1128/genomea.00933-13" ext-link-type="DOI">10.1128/genomea.00933-13</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Finn et al.(2015)</label><?label FRH+15?><mixed-citation>Finn, S., Rogers, L., Händler, K., McClure, P., Amézquita, A., Hinton,
J. C. D., and Fanning, S.: Exposure of <italic>Salmonella enterica</italic> serovar
Typhimurium to three humectants used in the food industry induces different
osmoadaptation systems, Appl. Environ. Microbiol., 81,
6800–6811, <ext-link xlink:href="https://doi.org/10.1128/AEM.01379-15" ext-link-type="DOI">10.1128/AEM.01379-15</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Fortunato et al.(2018)</label><?label FLBH18?><mixed-citation>Fortunato, C. S., Larson, B., Butterfield, D. A., and Huber, J. A.: Spatially
distinct, temporally stable microbial populations mediate biogeochemical
cycling at and below the seafloor in hydrothermal vent fluids, Environ.
Microbiol., 20, 769–784, <ext-link xlink:href="https://doi.org/10.1111/1462-2920.14011" ext-link-type="DOI">10.1111/1462-2920.14011</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Garner and Burg(1994)</label><?label GB94a?><mixed-citation>Garner, M. M. and Burg, M. B.: Macromolecular crowding and confinement in
cells exposed to hypertonicity, Am. J. Physiol., 266,
C877–C892, <ext-link xlink:href="https://doi.org/10.1152/ajpcell.1994.266.4.C877" ext-link-type="DOI">10.1152/ajpcell.1994.266.4.C877</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Gasteiger et al.(2005)</label><?label GHG+05?><mixed-citation>Gasteiger, E., Hoogland, C., Gattiker, A., Duvaud, S., Wilkins, M. R., Appel,
R. D., and Bairoch, A.: Protein identification and analysis tools on the
ExPASy server, in: The Proteomics Protocols Handbook, edited by: Walker,
J. M., Humana Press, Totowa, NJ,
571–607, <ext-link xlink:href="https://doi.org/10.1385/1-59259-890-0:571" ext-link-type="DOI">10.1385/1-59259-890-0:571</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Ghai et al.(2011)</label><?label GPF+11?><mixed-citation>Ghai, R., Pašić, L., Fernández, A. B., Martin-Cuadrado, A.-B., Mizuno,
C. M., McMahon, K. D., Papke, R. T., Stepanauskas, R., Rodriguez-Brito, B.,
Rohwer, F., Sánchez-Porro, C., Ventosa, A., and Rodríguez-Valera, F.:
New abundant microbial groups in aquatic hypersaline environments,
Sci. Rep., 1, 135, <ext-link xlink:href="https://doi.org/10.1038/srep00135" ext-link-type="DOI">10.1038/srep00135</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Gunde-Cimerman et al.(2018)</label><?label GPO18?><mixed-citation>Gunde-Cimerman, N., Plemenitaš, A., and Oren, A.: Strategies of adaptation
of microorganisms of the three domains of life to high salt concentrations,
FEMS Microbiol. Rev., 42, 353–375, <ext-link xlink:href="https://doi.org/10.1093/femsre/fuy009" ext-link-type="DOI">10.1093/femsre/fuy009</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Han et al.(2017)</label><?label HLL17?><mixed-citation>Han, D., Link, H., and Liesack, W.: Response of <italic>Methylocystis</italic> sp.
strain SC2 to salt stress: Physiology, global transcriptome, and amino acid
profiles, Appl. Environ. Microbiol., 83, e00866,
<ext-link xlink:href="https://doi.org/10.1128/AEM.00866-17" ext-link-type="DOI">10.1128/AEM.00866-17</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Han et al.(2005)</label><?label HZP+05?><mixed-citation>Han, Y., Zhou, D., Pang, X., Zhang, L., Song, Y., Tong, Z., Bao, J., Dai, E.,
Wang, J., Guo, Z., Zhai, J., Du, Z., Wang, X., Wang, J., Huang, P., and Yang,
R.: Comparative transcriptome analysis of <italic>Yersinia pestis</italic> in
response to hyperosmotic and high-salinity stress, Res. Microbiol.,
156, 403–415, <ext-link xlink:href="https://doi.org/10.1016/j.resmic.2004.10.004" ext-link-type="DOI">10.1016/j.resmic.2004.10.004</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Havig et al.(2011)</label><?label HRM+11?><mixed-citation>Havig, J. R., Raymond, J., Meyer-Dombard, D. R., Zolotova, N., and Shock,
E. L.: Merging isotopes and community genomics in a siliceous
sinter-depositing hot spring, J. Geophys. Res., 116, G01005,
<ext-link xlink:href="https://doi.org/10.1029/2010JG001415" ext-link-type="DOI">10.1029/2010JG001415</ext-link>, 2011.</mixed-citation></ref>
      <?pagebreak page6160?><ref id="bib1.bibx41"><label>Huang et al.(2011)</label><?label HMS+11?><mixed-citation>Huang, H., McGarvey, P. B., Suzek, B. E., Mazumder, R., Zhang, J., Chen, Y.,
and Wu, C. H.: A comprehensive protein-centric ID mapping service for
molecular data integration, Bioinformatics, 27, 1190–1191,
<ext-link xlink:href="https://doi.org/10.1093/bioinformatics/btr101" ext-link-type="DOI">10.1093/bioinformatics/btr101</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx42"><?xmltex \def\ref@label{{Jevti\'{c} et~al.(2019)}}?><label>Jevtić et al.(2019)</label><?label JSP+19?><mixed-citation>Jevtić, v., Stoll, B., Pfeiffer, F., Sharma, K., Urlaub, H., Marchfelder, A.,
and Lenz, C.: The response of <italic>Haloferax volcanii</italic> to salt and
temperature stress: A proteome study by label-free mass spectrometry,
Proteomics, 19, 1800491, <ext-link xlink:href="https://doi.org/10.1002/pmic.201800491" ext-link-type="DOI">10.1002/pmic.201800491</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Kanesaki et al.(2002)</label><?label KSA+02?><mixed-citation>Kanesaki, Y., Suzuki, I., Allakhverdiev, S. I., Mikami, K., and Murata, N.:
Salt stress and hyperosmotic stress regulate the expression of different sets
of genes in <italic>Synechocystis</italic> sp. PCC 6803, Biochem.
Biophys. Res. Commun., 290, 339–348,
<ext-link xlink:href="https://doi.org/10.1006/bbrc.2001.6201" ext-link-type="DOI">10.1006/bbrc.2001.6201</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Karl and Grabowski(2017)</label><?label KG17?><mixed-citation>Karl, D. M. and Grabowski, E.: The importance of H in particulate organic
matter stoichiometry, export and energy flow, Front. Microbiol., 8,
826, <ext-link xlink:href="https://doi.org/10.3389/fmicb.2017.00826" ext-link-type="DOI">10.3389/fmicb.2017.00826</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Kauffman(1986)</label><?label Kau86?><mixed-citation>Kauffman, J. M.: Simple method for determination of oxidation numbers of
atoms in compounds, J. Chem. Educat., 63, 474–475,
<ext-link xlink:href="https://doi.org/10.1021/ed063p474" ext-link-type="DOI">10.1021/ed063p474</ext-link>, 1986.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Keegan et al.(2016)</label><?label KGM16?><mixed-citation>Keegan, K. P., Glass, E. M., and Meyer, F.: MG-RAST, a metagenomics service
for analysis of microbial community structure and function, in: Microbial
Environmental Genomics (MEG), edited by: Martin, F. and Uroz, S.,
Springer, New York, 207–233, <ext-link xlink:href="https://doi.org/10.1007/978-1-4939-3369-3_13" ext-link-type="DOI">10.1007/978-1-4939-3369-3_13</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Kimbrel et al.(2018)</label><?label KBW+18?><mixed-citation>Kimbrel, J. A., Ballor, N., Wu, Y.-W., David, M. M., Hazen, T. C., Simmons,
B. A., Singer, S. W., and Jansson, J. K.: Microbial community structure and
functional potential along a hypersaline gradient, Front. Microbiol.,
9, 1492, <ext-link xlink:href="https://doi.org/10.3389/fmicb.2018.01492" ext-link-type="DOI">10.3389/fmicb.2018.01492</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Kocharunchitt et al.(2014)</label><?label KKG+14?><mixed-citation>Kocharunchitt, C., King, T., Gobius, K., Bowman, J. P., and Ross, T.: Global
genome response of <italic>Escherichia coli</italic> O157:H7 Sakai during dynamic
changes in growth kinetics induced by an abrupt downshift in water activity,
PLOS One, 9, e90422, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0090422" ext-link-type="DOI">10.1371/journal.pone.0090422</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Kohler et al.(2015)</label><?label KLB+15?><mixed-citation>Kohler, C., Lourenço, R. F., Bernhardt, J., Albrecht, D., Schüler, J.,
Hecker, M., and Gomes, S. L.: A comprehensive genomic, transcriptomic and
proteomic analysis of a hyperosmotic stress sensitive
<inline-formula><mml:math id="M295" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>-proteobacterium, BMC Microbiol., 15, 71,
<ext-link xlink:href="https://doi.org/10.1186/s12866-015-0404-x" ext-link-type="DOI">10.1186/s12866-015-0404-x</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Kopylova et al.(2012)</label><?label KNT12?><mixed-citation>Kopylova, E., Noé, L., and Touzet, H.: SortMeRNA: Fast and accurate
filtering of ribosomal RNAs in metatranscriptomic data, Bioinformatics, 28,
3211–3217, <ext-link xlink:href="https://doi.org/10.1093/bioinformatics/bts611" ext-link-type="DOI">10.1093/bioinformatics/bts611</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Kunin et al.(2008)</label><?label KRH+08?><mixed-citation>Kunin, V., Raes, J., Harris, J. K., Spear, J. R., Walker, J. J., Ivanova, N.,
von Mering, C., Bebout, B. M., Pace, N. R., Bork, P., and Hugenholtz, P.:
Millimeter-scale genetic gradients and community-level molecular
convergence in a hypersaline microbial mat, Mol. Syst. Biol., 4,
198, <ext-link xlink:href="https://doi.org/10.1038/msb.2008.35" ext-link-type="DOI">10.1038/msb.2008.35</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Kyte and Doolittle(1982)</label><?label KD82?><mixed-citation>Kyte, J. and Doolittle, R. F.: A simple method for displaying the hydropathic
character of a protein, J. Mol. Biol., 157, 105–132,
<ext-link xlink:href="https://doi.org/10.1016/0022-2836(82)90515-0" ext-link-type="DOI">10.1016/0022-2836(82)90515-0</ext-link>, 1982.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>LaRowe and Van Cappellen(2011)</label><?label LV11?><mixed-citation>LaRowe, D. E. and Van Cappellen, P.: Degradation of natural organic matter:
A thermodynamic analysis, Geochim. Cosmochim. Ac., 75, 2030–2042,
<ext-link xlink:href="https://doi.org/10.1016/j.gca.2011.01.020" ext-link-type="DOI">10.1016/j.gca.2011.01.020</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Leuko et al.(2009)</label><?label LRB+09?><mixed-citation>Leuko, S., Raftery, M. J., Burns, B. P., Walter, M. R., and Neilan, B. A.:
Global protein-level responses of <italic>Halobacterium salinarum</italic> NRC-1
to prolonged changes in external sodium chloride concentrations, J.
Proteome Res., 8, 2218–2225, <ext-link xlink:href="https://doi.org/10.1021/pr800663c" ext-link-type="DOI">10.1021/pr800663c</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Ley et al.(2006)</label><?label LHW+06?><mixed-citation>Ley, R. E., Harris, J. K., Wilcox, J., Spear, J. R., Miller, S. R., Bebout,
B. M., Maresca, J. A., Bryant, D. A., Sogin, M. L., and Pace, N. R.:
Unexpected diversity and complexity of the Guerrero Negro hypersaline
microbial mat, Appl. Environ. Microbiol., 72, 3685–3695,
<ext-link xlink:href="https://doi.org/10.1128/AEM.72.5.3685-3695.2006" ext-link-type="DOI">10.1128/AEM.72.5.3685-3695.2006</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Lin et al.(2017)</label><?label LLYL17?><mixed-citation>Lin, J., Liang, H., Yan, J., and Luo, L.: The molecular mechanism and
post-transcriptional regulation characteristic of <italic>Tetragenococcus halophilus</italic> acclimation to osmotic stress revealed by quantitative
proteomics, J. Proteomics, 168, 1–14,
<ext-link xlink:href="https://doi.org/10.1016/j.jprot.2017.08.014" ext-link-type="DOI">10.1016/j.jprot.2017.08.014</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Lindsay et al.(2018)</label><?label LAF+18?><mixed-citation>Lindsay, M. R., Amenabar, M. J., Fecteau, K. M., Debes II, R. V.,
Fernandes Martins, M. C., Fristad, K. E., Xu, H., Hoehler, T. M., Shock,
E. L., and Boyd, E. S.: Subsurface processes influence oxidant availability
and chemoautotrophic hydrogen metabolism in Yellowstone hot springs,
Geobiology, 16, 674–692, <ext-link xlink:href="https://doi.org/10.1111/gbi.12308" ext-link-type="DOI">10.1111/gbi.12308</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>May and Rowland(2018)</label><?label MR18?><mixed-citation>May, P. M. and Rowland, D.: JESS, a Joint Expert Speciation System –
VI: thermodynamically-consistent standard Gibbs energies of reaction for
aqueous solutions, New J. Chem., 42, 7617–7629,
<ext-link xlink:href="https://doi.org/10.1039/C7NJ03597G" ext-link-type="DOI">10.1039/C7NJ03597G</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Milo et al.(2010)</label><?label MJM+10?><mixed-citation>Milo, R., Jorgensen, P., Moran, U., Weber, G., and Springer, M.:
BioNumbers—the database of key numbers in molecular and cell biology,
Nucl. Acid. Res., 38, D750–D753, <ext-link xlink:href="https://doi.org/10.1093/nar/gkp889" ext-link-type="DOI">10.1093/nar/gkp889</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Minkiewicz et al.(2018)</label><?label MDI18?><mixed-citation>Minkiewicz, P., Darewicz, M., and Iwaniak, A.: Introducing a simple equation to
express oxidation states as an alternative to using rules associated with
words alone, J. Chem. Educat., 95, 340–342,
<ext-link xlink:href="https://doi.org/10.1021/acs.jchemed.7b00322" ext-link-type="DOI">10.1021/acs.jchemed.7b00322</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Morowitz(1999)</label><?label Mor99?><mixed-citation>Morowitz, H. J.: A theory of biochemical organization, metabolic pathways,
and evolution, Complexity, 4, 39–53,
<ext-link xlink:href="https://doi.org/10.1002/(SICI)1099-0526(199907/08)4:6&lt;39::AID-CPLX8&gt;3.0.CO;2-2" ext-link-type="DOI">10.1002/(SICI)1099-0526(199907/08)4:6&lt;39::AID-CPLX8&gt;3.0.CO;2-2</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Möller et al.(2016)</label><?label MLC+16a?><mixed-citation>Möller, M. N., Li, Q., Chinnaraj, M., Cheung, H. C., Lancaster, J. R., and
Denicola, A.: Solubility and diffusion of oxygen in phospholipid membranes,
Biochim. Biophys. Ac., 1858, 2923–2930,
<ext-link xlink:href="https://doi.org/10.1016/j.bbamem.2016.09.003" ext-link-type="DOI">10.1016/j.bbamem.2016.09.003</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>O'Leary et al.(2016)</label><?label OWB+16?><mixed-citation>O'Leary, N. A., Wright, M. W., Brister, J. R., Ciufo, S., Haddad, D., McVeigh,
R., Rajput, B., Robbertse, B., Smith-White, B., Ako-Adjei, D., Astashyn, A.,
Badretdin, A., Bao, Y., Blinkova, O., Brover, V., Chetvernin, V., Choi, J.,
Cox, E., Ermolaeva, O., Farrell, C. M., Goldfarb, T., Gupta, T., Haft, D.,
Hatcher, E., Hlavina, W., Joardar, V. S., Kodali, V. K., Li, W., Maglott, D.,
Masterson, P., McGarvey, K. M., Murphy, M. R., O'Neill, K., Pujar, S.,
Rangwala, S. H., Rausch, D., Riddick, L. D., Schoch, C., Shkeda, A., Storz,
S. S., Sun, H., Thibaud-Nissen, F., Tolstoy, I., Tully, R. E., Vatsan, A. R.,
Wallin, C., Webb, D., Wu, W., Landrum, M. J., Kimchi, A., Tatusova, T.,
DiCuccio, M., Kitts, P., Murphy, T. D., and Pruitt, K. D.: Reference
sequence (RefSeq) database at NCBI: current status, taxonomic
expansion, and functional annotation, Nucl. Acid. Res., 44, D733–D745,
<ext-link xlink:href="https://doi.org/10.1093/nar/gkv1189" ext-link-type="DOI">10.1093/nar/gkv1189</ext-link>, 2016.</mixed-citation></ref>
      <?pagebreak page6161?><ref id="bib1.bibx64"><label>Ooka et al.(2019)</label><?label OMN19?><mixed-citation>Ooka, H., McGlynn, S. E., and Nakamura, R.: Electrochemistry at deep-sea
hydrothermal vents: Utilization of the thermodynamic driving force towards
the autotrophic origin of life, ChemElectroChem, 6, 1316–1323,
<ext-link xlink:href="https://doi.org/10.1002/celc.201801432" ext-link-type="DOI">10.1002/celc.201801432</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>Oren(2013)</label><?label Ore13?><mixed-citation>Oren, A.: Life at high salt concentrations, intracellular KCl
concentrations, and acidic proteomes, Front. Microbiol., 4, 315,
<ext-link xlink:href="https://doi.org/10.3389/fmicb.2013.00315" ext-link-type="DOI">10.3389/fmicb.2013.00315</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Paul et al.(2008)</label><?label PBD+08?><mixed-citation>Paul, S., Bag, S. K., Das, S., Harvill, E. T., and Dutta, C.: Molecular
signature of hypersaline adaptation: insights from genome and proteome
composition of halophilic prokaryotes, Genome Biol., 9, R70,
<ext-link xlink:href="https://doi.org/10.1186/gb-2008-9-4-r70" ext-link-type="DOI">10.1186/gb-2008-9-4-r70</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Poudel et al.(2018)</label><?label PCF+18?><mixed-citation>Poudel, S., Colman, D. R., Fixen, K. R., Ledbetter, R. N., Zheng, Y., Pence,
N., Seefeldt, L. C., Peters, J. W., Harwood, C. S., and Boyd, E. S.:
Electron transfer to nitrogenase in different genomic and metabolic
backgrounds, J. Bacteriol., 200, e00757,
<ext-link xlink:href="https://doi.org/10.1128/JB.00757-17" ext-link-type="DOI">10.1128/JB.00757-17</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Qiao et al.(2013)</label><?label QHT+13?><mixed-citation>Qiao, J., Huang, S., Te, R., Wang, J., Chen, L., and Zhang, W.: Integrated
proteomic and transcriptomic analysis reveals novel genes and regulatory
mechanisms involved in salt stress responses in <italic>Synechocystis</italic> sp.
PCC 6803, Appl. Microbiol. Biotechnol., 97, 8253–8264,
<ext-link xlink:href="https://doi.org/10.1007/s00253-013-5139-8" ext-link-type="DOI">10.1007/s00253-013-5139-8</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx69"><label>R Core Team(2020)</label><?label RCT20?><mixed-citation>R Core Team: R: A Language and Environment for Statistical
Computing, R Foundation for Statistical Computing, Vienna, Austria,
<uri>https://www.R-project.org</uri>, last access: 21 October 2020.</mixed-citation></ref>
      <ref id="bib1.bibx70"><label>Record et al.(1998)</label><?label RCCG98?><mixed-citation>Record, Jr., M. T., Courtenay, E. S., Cayley, D. S., and Guttman, H. J.:
Responses of <italic>E. coli</italic> to osmotic stress: Large changes in amounts
of cytoplasmic solutes and water, Trend. Biochem. Sci., 23,
143–148, <ext-link xlink:href="https://doi.org/10.1016/S0968-0004(98)01196-7" ext-link-type="DOI">10.1016/S0968-0004(98)01196-7</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx71"><label>Reeves et al.(2014)</label><?label RMS14?><mixed-citation>Reeves, E. P., McDermott, J. M., and Seewald, J. S.: The origin of
methanethiol in midocean ridge hydrothermal fluids, P.
Natl. Acad. Sci., 111, 5474–5479, <ext-link xlink:href="https://doi.org/10.1073/pnas.1400643111" ext-link-type="DOI">10.1073/pnas.1400643111</ext-link>,
2014.</mixed-citation></ref>
      <ref id="bib1.bibx72"><label>Reveillaud et al.(2016)</label><?label RRM+16?><mixed-citation>Reveillaud, J., Reddington, E., McDermott, J., Algar, C., Meyer, J. L., Sylva,
S., Seewald, J., German, C. R., and Huber, J. A.: Subseafloor microbial
communities in hydrogen-rich vent fluids from hydrothermal systems along the
Mid-Cayman Rise, Environ. Microbiol., 18, 1970–1987,
<ext-link xlink:href="https://doi.org/10.1111/1462-2920.13173" ext-link-type="DOI">10.1111/1462-2920.13173</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx73"><label>Rho et al.(2010)</label><?label RTY10?><mixed-citation>Rho, M., Tang, H., and Ye, Y.: FragGeneScan: Predicting genes in short
and error-prone reads, Nucl. Acid. Res., 38, e191,
<ext-link xlink:href="https://doi.org/10.1093/nar/gkq747" ext-link-type="DOI">10.1093/nar/gkq747</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx74"><label>Rhodes et al.(2010)</label><?label RFOH10?><mixed-citation>Rhodes, M. E., Fitz-Gibbon, S. T., Oren, A., and House, C. H.: Amino acid
signatures of salinity on an environmental scale with a focus on the Dead
Sea, Environ. Microbiol., 12, 2613–2623,
<ext-link xlink:href="https://doi.org/10.1111/j.1462-2920.2010.02232.x" ext-link-type="DOI">10.1111/j.1462-2920.2010.02232.x</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx75"><label>Rodriguez-Brito et al.(2010)</label><?label RLW+10?><mixed-citation>Rodriguez-Brito, B., Li, L., Wegley, L., Furlan, M., Angly, F., Breitbart, M.,
Buchanan, J., Desnues, C., Dinsdale, E., Edwards, R., Felts, B., Haynes, M.,
Liu, H., Lipson, D., Mahaffy, J., Martin-Cuadrado, A. B., Mira, A., Nulton,
J., Pašić, L., Rayhawk, S., Rodriguez-Mueller, J., Rodriguez-Valera, F.,
Salamon, P., Srinagesh, S., Thingstad, T. F., Tran, T., Thurber, R. V.,
Willner, D., Youle, M., and Rohwer, F.: Viral and microbial community
dynamics in four aquatic environments, ISME J., 4, 739–751,
<ext-link xlink:href="https://doi.org/10.1038/ismej.2010.1" ext-link-type="DOI">10.1038/ismej.2010.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx76"><label>Satinsky et al.(2014)</label><?label SZD+14?><mixed-citation>Satinsky, B. M., Zielinski, B. L., Doherty, M., Smith, C. B., Sharma, S., Paul,
J. H., Crump, B. C., and Moran, M. A.: The Amazon continuum dataset:
quantitative metagenomic and metatranscriptomic inventories of the Amazon
River plume, June 2010, Microbiome, 2, 17, <ext-link xlink:href="https://doi.org/10.1186/2049-2618-2-17" ext-link-type="DOI">10.1186/2049-2618-2-17</ext-link>,
2014.</mixed-citation></ref>
      <ref id="bib1.bibx77"><label>Satinsky et al.(2015)</label><?label SFD+15?><mixed-citation>Satinsky, B. M., Fortunato, C. S., Doherty, M., Smith, C. B., Sharma, S., Ward,
N. D., Krusche, A. V., Yager, P. L., Richey, J. E., Moran, M. A., and Crump,
B. C.: Metagenomic and metatranscriptomic inventories of the lower Amazon
River, May 2011, Microbiome, 3, 39, <ext-link xlink:href="https://doi.org/10.1186/s40168-015-0099-0" ext-link-type="DOI">10.1186/s40168-015-0099-0</ext-link>,
2015.</mixed-citation></ref>
      <ref id="bib1.bibx78"><label>Schmidt et al.(2016)</label><?label SKV+16?><mixed-citation>Schmidt, A., Kochanowski, K., Vedelaar, S., Ahrné, E., Volkmer, B., Callipo,
L., Knoops, K., Bauer, M., Aebersold, R., and Heinemann, M.: The quantitative
and condition-dependent <italic>Escherichia coli</italic> proteome, Nat.
Biotechnol., 34, 104–110, <ext-link xlink:href="https://doi.org/10.1038/nbt.3418" ext-link-type="DOI">10.1038/nbt.3418</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx79"><label>Shabala et al.(2009)</label><?label SBB+09?><mixed-citation>Shabala, L., Bowman, J., Brown, J., Ross, T., McMeekin, T., and Shabala, S.:
Ion transport and osmotic adjustment in <italic>Escherichia coli</italic> in response
to ionic and non-ionic osmotica, Environ. Microbiol., 11, 137–148,
<ext-link xlink:href="https://doi.org/10.1111/j.1462-2920.2008.01748.x" ext-link-type="DOI">10.1111/j.1462-2920.2008.01748.x</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx80"><label>Shock et al.(2010)</label><?label SHM+10?><mixed-citation>Shock, E. L., Holland, M., Meyer-Dombard, D. R., Amend, J. P., Osburn, G. R.,
and Fischer, T. P.: Quantifying inorganic sources of geochemical energy in
hydrothermal ecosystems, Yellowstone National Park, USA, Geochim.
Cosmochim. Ac., 74, 4005–4043, <ext-link xlink:href="https://doi.org/10.1016/j.gca.2009.08.036" ext-link-type="DOI">10.1016/j.gca.2009.08.036</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx81"><label>Simon et al.(2014)</label><?label SSH14d?><mixed-citation>Simon, H. M., Smith, M. W., and Herfort, L.: Metagenomic insights into
particles and their associated microbiota in a coastal margin ecosystem,
Front. Microbiol., 5, 466, <ext-link xlink:href="https://doi.org/10.3389/fmicb.2014.00466" ext-link-type="DOI">10.3389/fmicb.2014.00466</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx82"><label>Slonczewski et al.(2009)</label><?label SFDK09?><mixed-citation>Slonczewski, J. L., Fujisawa, M., Dopson, M., and Krulwich, T. A.:
Cytoplasmic pH measurement and homeostasis in bacteria and archaea, in:
Advances in Microbial Physiology, edited by: Poole, R. K., Vol. 55,
Academic Press, New York, 1–79, <ext-link xlink:href="https://doi.org/10.1016/S0065-2911(09)05501-5" ext-link-type="DOI">10.1016/S0065-2911(09)05501-5</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx83"><label>Solheim et al.(2014)</label><?label SLM+14?><mixed-citation>Solheim, M., La Rosa, S. L., Mathisen, T., Snipen, L. G., Nes, I. F., and
Brede, D. A.: Transcriptomic and functional analysis of NaCl-induced stress
in <italic>Enterococcus faecalis</italic>, PLOS One, 9, e94571,
<ext-link xlink:href="https://doi.org/10.1371/journal.pone.0094571" ext-link-type="DOI">10.1371/journal.pone.0094571</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx84"><label>Sterner and Liebl(2001)</label><?label SL01?><mixed-citation>Sterner, R. and Liebl, W.: Thermophilic adaptation of proteins, Crit.
Rev. Biochem. Mol., 36, 39–106,
<ext-link xlink:href="https://doi.org/10.1080/20014091074174" ext-link-type="DOI">10.1080/20014091074174</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx85"><label>Swingley et al.(2012)</label><?label SMS+12?><mixed-citation>Swingley, W. D., Meyer-Dombard, D. R., Shock, E. L., Alsop, E. B., Falenski,
H. D., Havig, J. R., and Raymond, J.: Coordinating environmental genomics
and geochemistry reveals metabolic transitions in a hot spring ecosystem,
PLOS One, 7, e38108, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0038108" ext-link-type="DOI">10.1371/journal.pone.0038108</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx86"><label>The UniProt Consortium(2019)</label><?label Uni19?><mixed-citation>The UniProt Consortium: UniProt: A worldwide hub of protein knowledge,
Nucl. Acid. Res., 47, D506–D515, <ext-link xlink:href="https://doi.org/10.1093/nar/gky1049" ext-link-type="DOI">10.1093/nar/gky1049</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx87"><label>Turner et al.(2017)</label><?label TWM+17?><mixed-citation>Turner, C. B., Wade, B. D., Meyer, J. R., Sommerfeld, B. A., and Lenski, R. E.:
Evolution of organismal stoichiometry in a long-term experiment with
<italic>Escherichia coli</italic>, Roy. Soc. Open Sci., 4, 170497,
<ext-link xlink:href="https://doi.org/10.1098/rsos.170497" ext-link-type="DOI">10.1098/rsos.170497</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx88"><label>Vavourakis et al.(2016)</label><?label VGR+16?><mixed-citation>Vavourakis, C. D., Ghai, R., Rodriguez-Valera, F., Sorokin, D. Y., Tringe,
S. G., Hugenholtz, P., and Muyzer, G.: Metagenomic insights into the
uncultured diversity and physiology of microbes in four hypersaline soda lake
brines, Front. Microbiol., 7, 211, <ext-link xlink:href="https://doi.org/10.3389/fmicb.2016.00211" ext-link-type="DOI">10.3389/fmicb.2016.00211</ext-link>,
2016.</mixed-citation></ref>
      <ref id="bib1.bibx89"><label>Wagner(2005)</label><?label Wag05?><mixed-citation>Wagner, A.: Energy constraints on the evolution of gene expression, Mol.
Biol. Evol., 22, 1365–1374, <ext-link xlink:href="https://doi.org/10.1093/molbev/msi126" ext-link-type="DOI">10.1093/molbev/msi126</ext-link>, 2005.</mixed-citation></ref>
      <?pagebreak page6162?><ref id="bib1.bibx90"><label>Walsh et al.(2018)</label><?label WTT18?><mixed-citation>Walsh, C. T., Tu, B. P., and Tang, Y.: Eight kinetically stable but
thermodynamically activated molecules that power cell metabolism, Chem.
Rev., 118, 1460–1494, <ext-link xlink:href="https://doi.org/10.1021/acs.chemrev.7b00510" ext-link-type="DOI">10.1021/acs.chemrev.7b00510</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx91"><label>Wang et al.(2003)</label><?label WBXG03?><mixed-citation>Wang, Y., Bryan, C., Xu, H., and Gao, H.: Nanogeochemistry: Geochemical
reactions and mass transfers in nanopores, Geology, 31, 387–390,
<ext-link xlink:href="https://doi.org/10.1130/0091-7613(2003)031&lt;0387:NGRAMT&gt;2.0.CO;2" ext-link-type="DOI">10.1130/0091-7613(2003)031&lt;0387:NGRAMT&gt;2.0.CO;2</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx92"><label>Warn and Peters(1996)</label><?label WP96?><mixed-citation>
Warn, J. R. W. and Peters, A. P. H.: Concise Chemical Thermodynamics, CRC
Press, 2nd Edn.,  1996.</mixed-citation></ref>
      <ref id="bib1.bibx93"><label>Withman et al.(2013)</label><?label WGB+13?><mixed-citation>Withman, B., Gunasekera, T. S., Beesetty, P., Agans, R., and Paliy, O.:
Transcriptional responses of uropathogenic <italic>Escherichia coli</italic> to
increased environmental osmolality caused by salt or urea, Infect.
Immun., 81, 80–89, <ext-link xlink:href="https://doi.org/10.1128/IAI.01049-12" ext-link-type="DOI">10.1128/IAI.01049-12</ext-link>, 2013.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx94"><label>Youens-Clark et al.(2019)</label><?label YBP+19?><mixed-citation>Youens-Clark, K., Bomhoff, M., Ponsero, A. J., Wood-Charlson, E. M., Lynch, J.,
Choi, I., Hartman, J. H., and Hurwitz, B. L.: iMicrobe: Tools and
data-driven discovery platform for the microbiome sciences, GigaScience, 8,
giz083, <ext-link xlink:href="https://doi.org/10.1093/gigascience/giz083" ext-link-type="DOI">10.1093/gigascience/giz083</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx95"><label>Zeldovich et al.(2007)</label><?label ZBS07?><mixed-citation>Zeldovich, K. B., Berezovsky, I. N., and Shakhnovich, E. I.: Protein and
DNA sequence determinants of thermophilic adaptation, PLOS Comput.
Biol., 3, e5, <ext-link xlink:href="https://doi.org/10.1371/journal.pcbi.0030005" ext-link-type="DOI">10.1371/journal.pcbi.0030005</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx96"><label>Zhang et al.(2016)</label><?label ZLZ+16?><mixed-citation>Zhang, Y., Li, Y., Zhang, Y., Wang, Z., Zhao, M., Su, N., Zhang, T., Chen, L.,
Wei, W., Luo, J., Zhou, Y., Xu, Y., Xu, P., Li, W., and Tao, Y.:
Quantitative proteomics reveals membrane protein-mediated hypersaline
sensitivity and adaptation in halophilic <italic>Nocardiopsis xinjiangensis</italic>,
J. Proteome Res., 15, 68–85,
<ext-link xlink:href="https://doi.org/10.1021/acs.jproteome.5b00526" ext-link-type="DOI">10.1021/acs.jproteome.5b00526</ext-link>, 2016.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Uncovering chemical signatures of salinity gradients through compositional analysis of protein sequences</article-title-html>
<abstract-html><p>Prediction of the direction of change of a system under specified
environmental conditions is one reason for the widespread utility
of thermodynamic models in geochemistry. However, thermodynamic influences
on the chemical compositions of proteins in nature have remained enigmatic
despite much work that demonstrates the impact of environmental conditions
on amino acid frequencies. Here, we present evidence that the dehydrating
effect of salinity is detectable as chemical differences in protein
sequences inferred from (1) metagenomes and metatranscriptomes in regional
salinity gradients and (2) differential gene and protein expression
in microbial cells under hyperosmotic stress. The stoichiometric hydration
state (<i>n</i><sub>H<sub>2</sub>O</sub>), derived from
the number of water molecules in theoretical reactions to form proteins
from a particular set of basis species (glutamine, glutamic acid,
cysteine, O<sub>2</sub>, H<sub>2</sub>O), decreases along
salinity gradients, including the Baltic Sea and Amazon River and ocean
plume, and decreases in particle-associated compared to free-living fractions.
However, the proposed metric does not respond as expected for hypersaline
environments. Analysis of data compiled for hyperosmotic stress experiments
under controlled laboratory conditions shows that differentially expressed
proteins are on average shifted toward lower <i>n</i><sub>H<sub>2</sub>O</sub>.
Notably, the dehydration effect is stronger for most organic solutes
compared to NaCl. This new method of compositional analysis can be
used to identify possible thermodynamic effects in the distribution
of proteins along chemical gradients at a range of scales from microbial
mats to oceans.</p></abstract-html>
<ref-html id="bib1.bib1"><label>Acquisti et al.(2007)</label><mixed-citation>
Acquisti, C., Kleffe, J., and Collins, S.: Oxygen content of transmembrane
proteins over macroevolutionary time scales, Nature, 445, 47–52,
<a href="https://doi.org/10.1038/nature05450" target="_blank">https://doi.org/10.1038/nature05450</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Akashi and Gojobori(2002)</label><mixed-citation>
Akashi, H. and Gojobori, T.: Metabolic efficiency and amino acid composition
in the proteomes of <i>Escherichia coli</i> and <i>Bacillus subtilis</i>,
P. Natl. Acad. Sci., 99, 3695–3700,
<a href="https://doi.org/10.1073/pnas.062526999" target="_blank">https://doi.org/10.1073/pnas.062526999</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Alsop et al.(2014)</label><mixed-citation>
Alsop, E. B., Boyd, E. S., and Raymond, J.: Merging metagenomics and
geochemistry reveals environmental controls on biological diversity and
evolution, BMC Ecol., 14, 16, <a href="https://doi.org/10.1186/1472-6785-14-16" target="_blank">https://doi.org/10.1186/1472-6785-14-16</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Amend and LaRowe(2019)</label><mixed-citation>
Amend, J. P. and LaRowe, D. E.: Mini-review: Demystifying microbial
reaction energetics, Environ. Microbiol., 21, 3539–3547,
<a href="https://doi.org/10.1111/1462-2920.14778" target="_blank">https://doi.org/10.1111/1462-2920.14778</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Amend and Shock(1998)</label><mixed-citation>
Amend, J. P. and Shock, E. L.: Energetics of amino acid synthesis in
hydrothermal ecosystems, Science, 281, 1659–1662,
<a href="https://doi.org/10.1126/science.281.5383.1659" target="_blank">https://doi.org/10.1126/science.281.5383.1659</a>, 1998.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Amend et al.(2013)</label><mixed-citation>
Amend, J. P., LaRowe, D. E., McCollom, T. M., and Shock, E. L.: The
energetics of organic synthesis inside and outside the cell, Philos.
T. R. Soc. B, 368, 20120255,
<a href="https://doi.org/10.1098/rstb.2012.0255" target="_blank">https://doi.org/10.1098/rstb.2012.0255</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Anderson(2005)</label><mixed-citation>
Anderson, G. M.: Thermodynamics of Natural Systems, Cambridge University
Press, Cambridge, 2nd Edn., <a href="https://doi.org/10.1017/CBO9780511840258" target="_blank">https://doi.org/10.1017/CBO9780511840258</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Asplund-Samuelsson et al.(2016)</label><mixed-citation>
Asplund-Samuelsson, J., Sundh, J., Dupont, C. L., Allen, A. E., McCrow, J. P.,
Celepli, N. A., Bergman, B., Ininbergs, K., and Ekman, M.: Diversity and
expression of bacterial metacaspases in an aquatic ecosystem, Front.
Microbiol., 7, 1043, <a href="https://doi.org/10.3389/fmicb.2016.01043" target="_blank">https://doi.org/10.3389/fmicb.2016.01043</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Baudouin-Cornu et al.(2001)</label><mixed-citation>
Baudouin-Cornu, P., Surdin-Kerjan, Y., Marlière, P., and Thomas, D.:
Molecular evolution of protein atomic composition, Science, 293, 297–300,
<a href="https://doi.org/10.1126/science.1061052" target="_blank">https://doi.org/10.1126/science.1061052</a>, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Bjellqvist et al.(1993)</label><mixed-citation>
Bjellqvist, B., Hughes, G. J., Pasquali, C., Paquet, N., Ravier, F., Sanchez,
J.-C., Frutiger, S., and Hochstrasser, D.: The focusing positions of
polypeptides in immobilized pH gradients can be predicted from their amino
acid sequences, Electrophoresis, 14, 1023–1031,
<a href="https://doi.org/10.1002/elps.11501401163" target="_blank">https://doi.org/10.1002/elps.11501401163</a>, 1993.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Bjellqvist et al.(1994)</label><mixed-citation>
Bjellqvist, B., Basse, B., Olsen, E., and Celis, J. E.: Reference points for
comparisons of two-dimensional maps of proteins from different human cell
types defined in a pH scale where isoelectric points correlate with
polypeptide compositions, Electrophoresis, 15, 529–539,
<a href="https://doi.org/10.1002/elps.1150150171" target="_blank">https://doi.org/10.1002/elps.1150150171</a>, 1994.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Boyd et al.(2014)</label><mixed-citation>
Boyd, E. S., Hamilton, T. L., Swanson, K. D., Howells, A. E., Baxter, B. K.,
Meuser, J. E., Posewitz, M. C., and Peters, J. W.: [FeFe]-hydrogenase
abundance and diversity along a vertical redox gradient in Great Salt
Lake, USA, Int. J. Mol. Sci., 15,
21947–21966, <a href="https://doi.org/10.3390/ijms151221947" target="_blank">https://doi.org/10.3390/ijms151221947</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Boyer et al.(2020)</label><mixed-citation>
Boyer, G. M., Schubotz, F., Summons, R. E., Woods, J., and Shock, E. L.: Carbon
oxidation state in microbial polar lipids suggests adaptation to hot spring
temperature and redox gradients, Front. Microbiol., 11, 229,
<a href="https://doi.org/10.3389/fmicb.2020.00229" target="_blank">https://doi.org/10.3389/fmicb.2020.00229</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Braakman and Smith(2013)</label><mixed-citation>
Braakman, R. and Smith, E.: The compositional and evolutionary logic of
metabolism, Phys. Biol., 10, 011001,
<a href="https://doi.org/10.1088/1478-3975/10/1/011001" target="_blank">https://doi.org/10.1088/1478-3975/10/1/011001</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Burg et al.(2007)</label><mixed-citation>
Burg, M. B., Ferraris, J. D., and Dmitrieva, N. I.: Cellular response to
hyperosmotic stresses, Physiol. Rev., 87, 1441–1474,
<a href="https://doi.org/10.1152/physrev.00056.2006" target="_blank">https://doi.org/10.1152/physrev.00056.2006</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Canovas and Shock(2020)</label><mixed-citation>
Canovas III, P. A. and Shock, E. L.: Energetics of the citric acid cycle in
the deep biosphere, in: Carbon in Earth's Interior, edited by: Manning,
C. E., Lin, J.-F., and Mao, W. L., chap. 25,   American
Geophysical Union, 303–327, <a href="https://doi.org/10.1002/9781119508229.ch25" target="_blank">https://doi.org/10.1002/9781119508229.ch25</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Chirife et al.(1981)</label><mixed-citation>
Chirife, J., Fontan, C. F., and Scorza, O. C.: The intracellular water
activity of bacteria in relation to the water activity of the growth medium,
J. Appl. Bacteriol., 50, 475–479,
<a href="https://doi.org/10.1111/j.1365-2672.1981.tb04250.x" target="_blank">https://doi.org/10.1111/j.1365-2672.1981.tb04250.x</a>, 1981.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>DeBerardinis and Cheng(2010)</label><mixed-citation>
DeBerardinis, R. J. and Cheng, T.: Q's next: The diverse functions of
glutamine in metabolism, cell biology and cancer, Oncogene, 29, 313–324,
<a href="https://doi.org/10.1038/onc.2009.358" target="_blank">https://doi.org/10.1038/onc.2009.358</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Dick(2014)</label><mixed-citation>
Dick, J. M.: Average oxidation state of carbon in proteins, J.
Roy. Soc. Int., 11, 20131095, <a href="https://doi.org/10.1098/rsif.2013.1095" target="_blank">https://doi.org/10.1098/rsif.2013.1095</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Dick(2016)</label><mixed-citation>
Dick, J. M.: Proteomic indicators of oxidation and hydration state in
colorectal cancer, PeerJ, 4, e2238, <a href="https://doi.org/10.7717/peerj.2238" target="_blank">https://doi.org/10.7717/peerj.2238</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Dick(2017)</label><mixed-citation>
Dick, J. M.: Chemical composition and the potential for proteomic
transformation in cancer, hypoxia, and hyperosmotic stress, PeerJ, 5, e3421,
<a href="https://doi.org/10.7717/peerj.3421" target="_blank">https://doi.org/10.7717/peerj.3421</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Dick(2020a)</label><mixed-citation>
Dick, J. M.: Water as a reactant in the differential expression of proteins
in cancer, bioRxiv, <a href="https://doi.org/10.1101/2020.04.09.035022" target="_blank">https://doi.org/10.1101/2020.04.09.035022</a>, 2020a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Dick(2020b)</label><mixed-citation>
Dick, J. M.: JMDplots 1.2.4, Zenodo, <a href="https://doi.org/10.5281/zenodo.4111016" target="_blank">https://doi.org/10.5281/zenodo.4111016</a>,
2020b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Dick(2020c)</label><mixed-citation>
Dick, J. M.: canprot 1.1.0, Zenodo, <a href="https://doi.org/10.5281/zenodo.4105653" target="_blank">https://doi.org/10.5281/zenodo.4105653</a>,
2020c.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Dick and Shock(2011)</label><mixed-citation>
Dick, J. M. and Shock, E. L.: Calculation of the relative chemical
stabilities of proteins as a function of temperature and redox chemistry in a
hot spring, PLOS One, 6, e22782, <a href="https://doi.org/10.1371/journal.pone.0022782" target="_blank">https://doi.org/10.1371/journal.pone.0022782</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Dick et al.(2019)</label><mixed-citation>
Dick, J. M., Yu, M., Tan, J., and Lu, A.: Changes in carbon oxidation state
of metagenomes along geochemical redox gradients, Front. Microbiol.,
10, 120, <a href="https://doi.org/10.3389/fmicb.2019.00120" target="_blank">https://doi.org/10.3389/fmicb.2019.00120</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Du et al.(2018)</label><mixed-citation>
Du, B., Zielinski, D. C., Monk, J. M., and Palsson, B. O.: Thermodynamic
favorability and pathway yield as evolutionary tradeoffs in biosynthetic
pathway choice, P. Natl. Acad. Sci., 115,
11339–11344, <a href="https://doi.org/10.1073/pnas.1805367115" target="_blank">https://doi.org/10.1073/pnas.1805367115</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Dupont et al.(2014)</label><mixed-citation>
Dupont, C. L., Larsson, J., Yooseph, S., Ininbergs, K., Goll, J.,
Asplund-Samuelsson, J., McCrow, J. P., Celepli, N., Allen, L. Z., Ekman, M.,
Lucas, A. J., Hagström, Å., Thiagarajan, M., Brindefalk, B., Richter,
A. R., Andersson, A. F., Tenney, A., Lundin, D., Tovchigrechko, A., Nylander,
J. A. A., Brami, D., Badger, J. H., Allen, A. E., Rusch, D. B., Hoffman, J.,
Norrby, E., Friedman, R., Pinhassi, J., Venter, J. C., and Bergman, B.:
Functional tradeoffs underpin salinity-driven divergence in microbial
community composition, PLOS One, 9, e89549, <a href="https://doi.org/10.1371/journal.pone.0089549" target="_blank">https://doi.org/10.1371/journal.pone.0089549</a>,
2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Eiler et al.(2014)</label><mixed-citation>
Eiler, A., Zaremba-Niedzwiedzka, K., Martínez-García, M., McMahon, K. D.,
Stepanauskas, R., Andersson, S. G. E., and Bertilsson, S.: Productivity and
salinity structuring of the microplankton revealed by comparative freshwater
metagenomics, Environ. Microbiol., 16, 2682–2698,
<a href="https://doi.org/10.1111/1462-2920.12301" target="_blank">https://doi.org/10.1111/1462-2920.12301</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Feist et al.(2007)</label><mixed-citation>
Feist, A. M., Henry, C. S., Reed, J. L., Krummenacker, M., Joyce, A. R., Karp,
P. D., Broadbelt, L. J., Hatzimanikatis, V., and Palsson, B. Ø.: A
genome-scale metabolic reconstruction for <i>Escherichia coli</i> K-12
MG1655 that accounts for 1260 ORFs and thermodynamic information,
Mol. Syst. Biol., 3, 121, <a href="https://doi.org/10.1038/msb4100155" target="_blank">https://doi.org/10.1038/msb4100155</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Fernandez et al.(2013)</label><mixed-citation>
Fernandez, A. B., Ghai, R., Martin-Cuadrado, A. B., Sanchez-Porro, C.,
Rodriguez-Valera, F., and Ventosa, A.: Metagenome sequencing of prokaryotic
microbiota from two hypersaline ponds of a marine saltern in Santa Pola,
Spain, Genome Announcements, 1, e00933-13, <a href="https://doi.org/10.1128/genomea.00933-13" target="_blank">https://doi.org/10.1128/genomea.00933-13</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Finn et al.(2015)</label><mixed-citation>
Finn, S., Rogers, L., Händler, K., McClure, P., Amézquita, A., Hinton,
J. C. D., and Fanning, S.: Exposure of <i>Salmonella enterica</i> serovar
Typhimurium to three humectants used in the food industry induces different
osmoadaptation systems, Appl. Environ. Microbiol., 81,
6800–6811, <a href="https://doi.org/10.1128/AEM.01379-15" target="_blank">https://doi.org/10.1128/AEM.01379-15</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Fortunato et al.(2018)</label><mixed-citation>
Fortunato, C. S., Larson, B., Butterfield, D. A., and Huber, J. A.: Spatially
distinct, temporally stable microbial populations mediate biogeochemical
cycling at and below the seafloor in hydrothermal vent fluids, Environ.
Microbiol., 20, 769–784, <a href="https://doi.org/10.1111/1462-2920.14011" target="_blank">https://doi.org/10.1111/1462-2920.14011</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Garner and Burg(1994)</label><mixed-citation>
Garner, M. M. and Burg, M. B.: Macromolecular crowding and confinement in
cells exposed to hypertonicity, Am. J. Physiol., 266,
C877–C892, <a href="https://doi.org/10.1152/ajpcell.1994.266.4.C877" target="_blank">https://doi.org/10.1152/ajpcell.1994.266.4.C877</a>, 1994.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Gasteiger et al.(2005)</label><mixed-citation>
Gasteiger, E., Hoogland, C., Gattiker, A., Duvaud, S., Wilkins, M. R., Appel,
R. D., and Bairoch, A.: Protein identification and analysis tools on the
ExPASy server, in: The Proteomics Protocols Handbook, edited by: Walker,
J. M., Humana Press, Totowa, NJ,
571–607, <a href="https://doi.org/10.1385/1-59259-890-0:571" target="_blank">https://doi.org/10.1385/1-59259-890-0:571</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Ghai et al.(2011)</label><mixed-citation>
Ghai, R., Pašić, L., Fernández, A. B., Martin-Cuadrado, A.-B., Mizuno,
C. M., McMahon, K. D., Papke, R. T., Stepanauskas, R., Rodriguez-Brito, B.,
Rohwer, F., Sánchez-Porro, C., Ventosa, A., and Rodríguez-Valera, F.:
New abundant microbial groups in aquatic hypersaline environments,
Sci. Rep., 1, 135, <a href="https://doi.org/10.1038/srep00135" target="_blank">https://doi.org/10.1038/srep00135</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Gunde-Cimerman et al.(2018)</label><mixed-citation>
Gunde-Cimerman, N., Plemenitaš, A., and Oren, A.: Strategies of adaptation
of microorganisms of the three domains of life to high salt concentrations,
FEMS Microbiol. Rev., 42, 353–375, <a href="https://doi.org/10.1093/femsre/fuy009" target="_blank">https://doi.org/10.1093/femsre/fuy009</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Han et al.(2017)</label><mixed-citation>
Han, D., Link, H., and Liesack, W.: Response of <i>Methylocystis</i> sp.
strain SC2 to salt stress: Physiology, global transcriptome, and amino acid
profiles, Appl. Environ. Microbiol., 83, e00866,
<a href="https://doi.org/10.1128/AEM.00866-17" target="_blank">https://doi.org/10.1128/AEM.00866-17</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Han et al.(2005)</label><mixed-citation>
Han, Y., Zhou, D., Pang, X., Zhang, L., Song, Y., Tong, Z., Bao, J., Dai, E.,
Wang, J., Guo, Z., Zhai, J., Du, Z., Wang, X., Wang, J., Huang, P., and Yang,
R.: Comparative transcriptome analysis of <i>Yersinia pestis</i> in
response to hyperosmotic and high-salinity stress, Res. Microbiol.,
156, 403–415, <a href="https://doi.org/10.1016/j.resmic.2004.10.004" target="_blank">https://doi.org/10.1016/j.resmic.2004.10.004</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Havig et al.(2011)</label><mixed-citation>
Havig, J. R., Raymond, J., Meyer-Dombard, D. R., Zolotova, N., and Shock,
E. L.: Merging isotopes and community genomics in a siliceous
sinter-depositing hot spring, J. Geophys. Res., 116, G01005,
<a href="https://doi.org/10.1029/2010JG001415" target="_blank">https://doi.org/10.1029/2010JG001415</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Huang et al.(2011)</label><mixed-citation>
Huang, H., McGarvey, P. B., Suzek, B. E., Mazumder, R., Zhang, J., Chen, Y.,
and Wu, C. H.: A comprehensive protein-centric ID mapping service for
molecular data integration, Bioinformatics, 27, 1190–1191,
<a href="https://doi.org/10.1093/bioinformatics/btr101" target="_blank">https://doi.org/10.1093/bioinformatics/btr101</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Jevtić et al.(2019)</label><mixed-citation>
Jevtić, v., Stoll, B., Pfeiffer, F., Sharma, K., Urlaub, H., Marchfelder, A.,
and Lenz, C.: The response of <i>Haloferax volcanii</i> to salt and
temperature stress: A proteome study by label-free mass spectrometry,
Proteomics, 19, 1800491, <a href="https://doi.org/10.1002/pmic.201800491" target="_blank">https://doi.org/10.1002/pmic.201800491</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Kanesaki et al.(2002)</label><mixed-citation>
Kanesaki, Y., Suzuki, I., Allakhverdiev, S. I., Mikami, K., and Murata, N.:
Salt stress and hyperosmotic stress regulate the expression of different sets
of genes in <i>Synechocystis</i> sp. PCC 6803, Biochem.
Biophys. Res. Commun., 290, 339–348,
<a href="https://doi.org/10.1006/bbrc.2001.6201" target="_blank">https://doi.org/10.1006/bbrc.2001.6201</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Karl and Grabowski(2017)</label><mixed-citation>
Karl, D. M. and Grabowski, E.: The importance of H in particulate organic
matter stoichiometry, export and energy flow, Front. Microbiol., 8,
826, <a href="https://doi.org/10.3389/fmicb.2017.00826" target="_blank">https://doi.org/10.3389/fmicb.2017.00826</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Kauffman(1986)</label><mixed-citation>
Kauffman, J. M.: Simple method for determination of oxidation numbers of
atoms in compounds, J. Chem. Educat., 63, 474–475,
<a href="https://doi.org/10.1021/ed063p474" target="_blank">https://doi.org/10.1021/ed063p474</a>, 1986.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Keegan et al.(2016)</label><mixed-citation>
Keegan, K. P., Glass, E. M., and Meyer, F.: MG-RAST, a metagenomics service
for analysis of microbial community structure and function, in: Microbial
Environmental Genomics (MEG), edited by: Martin, F. and Uroz, S.,
Springer, New York, 207–233, <a href="https://doi.org/10.1007/978-1-4939-3369-3_13" target="_blank">https://doi.org/10.1007/978-1-4939-3369-3_13</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Kimbrel et al.(2018)</label><mixed-citation>
Kimbrel, J. A., Ballor, N., Wu, Y.-W., David, M. M., Hazen, T. C., Simmons,
B. A., Singer, S. W., and Jansson, J. K.: Microbial community structure and
functional potential along a hypersaline gradient, Front. Microbiol.,
9, 1492, <a href="https://doi.org/10.3389/fmicb.2018.01492" target="_blank">https://doi.org/10.3389/fmicb.2018.01492</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Kocharunchitt et al.(2014)</label><mixed-citation>
Kocharunchitt, C., King, T., Gobius, K., Bowman, J. P., and Ross, T.: Global
genome response of <i>Escherichia coli</i> O157:H7 Sakai during dynamic
changes in growth kinetics induced by an abrupt downshift in water activity,
PLOS One, 9, e90422, <a href="https://doi.org/10.1371/journal.pone.0090422" target="_blank">https://doi.org/10.1371/journal.pone.0090422</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Kohler et al.(2015)</label><mixed-citation>
Kohler, C., Lourenço, R. F., Bernhardt, J., Albrecht, D., Schüler, J.,
Hecker, M., and Gomes, S. L.: A comprehensive genomic, transcriptomic and
proteomic analysis of a hyperosmotic stress sensitive
<i>α</i>-proteobacterium, BMC Microbiol., 15, 71,
<a href="https://doi.org/10.1186/s12866-015-0404-x" target="_blank">https://doi.org/10.1186/s12866-015-0404-x</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Kopylova et al.(2012)</label><mixed-citation>
Kopylova, E., Noé, L., and Touzet, H.: SortMeRNA: Fast and accurate
filtering of ribosomal RNAs in metatranscriptomic data, Bioinformatics, 28,
3211–3217, <a href="https://doi.org/10.1093/bioinformatics/bts611" target="_blank">https://doi.org/10.1093/bioinformatics/bts611</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Kunin et al.(2008)</label><mixed-citation>
Kunin, V., Raes, J., Harris, J. K., Spear, J. R., Walker, J. J., Ivanova, N.,
von Mering, C., Bebout, B. M., Pace, N. R., Bork, P., and Hugenholtz, P.:
Millimeter-scale genetic gradients and community-level molecular
convergence in a hypersaline microbial mat, Mol. Syst. Biol., 4,
198, <a href="https://doi.org/10.1038/msb.2008.35" target="_blank">https://doi.org/10.1038/msb.2008.35</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Kyte and Doolittle(1982)</label><mixed-citation>
Kyte, J. and Doolittle, R. F.: A simple method for displaying the hydropathic
character of a protein, J. Mol. Biol., 157, 105–132,
<a href="https://doi.org/10.1016/0022-2836(82)90515-0" target="_blank">https://doi.org/10.1016/0022-2836(82)90515-0</a>, 1982.
</mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>LaRowe and Van Cappellen(2011)</label><mixed-citation>
LaRowe, D. E. and Van Cappellen, P.: Degradation of natural organic matter:
A thermodynamic analysis, Geochim. Cosmochim. Ac., 75, 2030–2042,
<a href="https://doi.org/10.1016/j.gca.2011.01.020" target="_blank">https://doi.org/10.1016/j.gca.2011.01.020</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Leuko et al.(2009)</label><mixed-citation>
Leuko, S., Raftery, M. J., Burns, B. P., Walter, M. R., and Neilan, B. A.:
Global protein-level responses of <i>Halobacterium salinarum</i> NRC-1
to prolonged changes in external sodium chloride concentrations, J.
Proteome Res., 8, 2218–2225, <a href="https://doi.org/10.1021/pr800663c" target="_blank">https://doi.org/10.1021/pr800663c</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Ley et al.(2006)</label><mixed-citation>
Ley, R. E., Harris, J. K., Wilcox, J., Spear, J. R., Miller, S. R., Bebout,
B. M., Maresca, J. A., Bryant, D. A., Sogin, M. L., and Pace, N. R.:
Unexpected diversity and complexity of the Guerrero Negro hypersaline
microbial mat, Appl. Environ. Microbiol., 72, 3685–3695,
<a href="https://doi.org/10.1128/AEM.72.5.3685-3695.2006" target="_blank">https://doi.org/10.1128/AEM.72.5.3685-3695.2006</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Lin et al.(2017)</label><mixed-citation>
Lin, J., Liang, H., Yan, J., and Luo, L.: The molecular mechanism and
post-transcriptional regulation characteristic of <i>Tetragenococcus
halophilus</i> acclimation to osmotic stress revealed by quantitative
proteomics, J. Proteomics, 168, 1–14,
<a href="https://doi.org/10.1016/j.jprot.2017.08.014" target="_blank">https://doi.org/10.1016/j.jprot.2017.08.014</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Lindsay et al.(2018)</label><mixed-citation>
Lindsay, M. R., Amenabar, M. J., Fecteau, K. M., Debes II, R. V.,
Fernandes Martins, M. C., Fristad, K. E., Xu, H., Hoehler, T. M., Shock,
E. L., and Boyd, E. S.: Subsurface processes influence oxidant availability
and chemoautotrophic hydrogen metabolism in Yellowstone hot springs,
Geobiology, 16, 674–692, <a href="https://doi.org/10.1111/gbi.12308" target="_blank">https://doi.org/10.1111/gbi.12308</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>May and Rowland(2018)</label><mixed-citation>
May, P. M. and Rowland, D.: JESS, a Joint Expert Speciation System –
VI: thermodynamically-consistent standard Gibbs energies of reaction for
aqueous solutions, New J. Chem., 42, 7617–7629,
<a href="https://doi.org/10.1039/C7NJ03597G" target="_blank">https://doi.org/10.1039/C7NJ03597G</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Milo et al.(2010)</label><mixed-citation>
Milo, R., Jorgensen, P., Moran, U., Weber, G., and Springer, M.:
BioNumbers—the database of key numbers in molecular and cell biology,
Nucl. Acid. Res., 38, D750–D753, <a href="https://doi.org/10.1093/nar/gkp889" target="_blank">https://doi.org/10.1093/nar/gkp889</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Minkiewicz et al.(2018)</label><mixed-citation>
Minkiewicz, P., Darewicz, M., and Iwaniak, A.: Introducing a simple equation to
express oxidation states as an alternative to using rules associated with
words alone, J. Chem. Educat., 95, 340–342,
<a href="https://doi.org/10.1021/acs.jchemed.7b00322" target="_blank">https://doi.org/10.1021/acs.jchemed.7b00322</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Morowitz(1999)</label><mixed-citation>
Morowitz, H. J.: A theory of biochemical organization, metabolic pathways,
and evolution, Complexity, 4, 39–53,
<a href="https://doi.org/10.1002/(SICI)1099-0526(199907/08)4:6&lt;39::AID-CPLX8&gt;3.0.CO;2-2" target="_blank">https://doi.org/10.1002/(SICI)1099-0526(199907/08)4:6&lt;39::AID-CPLX8&gt;3.0.CO;2-2</a>, 1999.
</mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Möller et al.(2016)</label><mixed-citation>
Möller, M. N., Li, Q., Chinnaraj, M., Cheung, H. C., Lancaster, J. R., and
Denicola, A.: Solubility and diffusion of oxygen in phospholipid membranes,
Biochim. Biophys. Ac., 1858, 2923–2930,
<a href="https://doi.org/10.1016/j.bbamem.2016.09.003" target="_blank">https://doi.org/10.1016/j.bbamem.2016.09.003</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>O'Leary et al.(2016)</label><mixed-citation>
O'Leary, N. A., Wright, M. W., Brister, J. R., Ciufo, S., Haddad, D., McVeigh,
R., Rajput, B., Robbertse, B., Smith-White, B., Ako-Adjei, D., Astashyn, A.,
Badretdin, A., Bao, Y., Blinkova, O., Brover, V., Chetvernin, V., Choi, J.,
Cox, E., Ermolaeva, O., Farrell, C. M., Goldfarb, T., Gupta, T., Haft, D.,
Hatcher, E., Hlavina, W., Joardar, V. S., Kodali, V. K., Li, W., Maglott, D.,
Masterson, P., McGarvey, K. M., Murphy, M. R., O'Neill, K., Pujar, S.,
Rangwala, S. H., Rausch, D., Riddick, L. D., Schoch, C., Shkeda, A., Storz,
S. S., Sun, H., Thibaud-Nissen, F., Tolstoy, I., Tully, R. E., Vatsan, A. R.,
Wallin, C., Webb, D., Wu, W., Landrum, M. J., Kimchi, A., Tatusova, T.,
DiCuccio, M., Kitts, P., Murphy, T. D., and Pruitt, K. D.: Reference
sequence (RefSeq) database at NCBI: current status, taxonomic
expansion, and functional annotation, Nucl. Acid. Res., 44, D733–D745,
<a href="https://doi.org/10.1093/nar/gkv1189" target="_blank">https://doi.org/10.1093/nar/gkv1189</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Ooka et al.(2019)</label><mixed-citation>
Ooka, H., McGlynn, S. E., and Nakamura, R.: Electrochemistry at deep-sea
hydrothermal vents: Utilization of the thermodynamic driving force towards
the autotrophic origin of life, ChemElectroChem, 6, 1316–1323,
<a href="https://doi.org/10.1002/celc.201801432" target="_blank">https://doi.org/10.1002/celc.201801432</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Oren(2013)</label><mixed-citation>
Oren, A.: Life at high salt concentrations, intracellular KCl
concentrations, and acidic proteomes, Front. Microbiol., 4, 315,
<a href="https://doi.org/10.3389/fmicb.2013.00315" target="_blank">https://doi.org/10.3389/fmicb.2013.00315</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Paul et al.(2008)</label><mixed-citation>
Paul, S., Bag, S. K., Das, S., Harvill, E. T., and Dutta, C.: Molecular
signature of hypersaline adaptation: insights from genome and proteome
composition of halophilic prokaryotes, Genome Biol., 9, R70,
<a href="https://doi.org/10.1186/gb-2008-9-4-r70" target="_blank">https://doi.org/10.1186/gb-2008-9-4-r70</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Poudel et al.(2018)</label><mixed-citation>
Poudel, S., Colman, D. R., Fixen, K. R., Ledbetter, R. N., Zheng, Y., Pence,
N., Seefeldt, L. C., Peters, J. W., Harwood, C. S., and Boyd, E. S.:
Electron transfer to nitrogenase in different genomic and metabolic
backgrounds, J. Bacteriol., 200, e00757,
<a href="https://doi.org/10.1128/JB.00757-17" target="_blank">https://doi.org/10.1128/JB.00757-17</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Qiao et al.(2013)</label><mixed-citation>
Qiao, J., Huang, S., Te, R., Wang, J., Chen, L., and Zhang, W.: Integrated
proteomic and transcriptomic analysis reveals novel genes and regulatory
mechanisms involved in salt stress responses in <i>Synechocystis</i> sp.
PCC 6803, Appl. Microbiol. Biotechnol., 97, 8253–8264,
<a href="https://doi.org/10.1007/s00253-013-5139-8" target="_blank">https://doi.org/10.1007/s00253-013-5139-8</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>R Core Team(2020)</label><mixed-citation>
R Core Team: R: A Language and Environment for Statistical
Computing, R Foundation for Statistical Computing, Vienna, Austria,
<a href="https://www.R-project.org" target="_blank"/>, last access: 21 October 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Record et al.(1998)</label><mixed-citation>
Record, Jr., M. T., Courtenay, E. S., Cayley, D. S., and Guttman, H. J.:
Responses of <i>E. coli</i> to osmotic stress: Large changes in amounts
of cytoplasmic solutes and water, Trend. Biochem. Sci., 23,
143–148, <a href="https://doi.org/10.1016/S0968-0004(98)01196-7" target="_blank">https://doi.org/10.1016/S0968-0004(98)01196-7</a>, 1998.
</mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Reeves et al.(2014)</label><mixed-citation>
Reeves, E. P., McDermott, J. M., and Seewald, J. S.: The origin of
methanethiol in midocean ridge hydrothermal fluids, P.
Natl. Acad. Sci., 111, 5474–5479, <a href="https://doi.org/10.1073/pnas.1400643111" target="_blank">https://doi.org/10.1073/pnas.1400643111</a>,
2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Reveillaud et al.(2016)</label><mixed-citation>
Reveillaud, J., Reddington, E., McDermott, J., Algar, C., Meyer, J. L., Sylva,
S., Seewald, J., German, C. R., and Huber, J. A.: Subseafloor microbial
communities in hydrogen-rich vent fluids from hydrothermal systems along the
Mid-Cayman Rise, Environ. Microbiol., 18, 1970–1987,
<a href="https://doi.org/10.1111/1462-2920.13173" target="_blank">https://doi.org/10.1111/1462-2920.13173</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Rho et al.(2010)</label><mixed-citation>
Rho, M., Tang, H., and Ye, Y.: FragGeneScan: Predicting genes in short
and error-prone reads, Nucl. Acid. Res., 38, e191,
<a href="https://doi.org/10.1093/nar/gkq747" target="_blank">https://doi.org/10.1093/nar/gkq747</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Rhodes et al.(2010)</label><mixed-citation>
Rhodes, M. E., Fitz-Gibbon, S. T., Oren, A., and House, C. H.: Amino acid
signatures of salinity on an environmental scale with a focus on the Dead
Sea, Environ. Microbiol., 12, 2613–2623,
<a href="https://doi.org/10.1111/j.1462-2920.2010.02232.x" target="_blank">https://doi.org/10.1111/j.1462-2920.2010.02232.x</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>Rodriguez-Brito et al.(2010)</label><mixed-citation>
Rodriguez-Brito, B., Li, L., Wegley, L., Furlan, M., Angly, F., Breitbart, M.,
Buchanan, J., Desnues, C., Dinsdale, E., Edwards, R., Felts, B., Haynes, M.,
Liu, H., Lipson, D., Mahaffy, J., Martin-Cuadrado, A. B., Mira, A., Nulton,
J., Pašić, L., Rayhawk, S., Rodriguez-Mueller, J., Rodriguez-Valera, F.,
Salamon, P., Srinagesh, S., Thingstad, T. F., Tran, T., Thurber, R. V.,
Willner, D., Youle, M., and Rohwer, F.: Viral and microbial community
dynamics in four aquatic environments, ISME J., 4, 739–751,
<a href="https://doi.org/10.1038/ismej.2010.1" target="_blank">https://doi.org/10.1038/ismej.2010.1</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>Satinsky et al.(2014)</label><mixed-citation>
Satinsky, B. M., Zielinski, B. L., Doherty, M., Smith, C. B., Sharma, S., Paul,
J. H., Crump, B. C., and Moran, M. A.: The Amazon continuum dataset:
quantitative metagenomic and metatranscriptomic inventories of the Amazon
River plume, June 2010, Microbiome, 2, 17, <a href="https://doi.org/10.1186/2049-2618-2-17" target="_blank">https://doi.org/10.1186/2049-2618-2-17</a>,
2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>Satinsky et al.(2015)</label><mixed-citation>
Satinsky, B. M., Fortunato, C. S., Doherty, M., Smith, C. B., Sharma, S., Ward,
N. D., Krusche, A. V., Yager, P. L., Richey, J. E., Moran, M. A., and Crump,
B. C.: Metagenomic and metatranscriptomic inventories of the lower Amazon
River, May 2011, Microbiome, 3, 39, <a href="https://doi.org/10.1186/s40168-015-0099-0" target="_blank">https://doi.org/10.1186/s40168-015-0099-0</a>,
2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>Schmidt et al.(2016)</label><mixed-citation>
Schmidt, A., Kochanowski, K., Vedelaar, S., Ahrné, E., Volkmer, B., Callipo,
L., Knoops, K., Bauer, M., Aebersold, R., and Heinemann, M.: The quantitative
and condition-dependent <i>Escherichia coli</i> proteome, Nat.
Biotechnol., 34, 104–110, <a href="https://doi.org/10.1038/nbt.3418" target="_blank">https://doi.org/10.1038/nbt.3418</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>Shabala et al.(2009)</label><mixed-citation>
Shabala, L., Bowman, J., Brown, J., Ross, T., McMeekin, T., and Shabala, S.:
Ion transport and osmotic adjustment in <i>Escherichia coli</i> in response
to ionic and non-ionic osmotica, Environ. Microbiol., 11, 137–148,
<a href="https://doi.org/10.1111/j.1462-2920.2008.01748.x" target="_blank">https://doi.org/10.1111/j.1462-2920.2008.01748.x</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>Shock et al.(2010)</label><mixed-citation>
Shock, E. L., Holland, M., Meyer-Dombard, D. R., Amend, J. P., Osburn, G. R.,
and Fischer, T. P.: Quantifying inorganic sources of geochemical energy in
hydrothermal ecosystems, Yellowstone National Park, USA, Geochim.
Cosmochim. Ac., 74, 4005–4043, <a href="https://doi.org/10.1016/j.gca.2009.08.036" target="_blank">https://doi.org/10.1016/j.gca.2009.08.036</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>Simon et al.(2014)</label><mixed-citation>
Simon, H. M., Smith, M. W., and Herfort, L.: Metagenomic insights into
particles and their associated microbiota in a coastal margin ecosystem,
Front. Microbiol., 5, 466, <a href="https://doi.org/10.3389/fmicb.2014.00466" target="_blank">https://doi.org/10.3389/fmicb.2014.00466</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib82"><label>Slonczewski et al.(2009)</label><mixed-citation>
Slonczewski, J. L., Fujisawa, M., Dopson, M., and Krulwich, T. A.:
Cytoplasmic pH measurement and homeostasis in bacteria and archaea, in:
Advances in Microbial Physiology, edited by: Poole, R. K., Vol. 55,
Academic Press, New York, 1–79, <a href="https://doi.org/10.1016/S0065-2911(09)05501-5" target="_blank">https://doi.org/10.1016/S0065-2911(09)05501-5</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib83"><label>Solheim et al.(2014)</label><mixed-citation>
Solheim, M., La Rosa, S. L., Mathisen, T., Snipen, L. G., Nes, I. F., and
Brede, D. A.: Transcriptomic and functional analysis of NaCl-induced stress
in <i>Enterococcus faecalis</i>, PLOS One, 9, e94571,
<a href="https://doi.org/10.1371/journal.pone.0094571" target="_blank">https://doi.org/10.1371/journal.pone.0094571</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib84"><label>Sterner and Liebl(2001)</label><mixed-citation>
Sterner, R. and Liebl, W.: Thermophilic adaptation of proteins, Crit.
Rev. Biochem. Mol., 36, 39–106,
<a href="https://doi.org/10.1080/20014091074174" target="_blank">https://doi.org/10.1080/20014091074174</a>, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib85"><label>Swingley et al.(2012)</label><mixed-citation>
Swingley, W. D., Meyer-Dombard, D. R., Shock, E. L., Alsop, E. B., Falenski,
H. D., Havig, J. R., and Raymond, J.: Coordinating environmental genomics
and geochemistry reveals metabolic transitions in a hot spring ecosystem,
PLOS One, 7, e38108, <a href="https://doi.org/10.1371/journal.pone.0038108" target="_blank">https://doi.org/10.1371/journal.pone.0038108</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib86"><label>The UniProt Consortium(2019)</label><mixed-citation>
The UniProt Consortium: UniProt: A worldwide hub of protein knowledge,
Nucl. Acid. Res., 47, D506–D515, <a href="https://doi.org/10.1093/nar/gky1049" target="_blank">https://doi.org/10.1093/nar/gky1049</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib87"><label>Turner et al.(2017)</label><mixed-citation>
Turner, C. B., Wade, B. D., Meyer, J. R., Sommerfeld, B. A., and Lenski, R. E.:
Evolution of organismal stoichiometry in a long-term experiment with
<i>Escherichia coli</i>, Roy. Soc. Open Sci., 4, 170497,
<a href="https://doi.org/10.1098/rsos.170497" target="_blank">https://doi.org/10.1098/rsos.170497</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib88"><label>Vavourakis et al.(2016)</label><mixed-citation>
Vavourakis, C. D., Ghai, R., Rodriguez-Valera, F., Sorokin, D. Y., Tringe,
S. G., Hugenholtz, P., and Muyzer, G.: Metagenomic insights into the
uncultured diversity and physiology of microbes in four hypersaline soda lake
brines, Front. Microbiol., 7, 211, <a href="https://doi.org/10.3389/fmicb.2016.00211" target="_blank">https://doi.org/10.3389/fmicb.2016.00211</a>,
2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib89"><label>Wagner(2005)</label><mixed-citation>
Wagner, A.: Energy constraints on the evolution of gene expression, Mol.
Biol. Evol., 22, 1365–1374, <a href="https://doi.org/10.1093/molbev/msi126" target="_blank">https://doi.org/10.1093/molbev/msi126</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib90"><label>Walsh et al.(2018)</label><mixed-citation>
Walsh, C. T., Tu, B. P., and Tang, Y.: Eight kinetically stable but
thermodynamically activated molecules that power cell metabolism, Chem.
Rev., 118, 1460–1494, <a href="https://doi.org/10.1021/acs.chemrev.7b00510" target="_blank">https://doi.org/10.1021/acs.chemrev.7b00510</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib91"><label>Wang et al.(2003)</label><mixed-citation>
Wang, Y., Bryan, C., Xu, H., and Gao, H.: Nanogeochemistry: Geochemical
reactions and mass transfers in nanopores, Geology, 31, 387–390,
<a href="https://doi.org/10.1130/0091-7613(2003)031&lt;0387:NGRAMT&gt;2.0.CO;2" target="_blank">https://doi.org/10.1130/0091-7613(2003)031&lt;0387:NGRAMT&gt;2.0.CO;2</a>, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib92"><label>Warn and Peters(1996)</label><mixed-citation>
Warn, J. R. W. and Peters, A. P. H.: Concise Chemical Thermodynamics, CRC
Press, 2nd Edn.,  1996.
</mixed-citation></ref-html>
<ref-html id="bib1.bib93"><label>Withman et al.(2013)</label><mixed-citation>
Withman, B., Gunasekera, T. S., Beesetty, P., Agans, R., and Paliy, O.:
Transcriptional responses of uropathogenic <i>Escherichia coli</i> to
increased environmental osmolality caused by salt or urea, Infect.
Immun., 81, 80–89, <a href="https://doi.org/10.1128/IAI.01049-12" target="_blank">https://doi.org/10.1128/IAI.01049-12</a>, 2013.

</mixed-citation></ref-html>
<ref-html id="bib1.bib94"><label>Youens-Clark et al.(2019)</label><mixed-citation>
Youens-Clark, K., Bomhoff, M., Ponsero, A. J., Wood-Charlson, E. M., Lynch, J.,
Choi, I., Hartman, J. H., and Hurwitz, B. L.: iMicrobe: Tools and
data-driven discovery platform for the microbiome sciences, GigaScience, 8,
giz083, <a href="https://doi.org/10.1093/gigascience/giz083" target="_blank">https://doi.org/10.1093/gigascience/giz083</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib95"><label>Zeldovich et al.(2007)</label><mixed-citation>
Zeldovich, K. B., Berezovsky, I. N., and Shakhnovich, E. I.: Protein and
DNA sequence determinants of thermophilic adaptation, PLOS Comput.
Biol., 3, e5, <a href="https://doi.org/10.1371/journal.pcbi.0030005" target="_blank">https://doi.org/10.1371/journal.pcbi.0030005</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib96"><label>Zhang et al.(2016)</label><mixed-citation>
Zhang, Y., Li, Y., Zhang, Y., Wang, Z., Zhao, M., Su, N., Zhang, T., Chen, L.,
Wei, W., Luo, J., Zhou, Y., Xu, Y., Xu, P., Li, W., and Tao, Y.:
Quantitative proteomics reveals membrane protein-mediated hypersaline
sensitivity and adaptation in halophilic <i>Nocardiopsis xinjiangensis</i>,
J. Proteome Res., 15, 68–85,
<a href="https://doi.org/10.1021/acs.jproteome.5b00526" target="_blank">https://doi.org/10.1021/acs.jproteome.5b00526</a>, 2016.
</mixed-citation></ref-html>--></article>
