<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" specific-use="SMUR" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">BGD</journal-id>
<journal-title-group>
<journal-title>Biogeosciences Discussions</journal-title>
<abbrev-journal-title abbrev-type="publisher">BGD</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Biogeosciences Discuss.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1810-6285</issn>
<publisher><publisher-name></publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/bg-2023-106</article-id>
<title-group>
<article-title>Predicting dominant terrestrial biomes at a global scale using machine learning algorithms, climate variable indices, and extreme climate indices</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Sato</surname>
<given-names>Hisashi</given-names>
<ext-link>https://orcid.org/0000-0002-6510-4914</ext-link>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Research Institute for Global Change, Japan Agency for Marine-Earth Science and Technology (JAMSTEC), Yokohama, 236-0001, Japan</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Department of Ecosystem Studies, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, 113-8657, Japan</addr-line>
</aff>
<pub-date pub-type="epub">
<day>25</day>
<month>01</month>
<year>2024</year>
</pub-date>
<volume>2024</volume>
<fpage>1</fpage>
<lpage>25</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2024 Hisashi Sato</copyright-statement>
<copyright-year>2024</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://bg.copernicus.org/preprints/bg-2023-106/">This article is available from https://bg.copernicus.org/preprints/bg-2023-106/</self-uri>
<self-uri xlink:href="https://bg.copernicus.org/preprints/bg-2023-106/bg-2023-106.pdf">The full text article is available as a PDF file from https://bg.copernicus.org/preprints/bg-2023-106/bg-2023-106.pdf</self-uri>
<abstract>
<p>Several methods have been proposed for modelling global biome distribution. Climate data are typically summarised in terms of a few climate indices. However, with the recent advancement of machine learning algorithms, such summarisation is no longer required. Extreme climate events such as intense droughts and very low temperatures cannot be captured by monthly mean climate data, which may limit the applicability of biome boundaries. In this study, I assessed the influences of machine learning algorithms, climate variable indices, and extreme climate indices on the accuracy and robustness of global biome modelling. I found that the random forest and convolutional neural network algorithms produced highly accurate models for reconstructing the global biome distribution. However, the convolutional neural network algorithm was preferable, because the random forest algorithm substantially overfit the training data relative to the other machine learning algorithms examined. Including indexed climate data slightly reduced model accuracy, whereas including extreme climate data slightly improved it. However, there were significant deviations in the distribution of values between the observed and predicted climate when extreme climate data was included; this fatally reduced the robustness of the models, which were evaluated in terms of prediction consistency. Therefore, I recommend that extreme climate data not be considered in global-scale biome prediction applications.</p>
</abstract>
<counts><page-count count="25"/></counts>
<funding-group>
<award-group id="gs1">
<funding-source>National Institute of Polar Research</funding-source>
<award-id>JPMXD1420318865</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body/>
<back>
</back>
</article>