Response of simulated burned area to historical changes in environmental and anthropogenic factors: a comparison of seven fire models

. Understanding how ﬁre regimes change over time is of major importance for understanding their future impact on the Earth system, including society. Large differences in simulated burned area between ﬁre models show that there is substantial uncertainty associated with modelling global change impacts on ﬁre regimes. We draw here on sensitivity modelling. Both improvements in process understanding and observational constraints reduce uncertainties in modelling burned area trends.

Abstract. Understanding how fire regimes change over time is of major importance for understanding their future impact on the Earth system, including society. Large differences in simulated burned area between fire models show that there is substantial uncertainty associated with modelling global change impacts on fire regimes. We draw here on sensitivity simulations made by seven global dynamic vegetation models participating in the Fire Model Intercomparison Project (FireMIP) to understand how differences in models translate into differences in fire regime projections. The sensitivity experiments isolate the impact of the individual drivers on simulated burned area, which are prescribed in the simulations. Specifically these drivers are atmospheric CO 2 concentration, population density, land-use change, lightning and climate.
The seven models capture spatial patterns in burned area. However, they show considerable differences in the burned area trends since 1921. We analyse the trajectories of differences between the sensitivity and reference simulation to improve our understanding of what drives the global trends in burned area. Where it is possible, we link the inter-model differences to model assumptions.
Overall, these analyses reveal that the largest uncertainties in simulating global historical burned area are related to the representation of anthropogenic ignitions and suppression and effects of land use on vegetation and fire. In line with previous studies this highlights the need to improve our understanding and model representation of the relationship between human activities and fire to improve our abilities to model fire within Earth system model applications. Only two models show a strong response to atmospheric CO 2 concentration. The effects of changes in atmospheric CO 2 concentration on fire are complex and quantitative information of how fuel loads and how flammability changes due to this factor is missing. The response to lightning on global scale is low. The response of burned area to climate is spatially heterogeneous and has a strong inter-annual variation. Climate is therefore likely more important than the other factors for short-term variations and extremes in burned area. This study provides a basis to understand the uncertainties in global fire

Introduction
Wildfires are an important driver of vegetation distribution and regulate ecosystem functioning, biodiversity and carbon storage over large parts of the world (Bond et al., 2005;Hantson et al., 2016a). Fire has strong impacts on climate through changing land surface properties, through atmospheric chemistry and hence radiative forcing and through biogeochemical cycling (Bowman et al., 2009;Randerson et al., 2012;Ward et al., 2012;Yue et al., 2016;Lasslop et al., 2019). Estimates of the net effect of fire on the Earth system vary. Analyses based on observations of the pre-industrial period suggest that the contribution of fire to the overall climatecarbon-cycle feedback is substantial with 5.6 ± 3.2 ppm K −1 CO 2 (Harrison et al., 2018) while the strength of the global land climate-carbon-cycle feedback estimated from Earth system simulations (Arora et al., 2013) is 17.5 ppm K −1 (Harrison et al., 2018). However, comparing potential fireinduced losses from terrestrial carbon pools and stocks of solid pyrogenic carbon in soils and ocean, fire may also be a net sink of carbon, and Earth system simulations show a negative effect of fire on radiative forcing (Lasslop et al., 2019). In addition to these consequences for the Earth system, wildfires directly impact society and economy (Gauthier et al., 2015), and human health can be seriously impaired (Johnston et al., 2012;Finlay et al., 2012).
Given the various impacts of fire on natural and human systems, and the large uncertainties, it is important to improve the understanding of what controls the occurrence of wildfires and to know how fire regimes might change in the future.
Based on current process understanding, the following drivers influenced burned area over the last decades to centuries. Increasing atmospheric CO 2 concentration leads to increases in net primary production (Hickler et al., 2008), and decreased stomatal conductance reduces the plant transpiration and enhances water conservation in plants (Morison, 1985). It can lead to an increase in the abundance of woody plants ("woody thickening"; Wigley et al., 2010;Bond and Midgley, 2012;Buitenwerf et al., 2012) because C 3 plants are generally more competitive than C 4 plants under higher atmospheric CO 2 concentration (e.g. Ehleringer and Björkman, 1977;Ehleringer et al., 1997;Wand et al., 2001;Sage and Kubien, 2007). The impact of these various changes on burned area is complex. Increased productivity can lead to increased fuel availability, which can lead to increased burned area in water-and fuel-limited regions (Kelley and Harrison, 2014). On the other hand, decreased stomatal conductance and lower transpiration can lead to enhanced water conservation in plants. This increases the moisture content of soil, as well as vegetation moisture content, and consequently increases live and dead fuel moisture contents, which decreases flammability and reduces burned area. Woody thickening can lead to a reduction in burned area through changing the nature of fuel loads (Kelley and Harrison, 2014).
There is still controversy about whether humans increase or decrease fire overall. Although there is broad agreement that humans suppress fires in regions with high population density, observational studies are less clear about what happens in areas of low population density and show both increases or decreases due to human activities (see for instance Marlon et al., 2008;Bowman et al., 2011;Marlon et al., 2013;Vannière et al., 2016;Andela et al., 2017;Balch et al., 2017). Studies of the covariation between population density and number of fires have shown that increasing population density leads to an increase in the number of ignitions or in the number of individual fires until peaking at intermediate population densities and then subsequently dropping (Syphard et al., 2009;Archibald et al., 2010). Burned area can be expressed as the number of fires multiplied by their fire size. The increase in burned area due to changes in ignitions is expected to differ between regions with varying population density as the largest fires occur in unpopulated areas (Hantson et al., 2015a). Global analyses find that the net effect of population density is a decrease in burned area (Bistinas et al., 2014;Knorr et al., 2014), with high uncertainties for low population density if the method allows for non-monotonic relationships (Knorr et al., 2014). Regional analyses tend to confirm this, but positive relationships between burned area and population density have been shown, for instance, for the least disturbed areas in the USA (Parisien et al., 2016).
Fire was used to manage croplands in pre-industrial times (e.g. Dumond, 1961;Otto and Anderson, 1982;Johnston, 2003) and is still common practice mainly in nonindustrialized areas (i.e. sub-Saharan Africa, parts of Southeast Asia, Indonesia and Latin America; e.g. Conklin, 1961;Rasul and Thapa, 2003). However fires in agricultural areas are common all over the world (Korontzi et al., 2006). Global analyses indicate a decrease in burned area (Bistinas et al., 2014;Andela and van der Werf, 2014) and fire size (Hantson et al., 2015b) with increases in cropland fraction. Fires on pasturelands have been estimated to contribute over 40 % of the global burned area (Rabin et al., 2015). Analyses of global datasets have found an increase in burned area with increases in grazing land cover (Bistinas et al., 2014) but found reduced burned area on intensely grazed areas (Andela et al., 2017). Despite these analyses, the severe data gaps limit our level of understanding on how humans use fire in land management (Erb et al., 2017).
Lightning is the main source of natural ignitions (Scott et al., 2014). It is connected to convective activity and is therefore expected to change with global warming (Krause et al., 2014). Most of the burned area in boreal regions results from a few large fires (Stocks et al., 2002); these large fires are frequently ignited by lightning (Peterson et al., 2010). Veraverbeke et al. (2017) have shown that lightning ignitions drive the inter-annual variability as well as the longterm trends of ignitions in boreal regions.
Climate influences burned area through weather conditions and through its influence on vegetation (Bistinas et al., 2014;Forkel et al., 2017). Weather conditions (precedent precipitation, temperature and wind speed) influence fuel drying, and wind speed additionally affects the rate of fire spread (Harrison et al., 2010;Scott et al., 2014). Vegetation type and fuel load are driven by climate, and both strongly influence fire occurrence (Chuvieco et al., 2008;Pettinari and Chuvieco, 2016). Fires are limited under dry conditions due to low vegetation productivity and therefore insufficient fuel, and are limited under wet conditions because the fuel is too wet to burn. The highest burned areas are therefore found in areas with intermediate moisture conditions . There is no obvious disagreement in literature about how specific climatic factors influence fire. However, the relative importance of each factor, e.g. weather vs. vegetation, is still uncertain and varies spatially (Forkel et al., 2017). Fire models are sensitive to meteorological forcing, and different forcing datasets already lead to large differences in simulated burned area (Rabin et al., 2017a;Lasslop et al., 2014). The importance of factors also varies between small and large scales. Wind speed is an obvious driver of fire spread on the local scale, but it is difficult to extract this influence on the spatial resolution of global models .
Fire-enabled vegetation models simulate fire regimes in response to the combination of individual forcings, including atmospheric CO 2 concentration, population density, land-use change, lightning and climate. Individual fire-enabled vegetation models have been shown to simulate observed global patterns of burned area and fire emissions reasonably well (Kloster et al., 2010;Prentice et al., 2011;Li et al., 2012;Lasslop et al., 2014;Yue et al., 2014), but there are large differences between models in terms of regional patterns, fire seasonality and inter-annual variability, historical trends (Kelley et al., 2013;Andela et al., 2017) and responses to individual factors (Kloster et al., 2010;Knorr et al., 2014Knorr et al., , 2016Kloster, 2017, 2015). The Fire Model Intercomparison Project (FireMIP, Hantson et al., 2016a;Rabin et al., 2017a) provides a systematic framework to consistently analyse and understand the causes of these differences and to relate them to differences in the treatment of key drivers of fire in individual models. FireMIP provides simulations for a systematic comparison of fire model behaviour based on outputs of a large range of models with identical forcing inputs. In addition to a reference historical simulation, sensitivity simulations were conducted for individual forcings, specifically atmospheric CO 2 concentration, population density, land-use change, lightning and climate. A re-cent evaluation of the FireMIP models indicates that the relationship with climatic parameters is captured well by models, the response to human factors is captured by some models and the response to vegetation productivity or the allocation of carbon to fuels needs refinement for most models (Forkel et al., 2019a). Comparisons of the FireMIP historical simulations found differences in transient model behaviour in the 20th century (Andela et al., 2017;van Marle et al., 2017). The causes of the differences and the reasons why different models show different responses are not yet understood.
In this multi-model study we use the historical simulation to show the overall modelled response of burned area to changes in environmental and human factors. We then compare the sensitivity experiments of the five most commonly used driving factors to document how simulated burned area responds to the individual forcing factors and relate intermodel differences of the burned area response to differences in model assumptions or parametrization. We finally suggest implications of our results for model development and application.

Methods
The baseline FireMIP experiment (SF1) is a transient simulation from 1700 to 2013, in which atmospheric CO 2 concentration, population density, land use, lightning and climate change through time according to prescribed datasets. The baseline and sensitivity simulations start from the end of a spin-up simulation with equilibrated carbon pools (see Rabin et al., 2017a, for details of the experimental protocol). The five sensitivity experiments (SF2) are designed to isolate differences in model behaviour associated with individual forcing factors. The model inputs and setup are the same as in SF1, but one of the forcings is kept constant at the value used in the spin-up throughout the experiment (see Table 1). Thus, for example in SF2_CO2, population density, land use, lightning and climate inputs change each year, but atmospheric CO 2 concentration is held constant at 277.33 ppm for the whole of the simulation. The resulting difference in burned area between the simulations is then a combination of the changes in the forcing and the sensitivity of the model to that forcing factor. Not all models performed every sensitivity experiment due to limitations in model structure (see Table 2). Detailed model descriptions can be found in the corresponding literature listed in Table A1. Two of the models (CLASS-CTEM and CLM) started the simulations later than the others (1861 and 1850, respectively), and due to limitations in data availability the reference year of the forcings used in the spin-up varies (see Table 1). We account for these differences in starting years between models and in the forcing factors by limiting our analysis to the period where all factors are different from the ones used in the spin-up (after 1921). These differences still influence the absolute differences, and we therefore quantify the strength of the impact through the slope of a regression line and do not interpret the offset.

Data processing and analysis of simulation results
Our analyses of the SF1 and SF2 simulations focus on the simulation of burned area but are complemented by effects on vegetation carbon pools for the SF2_CO2 simulation. We focus on the time series of global burned area over the historical simulation and the spatial patterns of differences in burned area between 1921 and 2013, as in this period all forcings are transient and different from the values used in the spin-up. Annual global values are an area weighted average using the grid cell area. We quantify the response of the models to each driving factor using the absolute difference in burned area between the baseline and the respective sensitivity experiment (SF1-SF2_i, with i in CO 2 , FPO, FLA, FLI, and CLI; see Table 1 for details). Positive differences mean that the transient change of the factor leads to an increase in burned area. We use the Climate Data Operators (CDO version 2018: Climate Data Operators; available at: http://www.mpimet.mpg.de/cdo, last access: 30 January 2019) to process and remap the simulated outputs. We test the difference time series for trends over the period from 1921 to 2013 using the Mann-Kendall test, implemented in the R package Kendall (McLeod, 2011). We quantify the global trend as the slope of a linear regression and summarize the spatial distribution of trends by quantifying the area with significant positive trends and the area with significant negative trends.
Due to a postprocessing error, INFERNO lacks 2 years in SF2_CO2 (2001 and 2002).

Model-data comparison
To evaluate the simulations of burned area, we compare the simulated burned area with remote sensing data products. Global burned area observations from satellites still suffer from substantial uncertainty, as reflected by the considerable differences in spatial and temporal patterns between different data products (Humber et al., 2018;Hantson et al., 2016a;Chuvieco et al., 2018;. Using multiple satellite products in model benchmarking is one approach which takes into account these observational uncertainties (Rabin et al., 2017a). In this study, we use three satellite products: GFED4 (Giglio et al., 2013), GFED4s (van der Werf et al., 2017 and FireCCI50 (Chuvieco et al., 2018). GFED4 is a gridded version of the MODIS Collection 5.1 MCD64 burned area product. It is known that this product strongly underestimates small fires, including cropland fires (e.g. Hall et al., 2016). In GFED4s, burned area due to small fires is estimated based on MODIS active fire (AF) detections and added to GFED4 burned area. However, this methodology may introduce significant errors related to erroneous AF detections (Zhang et al., 2018). As a comple-mentary product, FireCCI50 was developed using MODIS spectral bands with higher spatial resolution than MCD64. A higher resolution enhances the ability to detect smaller fires; however, this improvement is partially offset by suboptimal spectral properties of the bands. Both GFED4s and FireCCI50 have a larger burned area than GFED4. Since all three products are based on MODIS data, the inter-product differences probably underestimate uncertainties associated with these products. A recent mapping of burned area for Africa using higher-resolution Sentinel-2 observations indicates that all three products substantially underestimate burned area (Roteta et al., 2019). For the model evaluation we use temporally averaged burned area fraction for the years 2001-2013, which is the interval common to all three satellite products and the model simulations. We resample the model outputs to the lowest model resolution (CLASS-CTEM: 2.8125 • × 2.8125 • ) with first-order conservative remapping. We quantify the agreement between models and observations by providing the global burned area and the Pearson correlation coefficient for the between grid cell variation (see Table 3). We choose the Pearson correlation as it quantifies the covariation of the spatial patterns and is less sensitive to the highly uncertain absolute burned area values. Burned area has a strongly skewed distribution, with few high values and many small values close to, or equal to, zero. These few high values have a much higher contribution to the overall correlation (see Fig. A9 in Appendix), and therefore the metric is strongly determined by the performance of the model in areas with high burning. Square root or logarithmic transformation leads to more normally distributed values that reduce this bias (see Fig. A9). As the logarithm transformation excludes grid cells with zero burned area, we adopt the square root transformation.
In spite of major advances in mapping burned area based on satellite data, these data products include major uncertainties. GFED4 and FireCCI50 provide uncertainty estimates for the burned area. Applying Gaussian error propagation, which assumes that errors are independent and normally distributed, yields uncertainty estimates of 0.01 % (GFED4) and 0.2 % (FireCCI50) of the global burned area, which is certainly an underestimation. The assumptions of normal distribution and independence are likely violated. The spread between global burned area datasets is probably a more realistic estimate. Since all the products rely on the MODIS sensor, this approach will not capture the full uncertainty. Nevertheless, to investigate the effect of data quality in the observations on the model-data comparison we use the burned area product uncertainty estimates (aggregated to model resolution assuming independence) to group the observations into points with low, medium and high uncertainty (low: within the 0-33rd percentile, medium: within the 33rd-66th percentile and high: within the 66th-99th percentile of the relative uncertainty; estimates = uncertainty / burned area). We then compute the correlations for datapoints with low, medium and high uncertainty separately. The models show magnitudes of annual global burned area between 354 and 531 Mha yr −1 for present day. This is comparable to the estimates obtained from the satellite products, which range from 345 to 480 Mha yr −1 (see Fig. 1, Table 3). The correlation coefficients between all of the simulations and the satellite observations are reasonable, with values ranging from 0.51 (CLASS-CTEM and GFED4s) to 0.8 (ORCHIDEE-SPITFIRE and GFED4; see Table 3). In general, the correlations with GFED4 are highest and with GFED4s being the lowest for almost all models -which may reflect the fact that most models do not explicitly simulate agricultural fires or may indicate inaccuracies in the mapping of agricultural fires in the GFED4s dataset. The correlation coefficients strongly decrease with increasing observational relative uncertainty (see Table A2 in Appendix). This shows that part of the mismatch in the spatial patterns between simulations and observations is a consequence of uncertainties in the satellite products themselves. The FireMIP models simulate the broad-scale patterns in burned area reasonably well (see Fig. A1), with maxima in the major fireaffected regions of the Sahel, southern Africa, northern Aus-tralia and the western USA. All of the models tend to overestimate the burned area in South America and also in the temperate regions of the USA. For a more detailed evaluation of the burned area see Forkel et al. (2019a). The simulated trend in burned area in the historical simulation differs between the models (see Fig. 1). All models show a significant trend over the time series from 1921 to 2013 (see Table 4). Models that have a relatively high total burned area initially (LPJ-GUESS-SIMFIRE-BLAZE and CLASS-CTEM) show a decline in burned area over the 20th century. Most models that have a low burned area (INFERNO, ORCHIDEE-SPITFIRE and LPJ-GUESS-SPITFIRE) show an increasing trend. JSBACH-SPITFIRE and CLM have intermediate levels in burned area and show a weak decreasing trend over the 20th century.
Satellite records show a decline in global burned area since 1996 (Andela et al., 2017). However, as Forkel et al. (2019b) have shown, the significance of the observed global decline is strongly affected by the length of the sampled interval because of the high inter-annual variability in burned area and trends between products show only a low correlation (Forkel et al., 2019b).
No observations document the longer-term trends in burned area. Charcoal records (Marlon et al., 2008(Marlon et al., , 2016 and Table 3. Global burned area averaged over 2001-2013 in megahectare per year (Mha yr −1 ) and the Pearson correlation coefficients between the baseline experiment SF1 for all FireMIP models and the respective observation data. We use a square root transformation on both model and observations. All correlation coefficients are significant (p value < 0.05).

Model
Burned carbon monoxide data from ice-core records (Wang et al., 2010) are a proxy for biomass burning and show a global decrease in biomass burning over most of the 20th century. However, the charcoal records show an increase in burning since 2000 CE, but this discrepancy might reflect regional undersampling (for instance in Africa) or taphonomic issues of the charcoal record. A recent fire emission dataset (van Marle et al., 2017) merges information from satellites, charcoal records, airport visibility records and if no other information was available uses simulation results of the FireMIP models. This dataset is not included to evaluate the models here as it is partly based on the simulations of the FireMIP models and as it provides only estimates for emissions not burned area.
The understanding of the drivers on simulated trends that we give below provides insights on what causes the simulated trends and which assumptions control the trend. These insights will help to understand which observational constraints and process understanding is required to improve global fire models.

Response of simulated burned area to individual drivers
The response of burned area to the individual factors is determined by the changes in the driving factors and the sensitivity of the model to these changes. The population density forcing dataset has the strongest trend in the relative differences between the transient forcing and the year 1920 value followed by the land-use and land-cover change dataset. The trend in atmospheric CO 2 concentration is higher than the trend in the lightning dataset, which is more than twice as strong as in the air temperature. Wind speed shows the lowest trend of all investigated driving factors (see Table 4). Population density (SF2_FPO) and land-use change (SF2_FLA) cause the largest divergence between models in trends of burned area (slope between −1.05 and 1.345 Mha yr −1 and between −1.485 and 1.845 Mha yr −1 , respectively). All models have a statistically significant trend in burned area for SF2_FPO as well as for SF2_FLA, except for CLM for SF2_FLA (see Table 4, Fig. 2b and c). For SF2_CO2 all models have a significant trend, however, the magnitude of the trend is much smaller compared to the trend due to anthropogenic factors. LPJ-GUESS-SPITFIRE and JSBACH-SPITFIRE have strong trends (> 0.5 Mha yr −1 ), for all other models the slope is close to zero (< 0.15 Mha yr −1 ; see Table 4, Fig. 2a). The differences between models are increasing over the 20th century for these first three experiments. The response to changes in lightning and climate generally shows much smaller trends but high inter-annual variability: none of the models has a significant trend for climate. Three models show significant (but inconsistent 0.014, 0.334 and −0.074 Mha yr −1 ) trends for lightning (see Table 4). The inter-annual variability is stronger for climate. The mean standard deviation of the absolute differences averaged over all models is 30 Mha for climate and 7 Mha for lightning (only 3 Mha if the model with the strongest response is excluded; see Fig. 2d and e). The spatial patterns of trends in burned area are mostly heterogeneous (see Figs. A3-A7). The global trend can be dominated by changes in limited areas of the world, while the lack of a global trend can reflect opposing trends in different regions. A detailed regional analysis is beyond the scope of this study, but we provide an alternative global view by quantifying the area affected by positive or negative trends (see Fig. 3). This comparison shows that for most models larger areas show significant positive trends for the reference simulation (5 models), increasing atmospheric CO 2 concentration (5 models) and varying climate (5 models and 1 equal areas). There is no clear signal of either positive or negative trends across the models for the other simulations. For climate and lightning smaller areas have significant trends (see Fig. 3). For ORCHIDEE-SPITFIRE and LPJ-GUESS- Table 4. Trends (slope and standard error of a linear regression, megahectare per year, Mha yr −1 ) in annual global burned area for the years 1921-2013 for the baseline experiment SF1 and absolute difference time series of annual burned area. The trends for the forcing datasets are based on the relative difference between the transient forcing and year 1920 values for SF2_CO2, SF2_FPO and SF2_FLA and are based on the relative difference between the transient and the recycled forcing for SF2_FLI and SF2_FCL for the years 1921-2013 (%) (see Table 1). Bold values indicate significance based on a Mann-Kendall test (p value < 0.05). Experiments that are not available for specific models are indicated with NA.

Model
Sensitivity experiments  SPITFIRE all factors but climate cause a significant positive trend globally (see Table 4), and larger areas have positive trends for all factors, with the exception of lightning for LPJ-GUESS-SPITFIRE (see Fig. 3). On the other end of the model range, LPJ-GUESS-SIMFIRE-BLAZE only shows a positive global trend for climate and shows positive trends induced by atmospheric CO 2 concentration in larger areas (see Fig. 3).
In the following paragraphs we detail the inter-model differences and their causes for each sensitivity experiment.

Response of simulated burned area to atmospheric CO 2 concentration
The overall changes in burned area in individual simulations as a result of atmospheric CO 2 concentration changes are a complex response to multiple changes in vegetation, changes in land cover, fuel load, fuel characteristics and fuel moisture. Burned area can either increase due to higher availability of fuel loads or decrease due to changes in flammabil-  Table 1).

Figure 3.
Area with a significant positive trend (red bar) or with a significant (Mann-Kendall test p < 0.05) negative trend (blue bar) in burned area fraction averaged over the years 1921-2013 for the baseline experiment SF1 and for the absolute differences in burned area fraction between the sensitivity experiments SF2 and SF1 (see Table 1). See Figs. A2-A7 for comparison.
ity caused by different fuel properties. The FireMIP models react to increasing atmospheric CO 2 concentration in different ways: some models (JSBACH-SPITFIRE and LPJ-GUESS-SPITFIRE) show a strong increase in burned area, some (CLM and INFERNO) show a moderate increase, CLASS-CTEM shows a slight decrease, and LPJ-GUESS-SIMFIRE-BLAZE and ORCHIDEE-SPITFIRE show a non-monotonic response (see Fig. 2a). For all models, the trends over the 20th century are significant (see Table 4). We use changes in vegetation carbon to understand changes in fuel load and composition because information on the amount of fuel used within the fire models was not available for individual plant functional types (PFTs). All models show an increase in total vegetation biomass ("total" is indicated by solid lines; see Fig. 4) as expected because of higher productivity (Farquhar et al., 1980;Hickler et al., 2008) and increased water use efficiency (De Kauwe et al., 2013). The response of specific types of vegetation carbon to increasing atmospheric CO 2 concentration varies between the vegetation models. The biomass of C 3 vegetation (trees and C 3 grasses) increases in all of the models. The biomass of C 4 grasses increases in CLASS-CTEM, IN-FERNO and JSBACH-SPITFIRE, but it does not change in ORCHIDEE-SPITFIRE. Since ORCHIDEE-SPITFIRE was run with fixed vegetation distribution, changes in the extent of different PFTs can be ruled out as a cause of changes in vegetation carbon. There is a decrease in burned area in regions with abundant C 4 grasses (Sahel and north Australia) in this model, suggesting that changes in fuel type (increased C 3 tree biomass) result in changes in flammability in these regions. The carbon stored in C 4 grasses is reduced in response to increasing atmospheric CO 2 concentration in CLM and LPJ-GUESS-SIMFIRE-BLAZE and is fairly constant in LPJ-GUESS-SPITFIRE. This can be a result of a decrease in C 4 grass cover in LPJ-GUESS-SIMFIRE-BLAZE and LPJ-GUESS-SPITFIRE. However, since CLM was run with prescribed vegetation cover, the reduction in C 4 carbon must reflect the fact that any increase in C 4 grass biomass due to higher atmospheric CO 2 concentration is offset by greater losses through burning due to the increased total fuel load.
CLM and LPJ-GUESS-SIMFIRE-BLAZE include an interactive nitrogen cycle, and CLASS-CTEM includes a noninteractive nitrogen downregulation. Effects of atmospheric CO 2 concentration on vegetation biomass for these three models are therefore at the lower end of the model ensemble. The strength of atmospheric CO 2 concentration effects on productivity is still uncertain and quantitative information about effects on fuel loads is not available. Comparisons with experimental data suggest that models that do not include the nitrogen cycle overestimate the effect on productivity (Hickler et al., 2015). However, an analysis using an observation-based emergent constraint on the long-term sensitivity of land carbon storage shows that models from the Coupled Climate Model Intercomparison Project (CMIP5) ensemble, which includes an interactive nitrogen cycle, un-derestimate the impact of atmospheric CO 2 concentration on productivity (Wenzel et al., 2016).
Soil moisture is used by several models to compute fuel moisture (see Fig. 5). Soil moisture can be influenced by different atmospheric CO 2 concentrations as reductions in stomatal conductance can lead to increases in soil moisture, whereas increases in the leaf area index (LAI), caused by increased biomass of increased tree cover, lead to higher transpiration and therefore lower soil moisture. Soil moisture increases slightly in four models (INFERNO, CLASS-CTEM and CLM, JSBACH-SPITFIRE) and decreases slightly in ORCHIDEE-SPITFIRE. Only LPJ-GUESS-SPITFIRE shows a strong decrease (5 % in global average) in soil moisture (see Fig. 6).
Models which include fuel load and moisture effects through threshold functions (see Fig. 5; CLASS-CTEM, IN-FERNO and CLM) tend to show muted responses. Decreases in burned area appear to be largely caused by increases in soil moisture or tree cover. Increases associated with increasing fuel load are limited to regions with low biomass. The balance between these effects differs between the models. CLASS-CTEM shows a small decrease in burned area globally, and the spatial pattern is dominated by areas with negative trends in burned area, but there are positive trends in dry regions (see Fig. A3). The small global increase in burned area in INFERNO is likely related to increased fuel loads, while negative trends in burned area only occur in the tropical regions (see Fig. A3). INFERNO uses a constant burned area per PFT that is set to 0.6, 1.4 and 1.2 km 2 for trees, grass and shrubs, respectively. CLM shows increased global burned area, but increases are located in dry areas while the boreal regions show decreases. JSBACH-SPITFIRE and LPJ-GUESS-SPITFIRE respond to elevated atmospheric CO 2 concentrations with a strong increase in burned area, likely driven by increases in fuel load. LPJ-GUESS-SPITFIRE additionally shows a strong decrease in soil moisture, which might explain why this model shows the strongest increase in burned area. ORCHIDEE-SPITFIRE shows lower burned area in response to elevated atmospheric CO 2 concentrations but the decreases are mainly localized in the regions with very high burned area (Sahel and northern Australia; see Fig. A3) and are likely driven by the increase in C 3 woody biomass (see Fig. 4) as SPITFIRE is very sensitive to this type of fuel (Lasslop et al., 2014). LPJ-GUESS-SIMFIRE-BLAZE shows an initial increase and then a decrease in burned area at the end of the simulation. The spatial pattern is mixed, and the decrease in C 4 grass biomass indicates that woody thickening, either due to changes in land-cover fraction or fuel composition is the reason for this reduction in burned area. An increase in woody plants with higher atmospheric CO 2 concentration is expected (Wigley et al., 2010;Buitenwerf et al., 2012;Bond and Midgley, 2012). Their coarser and less flammable fuel can lead to reduced burned area. A recent study using an optimized empirical model indicates that increases in biomass lead to decreases in burned area in regions with high fuel loads, which is likely due to increases in coarser fuels and to increases in burned area in fuel-limited regions (Forkel et al., 2019b).

Response of simulated burned area to population density
The population density forcing used for FireMIP increases in every region of the globe over time as well as in annual global values (Goldewijk et al., 2010). This increasing population density is associated with a monotonic increase in global burned area for LPJ-GUESS-SPITFIRE, and monotonic decreases for LPJ-GUESS-SIMFIRE-BLAZE and CLM. The remaining models show a peak in the impact of population density on burned area around 1950 and a subsequent decline (see Fig. 2b). Models, however, largely agree on a decreasing trend due to the impact of population density since 1921 (see Table 4), and the ones that show a positive trend did not reproduce the relationship between population density and burned area in a multivariate model evaluation (Forkel et al., 2019a). Changes in population density therefore, very likely, contributed to a decrease in global burned area since 1921. All the models, except LPJ-GUESS-SIMFIRE-BLAZE, include the number of anthropogenic ignitions (I A ) or the probability of fire due to anthropogenic ignitions (P i,h in CLASS-CTEM) in the calculation of burned area. Most of the models represent the number of anthropogenic ignitions with an increase up to a certain threshold number and then a decline, implicitly assuming that for high population densities humans suppress fires (SPITFIRE-models, INFERNO and CLM; see Fig. 7). CLASS-CTEM, JSBACH-SPITFIRE and CLM include explicit terms to account for the effects of suppression not only on ignitions but also on fire size, or duration or both (see Fig. 8). The combination of the ignition and suppression terms in CLASS-CTEM leads to a maximum impact of humans on burned area at intermediate population density. The combination of ignition and suppression mechanisms dependant on population thresholds explains why most of the models have non-monotonic changes in burned area as population increases during the 20th century. LPJ-GUESS-SPITFIRE is the only model that shows a monotonic increase in burned area in response to increasing population density; other models that include the SPITFIRE fire module (JSBACH and ORCHIDEE) show the non-monotonic trajectory that results from the shift from the dominance of ignitions to that of suppression on burned area. ORCHIDEE-SPITFIRE has a much lower contribution from anthropogenic ignitions than LPJ-GUESS-SPITFIRE and therefore different spatial patterns of burned area (see Fig. A1); JSBACH-SPITFIRE has an additional suppression term based on fire size data (Hantson et al., 2015a). The inclusion of additional suppression mechanisms may also explain the behaviour of CLM, which shows a monotonic decrease in burned area over the 20th century.
LPJ-GUESS-SIMFIRE-BLAZE does not include anthropogenic ignitions explicitly but rather treats the net effect of changes in population density, which was optimized using burned area satellite data (Knorr et al., 2014). This optimized net effect is a monotonic decrease in burned area with increases in population density. This explains why this model shows a monotonic decrease overall (see Fig. A4).
The models all agree that at high population density fire is suppressed. This leads to similarities in the spatial patterns of the effect of population changes (see Fig. A4), but they differ in their assumptions for low population density, for the threshold where humans start to suppress fire and whether explicit suppression is included. The net or emerging effect of humans on burned area in models, however, also depends on the presence of lightning ignitions. The presence of lightning ignitions reduces the limiting effect of a lack of human ignitions on burned area. For the CLASS-CTEM model as soon as lightning ignitions are present, the net effect of humans is to suppress fires, even though the underlying relationship assumes an increase in ignitions with population density (Arora and Melton, 2018, Supplement). This may explain why global models assuming an increase in ignitions with increases in population density are able to capture the burned area variation along population density gradients Arora and Melton, 2018) and why global statistical analyses find a net human suppression also for low population density (Bistinas et al., 2014).

Response of simulated burned area to land-use change
The land-use change imposed in SF2_FLA is characterized by a strong decrease in forested areas and an increase in pastures and croplands (Hurtt et al., 2011). The FireMIP models do not show a uniform response of burned area to land-use change. LPJ-GUESS-SPITFIRE shows the strongest reaction with a monotonic increase in burned area with land-use change. INFERNO and ORCHIDEE-SPITFIRE also show an increasing trend but of lower magnitude. CLASS-CTEM, JSBACH-SPITFIRE and LPJ-GUESS-SIMFIRE-BLAZE show a decreased burned area due to increased land use. CLM also shows a decrease in burned area, but this change is not significant (see Fig. 2c). The FireMIP models handle land-cover dynamics, the expansion of agricultural areas and fire in agricultural areas differently. Some of the models (CLASS-CTEM, CLM, JSBACH-SPITFIRE and ORCHIDEE-SPITFIRE) prescribe the vegetation distribution so that the land-cover fraction for all PFTs does not change through time in SF2_FLA, while in the SF1 simulation the cover fractions of natural PFTs are reduced according to the expansion of agricultural areas. The other models simulate the distribution of the natural vegetation dynamically but prescribe the agricultural areas. All models decrease the tree cover to represent the expansion of croplands over time. Land conversion due to the expansion Figure 4. Relative difference in global carbon stored in C 4 grasses (dashed lines), in C 3 trees (dotted lines), in C 3 grasses (dash-dotted lines) and in total global carbon stored in vegetation (solid lines) between the baseline experiment SF1 and the sensitivity experiment SF2_CO2 (see Table 1; C V,CO 2 ) for 1950-2013 in percent (annual averages). C 4 and C 3 grasses, as well as C 3 trees, only include natural PFTs (pastures and croplands excluded). Note that the y axis limits differ between the panels. Due to a postprocessing error, INFERNO lacks 2 years (2001 and 2002).  CLM (a, b, c). Impact of soil moisture content and soil wetness on fire for CLASS-CTEM, CLM and INFERNO (d, e, f). In order to facilitate comparability, the soil moisture function for CLM is scaled to the value range (0,1). Figure 6. Annual average of the relative difference in volumetric soil moisture (CLM) and total soil moisture content (remaining models) between the baseline experiment SF1 and the sensitivity experiment SF2_CO2 (see Table 1 of pasture is not represented in CLASS-CTEM. Only CLM includes cropland fires, INFERNO treats croplands as natural grasslands, and all the other models exclude croplands from burning (see Table 5). Therefore for all models, except CLM and INFERNO, increases in cropland area lead to a reduction in burned area, and the reasons for the divergence between the other models must be caused by the treatment of pastures.
In LPJ-GUESS-SIMFIRE-BLAZE pastures are harvested; this reduction in biomass leads to a decrease in burned area in addition to the decrease caused by exclusion of fire in croplands. In JSBACH-SPITFIRE, the expansion of pastures occurs preferentially at the expense of natural grassland and does not affect tree cover until all the natural grassland has been replaced (Reick et al., 2013). This assumption decreases the effect of land-cover conversion on tree cover. Additionally, in JSBACH-SPITFIRE the fuel bulk density of pastures is higher than that of natural grass by a factor of 2, which decreases fire spread and thus burned area (Rabin et al., 2017b). This difference reduces burned area in pastures compared to natural grassland. In CLASS-CTEM, which also shows a decline, pastures are not included, and the only land conversion is due to the expansion of croplands.
LPJ-GUESS-SPITFIRE and ORCHIDEE-SPITFIRE react with an increase in burned area to the expansion of land use since they treat pastures as natural grasslands. The SPIT-FIRE fire module is very sensitive to the vegetation type with very high burned area for natural grasslands due to higher flammability compared to woody PFTs (Lasslop et al., 2014. Fuel bulk density is an important parameter, but additionally grass fuels dry out faster leading to an increase in flammability. Therefore an increase in burned area is observed if forested areas are converted to grasslands. LPJ-GUESS-SPITFIRE computes the vegetation cover dynamically, so that an increase in burned area reduces the cover fraction of woody types, which might explain the stronger response compared to that of ORCHIDEE-SPITFIRE. In CLM, pastures are represented by increased grass cover. The biomass scaling function does not distinguish fuel types (see Fig. 5); therefore the lower fuel amount of grasslands could lead to a decrease in fire probability, while the maximum fire spread rate depends on the vegetation type and is higher for grasslands (Rabin et al., 2017b). The inclusion of cropland and deforestation fires dampen the effect of land-cover change on global burned area. In INFERNO, agricultural regions are not defined explicitly. Instead, woody PFT types are excluded from the agricultural area (Clark et al., 2011). INFERNO includes an average burned area for each PFT in the calculation of the burned area per PFT, which leads directly to increasing grass cover and results in higher burned area (Mangeon et al., 2016;Rabin et al., 2017b).
Land use was already identified as a main reason for intermodel spread in the CMIP5 ensemble . We show that this largely reflects the way pastures are treated, as most models used here (except CLM and IN-FERNO) simply exclude croplands from burning.

Response of simulated burned area to lightning
Most of the models show a low response of burned area to lightning (see Fig. 2), although lightning rates increase by 20 % over the simulation period -an increase that is much larger than the 3.3 % change, between pre-industrial times and the present, estimated from a recent modelling study (Krause et al., 2014). ORCHIDEE-SPITFIRE shows an increase in burned area between 1940 and 1960 and towards the end of the simulation. In comparison to the other SPITFIREmodels the differences seem to be related to two points. Firstly, ORCHIDEE-SPITFIRE uses a 12-times higher factor to convert lightning strikes to actual ignitions and anthropogenic ignitions that are 100-times lower than for the other models (see Rabin et al., 2017b). Secondly, although a partitioning factor (SGFED) varies regionally, the per capita ignition frequency is constant; in JSBACH-SPITFIRE and LPJ-GUESS-SPITFIRE, the per capita ignition frequency varies regionally. This results in strong differences in the spatial patterns of burned area (see Fig. A1). Consequently, the strength of regions contributing to the global burned area varies between the models; ORCHIDEE-SPITFIRE shows much more burning in the tropical and far less burning in the temperate region. Whether a lightning turns into a fire depends on the local conditions at the time of the lightning strike. Differences in the spatial distribution and timing of fires can therefore lead to different responses between models even if lightning is used in the same way within the model. Our results show that even a substantial increase (20 %) in lightning has little influence on simulated global burned area. This is consistent with Krause et al. (2014), who  found that the pre-industrial-to-present increase in lightning, although this increase is much smaller, had little impact on burned area.

Response of simulated burned area to climate
Simulated burned area in FireMIP responds to changes in climate with strong inter-annual variability but only weak trends in burned area (see Fig. 2e). Only three models show a statistically significant trend in the global burned area according to a Mann-Kendall test (CLM, LPJ-GUESS-SIMFIRE-BLAZE and ORCHIDEE-SPITFIRE; see Table 4). However, in all models the area showing an increased burned area in response to climate is higher than the area with decreased burned area (see Fig. 3). Agreement in spatial patterns of trends between the models is however low (see Fig. A7).
The influence of climate on burned area is complex: it influences burned area through the meteorological conditions and through effects on vegetation conditions that influence fuel load and fuel characteristics (Scott et al., 2014). We therefore correlated for each grid cell changes in physical parameters (precipitation, temperature, wind speed and soil moisture) and vegetation parameters (litter, vegetation carbon and grass biomass) with changes in burned area. We find that the correlation between the individual parameters and burned area is low (see Fig. A8). The absolute rank correlations are lower at the monthly scale than at the annual scale. However, at the monthly scale the number of grid cells showing significant correlations with physical parameters is higher than the number showing significant correlations with vegetation parameters, indicating that changes in physical parameters have more influence at shorter timescales than changes in vegetation parameters. This difference disappears with the aggregation to annual timescale. On the annual timescale, however, the mean absolute rank correlation is slightly higher for the vegetation parameters. Soil moisture which is also influenced by vegetation has a slightly higher correlation compared to precipitation, temperature and wind speed. This indicates that vegetation parameters are more influential on the longer annual time step and physical parameters on the monthly time step. The relationship between precipitation or soil moisture and burned area is expected to be negative, while the impact of temperature is expected to be positive. This is clearly reflected in the percentage of positively significant correlations at the annual scale but is less clear at the monthly time step. This might reflect that the seasonality of temperature, precipitation and vegetation parameters is often synchronized, and therefore the effects of the parameters cannot be separated. The low correlation between individual parameters and burned area reflects the complex interactions between the climatic drivers, vegetation conditions and fire weather.
The impact of climate on the inter-annual variability, however, is strongly expressed in the simulated burned area. This is consistent with the finding that recent precipitation changes influence inter-annual variability in fire but have lit- Table 5. Treatment of agricultural fires (Rabin et al., 2017b). "None" indicates the vegetation type does not burn or that deforestation fires are not represented in the model. The models treating pasture fire the same as grassland do not treat pasture as a specific PFT. The indication "no pasture" means that there is no land-cover change due to pastures.
To fully understand the impact of the changes in climate, a number of simulations would be necessary, in which only individual climate parameters change while the others are kept constant. In addition, simulations in which combinations of variables change might give further insights into the synergies between the variables. An alternative approach, given the complex interactions between climate and vegetation parameters, might be to disentangle the model signals using multivariate analysis (see e.g. Forkel et al., 2019a;Lasslop et al., 2018).

Implications for model development and applications
Global vegetation models are an important tool for examining the impacts of climate change and are used in policyrelevant contexts (IPCC, 2014;Schellnhuber et al., 2014;IPBES, 2016). Given the various influences of fire on the ecosystems (Bond et al., 2005), the carbon cycle and climate (Lasslop et al., 2019) improvements of global fire models are particularly important.
The main concern for model applications is the large spread of the historical simulated burned area. It remains difficult to evaluate and optimize the transient burned area simulations as the period observed by satellites is still short, and the trends are not robust (Forkel et al., 2019b). Fire proxies (charcoal and ice cores) give information on biomass burning over longer timescales. They do not confirm the recent decrease in burned area detected by satellites but also only contain very few datapoints for that period (Marlon et al., 2016). For a valid comparison with the long-term fire proxies, the inclusion of estimates of deforestation fires in the models will be crucial as land-use change fire emissions will likely have a strong contribution to the signal (Marlon et al., 2008). An improved understanding of uncertainties in observed trends of fire regimes is therefore necessary. Only robust information should be included in models.
Our analysis shows which parts of the models are particularly important to simulate changes in burned area and need additional observational constraints or improved process un-derstanding. In line with previous research (Bistinas et al., 2014;Hantson et al., 2016a, b;Andela et al., 2017), the large divergence in the response to human activities between the FireMIP models shows that the human impact on fires is still insufficiently understood and therefore not constrained in current models.
We identify land-use change as the major cause of intermodel spread. Only one model explicitly includes fires associated with land-use and land-cover change (cropland and deforestation fires), and all the other models only include such effects through changes in vegetation parameters and structure. The inclusion of cropland fires is certainly important to understand and project changes in emissions, air pollution and the carbon cycle (Li et al., 2018;Arora and Melton, 2018). Cropland fires are, due to their small extent and low intensity, still a major uncertainty in our current understanding of global burned area . Biases in the spatial patterns of burned area and the relationship between cropland fraction and burned area can therefore be expected. High-resolution remote sensing may help to improve the detection (Hall et al., 2016). Moreover, understanding why and when humans burn croplands on a regional scale may help to find an adequate representation of cropland fires within models and help avoid overfitting to observational datasets. As croplands are simply excluded from burning in most models (except two), the spread of the other models is likely related to the treatment of pastures. Fires on pasturelands have been estimated to contribute to over 40 % of the global burned area (Rabin et al., 2015). Pasture fires are not treated explicitly in any of the models, although some models slightly modify the vegetation on pastures by harvesting or changing the fuel bulk density (see Table 5). Expansion of pastures is mostly implemented by simply increasing the area of grasslands. Information on how fuel properties differ between pastures and natural grasslands could therefore help to improve model parameterizations. Prescribing fires on anthropogenic land covers can be a solution for certain applications of fire models (Rabin et al., 2018). Grazing intensity was found to be related to decreases in burned area (Andela et al., 2017). Models so far represent the area that is converted due to land-cover change but not the intensity of land use. This was partly due to the lack of global data regarding land-use intensity, which is now becoming available and provides new opportunities for fire model development (e.g. the LUH2 dataset; Hurtt et al., 2017). In the sensitivity simulations shown here, even models that decrease burned area due to land-use and land-cover change do not show a further decrease over the last decade. This indicates that model input datasets, explicit in time and space, for land-use intensity and grazing intensity are necessary for fire projections. The level of socioeconomic development also modifies the relationship between humans and burned area (Andela et al., 2017;Forkel et al., 2017). Regional analysis of remote sensing data could be highly useful, as a global relationship between burned area and individual human factors, as assumed in many models and also statistical analyses, is not likely. Assumptions on how different human groups (hunter-gatherers, pastoralists and farmers) use fire have been included in a paleofire model (Pfeiffer et al., 2013). The development of such an approach for modern times would be highly valuable for fire models that aim to model the recent decades and future decades. Deforestation fires are only included in one model (CLM). As deforestation fires are likely a strong source of biomass burning over the longer timescales, accounting for deforestation fires will be crucial for a model comparison with the charcoal record.
We also find inter-model agreement for certain aspects. For instance, burned area is suppressed at high population densities, which leads to a similar spatial response to population density (see Fig. A4). Moreover, most models show a reduction of the global burned area due to changes in population density. The response functions of burned area to population density of the two models that increase burned area is less in line with response functions derived from global datasets (Forkel et al., 2019a). As a strong human suppressive effect is well supported by satellite observations (Andela et al., 2017;Hantson et al., 2015b), a reparametrization of these responses would be reasonable.
We show that, although all models show an overall increase in biomass as a consequence of increasing atmospheric CO 2 concentration, models disagree about whether this results in an increase or decrease in burned area. The disagreement reflects the complex ways in which changes in atmospheric CO 2 concentration influence vegetation properties, which results in different responses in different ecosystems. For LPJ-GUESS-SPITFIRE and JSBACH-SPITFIRE, the CO 2 fertilization effect considerably contributed to an increase in burned area. Such an effect is so far only supported for fuel-limited areas (Forkel et al., 2019b). Limiting the effect of increasing fuel load on burned area in regions with high fuel load as used in other models could help to reduce the increase in burned area simulated by JSBACH-and LPJ-GUESS-SPITFIRE.
Climate and lightning have a much lower effect on the trends than the other factors. While this study focuses on the trends, research on the short-term variability and extreme events will be highly useful to investigate fire risks. The influence of climate and lightning on fire are therefore important research topics even if we find a comparably low influence on the long-term trends. Moreover the trends in climate parameters may increase for the future, and therefore the influence on burned area might increase.
In contrast to many model simulations that use a lightning climatology based on satellite observations, the FireMIP experiments were driven by a transient dataset of lightning activity created by scaling a mean monthly climatology of lightning activity using convective available potential energy (CAPE) anomalies of a global numerical weather prediction model. Since climate changes can be expected to cause changes in lightning, it will be important to develop transient lightning datasets for climate change studies on fire. The use of present-day lightning patterns, for example, will certainly lead to an overestimation of lightning strikes in regions with drier climate projected in the future. But not only spatial patterns of lightning are important, the covariation with climate, as well as the temporal resolution of the input dataset, determines the influence on burned area (Felsberg et al., 2018). Although we do not detect large signals in global burned area due to changes in lightning, lightning is known to be an important cause of ignitions regionally and is potentially involved in more complex interactions between fire, vegetation and climate, which can speed up the northward expansion of trees to the north in boreal regions (Veraverbeke et al., 2017). Thus, although our results suggest that the influence of increasing lightning is negligible at a global scale, it is a potentially important factor for process-based models that aim to model interactions between fire, vegetation and climate.
Recent advances in remote sensing products have high potential to support model development. However, remotely sensed burned area datasets alone are not a sufficient basis to evaluate fire models as many model structures can lead to reasonable burned area patterns. The emergence of longer records of burned area and the increasing availability of information on other aspects of the fire regime considerably improve opportunities to evaluate and improve our models. The FRY database  and the global fire atlas (Andela et al., 2019), for example, provide information on fire size, numbers of fire, rate of spread and the characteristics of fire patches. These datasets will be useful to, for instance, separate effects of ignition and suppression. Rate of spread equations in global fire models are at present either very simple empirical representations tuned to improve burned area or based on laboratory experiments (Hantson et al., 2016a). The mentioned datasets now offer the opportunity to derive parameters for rate of spread equations at the spatial scales these models operate on. Fire size and rate of spread are important target variables besides burned area that can determine the impacts of fire. The effects on vegetation (combustion of biomass and tree mortality; Williams et al., 1999;Wooster et al., 2005) and on the atmosphere (Veira et al., 2016) are a function of fire intensity, which is also included in the FRY database . A better evaluation of such parameters can enhance the usability of fire model simulations.
The specific model application has a strong influence on judging the validity of a model. Our analyses of the controls on the variability of fire suggest that human activities drive the long-term (decadal to centennial) trajectories, while considering climate variability may be sufficient for short-term projections. Changes in the trends of the driving factors may change this balance. For instance, stronger changes in climate into the future may increase the relative importance of climate for long-term fire projections in the future.

Summary and conclusions
This comprehensive analysis of the influences of climate, lightning, atmospheric CO 2 concentration, population density and land-use and land-cover change provides improved understanding of the relation between simulated historical trends in burned area and process representations in the models. It shows in detail which model responses of burned area to environmental factors can be understood, how these are related to the model equations, and how these translate into trends of burned area for the historical period.
The analysis of the sensitivity experiments shows that the increase in atmospheric CO 2 concentration over the 20th century leads to increased burned area in regions where fuel loads increase, but it leads to decreased burned area in regions where tree density or coarse fuels with lower flammability increase, or in regions where elevations in soil moisture decrease flammability. Although models agree that the amount of available fuel increases, the type of fuel and vegetation composition are critical for understanding the influence of atmospheric CO 2 concentration on simulated burned area.
Most models agree on a decrease in burned area due to increases in population density. Most models link the number of ignitions to population in a way that ignitions increase initially at low population densities. In densely populated regions, all models assume that the effect of anthropogenic ignitions is outweighed by fire suppression and the increased fragmentation of the landscape by anthropogenic land use. It would be useful to develop an approach that represents local human-fire relationships, but this will likely remain a longterm challenge and requires the synthesis of knowledge from various research fields.
The simulated response of burned area to land-use and land-cover change depends on how fires in cropland and pastureland are treated in each model. Most models simply exclude croplands from the burnable area; therefore the treatment of pastures causes the largest part of the model spread. Models that do not allow fire in croplands, and either harvest biomass in pastures or assume specific vegetation pa-rameters, show a reduction in burned area. Models that treat pastures as natural grasslands and distinguish different fuel types or strongly increase burned area for grasslands show an increase in burned area. Improved knowledge on the effects of land-use intensity on burned area and the development of appropriate forcing datasets could strongly support model development.
The models are comparatively insensitive to changes in lightning, likely because lightning ignitions are not a limiting factor in many regions with very high burning activity. Previous studies however show the importance of lightning and changes in lightning for burned area in the boreal region. Therefore especially regional studies should pay attention to this factor.
None of the models shows a strong trend due to changing climate but all of them show a strong influence of climate on the inter-annual variability. Climatic and ecosystem parameters are only able to explain a rather small part of this variation, with stronger correlations for the ecosystem parameters on the longer annual timescale and a stronger relationship with climatic parameters on the monthly timescale.
Different drivers of burned area affect different timescales: the anthropogenic factors influence long-term variability, while climate and lightning affect short-term variability. Understanding the influence of climate and lightning is especially important for inter-annual variability and extreme events. On the other hand, understanding the impact of anthropogenic drivers is likely more important for the longerterm changes of fire, which is for instance needed in Earth system models. Changes in the trends of the forcing parameters might however affect the balance between them.
The uncertainties in global fire models need to be taken into account in model applications, for instance if model simulations are to be used to support climate adaptation strategies. Model ensemble simulations can give indications of such uncertainties. Therefore the results of this study provide a basis to interpret uncertainties in global fire modelling studies. The information content on the spatial variability of burned area has been well exploited in previous studies, and models reproduce the spatial patterns in a reasonable way. The temporal information of the satellite data is increasing with the increasing length of the record and has a higher potential to contain new information to support the improvement and evaluation of global fire models. Here we provide a summary of which model assumptions need additional constraints to efficiently reduce the uncertainty in temporal trends.
Code and data availability. Processed data and processing scripts are available upon request to publications@mpimet.mpg.de.  . Spatial distribution of regression slopes for the difference between the baseline experiment SF1 and the sensitivity experiment SF2_CO2 (SF1-SF2_CO2; see Table 1) over 1921-2013. Figure A4. Spatial distribution of regression slopes for the difference between the baseline experiment SF1 and the sensitivity experiment SF2_FPO (SF1-SF2_FPO; see Table 1) over 1921-2013. Figure A5. Spatial distribution of regression slopes for the difference between the baseline experiment SF1 and the sensitivity experiment SF2_FLA (SF1-SF2_FLA; see Table 1) over 1921-2013. Figure A6. Spatial distribution or regression slopes for the difference between the baseline experiment SF1 and the sensitivity experiment SF2_FLI (SF1-SF2_FLI; see Table 1) over 1921-2013. Figure A7. Spatial distribution of regression slopes for the difference between the baseline experiment SF1 and the sensitivity experiment SF2_CLI (SF1-SF2_CLI; see Table 1) over 1921-2013. Figure A8. Spearman rank-order correlation coefficient for each grid cell over 1921-2013 for the difference between the baseline experiment SF1 and the sensitivity experiment SF2_CLI (see Table 1) for annual burned area fraction, precipitation, temperature, wind speed, carbon stored in litter, carbon stored in vegetation, carbon stored in grass and in soil moisture, respectively. Panels (a) and (b) show the mean absolute rank correlation, i.e. the spatial average over the absolute and significant (p value < 0.05) Spearman rank-order correlation coefficients, in which the relative difference in burned area fraction is > 0.1. Panels (c) and (d) show the proportion of grid cells with a significant correlation. Panels (e) and (f) indicate the percentage of significant grid cells with a positive correlation. Figure A9. Scatter plots for the GFED4 and FireCCI50 dataset without transformation, square root transformation and log transformation (a). The colour indicates the influence of individual datapoints on the correlation (computed as the difference in the correlation with and without that datapoint). Cumulative influence of datapoints in the dataset on the correlation (b). Without transformation a very small fraction has a strong influence on the correlation; these are grid cells with high burned area fraction (as can be seen in a).  (2016) Melton and Arora (2016) Table A2. Correlation coefficients between burned area simulated by the FireMIP models within the baseline experiment SF1 and the respective observation data. Due to the very skewed distribution of burned area, we use a square root transformation on both the models and the observations. Numbers in brackets show the Pearson correlation coefficients for not-transformed data. Only GFED4 and FireCCI50 provide uncertainty estimates; therefore GFED4s is not included. Correlation coefficients for 33 % show the correlation between all grid points that lie within the 0 % and 33 % percentile of the relative standard error. Values for 66 % lie within the 33 %-66 % percentile of the relative standard error, and values for 99 % lie within the 66 %-99 % percentile. Bold numbers indicate correlation coefficients that are significant (p value < 0.05).