the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
High-resolution remote sensing and machine-learning-based upscaling of methane fluxes: a case study in the Western Canadian tundra
Anna-Maria Virkkala
Victor Brovkin
Tobias Stacke
Barbara Widhalm
Annett Bartsch
Carolina Voigt
Oliver Sonnentag
Mathias Göckede
Arctic methane (CH4) budgets are uncertain because field measurements often capture only fragments of the wet-to-dry gradient that control tundra CH4 fluxes. Wet hotspots are over-represented, while dry, net-sink sites are under-sampled. We paired over 13 000 chamber flux measurements during peak growing season in July (2019–2024) from Trail Valley Creek in the western Canadian Arctic with co-registered remotely sensed predictor variables to test how spatial resolution (1 m vs. 10 m) and choice of machine-learning algorithm shape upscaled CH4 flux maps over our 3.1 km2 study domain. Four algorithms for CH4 flux scaling (Random Forest (RF), Gradient Boosting Machine (GBM), Generalised Additive Model (GAM), and Support Vector Regression (SVR)) were tuned using the same stack of multispectral indices, terrain derivatives and a six-class landscape classification. Tree-based models such as RF and GBM offered the best balance of 10-fold cross-validated R2 (≤0.75) and errors, so RF and GBM were used in a subsequent step for upscaling to the study area. With 1 m resolution, GBM captured the full range of microtopographic extremes and predicted a mean July flux of 99 mg CH4 m−2 per month. In contrast, RF, which smoothed local extremes, yielded an average flux of 519 mg CH4 m−2 per month. The disagreement between flux estimates using GBM and RF correlated mainly with the Normalized Difference Water Index (NDWI), a moisture proxy, and was most pronounced in waterlogged, low-lying areas. Aggregating predictors to 10 m averaged the sharp metre-scale flux highs in hollows and lows on ridges, narrowing the GBM-RF difference to ∼75 mg CH4 m−2 per month while broadening the overall flux distribution with more intermediate values. At 1 m, microtopography was the main driver. At 10 m, moisture proxies explained about half of the variance. Our results demonstrate that: (i) metre predictors are indispensable for capturing the wet-dry microtopography and its CH4 signals, (ii) upscaling algorithm selection strongly controls prediction spread and uncertainty once that microrelief is resolved, and (iii) coarser grids smooth local microtopographic details, resulting in flattened CH4 flux peaks and wider distribution. At 10 m, however, flux estimates became more consistent between models and better represented broad moisture-driven patterns, suggesting improved generalisability despite some loss of detail. This is supported by findings for remote sensing derived seasonal subsidence which reflects moisture gradients. All factors combined lead to potentially large differences in scaled CH4 flux budgets, calling for a careful selection of scaling approaches, spatial predictor layers (e.g., vegetation, moisture, topography), and grid resolution. Future work should couple ultra-high-resolution imagery with temporally dynamic indices to reduce upscaling bias along Arctic wetness gradients.
- Article
(7926 KB) - Full-text XML
-
Supplement
(928 KB) - BibTeX
- EndNote
The Arctic is warming nearly four times faster than the global average due to Arctic amplification feedbacks (Previdi et al., 2021; AMAP, 2021; Rantanen et al., 2022; Ballinger et al., 2020). This rapid warming is of particular concern due to the substantial quantities of organic carbon stored in wetland ecosystems of the circumpolar permafrost region (Hugelius et al., 2014; Schuur et al., 2015; Turetsky et al., 2020; Olefeldt et al., 2016). Thaw exposure may mobilize part of the previously frozen carbon as methane (CH4), a greenhouse gas 28–34 times more potent than CO2 over 100 years (Koven et al., 2011; Etminan et al., 2016; Nisbet et al., 2019; Saunois et al., 2020). Rising temperatures, therefore, risk to trigger a positive feedback in which permafrost degradation elevates CH4 emissions, further intensifying warming (Schuur et al., 2015; Walter Anthony et al., 2018; Turetsky et al., 2020; Natali et al., 2021).
High-resolution CH4 flux measurements in tundra ecosystems remain sparse even during the growing season due to the Arctic's remoteness, harsh climate, and logistical challenges (e.g., lengthy travel times, high fieldwork costs, sparse infrastructure, and challenging equipment maintenance), which limits the number of long-term monitoring sites. The primary tools for plot- to ecosystem scale CH4 flux observations are flux chambers (Subke et al., 2021) and eddy covariance techniques, respectively (Matthes et al., 2014; Baldocchi, 2003); however, the time window to conduct growing season chamber campaigns is usually limited to a few months between June and September, and locations in the Arctic featuring eddy covariance towers are few (Vogt et al., 2025). As a consequence, most synthesis studies aiming at constraining CH4 budgets in the high northern latitudes must rely on a limited database biased toward high-emitting sites near research stations and often overlooking areas with net CH4 uptake (Mastepanov et al., 2013; Varner et al., 2022; Kuhn et al., 2021; Voigt et al., 2023c). Most tundra chamber campaigns collect data only for short intervals, typically from a single day up to a few weeks during the growing season, and many are conducted in just one growing season without repeated multi-year sampling or covering winter fluxes, which limits their value for model benchmarking and interannual analysis (Varner et al., 2022; Kuhn et al., 2021; Räsänen et al., 2021; Mastepanov et al., 2013; Treat et al., 2018).
Even where flux data exist, CH4 fluxes can shift within metres because the relative position and seasonal movement of the water table and the frost table create mosaics of anoxic (CH4 – producing) and oxic (CH4 – oxidising) soil (Frolking et al., 2011). These redox contrasts are further modulated by microtopography, plant functional type, and surface moisture (Mastepanov et al., 2013; Pirk et al., 2015; Olefeldt et al., 2021). Because the water table and frost table rarely coincide at the same depth across tundra microtopography, neighbouring microsites can experience very different oxic–anoxic conditions. Across the Arctic tundra, surface types range from water-saturated zones, such as sedge fens, polygon centres, troughs and thaw slumps, to better-drained features like hummocky ridges, palsas and gravelly uplands. These elements cover the entire CH4 flux range, with microtopographically lower, wetter zones acting as strong sources and microtopographically elevated, better-aerated zones often functioning as net sinks (Räsänen et al., 2021; Bao et al., 2021; Yuan et al., 2024). Such small-scale heterogeneity frequently occurs within a single 10 m pixel, so coarse maps or remote-sensing data products can combine zones of strong CH4 emission and neighbouring areas that act as net CH4 sinks (Knox et al., 2019; Treat et al., 2018). Without spatially explicit methods that resolve this fine-scale heterogeneity, upscaling can introduce systematic biases. It may overestimate CH4 emissions when dry areas that act as sinks are overlooked or underestimate them when narrow wet trenches surrounding dry patches are missed (Räsänen et al., 2021; Treat et al., 2018).
Ultra-high-resolution (<1–2 m) imagery from drones or commercial satellites can directly resolve fine-scale vegetation patterns and microtopographic features (e.g., hummocks and hollows) in heterogeneous tundra landscapes, for example mapping plant communities on dry polygon rims versus wet sedge hollows and other microrelief features that correspond to CH4 “hotspots” and “cold spots”, respectively. Studies using sub-metre to metre-scale imagery and plot-based observations have shown that fine spatial resolution is essential to capture local flux heterogeneity and microtopographic controls (Lehmann et al., 2016; Becker et al., 2008; Ström et al., 2005; Ludwig et al., 2024; Davidson et al., 2017). However, working with spatially ultra-high-resolution data presents significant challenges. The acquisition and processing of sub-metre or metre imagery through drones or advanced satellites and LiDAR are both costly and labour-intensive; such datasets are rarely available as dense, multi-date image stacks and cannot be easily collected over large areas (Scheller et al., 2022; Karim et al., 2024; Anderson and Gaston, 2013). Moreover, ultra-high resolution can introduce noise from small-scale elevation artefacts and micro-relief features that do not represent real hydrological connectivity, and thus may not lead to a better representation of environmental conditions (Riihimäki et al., 2021).
By contrast, high resolution (∼10 m) predictors such as Sentinel-2 multispectral imagery and ArcticDEM terrain products are freely available and cover the entire Arctic with regular revisits with a standardised approach (Drusch et al., 2012; Porter et al., 2023). However, the coarse 10 m resolution has a clear disadvantage because individual microtopographic features (e.g., hummocks, hollows) and landforms (e.g., dry palsas, wet trenches, etc.) that control small-scale variability in CH4 fluxes are aggregated into single pixels, blurring the fine-scale patterns of emission and uptake (Räsänen and Virtanen, 2019).
Data-driven approaches, including the machine-learning (ML) algorithms Random Forest (RF), Gradient Boosting Machine (GBM), and Support Vector Regression (SVR), as well as the semi-parametric statistical model Generalised Additive Model (GAM), can integrate predictors derived from remote sensing products with flux measurements to upscale CH4 from plot- and ecosystem- to landscape scales (Yuan et al., 2024; Chen et al., 2024; Zhang et al., 2020, Ying et al., 2025). Tree ensembles (RF, GBM) are particularly well suited for capturing complex interactions and handle multicollinearity, while GAMs have the advantage of yielding interpretable smooth functions, and SVR excels with limited nonlinear data (Wood, 2017; Smola and Schölkopf, 2004). Model choice, predictor resolution and limited training data still generate large spreads in upscaled Arctic tundra CH4 fluxes, with ensemble estimates differing by roughly 25 %–50 % of the mean depending on the study (Peltola et al., 2019; McNicol et al., 2023; Chen et al., 2024; Räsänen et al., 2021). Quantifying and reducing these uncertainties are essential for robust CH4 budgets.
Here, we address these methodological challenges in a study aiming at upscaling CH4 fluxes in a heterogeneous tundra landscape in the western Canadian Arctic by pairing >13 000 peak growing season (July) chamber measurements collected over five years with matched 1 and 10 m remote sensing predictors and training three machine-learning algorithms (RF, GBM, SVR) and one semi-parametric statistical model (GAM). Our overarching aim is to reduce uncertainties in peak-season (July) CH4 budgets for the 3.1 km2 heterogeneous tundra around the Trail Valley Creek Research Station. We address this aim through four specific questions:
-
Which remotely-sensed vegetation, moisture, and topographic characteristics best explain July CH4 fluxes across a wet-to-dry micro-site gradient?
-
Does replacing freely available 10 m data (Sentinel-2, ArcticDEM) with metre imagery from drones and airborne lidar lead to a detectable improvement in prediction accuracy and spatial detail?
-
How do the four modelling approaches differ in predicted net flux magnitudes and spatial patterns?
-
How do model choice, grid resolution, and their interaction shape the spatial patterns and uncertainty of our upscaled CH4 flux maps?
Optimising a data-driven upscaling approach based on these questions allows us to produce July CH4 flux maps with pixel-level uncertainty, improving peak-season emission estimates and guiding where additional measurements or higher-resolution imagery would most reduce prediction error.
2.1 Study site
The study site is the undulating tundra landscape of the Trail Valley Creek (TVC) Research Station, about 55 km north of the town of Inuvik, NT, in the western Canadian Arctic east of the Mackenzie River Delta (Fig. 1). TVC lies in the Southern Arctic ecozone and contains continuous permafrost, with thickness ranging from 100 to 150 m (Marsh et al., 2008). Our analyses focus on a ∼3.1 km2 section of this 57 km2 basin with elevations ranging from 41 to 102 m a.s.l. The 1991–2020 climate normals for Inuvik are a mean annual air temperature of −7 °C, mean annual precipitation of ∼250 mm, and a frost-free period (the interval with minimum air temperatures above 0 °C) of roughly 78 d (Environment and Climate Change Canada, 2024). The soils are classified as organic cryosols, with an upper peat horizon approximately 0.2–0.5 m thick overlying mineral silty-clay subsoil (Petrone et al., 2000). The vegetation at TVC is highly diverse, reflecting the microtopography and moisture gradients. Isolated patches of white and black spruce (Picea glauca, P. mariana) occur in valley bottoms and on slopes. Tall shrub tundra, dominated by green alder (Alnus alnobetula) and featuring scattered willows and dwarf birch, can be found on hill slopes and alongside streams. Riparian zones feature dense willow thickets reaching up to 2 m in height. Upland areas support dwarf shrub tundra with dense stands of dwarf birch (Betula glandulosa), Labrador tea (Ledum palustre) and mountain cranberry (Vaccinium vitis-idaea), interspersed with mosses and lichens. Flat, poorly drained areas are dominated by tussock-forming sedges (Eriophorum and Carex), alongside moss and scattered shrubs. Exposed uplands and polygon rims are covered by lichen mats and low dwarf shrubs. Mosses, especially Sphagnum and Polytrichum species, are prevalent in wetter microhabitats. Snow depth and winter soil temperatures are highest in the tall shrub and tussock zones and lowest in the lichen tundra (Grünberg et al., 2020; Marsh et al., 2010). Although TVC represents a single site, its strong microtopographic and vegetation heterogeneity reflects the wet-dry gradients typical of Arctic continuous-permafrost lowlands. Similar mosaics of sedge wetlands, dwarf-shrub uplands, and lichen tundra occur across large parts of the western Canadian Arctic and other low-relief tundra landscapes, suggesting that the scale effects we document are broadly transferable.
Figure 1Map of the study area showing the location of the area of interest (outlined with white polygon), with CH4 flux measurement locations marked with yellow circles. The inset map in the upper left corner highlights the region in which the Trail Valley Creek (TVC) research station is located (marked with a yellow triangle on the overview map and a white triangle on the detailed map). Background satellite imagery sources: © 2025 Maxar, provided by Esri, acquired on 12 July 2024. Area of interest aerial imagery: Rettelbach et al. (2024).
2.2 Data sources
This study combines field-based CH4 flux measurements with remotely sensed and meteorological data to build and evaluate spatially explicit models of CH4 exchange. The chamber flux data provide the response variable for model training, while the meteorological records include air temperature (AT), photosynthetically active radiation (PAR), and thawing degree days (TDD) as dynamic atmospheric drivers. Remotely sensing datasets supply spatial predictors describing vegetation, surface moisture, terrain structure, and landscape classification at two spatial resolutions (1 and 10 m). The resulting predictor stacks were then used to train and compare the four modelling approaches described in Sect. 2.3.
2.2.1 CH4 flux data
We used a combination of continuous and campaign-based CH4 flux measurements to capture spatial and temporal variability in CH4. The dataset includes previously published automated chamber observations made in 2019 and 2021 (Voigt et al., 2023a), and campaign-based manual chamber observations made in 2019 (Voigt et al., 2023b) and in 2022 to 2024 (Ivanova et al., 2025a). Manual chamber measurements from 2022 to 2024 were collected as part of this study. The main measurement protocols, chamber specifications, instrumentation, and flux calculation methods for each campaign are summarized in Table 1.
The complete dataset included 13 384 CH4 flux measurements collected between 1 July and 31 July (2019–2024) under both light and dark conditions. Our chamber measurements cover the spatial heterogeneity of the ∼3.1 km2 study area, ensuring representation of key CH4 controlling gradients.
Flux measurements were collected across the full range of microtopographic and vegetation conditions within the AOI. Observations were distributed across tussock tundra, dwarf shrubs, lichen-dominated uplands, and sedge wetlands at both spatial resolutions. The sampling distribution closely matched the mapped area fractions of these classes in the AOI (Fig. S1 in the Supplement), confirming robust ecological representativeness. Detailed percentages for both map area and flux sampling are provided in Fig. S1. Repeated measurements under different meteorological conditions also provide independent temporal variability for model training. On average, each microsite was measured 50–450 times depending on year and instrument type, resulting in a total of 13 384 individual chamber observations across 68 unique locations (microsites). Of these, 1093 fluxes were measured manually using closed chambers, while 12 291 were collected using an automated chamber system (Fig. 1). Manual chambers were installed directly on the ground surface without boardwalk contact. Automated chamber plots were accessed via short boardwalks located adjacent to chamber collars. These boardwalks did not overlap with chamber footprints and therefore did not influence the spectral signal of the exact measurement location. For each flux measurement, ancillary data recorded include coordinates, PAR (measured as photosynthetic photon flux density (PPFD; )), air temperature, land cover type, and time of day (when available).
2.2.2 Climatic data
AT data were obtained from the Trail Valley Creek meteorological station operated by Environment and Climate Change Canada – Meteorological Service of Canada (ECCC, 2024). The station is located within the study area at N, W, at an elevation of 85 m a.s.l. (Climate ID: 220N005; WMO ID: 71683; TC ID: XTV). The original data were recorded at hourly resolution and were downsampled to 3 h intervals to match the temporal resolution of the model predictions. PAR data were obtained from the NASA POWER dataset (NASA Langley Research Center, 2024) at a spatial resolution of 1 km. These data provided temporally dynamic inputs for model training and prediction.
2.2.3 Remotely sensed data
We assembled two separate but equivalent predictor stacks, one with a cell size of 1 m and one with 10 m. Both cover the same area of interest (AOI, Fig. 1), use the same map projection, and pass through the same preprocessing workflow (Ivanova et al., 2025b). The AOI was delineated along natural drainage lines on three sides and the Inuvik-Tuktoyaktuk Highway on the west. An image stack refers to a set of co-registered raster layers (multispectral indices and terrain derivatives) that share the same grid and extent. To facilitate comparison between datasets of different spatial resolutions, we summarized all predictors in Table A1.
It lists each variable with its data source, spatial resolution (1 m, 10 m, or constant), and whether it is static or dynamic. Variables derived from UAV imagery are used at 1 m resolution, while Sentinel-2 and ArcticDEM products are used at 10 m.
The 1 m stack is based on the RGB + NIR drone orthomosaic captured on 22 August 2018 by Rettelbach et al. (2024) and the 1 m LiDAR-derived digital terrain model (DTM) from Lange et al. (2021). From these layers, we derived the Normalised Difference Vegetation Index (NDVI; Rouse et al., 1974) and the Normalised Difference Water Index (NDWI; Gao, 1996; McFeeters, 1996) as proxies for biomass and surface moisture, respectively. Topographic derivatives including slope, aspect, the Topographic Position Index (TPI, 30 m window), and the Topographic Wetness Index (TWI) were calculated with Whitebox Tools (Lindsay, 2016). A 30 m neighbourhood was used for TPI, as this scale best captured local elevation contrasts typical of heterogeneous microtopography.
The 10 m stack contains the same set of variables but at coarser spatial resolution. It combines multispectral information from Sentinel-2 Level-2A scenes collected between 2015 and 2024 using the Copernicus Browser with topographic derivatives derived from the 2 m ArcticDEM, resampled to 10 m to match the Sentinel grid. Cloud-, shadow-, and snow-masked for AOI Sentinel-2 Level-2A scenes from July–August 2018 (n=6 cloud-free scenes) were composited using the mean to align with the 2018 drone campaign in Google Earth Engine (Gorelick et al., 2017). For the time-specific analysis, NDVI and NDWI were extracted from the nearest cloud-free scene within ±10 d of each chamber measurement, with no temporal averaging and only cloud-free pixels accepted. NDVI and NDWI were extracted from this composite, and the same set of terrain derivatives (slope, aspect, TPI, TWI) was computed for consistency. A complete overview of all predictor variables, including data descriptions, resolution, temporal variability, and references, is provided in Appendix Table A1. To link chamber measurements with remote sensing inputs, predictor values were extracted directly from the raster cell covering the chamber footprint, without spatial buffering. No spatial averaging or neighbourhood smoothing was applied to the pixel values at extraction. All chamber measurements were kept as individual records, even when multiple chambers or repeated measurements fell within the same 1 m or 10 m grid cell, to preserve sub-pixel heterogeneity in vegetation and soil conditions.
In a separate workflow, we used multispectral, terrain, and texture features to produce a site-specific landscape classification map at both 1 and 10 m resolution using a Random Forest approach (Breiman, 2001). The 1 m dataset was derived from RGB + NIR orthomosaic drone imagery collected by drone on 22 August 2018 (Rettelbach et al., 2024) and a co-registered 1 m LiDAR-based digital terrain model (Lange et al., 2021). The 10 m dataset was based on Sentinel-2 multispectral imagery and ArcticDEM-derived terrain parameters, representing the same area of interest. Six landscape classes were defined following Grünberg et al. (2020): Water, Lichens, Tussock, Dwarf Shrubs, Tall Shrubs + Trees, and Sedges.
Training and validation points (n=140 in total) were manually delineated from the drone orthomosaic. Eighty percent of the points were used for model training and 20 % for accuracy assessment. The same training polygons were used for both the 1 and 10 m classifications in terms of geographic location and class label, while predictor values were extracted from the respective remote-sensing datasets (drone + LiDAR for 1 m; Sentinel-2 + ArcticDEM for 10 m). This approach ensured that the two classifications were comparable while reflecting the characteristics of their respective input data. Because the spatial resolution and input data differ, the resulting landscape maps do not show identical boundaries or class proportions, but instead reflect the surface characteristics captured at each scale. Both maps contained the same six land-cover classes. However, for the 1 m model training, the Tall shrubs + trees class was merged with Dwarf shrubs because no chamber flux measurements overlapped that class. Thus, five classes were used for flux modeling at 1 m, whereas all six were retained at 10 m. Water pixels were masked prior to classification using a threshold of NDWI>0 and manually checked against the drone orthomosaic to ensure the exclusion of ponds and streams. All subsequent statistical analyses were restricted to terrestrial classes. A detailed description of the classification workflow, feature set, and accuracy assessment is provided in Appendix A1 and Table A2.
For broader application, the 10 m predictor stack is directly reproducible across the Arctic (Sentinel-2 Level-2A + ArcticDEM). In contrast, the 1 m stack depends on site-specific drone orthomosaics and LiDAR, which limits immediate circumpolar scaling but is valuable for local calibration and bias assessment.
In addition, we also explored the potential of two datasets that are particularly relevant for Arctic-scale applications. The Circumarctic Land cover Units (CALU) (Bartsch et al., 2024) provides a 10 m classification of vegetation physiognomy and soil moisture regimes across the circumpolar Arctic tundra. The data product is based on the fusion of Sentinel-1 and Sentinel-2 imagery and was calibrated using over 3500 field samples of soil and vegetation properties. One of the key strengths of CALU is that it captures spatial gradients in surface wetness while using a consistent classification scheme across all Arctic regions. This makes it possible to directly compare classes between distant sites across the Arctic, which is rarely achievable with site-specific classifications. 20 of 23 land cover units are found across the AOI, but only 5 of those were covered by CH4 measurements. The complete legend of CALU classes used in this study, including definitions, their occurrence within the AOI, and whether CH4 flux measurements are available for each class, is presented in Table A3. Additionally, we considered a radar interferometric (InSAR) dataset derived from Sentinel-1 data for 2018–2023 (Widhalm et al., 2025), which captures seasonal ground subsidence rates in thawing degree days domain associated with thaw table (the uppermost soil that freezes and thaws each year). The magnitude of the subsidence rates reflects soil moisture gradients (Widhalm et al., 2025).
Finally, we assessed the benefit of incorporating time-specific spectral indices (NDVI and NDWI) extracted from Sentinel-2 scenes close to each chamber measurement. We compared the effect of using these time-matched indices versus a seasonal composite (July–August 2018) to test whether short-term variability in vegetation and moisture status improves model skill. Although this approach relies on satellite scenes taken within a limited time window and may not align perfectly with the exact in situ measurement date, it still offers a more detailed representation of changes in surface conditions than seasonal averages.
All four additional predictors (CALU, InSAR, and temporally dynamic NDVI and NDWI) were tested in separate model runs to assess how much they improved predictive performance (R2, RMSE) when added to the main predictor set. These sensitivity analyses allowed us to evaluate their explanatory value without altering the resolution comparison, as each of these variables is available only at a single spatial scale.
To specifically evaluate how spatial resolution influences model performance and predicted CH4 fluxes, we designed the analysis so that the only factor differing between the two datasets was grid size (1 m vs. 10 m). All other parameters (predictors, preprocessing steps, algorithms, and training data) were kept identical. This approach allowed us to isolate the effect of scale from other potential sources of variation. As shown in Fig. 2, narrow, wet features such as polygonal trenches are captured at 1 m but blended at 10 m, which alters both the NDVI and the landscape classification. Since many high CH4 fluxes originate from these small wet zones, aggregation at a coarser resolution obscures their contribution. Some of the remaining model disagreement may also be due to the limited representation of extremely wet or complex terrain in the training data, which reduces the model's generalisability.
Figure 2Site-specific landscape classification (LC) and Normalized Difference Vegetation Index (NDVI) at two spatial resolutions: 1 m (panels A and C) and 10 m (panels B and D). Panels (A) and (B) show LC maps, while panels (C) and (D) show NDVI. Each panel includes a black-framed inset highlighting a representative polygonal mire. Narrow, waterlogged microtopographic features such as wet trenches remain distinct at 1 m resolution but blend into mixed pixels at 10 m. Background imagery: © Google Earth (Maxar) 2025.
2.3 Statistical analyses
The statistical analysis was structured into five sequential stages: (1) data preparation, (2) model training and evaluation, (3) spatial prediction, (4) temporal aggregation and interpretation, and (5) variable importance analysis (Fig. 3). All steps were applied identically to the 1 and 10 m datasets to enable direct comparison of model behaviour and prediction outcomes across spatial resolutions. The analysis was implemented in R 4.3.2 (R Core Team, 2024).
Figure 3Workflow for modelling and upscaling CH4 fluxes in the designated study area. The analysis was performed separately for 1 and 10 m spatial resolutions and comprised five primary stages: (1) Predictor preparation. (2) The training and tuning of models. (3) Spatial prediction. (4) Temporal aggregation and evaluation. (5) Variable importance.
2.3.1 Data preparation
The first step consisted in the preparation of the predictor datasets to explain spatio-temporal variability in CH4 fluxes. In total ten predictors were used: AT, PAR, TDD, NDVI, NDWI, slope, aspect, TPI, TWI, and a six-class landscape classification (see Appendix Table A1 for details). The three meteorological variables (AT, PAR, and TDD) were treated as spatially uniform across the ∼3 km2 study area, as it is covered by a single meteorological station. Their values varied only temporally, while all other predictors were spatially distributed and static during each model run. To assess potential multicollinearity, pairwise correlations among predictors were calculated separately for the 1 and 10 m datasets using Spearman's rank correlation.
2.3.2 Model training and evaluation
Second, we evaluated four modelling families for their ability to predict CH4 fluxes: random forests (RF), gradient-boosting machines (GBM), generalized additive models (GAM), and support-vector regression (SVR). RF is a ML algorithm that builds multiple decision trees on bootstrapped data. The mean of their outputs is then calculated. The averaging reduces noise and the method reports easy-to-read variable-importance scores (Breiman, 2001; Prasad et al., 2006). GBM also uses trees but adds them one after another. Each new tree learns from the errors of the current ensemble, which often reduces bias but requires careful tuning to avoid over-fitting (Friedman, 2001; Elith et al., 2008). Similar RF, GBM handles mixed predictor types, outliers, missing values, and nonlinear relationships without preprocessing (Elith et al., 2008). GAM is a statistical technique that fits a smooth curve to each predictor and then combines these curves to create a composite curve. The curves demonstrate how CH4 changes with each driver and provide reliable predictions beyond the training range (Hastie and Tibshirani, 1986; Wood, 2017). SVR is a ML algorithm that fits a flexible line or surface that best follows the data while allowing small errors within a defined range. It uses a mathematical function called a kernel to handle weak non-linear patterns, and is particularly effective when the dataset is small or the relationships are not strongly linear (Cortes and Vapnik, 1995; Smola and Schölkopf, 2004). Each model was implemented using the caret package in R (Kuhn, 2008) for hyperparameter tuning via stratified 10-fold cross-validation on individual measurements. We used the R-packages ranger for RF (Wright and Ziegler, 2017), gbm for GBM (Greenwell et al., 2022), kernlab for SVR (Karatzoglou et al., 2004), and mgcv for GAM (Wood, 2017). Model performance was assessed using five-fold cross-validation based on out-of-fold predictions, without grouping by site or year of measurement. Three complementary metrics were used: the coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE). R2 describes how well model predictions capture the variability of observed CH4 fluxes, RMSE emphasises large deviations, and MAE quantifies the average absolute difference between observed and predicted values. R2 and RMSE were the main criteria for evaluating predictive performance and selecting the best model configurations, while MAE was reported as an additional indicator of absolute error, given the low mean CH4 fluxes. All metrics were computed from cross-validated predictions using the yardstick package (Kuhn et al., 2025) to ensure consistent implementation across all model types.
We tuned the key parameters of RF, GBM, SVR, and GAM using five-fold cross-validation with RMSE as the evaluation metric (Sect. S1 in the Supplement). For SVR, several kernel functions were tested and the radial basis function kernel provided the best performance. GAMs were fitted using thin-plate regression splines for numeric predictors and penalization of uninformative smooth terms. Multicollinearity among predictors was assessed using Variance Inflation Factors (VIF/GVIF) and GAM concurvity diagnostics, and no predictors exceeded commonly used concern thresholds (Sect. S2 and Table S2 in the Supplement). Therefore, the full predictor set was retained at both spatial resolutions.
In addition to the main predictor set, models were also trained with additional variables available only at 10 m spatial resolution, including CALU land cover, InSAR-derived surface subsidence, and temporally dynamic NDVI and NDWI extracted for dates closest to each CH4 flux measurement. These variables were tested in separate model runs to evaluate their explanatory power but were not included in the main inter-resolution comparison, as they were unavailable at 1 m resolution and would otherwise bias scale-related analyses.
To disentangle the effects of spatial resolution and data source, we additionally aggregated the 1 m input dataset to 10 m resolution using the same workflow. Continuous predictors were averaged within each 10 m grid cell, and categorical variables (LC) were assigned based on the majority class. These aggregated data were then used to train and evaluate all models using the same hyperparameter settings and cross-validation strategy as for the main analysis.
2.3.3 Spatial prediction
Third, two best-performing models (RF and GBM) were applied to a complete spatial predictor stack, a multi-layer raster covering the entire study area without gaps. The stack included two types of layers. Static layers, such as NDVI, NDWI, slope, aspect, TPI 30 m, TWI, and land cover, remained unchanged throughout July. In contrast, the meteorological layers (AT, TDD, PAR) were spatially uniform but temporally dynamic. A temporal loop progressed from 1 July at 00:00 to 31 July at 23:59 LT in three-hour steps. At each time step, the corresponding values of AT, PAR, and TDD were inserted into their respective layers in the stack. The model then generated an instantaneous CH4 flux raster in using the terra package (Hijmans, 2023). This routine resulted in 248 flux rasters for the whole month of July, produced per year and spatial resolution. To ensure consistency, areas with missing input values (e.g., water bodies) were excluded from predictions. In total, 5952 CH4 flux rasters were generated (248 time steps × 2 models × 2 resolutions × 6 years).
2.3.4 Temporal aggregation and interpretation
Fourth, the predicted raster time series was aggregated using arithmetic operations in the terra package. Averaging over all time steps resulted in July mean flux maps, while summing and multiplying by three converted instantaneous rates into cumulative monthly fluxes. Six-year means and interannual variation (2019 to 2024) were calculated. To assess spatial mismatches related to scale, the 1 m predictions were aggregated to 10 m resolution, and the aggregated 1 m maps were subtracted from the 10 m maps pixel-by-pixel to compute spatial differences. In addition, differences between the two tree-based model families, RF and GBM, were mapped to quantify structural uncertainty. To interpret these mismatches, Pearson correlations were calculated between the difference maps (resolution- or model-based) and individual predictor layers.
2.3.5 Variable importance analysis
Fifth, we conducted a separate variable importance analysis to identify the most influential predictors in each model. Variable importance scores were extracted from the cross-validated, hyperparameter-optimised RF and GBM models using permutation importance (ranger package; Wright and Ziegler, 2017) and relative influence (gbm package; Greenwell et al., 2022), respectively. These scores were used to assess the consistency of predictor relevance across models and spatial resolutions. Variable importance scores for each model were normalized by dividing by the sum of all importance values within that model, resulting in relative importance values ranging from 0 to 1.
3.1 Correlation between observed CH4 fluxes and single remote sensing parameters
This exploratory analysis examined how observed July CH4 fluxes correlate with individual environmental variables at two spatial scales (1 and 10 m) to identify significant controls of CH4 flux and how spatial resolution affects their predictive power. Although the observed associations were generally weak, several clear patterns emerged across landscape classes and environmental gradients.
Seasonal subsidence showed the strongest positive correlation, underscoring the explanatory power of this parameter for moisture availability and related enhancements in CH4 fluxes (Table 2). This is in line with observations linking InSAR-derived subsidence to elevated CH4 fluxes in Arctic ecosystems (Sjögersten et al., 2023). Several moisture-related indices (NDWI, TWI, TPI) show higher correlations at 10 m than at 1 m, because 10 m aggregation smooths microtopographic noise while 1 m retains over-detailed, heterogeneous signals. This indicates that coarser resolution better captures landscape-scale hydrological gradients. This finding is supported by Ruhoff et al. (2011), who demonstrated that TWI values stabilise and become more spatially coherent at coarser resolutions, and by Riihimäki et al. (2021), who showed that TWI's ability to predict soil moisture improves when derived from coarser DEMs (e.g., 10–30 m). Conversely, the correlation with aspect weakened at 10 m, compared to 1 m resolution, likely due to the loss of microtopographic detail when pixels are aggregated, as shown previously (Schoorl et al., 2000; Vaze et al., 2010).
Table 2Spearman rank-correlation coefficients (ρ) between July CH4 flux and environmental predictors at 1 and 10 m spatial resolution. Positive values indicate that higher predictor values coincide with higher CH4 emissions, negative values indicate the opposite. All correlations were computed using the full dataset. For vegetation and surface wetness predictors (NDVI, NDWI), both static (July 2018) and temporally matched values (Sentinel-2 scenes within ±10 d of each chamber measurement) are shown. The column “10 m, temporal” reflects those temporally matched predictors. For predictors derived from static landscape characteristics (e.g., TWI, Slope, TPI, Subsidence), 10 m and 10 m-temporal columns are merged as they do not vary in time. Significance levels: p<0.01, p<0.001.
Temporally matched NDVI and NDWI show weaker correlation with CH4 fluxes compared to static indices. The reason may be the limited effective temporal resolution of Sentinel-2: although the constellation has a nominal 5 d revisit, persistent Arctic cloud cover often stretches the cloud-free gap well beyond 10 d (Runge and Grosse, 2019), producing a temporal mismatch with chamber measurements.
We examined CH4 flux variation across the landscape classes and CALU units (Fig. 4a and Table B1). For example, sedge-dominated landscape classes had the highest mean CH4 flux (0.87–0.94 ). Elevated fluxes in these systems are likely driven by plant transport through aerenchymatous tissue during which CH4 produced at depth bypasses the oxic zones, and enhanced CH4 production resulting from high plant productivity and increased substrate availability via root exudates (Olefeldt et al., 2013; Kwon et al., 2017). Tussock areas displayed the lowest flux values, with on average minor uptake of CH4 (−0.02 ). These patterns were consistent with observations by Voigt et al. (2023c).
Figure 4Comparison of observed CH4 fluxes across site-specific landscape classification at two spatial resolutions and CALU (Circumarctic Land cover Units) classes. The numbers above the boxplots indicate the total number of unique measurements followed by the number of Sites of measurements in parentheses Panel (A): CH4 fluxes across five site-specific landscape classes with existing CH4 flux measurements. Measurements were aggregated separately for 1 and 10 m spatial resolution. Panel (B): CH4 fluxes grouped by CALU classes. CH4 fluxes differed significantly between most CALU classes (p<0.001, pairwise Wilcoxon test), except classes 10 and 3 (wetlands), where no significant difference was observed (p=0.054). Boxplots show the distribution of fluxes for each group. horizontal lines represent medians, boxes indicate the interquartile range, and whiskers extend to 1.5× the IQR. The red dashed lines indicate zero fluxes.
All pairwise differences between CH4 flux distributions for the 1 and 10 m products were statistically significant (Wilcoxon rank-sum test, p<0.0001). However, this result should be interpreted with caution due to the large sample sizes (even subtle differences can appear significant). In some cases, the differences in median fluxes were small (e.g., sedges), while in others, the resolution shift results in more substantial changes (e.g., dwarf shrubs: median increased from 0.05 to 0.19 ). In some cases, the flux sign even changed, for instance, lichen-dominated areas shift from weak uptake to weak emission. These shifts likely reflect the effects of aggregation, where coarser resolution mixes surface types or blends microsites with different flux patterns.
To assess how well a pan-Arctic land-cover scheme captures CH4 flux variation, we aligned our measurements with the CALU map (Fig. 4b). CALU vegetation classes differed significantly in CH4 flux, except between moist moss tundra, abundant moss, prostrate to low shrubs (class 10) and permanent wetlands (class 3) (Fig. 4b, Table B2). Within CALU classes, average CH4 fluxes ranged from slight uptake in wetland class (−0.09 ) to moderate emissions in moist tundra, abundant moss, dwarf and low shrubs (CALU 11) (0.46 ). Unexpectedly, the permanent wetland class showed CH4 uptake. This category only included one area, where dry lichen areas dominate most of the area. Moreover, the 10 m resolution of CALU likely leads to mixed pixels, where wetter spots were averaged with drier surroundings, reducing the apparent CH4 emissions. In contrast, many wet areas at our site were too small to be resolved as wetlands in CALU and were instead classified into other categories.
Overlay analysis between our site-specific landscape classification and the CALU (Fig. 5) showed that each of our landscape classes included 6–11 CALU classes (with coverage >1 %), typically dominated by moist tundra, abundant moss, dwarf and low shrubs (CALU 11). This reflects differences in classification approaches: CALU aimed at representing vegetation diversity and wetness gradients across the entire Arctic (Bartsch et al., 2024), whereas the site-specific landscape classification was explicitly built for CH4 flux modelling and therefore integrates fine-scale microtopography, surface-moisture patterns, and local vegetation. A similar degree of cross-class mixing was observed for the 1 m classification (Fig. S2 in the Supplement), indicating that these differences are primarily driven by conceptual distinctions between classification schemes rather than spatial resolution.
Figure 5Pixel-wise cross-comparison between two 10 m land-cover products for the TVC study area. LC 10 m (this study): a site-specific Sentinel-2 + ArcticDEM classification built (see Appendix A1). CALU (Circumarctic Land cover Units): published pan-Arctic landcover units (full legend in Table A3). Each tile shows the fraction of pixels of a given CALU class that fall into that LC 10 m class; row totals, therefore, equal 100 %. Values ≥0.5 % are printed inside the tiles. Tiles that are coloured but unlabelled occur (<0.5 %), while blank tiles indicate class pairs that do not intersect within the AOI.
While both CALU and our site-specific classifications captured broad vegetation and wetness gradients, tall-shrub areas were clearly underrepresented in the flux dataset (<1.5 % of points vs. ∼20 %–30 % of the mapped area; Fig. S1). These zones often coincide with wetter micro-depressions and drainage areas (Grünberg et al., 2020), suggesting that the wettest conditions are not fully captured in the current sampling. Increasing coverage of these habitats would improve the robustness of flux comparisons and reduce residual variability in future models.
However, even within each CALU or LC class, flux variance remained high, underlining that vegetation type alone cannot capture the full pattern of CH4 fluxes without considering microtopography and moisture indices. Similar to the pan-Arctic synthesis by Olefeldt et al. (2013), our findings support the view that the effects of key environmental parameters on CH4 flux should be considered jointly rather than independently. Additionally, soil temperature and soil moisture, key controls of CH4 production and oxidation (Wille et al., 2008; Mastepanov et al., 2013), were not included as predictors in the present analysis due to limited spatial coverage but are planned for integration in future model development.
3.2 Evaluation of Model Accuracy
Our cross-validated modelling framework achieved predictive performance (R2 from 0.53 to 0.87, Table 2) comparable to recent CH4 upscaling studies in the Arctic-boreal region, including both chamber- (e.g., Virkkala et al., 2024; Räsänen et al., 2021) and eddy covariance-based studies (e.g., McNicol et al., 2023; Chen et al., 2024; Peltola et al., 2019; Tramontana et al., 2016).
Model evaluation at 1 m resolution revealed that SVR achieved the highest R2 of 0.87, indicating strong predictive power. However, this was accompanied by substantial errors (RMSE=0.078, MAE=0.019 of mean CH4 flux), suggesting high sensitivity to skewed distributions and outliers, a known limitation of SVR when modelling non-Gaussian ecological data (Smola and Schölkopf, 2004). In contrast, RF showed both high accuracy and robustness, combining high R2 with the lowest errors among tested algorithms. This confirms the algorithm's strength in capturing nonlinear interactions while being less sensitive to noise and overfitting, as highlighted in ecological applications (Belgiu and Drăguţ, 2016; Räsänen et al., 2021; Cutler et al., 2007). GBM also showed strong performance, with low errors and consistent R2 values, reflecting its capability to efficiently leverage key predictors (Kämäräinen et al., 2023; Natekin and Knoll, 2013). GAM, in contrast, had the weakest performance among all models at 1 m resolution, with the lowest R2 (0.62), highest RMSE (0.077), and highest MAE (0.025). This likely reflects the model's limited ability to capture sharp spatial variability in CH4 fluxes when localized structure is strong. GAMs rely on detecting smooth nonlinear effects, but when predictors become noisy or spatially complex, the fitted splines lack the detail needed for accurate prediction (Wood, 2017).
At 10 m resolution, RF not only achieved the lowest mean absolute and root-mean-square errors, but its R2 and error metrics also changed the least when we varied resolution or added temporally dynamic predictors, indicating the most consistent performance in our experiments (Table 3).
Table 3Performance of four models at 1 and 10 m spatial resolutions. Metrics include R2 (coefficient of determination), MAE (mean absolute error), and RMSE (root mean square error). Bold values represent the best score for each metric within each resolution. The “10 m” scenario includes models with temporally stable normalized difference vegetation index (NDVI) and normalized difference water index (NDWI), while “10 m_temporal” refers to models using temporally dynamic indices, matched to the closest available date of in-field CH4 flux measurements.
GBM showed similarly low errors but a slightly lower R2 (0.57). SVR achieved the highest R2 (0.68), but this was offset by much higher prediction errors, indicating poor generalisation despite high apparent fit. GAM performed worst, with the lowest R2 (0.53) and the highest RMSE (0.13).
The decrease in SVR and GAM performance at 10 m resolution likely reflects the loss of fine-scale spatial detail when data are aggregated to coarser grids. At coarser resolution, each pixel represents a mixture of surface types and microtopographic conditions, which reduces local variability in the predictors and weakens the model's ability to capture small-scale relationships with CH4 fluxes. SVR models, which depend on detailed nonlinear patterns, become less stable when this localised structure is smoothed out. Similarly, GAM performance declines when predictors become more homogeneous, since spline functions can no longer represent fine spatial gradients. In contrast, RF and GBM were more robust to this loss of detail because their ensemble structure allows them to generalise better under coarser input conditions. Based on these results, we selected RF and GBM for further analysis as the most reliable combination of accuracy and cross-resolution stability. When cross-validation was grouped by site or year, R2 values dropped to ∼0.1–0.2 and RMSE increased compared to the standard stratified 5-fold CV (Sect. S5 and Fig. S7 in the Supplement). This decline might reflect the heterogeneous sampling structure at Trail Valley Creek: because environmental conditions are highly heterogeneous, many combinations of vegetation, moisture and microtopography are represented by only one or a few sites. Leaving such a site out removes these conditions from the training data and forces the models to extrapolate, rather than revealing instability of the general models.
Including the temporal variability of the NDVI and NDWI values led to an average increase in R2 of approximately 0.11 for the GAM, GBM and RF models at a resolution of 10 m (Table 3). SVR was the exception, showing a slight decrease in R2 with no reduction in errors. For the GAM model, this increase in explanatory power was accompanied by lower RMSE and MAE values, indicating more accurate and robust performance. In contrast, both RF and GBM showed a higher R2, but also exhibited increased absolute errors, which may indicate overfitting to temporally dynamic predictors. This likely reflects the tendency of ensemble models to capture noise in dynamic inputs when training data is limited (Barry and Elith, 2006; Chollet Ramampiandra et al., 2023; Reichstein et al., 2019). Similar behaviour has been observed in other ecosystem carbon flux modelling studies, for example, in neural network models that overfit to lagged meteorological inputs (Papale and Valentini, 2003). The GAM model likely benefited from its ability to represent gradual ecological shifts through penalised smoothers, which reduces sensitivity to noise (Berbesi and Pritchard, 2023). The limited improvement in performance for SVR may be due to its sensitivity to data structure and lower flexibility when modelling smooth temporal trends in ecological datasets (Smola and Schölkopf, 2004).
3.3 Impact of model and resolution selection on CH4 flux predictions
Because only meteorological variables changed over time, the interannual variation in the predicted maps arises from interactions between the static landscape predictors and varying atmospheric conditions, rather than from spatial changes in surface characteristics. Different data-driven models can produce distinct spatial predictions even when trained on the same input data. Although well documented, most machine-learning algorithms are not easily interpretable, whereas statistical approaches such as GAMs provide more transparent relationships between predictors and fluxes. We therefore compare their spatial predictions and simple diagnostics to assess reliability and guide model choice for CH4 upscaling. Our comparison of upscaled CH4 flux fields produced by the RF and GBM models showed that algorithm choice remained an important influence on spatial variability in predicted CH4 fluxes (Fig. 6). The GBM model generated higher local contrast and more pronounced extremes, especially at 1 m resolution, with pronounced peaks in wet, topographically complex areas, reflecting its greater sensitivity to extreme values and local predictor variation. RF produced smoother, noise-resistant distributions, aligning with its known strength in generalising across heterogeneous landscapes (Räsänen et al., 2021; Cutler et al., 2007). While RF remains a robust and widely applied method for spatial upscaling (Cutler et al., 2007), our findings demonstrate that algorithm choice still affects spatial outcomes, with each model emphasising different aspects of landscape variability. This highlights the value of including multiple model types, not only for optimising performance, but also for quantifying model-driven uncertainty in CH4 flux upscaling.
Figure 6Predicted mean monthly CH4 fluxes (mg CH4 m−2 per month) for July (averaged over 2019–2024), generated by two machine-learning models: Random Forest (RF, panels A and B) and Gradient Boosting Machine (GBM, panels C and D). The panels (A) and (C) shows predictions at 1 m resolution, and panels (B) and (D) right column at 10 m resolution. Each panel contains a black-framed zoom window, which enlarges a representative section of the polygonal mire. Visual comparison of the two insets illustrates how the fine wet-to-dry microtopography resolved at 1 m is smoothed when aggregated to 10 m. Background imagery: © Google Earth (Maxar) 2025.
Interestingly, although GBM exhibited more spatial flux variability, the mean fluxes predicted by GBM were consistently lower than those of RF. At 1 m, GBM averages 98.7 mg CH4 m−2 per month, whereas RF averages 518.6 mg; at 10 m the values rise to 608.8 and 683.4 mg, respectively (Fig. 7a). This more than fivefold difference at 1 m resolution underscores the substantial structural uncertainty that arises purely from algorithm selection, even when all predictors and training data are identical. At 10 m resolution, this discrepancy largely disappears because spatial aggregation smooths microtopographic extremes and reduces the influence of local outliers, making both models converge toward similar mean fluxes. Net-sink pixels accounted for 10.0 % (RF) and 9.5 % (GBM) of the 1 m domain, but only 4.9 % (RF) and 4.4 % (GBM) at 10 m. CH4 sink areas were spatially limited and highly sensitive to scale. Pixels acting as net CH4 sinks (i.e. with negative monthly fluxes) were located on well-drained polygon rims and other lichen-dominated uplands where oxygen remained available throughout the summer. This allowed highly efficient methanotrophs to oxidise CH4 faster than it was produced (Biasi et al., 2008). Resolving these units at a scale of 1 m showed that they covered around 10 % of the scene and significantly reduced the landscape-mean flux. However, coarsening to 10 m mixed the aerobic patches with adjacent wet hollows, reducing their mapped extent to approximately 4.5 % and erasing many uptake pixels. A comparable effect has been observed when chamber data were averaged across broader physiographic units, shifting site-level balances from weak sinks to slight sources (Zona et al., 2016). This pronounced scale effect is consistent with pan-Arctic syntheses, indicating that, although they cover only a small fraction of the surface, aerated uplands can offset a significant proportion of wetland emissions, yet they are often obscured in coarse products and regional budgets (Olefeldt et al., 2013; Kuhn et al., 2021). Our findings support recent assessments that retaining metre-scale information on microtopography, vegetation, and soil moisture is essential for capturing sink behaviour and ultimately for refining carbon budgets in permafrost regions, which currently indicate a small terrestrial CO2 sink and a wetland CH4 source (Treat et al., 2024).
Figure 7CH4 fluxes and their interannual variability across the study area. (A) Pixel-wise monthly mean CH4 flux predicted by RF and GBM models for July 2019–2024. Each point in the boxplots represents the mean July CH4 flux of a single pixel, illustrating the spatial distribution of fluxes across the study area. (B) Pixel-wise interannual standard deviation of predicted CH4 fluxes for July months from 2019 to 2024, calculated separately for each model and resolution. The box spans the interquartile range (IQR, 25th–75th percentile), the horizontal line indicates the median, whiskers extend to 1.5× IQR, and points beyond the whiskers represent potential outliers.
Both models predict a comparable overall flux range, but the main disagreement occurs in the intensity and spatial extent of intermediate values. GBM also tends to produce stronger negative extremes, indicating higher sensitivity to localized sink conditions. The residual model disagreement is driven less by the number of sink pixels than by their intensity. Minimum fluxes predicted by GBM were consistently more negative than those from RF, with extremes of −147 mg CH4 m−2 per month (1 m) and −330 mg CH4 m−2 per month (10 m), compared to −45 and −33 mg CH4 m−2 per month in RF, respectively. This suggests that GBM may emphasise CH4 sink strength more than RF, even though the spatial extent of sinks is similar across models.
At 1 m, GBM often responds more strongly to localized environmental extremes. These include areas with much higher soil moisture, surface temperature spikes, or abrupt changes in microtopography that may only occur at the metre scale. This is due to its sequential learning process, which can emphasize subtle but high-impact predictors. RF, in contrast, smooths local extremes and yields more conservative area means. Because GBM-1 m produced a markedly lower AOI mean than RF, we treat this behaviour as a potential systematic bias toward stronger sinks and hotspots. We therefore use RF-1 m as the reference budget estimate and retain GBM-1 m as a sensitivity case to bracket structural uncertainty. At 10 m, aggregation reduces fine-scale contrasts and the RF-GBM predictions converge. Pixel-wise standard deviations (Fig. 7b) reveal that RF is temporally more stable, while GBM is more sensitive to inter-annual variation, particularly in wet or geomorphically complex areas.
Spatial differences between models and resolutions were calculated as pixel-wise subtraction (RF−GBM and 1 m−10 m), ensuring consistent direction of comparison across all analyses. Additional analysis of spatial differences between models (Fig. B2) showed that several predictors were moderately to strongly correlated with the differences between RF and GBM predictions. At 1 m resolution, the strongest correlation was observed for NDWI (−0.53), indicating that model disagreement was most pronounced in wetter areas. NDVI (0.49) and landscape type (0.41) also showed strong positive correlations with model differences, suggesting greater divergence in vegetated zones and across cover transitions. For the 10 m products, aspect (0.43) became the only predictor for model differences above 0.4, implying that model choice matters most on directionally exposed terrain once fine micro-relief is lost. Across both resolutions, NDWI exhibited consistent negative correlations, implying that divergences are magnified in wetter and concave landforms that tend to accumulate water or thaw differently. These findings are in line with Tagesson et al. (2013), who showed that adding satellite-derived NDWI improves CH4 flux modelling by capturing moisture-driven variability.
The full spatial difference maps for each model and resolution are provided in the supplementary Zotero dataset (Ivanova et al., 2025a) to enable direct comparison with environmental layers and visual exploration of model- and scale-driven patterns.
Part of the disagreement between the two models, particularly at 10 m resolution, can be attributed to limited training data in certain landscape types such as tall-shrub and complex wetland zones, which were sampled less intensively due to access constraints. These classes show higher prediction uncertainty and stronger divergence between RF and GBM, as GBM amplifies local extremes while RF tends to smooth them.
Model performance based on the aggregated 1 m data (10 m from 1 m) was nearly identical to that of the original 10 m models, with only small differences across algorithms. GBM and SVR showed slightly improved accuracy after aggregation, while RF performed marginally worse and GAM remained nearly unchanged. These results indicate that the performance differences between the 1 and 10 m models reported above are mainly attributable to spatial resolution rather than to differences between sensor-based and aggregated input data (see Sect. S4, Figs. S5 and S6 in the Supplement).
3.4 Parameters importance in CH4 flux prediction
Analysis of the relative importance of the predictors revealed fundamental differences between the RF and GBM models, and how these differences change when moving from 1 m to 10 m resolution (Table B3). Significance was assessed using the permutation method for each model and scale combination.
At the 1 m resolution, RF distributed importance fairly evenly across the topographic parameters. TPI (∼22 %), Aspect (∼21 %), and Slope (∼18 %) showed comparably high influence, followed by landscape class (∼16 %). All other predictors contributed less than 10 %, and meteorological drivers collectively stayed below that level. This topography-centred profile is consistent with the moderate intercorrelation among terrain metrics such as TPI, Slope, and TWI (Fig. B1), which share a common DEM origin and partly capture overlapping relief and moisture patterns. Such behaviour aligns with the known tendency of random forests to distribute importance across correlated terrain drivers due to their random feature-selection mechanism (Räsänen et al., 2021; Cutler et al., 2007).
GBM showed a different pattern: again, no single parameter dominates, but five drivers spread across different input categories (Slope, Landscape class, AT, TDD and NDWI) each explained about 14 %–16 % of the total, and none exceeds 20 %. This flatter profile is based on the boosting process. Each new tree fixes the errors left by the previous one, so different predictors take turns improving the model (Friedman, 2001). When several drivers reduce error by a similar amount, the model splits importance among them (Kämäräinen et al., 2023).
Figure 8Mean relative importance (± SD) of environmental predictors for CH4 fluxes across two machine-learning models – Random Forest (RF) and Gradient Boosting Machine (GBM) – evaluated at 1 and 10 m spatial resolutions. Importance was estimated by bootstrap resampling (n=100) and is expressed as a percentage of total importance within each model. Predictors are grouped into four categories: Meteorological drivers (thawing degree days, air temperature, photosynthetically active radiation), Vegetation/Terrain (Normalized Difference Vegetation Index, landscape class), Topography (Topographic Position Index, aspect, slope), and Hydrology/Moisture (Topographic Wetness Index, Normalized Difference Water Index). Abbreviations: TDD – thawing degree days; PAR – photosynthetically active radiation; NDVI – Normalized Difference Vegetation Index; TPI – Topographic Position Index; TWI – Topographic Wetness Index; NDWI – Normalized Difference Water Index.
When the resolution was coarsened to 10 m, pixel aggregation smoothed micro-relief, and both algorithms shifted toward moisture-integrating drivers as primary explanatory influences. In RF, NDWI (∼25 %) and Landscape class (∼25 %) emerge as joint leaders, NDVI rises to ∼12 %, and all topographic parameters drop below 8 %. In GBM, the re-organisation is even stronger: the moisture indicators NDWI (∼25 %) and TWI (∼19 %) together explained almost half of the total importance, while landscape class follows at ∼11 % and Slope and Aspect fall below 7 %. This pattern agrees with field evidence that moisture proxies dominate CH4-flux prediction at coarser resolution, where fine-scale topographic details are lost (Tagesson et al., 2013; Gachibu Wangari et al., 2023). NDWI and TWI both integrate water content over several pixels, making them potential surrogates for local water-table height and the extent of anoxic microsites that drive methanogenesis. NDWI is also sensitive to vegetation water and phenology, allowing it to track water-table depth in peatlands (Meingast et al., 2014; Kalacska et al., 2018). TWI, which maps landscape-scale water accumulation and thus redox and gas-diffusion controls, aligns with syntheses showing that water-table fluctuations set the size of anoxic zones and largely govern CH4 production and emission (Kaiser et al., 2018; Cui et al., 2024). Landscape class and NDVI contributed complementary information on vegetation type and biomass, which modulate both substrate supply and methane oxidation. In practical terms, upscaling to 10 m can still capture landscape-scale CH4 patterns, but only if robust moisture indices such as NDWI and TWI were included; purely geometric terrain drivers lose most of their explanatory power once microtopography is averaged out.
The potential influence of CALU, subsidence, and temporally matched NDVI/NDWI indices was further examined in a separate 10 m model experiment (Sect. S3, Figs. S3 and S4 in the Supplement).
This study aimed to identify the key environmental and spectral drivers of CH4 fluxes in heterogeneous Arctic tundra, evaluate how both model performance and predictor importance change with spatial resolution and across different data-driven models, and assess the implications for upscaling CH4 fluxes.
Subsidence, derived from InSAR, showed the highest correlation with observed CH4 fluxes of all the tested predictors, emphasising its value as a spatial proxy for soil moisture. It should therefore be included directly in CH4 upscaling workflows, particularly in permafrost landscapes where moisture conditions were key drivers of fluxes.
Although different models varied significantly in their estimates, RF and GBM provided the most consistent and reliable upscaling results. At the highest spatial resolution, the two algorithms produced notably different flux magnitudes, reflecting structural uncertainty linked to how each model handles local extremes. However, their robustness should be verified through targeted sensitivity analyses, including tests with modified predictor sets, varied hyperparameters, and bootstrapped subsampling, to assess the stability of variable importance and model performance. Significance of model predictors was found to be strongly scale-dependent. At a resolution of 1 m, the models derived most of their explanatory power from microtopographic metrics, which capture the detailed elevation contrasts that distinguish between hummocks and hollows, as well as localising CH4 hotspots. However, after aggregation to 10 m, these relief cues were diluted, causing a change in ranking: moisture proxies NDWI and TWI became the principal drivers, together accounting for almost half of the explained variance. This transition from terrain- to moisture-controlled importance highlights the fact that fine-scale mapping requires detailed topographic data, whereas regional upscaling must prioritize robust hydrological indices. For AOI budgets we report RF at 1 m resolution as the reference and use GBM at 1 m resolution as a sensitivity bound due to its amplification of metre-scale extremes.
Spatial resolution emerged as the important factor determining the predictive power of data-driven upscaled CH4 flux patterns, exerting a stronger influence than model choice. At a resolution of 1 m, fine-scale heterogeneity was captured at a high degree of detail, making it possible for models to distinguish between local sources and sinks of CH4. At 10 m, micro features merge into mixed pixels, boosting mean fluxes and variability. This resulted in fine-scale sinks and hotspots disappearing, and in some cases, fluxes being misclassified as a source of CH4 in dry areas. Consequently, 10 m models produced higher mean fluxes and broader flux distributions. However, some of these high values may be due to mixed-pixel artefacts rather than true local emissions.
Our study findings imply that resolution is not simply a case of “the higher, the better”, and similarly, more complex ML methods may not necessarily yield better predictions. Although 1 m models captured fine-scale heterogeneity, 10 m models with temporally dynamic predictors improve explanatory power but increase prediction errors, likely due to overfitting to short-term fluctuations. This suggests that, in some cases, 10 m resolution models can outperform 1 m resolution ones, particularly when enhanced with well-timed spectral information, though caution is needed to balance fine-scale accuracy with broader spatial generalisability.
Although this study focuses on a single Arctic wetland complex at Trail Valley Creek, the workflow and findings are broadly transferable to other tundra environments. Ten-metre inputs from Sentinel-2 and ArcticDEM reproduce dominant moisture-control patterns typical of Arctic lowlands, while metre-scale (drone + LiDAR) layers reveal fine sink–source contrasts but require intensive data collection. Scale effects may vary across Arctic landscapes depending on topographic and vegetation complexity, and could differ in more homogeneous or highly dissected terrain. Because the models remain correlative and July-specific, extending the workflow across seasons and additional sites would strengthen generality and test the stability of the observed scale effects. Future work should expand sampling into underrepresented landscape and vegetation classes, high-emission zones, methane uptake regions, and winter fluxes, and incorporate temporally dynamic predictors. Integrating theory-guided time-series modelling approaches informed by ecological theory could enhance both the interpretability and accuracy of CH4 forecasts under complex seasonal dynamics, particularly when data availability is limited.
Table A1Overview of predictor variables used in the CH4 flux models. This table lists all environmental predictor variables considered in the modelling framework. For each parameter, the spatial resolution (for remote sensing layers), source, short description, and formulas for calculations are presented (where applicable). Parameters are grouped into six thematic categories: Meteorological Drivers (e.g., PAR, AT, TDD), Vegetation / Land Cover (e.g., NDVI, landscape classification, CALU), Hydrology / Moisture Indicators (e.g., NDWI, TWI), Topography (e.g., slope, aspect, TPI), and Surface Deformation (subsidence). Each variable is marked as either static (unchanging during the study period) or dynamic (time-specific).
A1 Landscape classification
To classify land cover in the TVC area, we employed a supervised classification approach using multi-source remote sensing data at 1 and 10 m resolutions. The classification process was implemented in Google Earth Engine (GEE), enabling large-scale data processing. A Random Forest (RF) classifier was chosen due to its ability to handle high-dimensional data, its resistance to overfitting, and its suitability for land cover mapping. By applying a consistent classification framework at both 1 and 10 m resolutions, this study enables direct comparisons of classification performance across spatial scales,
Training and Validation Data
The classification was trained using manually collected validation points that were assigned to six distinct land cover classes: Dwarf Shrub, Tall Shrub, Sedges, Tussock, Lichen, and Water. To ensure statistical robustness, 80 % of the validation points were used for model training, while the remaining 20 % were reserved for accuracy assessment.
Remote Sensing Data and Feature Extraction
To optimise classification accuracy, we integrated spectral, texture, and topographic features derived from multiple remote sensing sources. Sentinel-2 optical imagery at 10 m resolution was used for broad-scale classification, with images acquired during the 2018 growing season (25 June–4 September 2018) to ensure that differences in land cover classification were due to spatial resolution rather than changing environmental conditions, matching the same summer period as the 1 m drone survey. Topographic features were extracted from ArcticDEM (2 m resolution) (Porter et al., 2023). At finer spatial scales, we incorporated ultra-high resolution drone imagery (1 m and 10 cm) from Rettelbach et al. (2024) and a digital terrain model (DTM) (Lange et al., 2021).
To further enhance classification accuracy, we performed a Gray-Level Co-occurrence Matrix (GLCM) texture analysis of NDVI, allowing us to incorporate information on vegetation heterogeneity. A 2×2 kernel was used for 10 m classification, while a 20×20 kernel was applied at 1 m resolution to capture 20 m spatial patterns.
Table A2Parameters used for the landscape classification. Abbreviations in the table: NDVI – Normalized Difference Vegetation Index, NDWI – Normalized Difference Water Index, EVI – Enhanced Vegetation Index, SAVI – Soil-Adjusted Vegetation Index, GLCM – Gray-Level Co-occurrence Matrix, TPI – Topographic Position Index, TWI – Topographic Wetness Index, DEM – Digital Elevation Model. Spectral indices were derived from Sentinel-2 (10 m spatial resolution) and drone imagery (1 m spatial resolution) using the visible and near-infrared bands (Blue, Green, Red, NIR).
Classification Model and Accuracy Assessment
The Random Forest classifier was trained separately for 10 m Sentinel-2 data and 1 m drone-based data, with 200 decision trees used in both cases. The trained models were then applied to classify the entire dataset. The overall accuracy was 0.76 for 1 m resolution and 0.71 for 10 m resolution. Class-specific accuracies are provided in Table S1 in the Supplement.
Export
Final classified maps at 10 m and 1 m resolutions were exported as GeoTIFF files for further analysis and comparison.
Table A3Description of Circumarctic Land Cover Units (CALU) present in the study area. Class names and definitions are taken from Bartsch et al. (2024). Additional columns indicate (i) whether the class is present within the area of interest (AOI), and (ii) whether CH4 flux measurements are available for this class.
Table B1Summary statistics of observed CH4 fluxes () across site-specific landscape classes at 1 and 10 m spatial resolutions. The table reports the number of observations (N Obs), number of sites, where measurements were done (N sites), mean, first quartile (Q1), third quartile (Q3), minimum, and maximum CH4 flux values for each landscape class at both resolutions.
Table B2Summary statistics of observed CH4 fluxes () across CALU classes. The table reports the number of observations (n), mean, first quartile (Q1), third quartile (Q3), minimum, and maximum CH4 flux values for each landscape class at both resolutions. Class descriptions are available in Bartsch et al. (2024).
Figure B1Spearman rank correlations between environmental predictors used in the CH4 flux models at (left) 1 m and (right) 10 m resolution. Only statistically significant relationships (p<0.05) with absolute correlation strength are shown; non-significant and weak correlations are blanked. Positive correlations are shown in blue and negative correlations in red, with color intensity proportional to correlation strength (see scale bar, −1 to +1).
Figure B2Pearson correlation between spatial differences in CH4 flux predictions and selected environmental predictors. “Model influence” (left block) shows differences between RF and GBM predictions at the same resolution (RF−GBM). “Resolution influence” (middle and right blocks) show differences between 1 and 10 m predictions (1 m−10 m), calculated using predictors derived from (i) the 10 m products downscaled to 1 m (middle) and (ii) the 1 m predictors aggregated to 10 m (right). Positive correlations indicate that higher predictor values coincide with stronger CH4 flux mismatches between models or resolutions. Each cell represents Pearson's r across 30 402 pixels (10 m) and 3 050 788 pixels (1 m).
Table B3Relative importance [%] of environmental predictors for CH4 flux models across spatial resolutions and algorithms. The table shows the variable importance (in %) for each predictor derived from Random Forest (RF) and Gradient Boosted Machine (GBM) models at 1 and 10 m spatial resolution. Predictors are grouped by thematic category (e.g., Meteorological, Topographic). Importance values reflect the mean contribution of each predictor to the model performance and standard deviations (± SD).
The code used for CH4 flux modelling, resolution comparison, and upscaling across Arctic wetland landscapes is publicly available on Zenodo (Ivanova et al., 2025b): https://doi.org/10.5281/zenodo.15399084. The code used for processing the manual chamber flux measurements (2022–2024) is available at https://doi.org/10.5281/zenodo.16732354 (Ivanova and Göckede, 2025).
The CH4 flux predictions, spatial difference maps, and input dataset used in this study are publicly available on Zenodo (Ivanova et al., 2025a): https://doi.org/10.5281/zenodo.15753253.
The supplement related to this article is available online at https://doi.org/10.5194/bg-23-233-2026-supplement.
K.I., M.G., A.M.V., V.B., and T.S. developed the conceptual research design. K.I., M.G., A.B., and O.S. were responsible for the planning of research activities. Data collection was performed by K.I., B.W., and C.V. Data analysis and interpretation were conducted by K.I., M.G., and A.M.V. K.I. wrote the manuscript, with review and input from all co-authors.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
We are grateful for the support of all research teams and the assistance provided at Trail Valley Creek Research Station in organizing the fieldwork. We are grateful to Dr. Theresia Yazbeck for insightful feedback that helped improve the manuscript. Parts of the text were language-edited with DeepL, and some analysis and plotting scripts were revised for clarity with assistance from ChatGPT (OpenAI; accessed January 2025). All outputs were reviewed and verified by the authors, who take full responsibility for the content.
We acknowledge the Trail Valley Creek orthophotos, digital surface models, and 3D point clouds provided by the MOSES airborne campaign and published by Rettelbach et al. (2024) on PANGAEA (DOI: https://doi.org/10.1594/PANGAEA.961942). ArcticDEM provided by the Polar Geospatial Center under NSF-OPP awards 1043681, 1559691, 1542736, 1810976, and 2129685. We acknowledge the use of Copernicus Sentinel-2 data (Contains modified Copernicus Sentinel data [2018–2024]). We acknowledge the use of NASA POWER data, provided by the NASA Langley Research Center (LaRC) through the POWER Project, funded by the NASA Earth Science/Applied Science Program.
KI, MG, AB, BW, VB, and TS received funding from the European Researc Council Synergy Grant QArctic (grant no. 951288). MG further acknowledges financial support from the EU HORIZON EUROPE programme (project Green-FeedBack, grant agreement no. 101056921), the European Space Agency (AMPAC-net project, contract no. 4000137912/22/I-DT) and the German Ministry for Research, Technology and Space (MOMENT project, Grant no. 03F0931G). AMV received support from the Gordon and Betty Moore Foundation (grant no. 8414), and funding catalyzed by the TED Audacious Project (Permafrost Pathways). CV received financial support from the Research Council of Finland project MUFFIN (no. 332196) and the European Research Council Starting Grant COLDSPOT (no. 101163177). OS acknowledges financial support through the Canada Research Chair (CRC-2018-00259) and NSERC Discovery Grants program (DGPIN-2018-05743 awarded, ArcticNet, a Network of Centres of Excellence Canada, grant no. P216), Canada First Research Excellence Fund's Global Water Futures program (Northern Water Futures), and the Polar Continental Shelf Program (608-20 and 602-21).
The article processing charges for this open-access publication were covered by the Max Planck Society.
This paper was edited by Paul Stoy and reviewed by two anonymous referees.
AMAP: Arctic climate change update 2021: Key trends and impacts. Summary for policy-makers, Arctic Monitoring and Assessment Programme (AMAP), Tromsø, Norway, https://www.amap.no/documents/download/6759/inline (last access: 7 January 2026), 2021.
Anderson, K. and Gaston, K. J.: Lightweight unmanned aerial vehicles will revolutionize spatial ecology, Frontiers in Ecol & Environ, 11, 138–146, https://doi.org/10.1890/120150, 2013.
Baldocchi, D. D.: Assessing the eddy covariance technique for evaluating carbon dioxide exchange rates of ecosystems: past, present and future, Global Change Biology, 9, 479–492, https://doi.org/10.1046/j.1365-2486.2003.00629.x, 2003.
Ballinger, T. J., Overland, J. E., Wang, M., Bhatt, U. S., Hanna, E., Hanssen-Bauer, I., Kim, S.-J., Thoman, R. L., and Walsh, J. E.: Arctic Report Card 2020: Surface Air Temperature, https://doi.org/10.25923/GCW8-2Z06, 2020.
Bao, T., Xu, X., Jia, G., Billesbach, D. P., and Sullivan, R. C.: Much stronger tundra methane emissions during autumn‐freeze than spring‐thaw, Global Change Biology, 27, 376–387, https://doi.org/10.1111/gcb.15421, 2021.
Barry, S. and Elith, J.: Error and uncertainty in habitat models, Journal of Applied Ecology, 43, 413–423, https://doi.org/10.1111/j.1365-2664.2006.01136.x, 2006.
Bartsch, A., Efimova, A., Widhalm, B., Muri, X., von Baeckmann, C., Bergstedt, H., Ermokhina, K., Hugelius, G., Heim, B., and Leibman, M.: Circumarctic land cover diversity considering wetness gradients, Hydrol. Earth Syst. Sci., 28, 2421–2481, https://doi.org/10.5194/hess-28-2421-2024, 2024.
Becker, T., Kutzbach, L., Forbrich, I., Schneider, J., Jager, D., Thees, B., and Wilmking, M.: Do we miss the hot spots? – The use of very high resolution aerial photographs to quantify carbon fluxes in peatlands, Biogeosciences, 5, 1387–1393, https://doi.org/10.5194/bg-5-1387-2008, 2008.
Belgiu, M. and Drăguţ, L.: Random forest in remote sensing: A review of applications and future directions, ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24–31, https://doi.org/10.1016/j.isprsjprs.2016.01.011, 2016.
Berbesi, L. and Pritchard, G.: Modelling Energy Data in a Generalized Additive Model – A Case Study of Colombia, Energies, 16, 1929, https://doi.org/10.3390/en16041929, 2023.
Biasi, C., Meyer, H., Rusalimova, O., Hämmerle, R., Kaiser, C., Baranyi, C., Daims, H., Lashchinsky, N., Barsukov, P., and Richter, A.: Initial effects of experimental warming on carbon exchange rates, plant growth and microbial dynamics of a lichen-rich dwarf shrub tundra in Siberia, Plant Soil, 307, 191–205, https://doi.org/10.1007/s11104-008-9596-2, 2008.
Breiman, L.: Random Forests, Machine Learning, 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001.
Chen, S., Liu, L., Ma, Y., Zhuang, Q., and Shurpali, N. J.: Quantifying Global Wetland Methane Emissions With In Situ Methane Flux Data and Machine Learning Approaches, Earth's Future, 12, e2023EF004330, https://doi.org/10.1029/2023EF004330, 2024.
Chollet Ramampiandra, E., Scheidegger, A., Wydler, J., and Schuwirth, N.: A comparison of machine learning and statistical species distribution models: Quantifying overfitting supports model interpretation, Ecological Modelling, 481, 110353, https://doi.org/10.1016/j.ecolmodel.2023.110353, 2023.
Cortes, C. and Vapnik, V.: Support-vector networks, Mach Learn, 20, 273–297, https://doi.org/10.1007/BF00994018, 1995.
Cui, S., Liu, P., Guo, H., Nielsen, C. K., Pullens, J. W. M., Chen, Q., Pugliese, L., and Wu, S.: Wetland hydrological dynamics and methane emissions, Commun Earth Environ, 5, 470, https://doi.org/10.1038/s43247-024-01635-w, 2024.
Cutler, D. R., Edwards, T. C., Beard, K. H., Cutler, A., Hess, K. T., Gibson, J., and Lawler, J. J.: Random Forests for Classification in Ecology, Ecology, 88, 2783–2792, https://doi.org/10.1890/07-0539.1, 2007.
Davidson, S., Santos, M., Sloan, V., Reuss-Schmidt, K., Phoenix, G., Oechel, W., and Zona, D.: Upscaling CH4 Fluxes Using High-Resolution Imagery in Arctic Tundra Ecosystems, Remote Sensing, 9, 1227, https://doi.org/10.3390/rs9121227, 2017.
Drusch, M., Del Bello, U., Carlier, S., Colin, O., Fernandez, V., Gascon, F., Hoersch, B., Isola, C., Laberinti, P., Martimort, P., Meygret, A., Spoto, F., Sy, O., Marchese, F., and Bargellini, P.: Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services, Remote Sensing of Environment, 120, 25–36, https://doi.org/10.1016/j.rse.2011.11.026, 2012.
Elith, J., Leathwick, J. R., and Hastie, T.: A working guide to boosted regression trees, Journal of Animal Ecology, 77, 802–813, https://doi.org/10.1111/j.1365-2656.2008.01390.x, 2008.
Environment and Climate Change Canada (ECCC): Weather station data for trail valley creek (climate ID: 220N005, WMO ID: 71683), northwest territories, canada, 2003–2024, 2024.
Etminan, M., Myhre, G., Highwood, E. J., and Shine, K. P.: Radiative forcing of carbon dioxide, methane, and nitrous oxide: A significant revision of the methane radiative forcing, Geophysical Research Letters, 43, https://doi.org/10.1002/2016GL071930, 2016.
Friedman, J. H.: Greedy function approximation: A gradient boosting machine., Ann. Statist., 29, https://doi.org/10.1214/aos/1013203451, 2001.
Frolking, S., Talbot, J., Jones, M. C., Treat, C. C., Kauffman, J. B., Tuittila, E.-S., and Roulet, N.: Peatlands in the Earth's 21st century climate system, Environ. Rev., 19, 371–396, https://doi.org/10.1139/a11-014, 2011.
Gachibu Wangari, E., Mwangada Mwanake, R., Houska, T., Kraus, D., Gettel, G. M., Kiese, R., Breuer, L., and Butterbach-Bahl, K.: Identifying landscape hot and cold spots of soil greenhouse gas fluxes by combining field measurements and remote sensing data, Biogeosciences, 20, 5029–5067, https://doi.org/10.5194/bg-20-5029-2023, 2023.
Gao, B.: NDWI – A normalized difference water index for remote sensing of vegetation liquid water from space, Remote Sensing of Environment, 58, 257–266, https://doi.org/10.1016/S0034-4257(96)00067-3, 1996.
Gorelick, N., Hancher, M., Dixon, M., Ilyushchenko, S., Thau, D., and Moore, R.: Google Earth Engine: Planetary-scale geospatial analysis for everyone, Remote Sensing of Environment, 202, 18–27, https://doi.org/10.1016/j.rse.2017.06.031, 2017.
Greenwell, B., Boehmke, B., Cunningham, J., and GBM Developers: gbm: Generalized Boosted Regression Models, R package version 2.1.8.1, CRAN [code], https://CRAN.R-project.org/package=gbm (last access: 6 January 2026), 2022.
Grünberg, I., Wilcox, E. J., Zwieback, S., Marsh, P., and Boike, J.: Linking tundra vegetation, snow, soil temperature, and permafrost, Biogeosciences, 17, 4261–4279, https://doi.org/10.5194/bg-17-4261-2020, 2020.
Hastie, T. and Tibshirani, R.: Generalized Additive Models, Statist. Sci., 1, https://doi.org/10.1214/ss/1177013604, 1986.
Hijmans, R. J.: terra: Spatial Data Analysis, R package version 1.7-39, CRAN [code], https://CRAN.R-project.org/package=terra (last access: 6 January 2026), 2023.
Hugelius, G., Strauss, J., Zubrzycki, S., Harden, J. W., Schuur, E. A. G., Ping, C.-L., Schirrmeister, L., Grosse, G., Michaelson, G. J., Koven, C. D., O'Donnell, J. A., Elberling, B., Mishra, U., Camill, P., Yu, Z., Palmtag, J., and Kuhry, P.: Estimated stocks of circumpolar permafrost carbon with quantified uncertainty ranges and identified data gaps, Biogeosciences, 11, 6573–6593, https://doi.org/10.5194/bg-11-6573-2014, 2014.
Ivanova, K. and Göckede, M.: Processing of the carbon gas chamber flux, with automatic window detection and manual improvement, Zenodo [code], https://doi.org/10.5281/zenodo.16732354, 2025.
Ivanova, K., Virkkala, A.-M., Voigt, C., Sonnentag, O., Brovkin, V., Stacke, T., Bartsch, A., Widhalm, B., and Goeckede, M.: CH4 Flux Dataset and Upscaling Maps for TVC, Canada, 2019–2024, Zenodo [data set], https://doi.org/10.5281/zenodo.15753253, 2025a.
Ivanova, K., Virkkala, A.-M., Brovkin, V., Stacke, T., Widhalm, B., Bartsch, A., Voigt, C., Sonnentag, O., and Goeckede, M.: Modelling and Comparing Methane Flux Upscaling at 1 and 10 m Resolution in Trail Valley Creek, Zenodo [code], https://doi.org/10.5281/zenodo.15399084, 2025b.
Kaiser, K. E., McGlynn, B. L., and Dore, J. E.: Landscape analysis of soil methane flux across complex terrain, Biogeosciences, 15, 3143–3167, https://doi.org/10.5194/bg-15-3143-2018, 2018.
Kalacska, M., Arroyo-Mora, J. P., Soffer, R. J., Roulet, N. T., Moore, T. R., Humphreys, E., Leblanc, G., Lucanus, O., and Inamdar, D.: Estimating Peatland Water Table Depth and Net Ecosystem Exchange: A Comparison between Satellite and Airborne Imagery, Remote Sensing, 10, 687, https://doi.org/10.3390/rs10050687, 2018.
Kämäräinen, M., Tuovinen, J.-P., Kulmala, M., Mammarella, I., Aalto, J., Vekuri, H., Lohila, A., and Lintunen, A.: Spatiotemporal lagging of predictors improves machine learning estimates of atmosphere–forest CO2 exchange, Biogeosciences, 20, 897–909, https://doi.org/10.5194/bg-20-897-2023, 2023.
Karatzoglou, A., Smola, A., Hornik, K., and Zeileis, A.: kernlab – An S4 package for kernel methods in R, Journal of Machine Learning Research, 5, 1457–1460, 2004.
Karim, M. R., Reza, M. N., Jin, H., Haque, M. A., Lee, K.-H., Sung, J., and Chung, S.-O.: Application of LiDAR Sensors for Crop and Working Environment Recognition in Agriculture: A Review, Remote Sensing, 16, 4623, https://doi.org/10.3390/rs16244623, 2024.
Knox, S. H., Jackson, R. B., Poulter, B., McNicol, G., Fluet-Chouinard, E., Zhang, Z., Hugelius, G., Bousquet, P., Canadell, J. G., Saunois, M., Papale, D., Chu, H., Keenan, T. F., Baldocchi, D., Torn, M. S., Mammarella, I., Trotta, C., Aurela, M., Bohrer, G., Campbell, D. I., Cescatti, A., Chamberlain, S., Chen, J., Chen, W., Dengel, S., Desai, A. R., Euskirchen, E., Friborg, T., Gasbarra, D., Goded, I., Goeckede, M., Heimann, M., Helbig, M., Hirano, T., Hollinger, D. Y., Iwata, H., Kang, M., Klatt, J., Krauss, K. W., Kutzbach, L., Lohila, A., Mitra, B., Morin, T. H., Nilsson, M. B., Niu, S., Noormets, A., Oechel, W. C., Peichl, M., Peltola, O., Reba, M. L., Richardson, A. D., Runkle, B. R. K., Ryu, Y., Sachs, T., Schäfer, K. V. R., Schmid, H. P., Shurpali, N., Sonnentag, O., Tang, A. C. I., Ueyama, M., Vargas, R., Vesala, T., Ward, E. J., Windham-Myers, L., Wohlfahrt, G., and Zona, D.: FLUXNET-CH4 Synthesis Activity: Objectives, Observations, and Future Directions, Bulletin of the American Meteorological Society, 100, 2607–2632, https://doi.org/10.1175/BAMS-D-18-0268.1, 2019.
Koven, C. D., Ringeval, B., Friedlingstein, P., Ciais, P., Cadule, P., Khvorostyanov, D., Krinner, G., and Tarnocai, C.: Permafrost carbon-climate feedbacks accelerate global warming, Proc. Natl. Acad. Sci. U. S. A., 108, 14769–14774, https://doi.org/10.1073/pnas.1103910108, 2011.
Kuhn, M.: Building Predictive Models in R Using the caret Package, J. Stat. Soft., 28, https://doi.org/10.18637/jss.v028.i05, 2008.
Kuhn, M., Vaughan, D., and Hvitfeldt, E.: yardstick: Tidy Characterizations of Model Performance, R package version 1.3.2, CRAN [code], https://CRAN.R-project.org/package=yardstick (last access: 6 January 2026), 2025.
Kuhn, M. A., Varner, R. K., Bastviken, D., Crill, P., MacIntyre, S., Turetsky, M., Walter Anthony, K., McGuire, A. D., and Olefeldt, D.: BAWLD-CH4: a comprehensive dataset of methane fluxes from boreal and arctic ecosystems, Earth Syst. Sci. Data, 13, 5151–5189, https://doi.org/10.5194/essd-13-5151-2021, 2021.
Kwon, M. J., Beulig, F., Ilie, I., Wildner, M., Küsel, K., Merbold, L., Mahecha, M. D., Zimov, N., Zimov, S. A., Heimann, M., Schuur, E. A. G., Kostka, J. E., Kolle, O., Hilke, I., and Göckede, M.: Plants, microorganisms, and soil temperatures contribute to a decrease in methane fluxes on a drained Arctic floodplain, Glob. Change Biol., 23, 2396–2412, https://doi.org/10.1111/gcb.13558, 2017.
Lange, S., Grünberg, I., Anders, K., Hartmann, J., Helm, V., and Boike, J.: Airborne Laser Scanning (ALS) Point Clouds of Trail Valley Creek, NWT, Canada (2018), https://doi.org/10.1594/PANGAEA.934387, 2021.
Lehmann, J., Münchberger, W., Knoth, C., Blodau, C., Nieberding, F., Prinz, T., Pancotto, V., and Kleinebecker, T.: High-Resolution Classification of South Patagonian Peat Bog Microforms Reveals Potential Gaps in Up-Scaled CH4 Fluxes by use of Unmanned Aerial System (UAS) and CIR Imagery, Remote Sensing, 8, 173, https://doi.org/10.3390/rs8030173, 2016.
Lindsay, J. B.: Whitebox GAT: A case study in geomorphometric analysis, Computers & Geosciences, 95, 75–84, https://doi.org/10.1016/j.cageo.2016.07.003, 2016.
Ludwig, S. M., Schiferl, L., Hung, J., Natali, S. M., and Commane, R.: Resolving heterogeneous fluxes from tundra halves the growing season carbon budget, Biogeosciences, 21, 1301–1321, https://doi.org/10.5194/bg-21-1301-2024, 2024.
Marsh, P., Pomeroy, J., Pohl, S., Quinton, W., Onclin, C., Russell, M., Neumann, N., Pietroniro, A., Davison, B., and McCartney, S.: Snowmelt Processes and Runoff at the Arctic Treeline: Ten Years of MAGS Research, Springer Berlin Heidelberg, Berlin, Heidelberg, 97–123, https://doi.org/10.1007/978-3-540-75136-6_6, 2008.
Marsh, P., Bartlett, P., MacKay, M., Pohl, S., and Lantz, T.: Snowmelt energetics at a shrub tundra site in the western Canadian Arctic, Hydrological Processes, 24, 3603–3620, https://doi.org/10.1002/hyp.7786, 2010.
Mastepanov, M., Sigsgaard, C., Tagesson, T., Ström, L., Tamstorf, M. P., Lund, M., and Christensen, T. R.: Revisiting factors controlling methane emissions from high-Arctic tundra, Biogeosciences, 10, 5139–5158, https://doi.org/10.5194/bg-10-5139-2013, 2013.
Matthes, J. H., Sturtevant, C., Verfaillie, J., Knox, S., and Baldocchi, D.: Parsing the variability in CH4 flux at a spatially heterogeneous wetland: Integrating multiple eddy covariance towers with high-resolution flux footprint analysis, JGR Biogeosciences, 119, 1322–1339, https://doi.org/10.1002/2014JG002642, 2014.
McFeeters, S. K.: The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features, International Journal of Remote Sensing, 17, 1425–1432, https://doi.org/10.1080/01431169608948714, 1996.
McNicol, G., Fluet-Chouinard, E., Ouyang, Z., Knox, S., Zhang, Z., Aalto, T., Bansal, S., Chang, K., Chen, M., Delwiche, K., Feron, S., Goeckede, M., Liu, J., Malhotra, A., Melton, J. R., Riley, W., Vargas, R., Yuan, K., Ying, Q., Zhu, Q., Alekseychik, P., Aurela, M., Billesbach, D. P., Campbell, D. I., Chen, J., Chu, H., Desai, A. R., Euskirchen, E., Goodrich, J., Griffis, T., Helbig, M., Hirano, T., Iwata, H., Jurasinski, G., King, J., Koebsch, F., Kolka, R., Krauss, K., Lohila, A., Mammarella, I., Nilson, M., Noormets, A., Oechel, W., Peichl, M., Sachs, T., Sakabe, A., Schulze, C., Sonnentag, O., Sullivan, R. C., Tuittila, E., Ueyama, M., Vesala, T., Ward, E., Wille, C., Wong, G. X., Zona, D., Windham-Myers, L., Poulter, B., and Jackson, R. B.: Upscaling Wetland Methane Emissions From the FLUXNET-CH4 Eddy Covariance Network (UpCH4 v1.0): Model Development, Network Assessment, and Budget Comparison, AGU Advances, 4, e2023AV000956, https://doi.org/10.1029/2023AV000956, 2023.
Meingast, K. M., Falkowski, M. J., Kane, E. S., Potvin, L. R., Benscoter, B. W., Smith, A. M. S., Bourgeau-Chavez, L. L., and Miller, M. E.: Spectral detection of near-surface moisture content and water-table position in northern peatland ecosystems, Remote Sensing of Environment, 152, 536–546, https://doi.org/10.1016/j.rse.2014.07.014, 2014.
NASA Langley Research Center: Prediction Of Worldwide Energy Resources (POWER) Project, POWER Project's Hourly 2.5.1 version, https://power.larc.nasa.gov (last access: 28 January 2025), 2024.
Natali, S. M., Holdren, J. P., Rogers, B. M., Treharne, R., Duffy, P. B., Pomerance, R., and MacDonald, E.: Permafrost carbon feedbacks threaten global climate goals, Proc. Natl. Acad. Sci. U.S.A., 118, e2100163118, https://doi.org/10.1073/pnas.2100163118, 2021.
Natekin, A. and Knoll, A.: Gradient boosting machines, a tutorial, Front. Neurorobot., 7, https://doi.org/10.3389/fnbot.2013.00021, 2013.
Nisbet, E. G., Manning, M. R., Dlugokencky, E. J., Fisher, R. E., Lowry, D., Michel, S. E., Myhre, C. L., Platt, S. M., Allen, G., Bousquet, P., Brownlow, R., Cain, M., France, J. L., Hermansen, O., Hossaini, R., Jones, A. E., Levin, I., Manning, A. C., Myhre, G., Pyle, J. A., Vaughn, B. H., Warwick, N. J., and White, J. W. C.: Very Strong Atmospheric Methane Growth in the 4 Years 2014–2017: Implications for the Paris Agreement, Global Biogeochemical Cycles, 33, 318–342, https://doi.org/10.1029/2018GB006009, 2019.
Olefeldt, D., Turetsky, M. R., Crill, P. M., and McGuire, A. D.: Environmental and physical controls on northern terrestrial methane emissions across permafrost zones, Global Change Biology, 19, 589–603, https://doi.org/10.1111/gcb.12071, 2013.
Olefeldt, D., Goswami, S., Grosse, G., Hayes, D., Hugelius, G., Kuhry, P., McGuire, A. D., Romanovsky, V. E., Sannel, A. B. K., Schuur, E. A. G., and Turetsky, M. R.: Circumpolar distribution and carbon storage of thermokarst landscapes, Nat Commun, 7, 13043, https://doi.org/10.1038/ncomms13043, 2016.
Olefeldt, D., Hovemyr, M., Kuhn, M. A., Bastviken, D., Bohn, T. J., Connolly, J., Crill, P., Euskirchen, E. S., Finkelstein, S. A., Genet, H., Grosse, G., Harris, L. I., Heffernan, L., Helbig, M., Hugelius, G., Hutchins, R., Juutinen, S., Lara, M. J., Malhotra, A., Manies, K., McGuire, A. D., Natali, S. M., O'Donnell, J. A., Parmentier, F.-J. W., Räsänen, A., Schädel, C., Sonnentag, O., Strack, M., Tank, S. E., Treat, C., Varner, R. K., Virtanen, T., Warren, R. K., and Watts, J. D.: The Boreal–Arctic Wetland and Lake Dataset (BAWLD), Earth Syst. Sci. Data, 13, 5127–5149, https://doi.org/10.5194/essd-13-5127-2021, 2021.
Papale, D. and Valentini, R.: A new assessment of European forests carbon exchanges by eddy fluxes and artificial neural network spatialization, Global Change Biology, 9, 525–535, https://doi.org/10.1046/j.1365-2486.2003.00609.x, 2003.
Peltola, O., Vesala, T., Gao, Y., Räty, O., Alekseychik, P., Aurela, M., Chojnicki, B., Desai, A. R., Dolman, A. J., Euskirchen, E. S., Friborg, T., Göckede, M., Helbig, M., Humphreys, E., Jackson, R. B., Jocher, G., Joos, F., Klatt, J., Knox, S. H., Kowalska, N., Kutzbach, L., Lienert, S., Lohila, A., Mammarella, I., Nadeau, D. F., Nilsson, M. B., Oechel, W. C., Peichl, M., Pypker, T., Quinton, W., Rinne, J., Sachs, T., Samson, M., Schmid, H. P., Sonnentag, O., Wille, C., Zona, D., and Aalto, T.: Monthly gridded data product of northern wetland methane emissions based on upscaling eddy covariance observations, Earth Syst. Sci. Data, 11, 1263–1289, https://doi.org/10.5194/essd-11-1263-2019, 2019.
Petrone, R. M., Rouse, W. R., and Marsh, P.: Comparative surface energy budgets in western and central subarctic regions of Canada, International Journal of Climatology, 20, 1131–1148, https://doi.org/10.1002/1097-0088(200008)20:10<1131::AID-JOC524>3.0.CO;2-J, 2000.
Pirk, N., Santos, T., Gustafson, C., Johansson, A. J., Tufvesson, F., Parmentier, F. W., Mastepanov, M., and Christensen, T. R.: Methane emission bursts from permafrost environments during autumn freeze‐in: New insights from ground‐penetrating radar, Geophysical Research Letters, 42, 6732–6738, https://doi.org/10.1002/2015GL065034, 2015.
Porter, C., Howat, I., Noh, M.-J., Husby, E., Khuvis, S., Danish, E., Tomko, K., Gardiner, J., Negrete, A., Yadav, B., Klassen, J., Kelleher, C., Cloutier, M., Bakker, J., Enos, J., Arnold, G., Bauer, G., Morin, P., and Polar Geospatial Center: ArcticDEM – Mosaics, Version 4.1 (1.0), https://doi.org/10.7910/DVN/3VDC4W, 2023.
Prasad, A. M., Iverson, L. R., and Liaw, A.: Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction, Ecosystems, 9, 181–199, https://doi.org/10.1007/s10021-005-0054-1, 2006.
Previdi, M., Smith, K. L., and Polvani, L. M.: Arctic amplification of climate change: a review of underlying mechanisms, Environ. Res. Lett., 16, 093003, https://doi.org/10.1088/1748-9326/ac1c29, 2021.
R Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/ (last access: 6 January 2026), 2024.
Rantanen, M., Karpechko, A. Yu., Lipponen, A., Nordling, K., Hyvärinen, O., Ruosteenoja, K., Vihma, T., and Laaksonen, A.: The Arctic has warmed nearly four times faster than the globe since 1979, Commun Earth Environ, 3, 168, https://doi.org/10.1038/s43247-022-00498-3, 2022.
Räsänen, A. and Virtanen, T.: Data and resolution requirements in mapping vegetation in spatially heterogeneous landscapes, Remote Sensing of Environment, 230, 111207, https://doi.org/10.1016/j.rse.2019.05.026, 2019.
Räsänen, A., Manninen, T., Korkiakoski, M., Lohila, A., and Virtanen, T.: Predicting catchment-scale methane fluxes with multi-source remote sensing, Landscape Ecol, 36, 1177–1195, https://doi.org/10.1007/s10980-021-01194-x, 2021.
Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., and Prabhat: Deep learning and process understanding for data-driven Earth system science, Nature, 566, 195–204, https://doi.org/10.1038/s41586-019-0912-1, 2019.
Rettelbach, T., Nitze, I., Grünberg, I., Hammar, J., Schäffler, S., Hein, D., Gessner, M., Bucher, T., Brauchle, J., Hartmann, J., Sachs, T., Boike, J., and Grosse, G.: Super-high-resolution aerial imagery, digital surface models and 3D point clouds of Trail Valley Creek, Canada, https://doi.org/10.1594/PANGAEA.961942, 2024.
Riihimäki, H., Kemppinen, J., Kopecký, M., and Luoto, M.: Topographic Wetness Index as a Proxy for Soil Moisture: The Importance of Flow-Routing Algorithm and Grid Resolution, Water Resources Research, 57, e2021WR029871, https://doi.org/10.1029/2021WR029871, 2021.
Rouse, J. W., Haas, R. H., Schell, J. A., and Deering, D. W.: Monitoring vegetation systems in the Great Plains with ERTS, in: Third Earth Resources Technology Satellite-1 Symposium, NASA SP-351, NASA, Washington, D.C., 309–317, https://ntrs.nasa.gov/api/citations/19740022614/downloads/19740022614.pdf (last access: 7 January 2026), 1974.
Ruhoff, A. L., Castro, N. M. R., and Risso, A.: Numerical modelling of the topographic wetness index: An analysis at different scales, International Journal of Geosciences, 2, 476–483, 2011.
Runge, A. and Grosse, G.: Comparing Spectral Characteristics of Landsat-8 and Sentinel-2 Same-Day Data for Arctic-Boreal Regions, Remote Sensing, 11, 1730, https://doi.org/10.3390/rs11141730, 2019.
Saunois, M., Stavert, A. R., Poulter, B., Bousquet, P., Canadell, J. G., Jackson, R. B., Raymond, P. A., Dlugokencky, E. J., Houweling, S., Patra, P. K., Ciais, P., Arora, V. K., Bastviken, D., Bergamaschi, P., Blake, D. R., Brailsford, G., Bruhwiler, L., Carlson, K. M., Carrol, M., Castaldi, S., Chandra, N., Crevoisier, C., Crill, P. M., Covey, K., Curry, C. L., Etiope, G., Frankenberg, C., Gedney, N., Hegglin, M. I., Höglund-Isaksson, L., Hugelius, G., Ishizawa, M., Ito, A., Janssens-Maenhout, G., Jensen, K. M., Joos, F., Kleinen, T., Krummel, P. B., Langenfelds, R. L., Laruelle, G. G., Liu, L., Machida, T., Maksyutov, S., McDonald, K. C., McNorton, J., Miller, P. A., Melton, J. R., Morino, I., Müller, J., Murguia-Flores, F., Naik, V., Niwa, Y., Noce, S., O'Doherty, S., Parker, R. J., Peng, C., Peng, S., Peters, G. P., Prigent, C., Prinn, R., Ramonet, M., Regnier, P., Riley, W. J., Rosentreter, J. A., Segers, A., Simpson, I. J., Shi, H., Smith, S. J., Steele, L. P., Thornton, B. F., Tian, H., Tohjima, Y., Tubiello, F. N., Tsuruta, A., Viovy, N., Voulgarakis, A., Weber, T. S., van Weele, M., van der Werf, G. R., Weiss, R. F., Worthy, D., Wunch, D., Yin, Y., Yoshida, Y., Zhang, W., Zhang, Z., Zhao, Y., Zheng, B., Zhu, Q., Zhu, Q., and Zhuang, Q.: The Global Methane Budget 2000–2017, Earth Syst. Sci. Data, 12, 1561–1623, https://doi.org/10.5194/essd-12-1561-2020, 2020.
Scheller, J. H., Mastepanov, M., and Christensen, T. R.: Toward UAV-based methane emission mapping of Arctic terrestrial ecosystems, Science of The Total Environment, 819, 153161, https://doi.org/10.1016/j.scitotenv.2022.153161, 2022.
Schoorl, J. M., Sonneveld, M. P. W., and Veldkamp, A.: Three-dimensional landscape process modelling: the effect of DEM resolution, Earth Surf. Process. Landforms, 25, 1025–1034, https://doi.org/10.1002/1096-9837(200008)25:9<1025::AID-ESP116>3.0.CO;2-Z, 2000.
Schuur, E. A. G., McGuire, A. D., Schädel, C., Grosse, G., Harden, J. W., Hayes, D. J., Hugelius, G., Koven, C. D., Kuhry, P., Lawrence, D. M., Natali, S. M., Olefeldt, D., Romanovsky, V. E., Schaefer, K., Turetsky, M. R., Treat, C. C., and Vonk, J. E.: Climate change and the permafrost carbon feedback, Nature, 520, 171–179, https://doi.org/10.1038/nature14338, 2015.
Sjögersten, S., Ledger, M., Siewert, M., de la Barreda-Bautista, B., Sowter, A., Gee, D., Foody, G., and Boyd, D. S.: Optical and radar Earth observation data for upscaling methane emissions linked to permafrost degradation in sub-Arctic peatlands in northern Sweden, Biogeosciences, 20, 4221–4239, https://doi.org/10.5194/bg-20-4221-2023, 2023.
Smola, A. J. and Schölkopf, B.: A tutorial on support vector regression, Statistics and Computing, 14, 199–222, https://doi.org/10.1023/B:STCO.0000035301.49549.88, 2004.
Ström, L., Mastepanov, M., and Christensen, T. R.: Species-specific Effects of Vascular Plants on Carbon Turnover and Methane Emissions from Wetlands, Biogeochemistry, 75, 65–82, https://doi.org/10.1007/s10533-004-6124-1, 2005.
Subke, J.-A., Kutzbach, L., and Risk, D.: Soil chamber measurements, in: Springer handbook of atmospheric measurements, Springer, 1603–1624, https://doi.org/10.1007/978-3-030-52171-4_60, 2021.
Tagesson, T., Mastepanov, M., Mölder, M., Tamstorf, M. P., Eklundh, L., Smith, B., Sigsgaard, C., Lund, M., Ekberg, A., Falk, J. M., Friborg, T., Christensen, T. R., and Ström, L.: Modelling of growing season methane fluxes in a high-Arctic wet tundra ecosystem 1997–2010 using in situ and high-resolution satellite data, Tellus B: Chemical and Physical Meteorology, 65, 19722, https://doi.org/10.3402/tellusb.v65i0.19722, 2013.
The MathWorks Inc.: MATLAB version: 9.9 (R2020b), The MathWorks Inc., Natick, Massachusetts, https://www.mathworks.com (last access: 6 January 2026), 2020.
Tramontana, G., Jung, M., Schwalm, C. R., Ichii, K., Camps-Valls, G., Ráduly, B., Reichstein, M., Arain, M. A., Cescatti, A., Kiely, G., Merbold, L., Serrano-Ortiz, P., Sickert, S., Wolf, S., and Papale, D.: Predicting carbon dioxide and energy fluxes across global FLUXNET sites with regression algorithms, Biogeosciences, 13, 4291–4313, https://doi.org/10.5194/bg-13-4291-2016, 2016.
Treat, C. C., Marushchak, M. E., Voigt, C., Zhang, Y., Tan, Z., Zhuang, Q., Virtanen, T. A., Räsänen, A., Biasi, C., Hugelius, G., Kaverin, D., Miller, P. A., Stendel, M., Romanovsky, V., Rivkin, F., Martikainen, P. J., and Shurpali, N. J.: Tundra landscape heterogeneity, not interannual variability, controls the decadal regional carbon balance in the Western Russian Arctic, Global Change Biology, 24, 5188–5204, https://doi.org/10.1111/gcb.14421, 2018.
Treat, C. C., Virkkala, A., Burke, E., Bruhwiler, L., Chatterjee, A., Fisher, J. B., Hashemi, J., Parmentier, F. W., Rogers, B. M., Westermann, S., Watts, J. D., Blanc-Betes, E., Fuchs, M., Kruse, S., Malhotra, A., Miner, K., Strauss, J., Armstrong, A., Epstein, H. E., Gay, B., Goeckede, M., Kalhori, A., Kou, D., Miller, C. E., Natali, S. M., Oh, Y., Shakil, S., Sonnentag, O., Varner, R. K., Zolkos, S., Schuur, E. A. G., and Hugelius, G.: Permafrost Carbon: Progress on Understanding Stocks and Fluxes Across Northern Terrestrial Ecosystems, JGR Biogeosciences, 129, e2023JG007638, https://doi.org/10.1029/2023JG007638, 2024.
Turetsky, M. R., Abbott, B. W., Jones, M. C., Anthony, K. W., Olefeldt, D., Schuur, E. A. G., Grosse, G., Kuhry, P., Hugelius, G., Koven, C., Lawrence, D. M., Gibson, C., Sannel, A. B. K., and McGuire, A. D.: Carbon release through abrupt permafrost thaw, Nat. Geosci., 13, 138–143, https://doi.org/10.1038/s41561-019-0526-0, 2020.
Varner, R. K., Crill, P. M., Frolking, S., McCalley, C. K., Burke, S. A., Chanton, J. P., Holmes, M. E., Isogenie Project Coordinators, Saleska, S., and Palace, M. W.: Permafrost thaw driven changes in hydrology and vegetation cover increase trace gas emissions and climate forcing in Stordalen Mire from 1970 to 2014, Phil. Trans. R. Soc. A., 380, 20210022, https://doi.org/10.1098/rsta.2021.0022, 2022.
Vaze, J., Teng, J., and Spencer, G.: Impact of DEM accuracy and resolution on topographic indices, Environmental Modelling & Software, 25, 1086–1098, https://doi.org/10.1016/j.envsoft.2010.03.014, 2010.
Virkkala, A.-M., Niittynen, P., Kemppinen, J., Marushchak, M. E., Voigt, C., Hensgens, G., Kerttula, J., Happonen, K., Tyystjärvi, V., Biasi, C., Hultman, J., Rinne, J., and Luoto, M.: High-resolution spatial patterns and drivers of terrestrial ecosystem carbon dioxide, methane, and nitrous oxide fluxes in the tundra, Biogeosciences, 21, 335–355, https://doi.org/10.5194/bg-21-335-2024, 2024.
Vogt, J., Pallandt, M. M. T. A., Basso, L. S., Bolek, A., Ivanova, K., Schlutow, M., Celis, G., Kuhn, M., Mauritz, M., Schuur, E. A. G., Arndt, K., Virkkala, A.-M., Wargowsky, I., and Göckede, M.: ARGO: ARctic greenhouse Gas Observation metadata version 1, Earth Syst. Sci. Data, 17, 2553–2573, https://doi.org/10.5194/essd-17-2553-2025, 2025.
Voigt, C., Sonnentag, O., Nesic, Z., and Hould Gosselin, G.: Atmospheric methane consumption by upland soils – dataset 1: high temporal resolution methane fluxes, https://doi.org/10.1594/PANGAEA.953117, 2023a.
Voigt, C., Sonnentag, O., Virkkala, A.-M., Bennett, K. A., Marushchak, M. E., Saarela, T., Wilcox, E. J., Chevrier-Dion, C., and Marquis, C.: Atmospheric methane consumption by upland soils – dataset 2: campaign-based manual chamber measurements, https://doi.org/10.1594/PANGAEA.953119, 2023b.
Voigt, C., Virkkala, A.-M., Hould Gosselin, G., Bennett, K. A., Black, T. A., Detto, M., Chevrier-Dion, C., Guggenberger, G., Hashmi, W., Kohl, L., Kou, D., Marquis, C., Marsh, P., Marushchak, M. E., Nesic, Z., Nykänen, H., Saarela, T., Sauheitl, L., Walker, B., Weiss, N., Wilcox, E. J., and Sonnentag, O.: Arctic soil methane sink increases with drier conditions and higher ecosystem respiration, Nat. Clim. Chang., 13, 1095–1104, https://doi.org/10.1038/s41558-023-01785-3, 2023c.
Walter Anthony, K., Schneider von Deimling, T., Nitze, I., Frolking, S., Emond, A., Daanen, R., Anthony, P., Lindgren, P., Jones, B., and Grosse, G.: 21st-century modeled permafrost carbon emissions accelerated by abrupt thaw beneath lakes, Nature Communications, 9, 1–11, https://doi.org/10.1038/s41467-018-05738-9, 2018.
Widhalm, B., Bartsch, A., Strozzi, T., Jones, N., Khomutov, A., Babkina, E., Leibman, M., Khairullin, R., Göckede, M., Bergstedt, H., von Baeckmann, C., and Muri, X.: InSAR-derived seasonal subsidence reflects spatial soil moisture patterns in Arctic lowland permafrost regions, The Cryosphere, 19, 1103–1133, https://doi.org/10.5194/tc-19-1103-2025, 2025.
Wille, C., Kutzbach, L., Sachs, T., Wagner, D., and Pfeiffer, E.: Methane emission from Siberian arctic polygonal tundra: eddy covariance measurements and modelling, Global Change Biology, 14, 1395–1408, https://doi.org/10.1111/j.1365-2486.2008.01586.x, 2008.
Wood, S. N.: Generalized Additive Models: An Introduction with R, 2nd edn., Chapman and Hall/CRC, https://doi.org/10.1201/9781315370279, 2017.
Wright, M. N. and Ziegler, A.: ranger: A Fast Implementation of Random Forests for High Dimensional Data in C and R, J. Stat. Soft., 77, https://doi.org/10.18637/jss.v077.i01, 2017.
Ying, Q., Poulter, B., Watts, J. D., Arndt, K. A., Virkkala, A.-M., Bruhwiler, L., Oh, Y., Rogers, B. M., Natali, S. M., Sullivan, H., Armstrong, A., Ward, E. J., Schiferl, L. D., Elder, C. D., Peltola, O., Bartsch, A., Desai, A. R., Euskirchen, E., Göckede, M., Lehner, B., Nilsson, M. B., Peichl, M., Sonnentag, O., Tuittila, E.-S., Sachs, T., Kalhori, A., Ueyama, M., and Zhang, Z.: WetCH4: a machine-learning-based upscaling of methane fluxes of northern wetlands during 2016–2022, Earth Syst. Sci. Data, 17, 2507–2534, https://doi.org/10.5194/essd-17-2507-2025, 2025.
Yuan, K., Li, F., McNicol, G., Chen, M., Hoyt, A., Knox, S., Riley, W. J., Jackson, R., and Zhu, Q.: Boreal–Arctic wetland methane emissions modulated by warming and vegetation activity, Nat. Clim. Chang., 14, 282–288, https://doi.org/10.1038/s41558-024-01933-3, 2024.
Zhang, C., Comas, X., and Brodylo, D.: A Remote Sensing Technique to Upscale Methane Emission Flux in a Subtropical Peatland, JGR Biogeosciences, 125, e2020JG006002, https://doi.org/10.1029/2020JG006002, 2020.
Zona, D., Gioli, B., Commane, R., Lindaas, J., Wofsy, S. C., Miller, C. E., Dinardo, S. J., Dengel, S., Sweeney, C., Karion, A., Chang, R. Y.-W., Henderson, J. M., Murphy, P. C., Goodrich, J. P., Moreaux, V., Liljedahl, A., Watts, J. D., Kimball, J. S., Lipson, D. A., and Oechel, W. C.: Cold season emissions dominate the Arctic tundra methane budget, Proc. Natl. Acad. Sci. U. S. A., 113, 40–45, https://doi.org/10.1073/pnas.1516017113, 2016.
- Abstract
- Introduction
- Materials and Methods
- Results and Discussion
- Conclusion
- Appendix A: Predictors from remote sensing and meteorological data
- Appendix B: Results
- Code availability
- Data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References
- Supplement
- Abstract
- Introduction
- Materials and Methods
- Results and Discussion
- Conclusion
- Appendix A: Predictors from remote sensing and meteorological data
- Appendix B: Results
- Code availability
- Data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References
- Supplement