Eddy covariance flux errors due to random and systematic timing errors during data acquisition
- 1LI-COR Biosciences Inc., Lincoln, Nebraska 68504, USA
- 2Dipartimento per la Innovazione nei sistemi Biologici, Agroalimentari e Forestali – DIBAF, Università degli Studi della Tuscia, Viterbo, 01100, Italy
- 3R. B. Daugherty Water for Food Institute, School of Natural Resources, University of Nebraska, Lincoln, Nebraska 68583, USA
- 4Centro Euro-Mediterraneo sui Cambiamenti Climatici – CMCC, Lecce, 73100, Italy
Correspondence: Gerardo Fratini (firstname.lastname@example.org)
Modern eddy covariance (EC) systems collect high-frequency data (10–20 Hz) via digital outputs of instruments. This is an important evolution with respect to the traditional and widely used mixed analog/digital systems, as fully digital systems help overcome the traditional limitations of transmission reliability, data quality, and completeness of the datasets.
However, fully digital acquisition introduces a new problem for guaranteeing data synchronicity when the clocks of the involved devices themselves cannot be synchronized, which is often the case with instruments providing data via serial or Ethernet connectivity in a streaming mode. In this paper, we suggest that, when assembling EC systems “in-house”, aspects related to timing issues need to be carefully considered to avoid significant flux biases.
By means of a simulation study, we found that, in most cases, random timing errors can safely be neglected, as they do not impact fluxes significantly. At the same time, systematic timing errors potentially arising in asynchronous systems can effectively act as filters leading to significant flux underestimations, as large as 10 %, by means of attenuation of high-frequency flux contributions. We characterized the transfer function of such “filters” as a function of the error magnitude and found cutoff frequencies as low as 1 Hz, implying that synchronization errors can dominate high-frequency attenuations in open- and enclosed-path EC systems. In most cases, such timing errors neither be detected nor characterized a posteriori. Therefore, it is important to test the ability of traditional and prospective EC data logging systems to assure the required synchronicity and propose a procedure to implement such a test relying on readily available equipment.
Eddy covariance (EC) is the most direct and defensible technique to measure atmosphere–biosphere exchange fluxes of energy and matter to date (e.g., see Aubinet et al., 2000, 2012; Baldocchi et al., 2001). The method is based on the Navier–Stokes equations for mass and momentum conservation and relies on simplifying assumptions to describe the vertical turbulent flux in terms of the covariance of the vertical wind component (w) and of the scalar of interest.
Calculating EC fluxes of a gaseous species requires collecting synchronous data of w and of the concentration c of the gas, which is typically performed using a 3-D ultrasonic anemometer and a gas analyzer operating at suitable frequencies of 10 to 20 Hz. After proper data treatment and time alignment, the covariance of the two time series is calculated, from which the flux is derived (e.g., Foken et al., 2012). In this context, synchronicity means that w and c values for any given timestamp (i.e., the data that are multiplied together in the covariance) describe the properties of the same air parcel.
Regardless of the level of integration and physical configuration of the instruments within an EC system, wind and concentration data are measured by two different instruments, an anemometer relying on the speed of sound measurements between transducer pairs, and a gas analyzer relying on the light transformation measurements in the sampling path. In addition, data collection is performed by means of a variety of more or less engineered data acquisition systems. Delays in the data flows, digital clock drifts, required separation of the measuring devices, and artifacts in the data acquisition strategy can lead to poor synchronicity, i.e., to misalignments of the time series, such that w and c values assigned to a given timestamp refer to properties of fully or partially different air parcels. If not addressed, such misalignments can lead to significant flux errors of both a random and systematic nature.
Commercial solutions implementing sound engineering practices do exist for well-established EC measurements of CO2 and H2O fluxes to assure a sufficient level of data synchronicity as per the requirements of the EC method, for a select few anemometer–analyzer pairs. However, most of such solutions are not scalable to other hardware models or gas species because the required instrumentation does not necessarily support the same connectivity technology and specifications. Therefore, it is generally very challenging, for example, to simply replace a gas analyzer with another one from another manufacturer and keep the same synchronization performance. Furthermore, it is customary for many research groups to assemble EC systems “in-house”, especially when addressing gas species that have not been popular enough to grow strong commercial interest. Typically, in these systems, data collection is performed with industrial data loggers or computers via serial or Ethernet connectivity, using custom-built logging software. In such cases, it is particularly important to verify that various types of data misalignment are not being introduced by the data logging system and data collection strategy to assure minimal or no bias in resulting fluxes.
In this paper, we discuss the types and sources of misalignment that can arise in poorly designed fully digital EC systems and quantify their effects on resulting fluxes. In conjunction with site-specific characteristics, such as the typical co-spectral shapes, this information can help design appropriate data collection scheme for EC systems assembled “in-house”. Users of most commercially available industrial-grade EC systems can generally assume their systems to not be affected by significant timing errors, although this can and should be verified case by case. We also propose a strategy for evaluating prospective EC data collection systems from the point of view of data synchronicity before they are used in routine field activities.
1.1 Analog vs. digital EC systems
Traditionally, a combination of analog and digital transmission systems has been used to collect EC data (Eugster and Plüss, 2010). For example, analog signals from the gas analyzer were sent to an interface unit responsible for digitizing the data before merging it with anemometric data, itself coming from an A/D (analog/digital) converter into the interface unit, typically from a sonic anemometer-thermometer (SAT). More recently, specular solutions, with the analog data from the SAT sent to an interface unit residing in the gas analyzer system, became available and were widely adopted. With both of these approaches, the data were presented to the user for the flux calculation as single files with wind and gas time series merged and synchronized by the interface unit.
Analog data output allows the data to easily cross clock domains. The clock that is used to sample the original signal does not need to be synchronized to the clock that samples the analog output. This makes it very convenient to merge data from systems with unsynchronized clocks and risks of misalignments are limited to small random errors that, as we will see later, have no significant effects on fluxes.
However, collecting data in analog form has several limitations and risks. First, the number of analog channels available either as outputs from the instruments or as inputs to the interface unit is typically limited to four or six, which dramatically reduces the number of variables that can be collected. In fact, historic EC raw datasets are comprised of six or maximum seven variables: the three wind components (u, v, w), the sonic temperature (Ts), and the concentration of the gases of interest (c, traditionally CO2 and H2O); more rarely, a diagnostic variable for the anemometric data was also collected. Critical information such as the full diagnostics of both instruments and their status (e.g., the temperature and pressure in the gas analyzer cell or the signal strength) or the original raw measurement (speed of sound, raw data counts etc.) are not collected in most analog systems, limiting the means for quality screening and limiting the possibility of future recomputation of the most fundamental raw measurements. Another problem with analog data collection is that signals are subject to degradation due to dissipation, electromagnetic noise, and ageing of cables and connectors, which reduces the quality of collected data (Barnes, 1987). In addition, although all raw measurements are analog in nature, they are typically immediately digitized (native digital format provided by the manufacturer) and then – in the case of analog data collection – they are re-converted to analog, sent to the interface unit and there converted back to digital; these A/D–D/A conversions potentially degrade the signal adding noise and dampening high-frequency signal components (Eugster and Plüss, 2010). For these reasons, analog connectivity should nowadays be avoided whenever possible in favor of fully digital solutions.
In fully digital EC acquisition systems, both data streams are collected in their respective native digital format, i.e., without additional A/D conversions other than those implemented by the manufacturer to provide digital outputs. Fully digital systems largely or completely overcome both problems with mixed analog/digital systems, using more robust and less corruptible data transmission protocols, and providing the possibility of collecting all variables available from the individual instrument. However, combining digital data streams from different instruments brings new challenges, most notably with respect to data synchronization. While moving between clock domains is trivial in an analog system, it can be much more challenging with digital data when the involved clocks can be completely asynchronous to each other.
1.2 The problem of clock synchronization in digital systems
Different strategies exist for collecting data digitally. First, instruments can perform the measurements according to their own scheduler or a trigger. In the scheduler case, data can then be collected by polling the instrument for the latest available data (polling mode) or by keeping an open channel where data are streamed (streaming mode). In the trigger case, data are usually made available after a fixed or somewhat variable delay to the logging device. This delay is due to the acquisition time and could also include a delay due to filtering. However, the timestamp for the acquired data is assigned based on the occurrence of the trigger, therefore removing any timing error due to that delay. All modes have advantages and disadvantages. As described later, triggering and polling modes are less susceptible to timing errors, but they require the instrumentation to be designed for the particular triggering or polling system adopted. They are therefore best suited for EC systems built with all components from the same manufacturer. For the same reason, such systems are commonly not flexible enough to accommodate third-party instrumentation. EC systems commercialized by Campbell Scientific Inc. (Logan, UT, USA; “CSI” hereafter) are examples of integrated systems using a data triggering strategy to collect data from instruments designed ad hoc. Data communication in these systems is realized via the synchronous devices for measurement (SDM) protocol (or its evolutions), which is a CSI proprietary protocol, implemented only in CSI instruments and some CO2∕H2O gas analyzers by LI-COR Biosciences Inc. (Lincoln, NE, USA; “LI-COR” hereafter). By contrast, most instrumentation available for fast wind and gas measurements only provide data transmission options in streaming mode. As a consequence, most data logging solutions developed by the scientific community or by commercial entities are designed to handle data provided in streaming mode and are therefore flexible to accommodate a wide variety of instrumentation. Examples of such logging systems developed by the community are PC-based software such as Huskerflux (https://github.com/Flux-Dave/HuskerFlux, last access: 26 June 2018), EddyMeas (Kolle and Rebmann, 2007), EdiSol (EdiSol User Guide V0.39b https://epic.awi.de/29686/1/Mon2005d.pdf, last access: 26 June 2018), or the already referenced system proposed by Eugster and Plüss (2010) for EC measurement of methane. As for commercial solutions, LI-COR provides industrial-grade EC systems based on the SmartFlux® system, that can accommodate a wide variety of instrumentation using the data-streaming approach.
However, collecting data in streaming mode exposes the risk of introducing significant timing errors, because of the number of asynchronous digital clocks involved.
Digital clocks are electronic oscillator circuits that use the mechanical resonance of a vibrating crystal of piezoelectric material to create an electrical signal with a precise frequency, which is then used to keep track of time. The number and quality of clocks involved in an EC system vary with the data collection strategy and technology. In systems based on data triggering or polling, there is only one critical clock (the one responsible for the timing of the triggering or polling signal); therefore, there is no significant risk of introducing systematic timing misalignments between data from different instruments (see later). With these systems, the risk is limited to random and/or constant misalignments. As we will see later, random or constant misalignments do not entail large errors. For this reason, in the remainder of this section we consider in more detail the situation with systems based on data streaming.
In such systems the potentially relevant clocks are, in general, the following.
The sampling clocks of the sensing instruments (the SAT and the gas analyzer in a typical EC system), responsible for sampling data at the prescribed rate with sufficient precision and accuracy;
For systems that transmit data serially (RS-232 or RS-485), the serial clock of the same sensing instrument, which may or may not be correlated to its sampling clock;
The clocks of the logging device(s) (data logger, PC, etc.), responsible for attaching a timestamp to the data. If a single logging device is used, this is usually also responsible for merging data streams from the different instruments; in case dedicated logging systems are used for different instruments, merging is performed in post-processing and the clocks of the different loggers must, therefore, be aligned sufficiently frequently (e.g., every second, using a GPS signal).
Typical open digital communication protocols used for EC instruments with data-streaming instrumentation are serial (RS-232, RS-485) and packet-based data protocols (Ethernet). In devices that transmit data via serial communication, such as SATs, there are no means to synchronize the sampling clock of the device to that of the data logger. With such devices, the best that can be done is to assign a timestamp after transmission, based on the clock of the data logger (this last step should be performed carefully to avoid large inaccuracies due to serial port latencies, especially in PC-based systems). In addition, devices implementing serial communication have an asynchronous clock that drives those protocols (e.g., Dobkin et al., 2010). If this clock is correlated with the device's sampling clock, the receiving data logger can – at least in principle – reconstruct the sampling clock. However, in devices that do not correlate sampling and serial clocks (such as those that output data in a software thread that is independent of an acquisition thread), the system scheduler then determines when data are transmitted, thereby completely isolating the serial clock from the sampling clock and making it impossible for the data logger to reconstruct the sampling clock.
Packet-based data communications such as Ethernet even further isolate the sampling clock from the transmission clock. In devices using this protocol, it is therefore impossible to reconstruct a sampling clock. However, for Ethernet-based systems additional protocols are available, such as Network Time Protocol (NTP) or Precision Time Protocol (PTP), to actually synchronize all system clocks. The synchronized system clocks then allow the data to be correctly timestamped before transmission, eliminating any synchronization issue, provided that downstream software can align the various data streams based on their timestamps (e.g., Mahmood et al., 2014).
1.3 Types of timing errors
In typical EC data acquisition setups, time series collected by different instruments can show three distinct types of misalignments (Fig. 1).
Time lags: these are constant offsets in otherwise perfectly aligned time series. They can be the result of constant electronic delays or of fixed delays due to digital signal processing. More frequently, they result from a physical separation of the sampling volumes or from the delay due to the time needed for the passage of air in a sampling line.
Random timing errors (RTEs in the following): occur when the timestamps assigned to the data differ from the exact time dictated by the nominal sampling frequency, and such differences are randomly distributed so that, on average, the actual frequency is equal to the nominal one. In practice, in the EC context, it is more useful to consider the random differences in the timestamps assigned to data from one instrument with respect to that of the paired instrument. In fact, in the hypothetical case in which the two instruments would have the exact same sequence of random errors, that would not introduce any misalignment and hence no flux bias.
Systematic timing errors (STEs): occur when the timestamps assigned to the data differ from the exact time dictated by the nominal sampling frequency, and such differences are systematic, e.g., the actual time step is slightly longer or shorter than the nominal one for time spans on the order of the flux averaging interval. Again, in EC we are only concerned with systematic relative errors, for identical errors in the two concerned instruments would entail no misalignment and hence no flux bias.
Instances of each type of misalignment can, and typically will, be present at the same time to various degrees.
1.4 Sources of misalignment and their effects on time series
1.4.1 Spatial separation between sampling volumes
In a SAT, the sampling volume is the volume of air between the upper and lower sets of transducers. Similarly, in an open-path gas analyzer such as the LI-7500 CO2/H2O analyzer and the LI-7700 CH4 analyzer (LI-COR Biosciences Inc., Lincoln, NE, USA), the sampling volume is the volume of air between the upper and lower mirrors. In a closed- or enclosed-path gas analyzer such as the LI-7000, the LI-7200 (LI-COR Biosciences Inc.), and the EC155 (Campbell Scientific Inc., Logan, UT, USA), instead, the sampling volume can be identified with the volume of the intake device, e.g., a rain cup.
Even in the hypothetical situation of perfectly synchronized timestamps for wind and gas data, if the respective instruments' sampling volumes have to be spatially separated to avoid presently intractable flow distortion issues in the anemometer, as is notably the case with open-path setups (see, for example, Wyngaard, 1988; Frank et al., 2016; Grare et al., 2016; Horst et al., 2016; and Huq et al., 2017), the corresponding time series will be affected by misalignment, possibly to varying degrees. Indeed, assuming the validity of Taylor's hypothesis of frozen turbulence, wind and concentration data will be affected by a time lag (the time air takes to travel between the two sampling volumes), which will be further modulated by wind intensity and direction. Additionally, modification of turbulence structure intervening while air parcels transit through the dislocated instrument volumes may introduce further uncertainty in flux estimates (Cheng et al., 2017). In the case of co-located sensors (e.g., Hydra-IV, CEH; IRGASON, Campbell Scientific Inc.) this problem is not present but is replaced by the flow distortion issues mentioned above and not addressed in the present study.
1.4.2 Spatial separation between measuring volumes
In a SAT, the sampling volume coincides with the measuring volume, i.e., wind velocity is measured exactly where it is sampled. The same is true for open-path gas analyzers. However, closed- and enclosed-path analyzers take the sampled air into a measuring cell via a sampling line that can be anywhere between 0.5 and 50 m long, with its inlet usually placed very close to the SAT sampling volume. This implies a delay of the time series of gas concentrations with respect to the wind time series. Such a delay can be more or less constant in time depending on the possibility of actively controlling the sampling line flow rate. In systems without flow controllers, the flow rate may vary significantly in response to power fluctuations or tube clogging and so would the corresponding time lags.
1.4.3 Clock errors
Quartz crystal clocks universally used in electronic devices are subject to two main types of error: periodic jitter and frequency drift.
Period jitter in clock signals is the random error of the clock with respect to its nominal frequency. It is typically caused by thermal noise, power supply variations, loading conditions, device noise, and interference coupled from nearby circuits. Jitter is a source of RTE in time series.
The oscillation frequency of a clock varies with temperature, leading to drifts of the measured time and hence to STEs. The drift of a clock can be expressed as the amount of time gained (or lost) as a result of the drift per unit of time, with suitable units being µs s−1. For example, a drift of −30 µs s−1 means that a clock accumulates 30 µs of delay per second, or about 2.6 s over the course of 1 day (. The dependence of a crystal oscillation frequency on temperature varies, even dramatically, with the type and angle of crystal cut and can be modeled as quadratic (BT, CT, DT cuts) or cubic (AT cuts) (Hewlett Packard, 1997). Figure 2 shows exemplary drift curves for different crystal cuts. Typically, the nominal frequency (e.g., 32 kHz) is specified at 20 or 25 ∘C. Apart from that temperature, the frequency can vary, for example, according to (for a BT cut):
where and typical values of α range between 0.035 and 0.040.
Clocks in EC systems can be exposed to large variations in temperature (day–night, seasonal cycles). Because we are concerned with relative drifts, we are interested in differences in the temperatures experienced by the instruments' sampling/logging clock as well as with differences in their temperature responses. Clocks experiencing similar temperatures and with similar temperature responses, would minimize relative drift. On the contrary, clocks with opposite responses to temperature will result in relative drifts that are close to the sum of the individual drifts, e.g., in the case of AT-cut crystals with different angles of rotation at relatively high temperatures (i.e., above 30 ∘C, see Fig. 2)
It is also to be noted that temperature-compensated clocks do exist, which have accuracies of around ±2 µs s−1. As we will show later, such drifts can be safely neglected, as long as clocks are synced sufficiently often (e.g., once a day). For completeness, we note that clock drifts also occur due to the ageing of components. However, the absolute values of typical ageing rates (< 1 µs year−1) are of no concern in EC applications. Because STEs in EC systems are caused primarily or exclusively by clock drifts, in the rest of the paper we will use the terms STE and drift interchangeably.
1.4.4 Further sources of timing errors in digital asynchronous systems
Ethernet connectivity available in commercial loggers and industrial PCs (e.g., SmartFlux 2 and 3 by LI-COR Biosciences Inc., CR3000 and CR6 by Campbell Scientific Inc.) can be used for data acquisition in EC systems. The acquisition is usually done using the Transmission Control Protocol (TCP), a packet-based protocol specifically designed to preserve the accuracy of the data during transmission. TCP, however, is not designed to preserve the temporal aspect of the packets. The TCP receiving system must buffer up packets and signal the sender if an error occurs in any packet. This can cause packets to even arrive out of order, even though they are always delivered to the application in order. Therefore, TCP-based systems are subject to significant RTEs and, potentially, to STEs.
Serial communication devices typically use a first-in-first-out (FIFO) policy to buffer data, on both the sending and the receiving sides. The FIFO increases the efficiency and throughput by reducing the number of interrupts the CPU has to handle (e.g., Park et al., 2003). Without a FIFO buffer, the CPU has to interrupt for every data unit. With a FIFO, the CPU is interrupted only when a FIFO is full, or a programmed amount of data is ready. However, the FIFO can become a problem on a system where it's desired to correlate the serial clock to the sampling clock. If not properly handled, the FIFO introduces timing jitter on both the transmitter and the receiver, hence inducing RTEs in the system.
In a streaming-based system, an instrument with a time response (irrespective of the supported output rate) slower than the sampling rate will lead to RTEs even in the absence of any clock errors, because the measurement cannot in general be performed at the required moment in time. In the general case, such an instrument will be oversampled (i.e., the same measured value will appear multiple times in the final time series). Eugster and Plüss (2010) discuss in detail the consequences of such an occurrence with a CH4 gas analyzer with a time response of 5.7 Hz in a system sampling data at 20 Hz, concluding that the flux errors are negligible in most applications.
For completeness, we note that in virtually all EC instruments the native measurement is time-discrete. For example, in a nondispersive infrared (NDIR) gas analyzer, the presence of a rotating filter used to multiplex the desired infrared bands makes the gas concentration measurement frequency dependent on the wheel rotational frequency, which leads to RTEs. Nonetheless, if the rotational speed is high enough (e.g., > 100 Hz), the resulting errors are minimal.
1.5 Dealing with timing errors in EC practice
The fundamental difference between time lags on one side and RTE/STE on the other side is that constant time lags can, at least in principle, be addressed a posteriori during data processing. The topic of correctly estimating and compensating time lags has long been discussed in the EC literature (Vickers and Mahrt, 1997; Ibrom et al., 2007; Massman and Ibrom, 2008; Langford et al., 2015), and corresponding algorithms are available in EC processing software. We will therefore not further discuss time lags in this paper.
Random and systematic timing errors, instead, are not identifiable and therefore it is not possible to correct them. However, their effect on flux estimates, as we will see, can become significant. For this reason, the only viable strategy to reduce flux biases is to design the data acquisition system in a way that prevents or minimizes the possibility of their occurrence.
The focus of this paper is, therefore, the quantification of flux underestimations as a function of RTE and STE, so as to derive quantitative specifications for a data acquisition system that minimizes EC flux losses. We further propose a simple scheme for evaluating existing data acquisition systems with respect to data synchronization.
2.1 Simulation design
In order to accurately quantify how time-alignment errors affect flux estimates, we performed a simulation study. As a reference, we used the covariance estimated from high-frequency data of vertical wind speed (w) and sonic temperature (Ts) which are by definition perfectly synchronized since they are computed starting from the same raw data (the trave ling time of sound signals between pairs of transducers in a SAT). We also assumed that high-frequency time series are provided at perfectly constant time steps of 0.1 (10 Hz) or 0.05 (20 Hz) seconds. Subsequently, we manipulated the array of timestamps at which the sonic temperature data were sampled in order to simulate realistic ranges of RTEs and STEs. Values of sonic temperature in correspondence to the new simulated timestamps were estimated by linearly interpolating the closest data points in the original series (Fig. 3).
Before calculating covariances, standard EC processing steps were applied such as spike removal (Vickers and Mahrt, 1997), tilt correction by the double rotation method (Wilczak et al., 2001), and fluctuation estimation via block-averaging. Covariance estimates obtained with the new versions of Ts were then compared with the reference to quantify the effect of simulated timing errors. Flux biases would be almost identical to biases in covariances, weakly modulated by corrections intervening between covariance and flux computation, such as spectral corrections and consideration of air density fluctuation effects (e.g., Fratini et al., 2012). The simulation study was implemented in the source code of EddyPro v6.2.1 (LI-COR Biosciences Inc, Lincoln, NE; Fratini and Mauder, 2014).
For the present analysis, we simulated RTEs ranging ±1 to ±100 ms; that is, up to the same order of the sampling interval. As an example, with a simulated ±10 ms RTE using 10 Hz data (nominal time step = 100 ms), simulated time steps for Ts varied randomly between 90 and 110 ms, with an average of 100 ms. We note that RTEs of 10–100 ms will not usually be caused by clock jitter, which is typically several orders of magnitude smaller but can easily be caused by acquisition systems based on serial or Ethernet communication not specifically designed to collect synchronous time series, as described above.
For STEs, we simulated relative drifts ranging from 10 to 180 µs s−1 (specifically 10, 30, 60, 90, 120, 150, and 180 µs s−1). For example, to simulate a STE of 60 µs s−1 we kept the original w time series (time step equal to 100 ms) and modified the time step of Ts to be 100.006 ms. This may seem a negligible difference, which accumulates a difference of 108 ms between w and Ts within 30 minutes and manifests itself as a difference of one row in the length of the time series (i.e., 18 000 value for w and 17 999 for Ts at 10 Hz). Similarly, systematic errors of 120 and 180 µs s−1 would lead to two and three row differences, respectively.
Systematic timing errors such as frequency drifts effectively act as low-pass filters, which can be described by characterizing their transfer function, provided that the drift is known, as in our simulation. Here, for each 30 min period and for each STE amount, we calculated an in situ transfer function as the frequency-wise ratio of drifted and original w-Ts co-spectra:
where (K) is the simulated sonic temperature for each STE value, fn (Hz) is the natural frequency, and TFEXP(STE|fn) is the in situ transfer function for STE. We repeated this calculation for a number of co-spectra ranging from 1000 to 2000 (depending on data availability). The ensemble of all transfer functions so obtained and for each STE amount was then fitted with the following function, which was found to reasonably approximate the data obtained for all drifts at all sites in the most relevant frequency range (0.01–5 Hz):
where fco (Hz) is the transfer function cutoff frequency and α and β are fitting parameters whose values were found to vary very little around α=2.65 and β=0.25.
We performed simulations on four datasets acquired from EC sites representative of various ecosystem types and climatic regimes and characterized by the different height of measurement and height of the canopy:
IT-Ro2: a deciduous forest of Turkey Oak (Quercus cerris L.) in Italy. Eddy covariance measurements were carried out from 2002 to 2013 and the period used for the simulations was May 2013, when the canopy height was 15 m and the measurement height 18 m (Rey et al., 2002).
IT-Ro4: located at about 1 km from IT-Ro2, is a rotation crop site where EC measurements have been carried out from 2008 to 2014. Data used for the simulations include 43 days in 2012 when crimson clover (Trifolium incarnatum L.) was cultivated (maximum canopy height 60 cm, measurement height of 3.7 m).
DK-Sor: evergreen forest near Sorø, Denmark. EC measurements are performed since 1997: during the period used for the simulation (the entire 2015) the forest was 25 m tall and the EC system placed at 60 m a.g.l. (Pilegaard et al., 2011).
IT-CA3: fast-growing short-rotation coppice of poplar clones planted in 2010, located in Castel d'Asso, Viterbo, Italy. The EC tower was installed at the end of 2011, and measurements were taken until mid-2015. The period used for the simulation included 9 months over the period 2012–2015, with a canopy height ranging 0–5.3 m, and the measurement height between 3 and 5.5 m (Sabbatini et al., 2016).
2.3 Validation of the simulation design
Although the proposed simulation design enables the evaluation of resulting errors using readily available EC data, we note that interpolating data sampled at 10 or 20 Hz frequency can potentially introduce artifacts (due to the lack of information at higher frequencies) such as, for example, an undue reduction of the sonic temperature variance, which would result in artificial reduction of the w-Ts covariance. In order to detect any such effects, we preliminarily implemented a validation procedure, making use of 1 week of sonic data from a Gill HS-100 (Gill Instruments Ltd., Lymington, UK) collected at 100 Hz. The validation involved the following steps:
Subsampling at 10 Hz and simulating timing errors as described above, i.e., interpolating starting from the subsampled data.
Subsampling at 10 Hz and simulating timing errors by interpolating the original 100 Hz data.
Comparing w-Ts covariances obtained in steps 1 and 2.
The timing errors simulated interpolating the original 100 Hz measurements (option 2 above) are much less prone to artifacts because interpolation occurs between data that are 0.01 s apart, an interval too short for any significant flux signal to occur. Using this procedure, we could verify that there is no detectable difference between results obtained with 100 and 10 Hz data (not shown), which implies that the interpolation procedure is not introducing significant artifacts in the estimation of variances and covariances and therefore the simulation can be performed with virtually any historic EC dataset using the available code.
Figure 4 compares covariances w-Ts obtained with increasing amounts of RTE against the reference covariance obtained with the original, perfectly synced, time series. Reduction in covariance estimates is fairly negligible provided that RTE is of the same order of magnitude of the sampling interval or less. Largest discrepancies were observed for the IT-CA3 and IT-Ro4 sites with a covariance underestimation of 3 % for RTE of amplitude 100 ms. As mentioned earlier, such large timing errors are never the result of electronic clock jitter and may instead be caused by a data transmission system not designed for time synchronization, such as TCP.
Conversely, flux biases induced by systematic timing errors are both more significant and more variable. Figure 5 shows that a STE of 60 µs s−1 (1 row of difference in a 30 min file with data collected at 10 Hz) can lead to errors anywhere between 0 % and 4 %, increasing to 1 %–8 % for a STE of 120 µs s−1 (2 rows of difference) and to 1 %–11 % for a STE of 180 µs s−1 (3 rows of difference).
Figure 6a shows an example of the transfer functions derived using the procedure described in Sect. 2.1. The Figure refers to the site IT-CA3, but the filters obtained for the other sites had very similar characteristics, as illustrated in Fig. 6b using the mean cutoff frequencies computed for all sites at each STE amount: the tight ±3.5σ range merely demonstrates that the low-pass filter properties of the STE are independent from the data used to derive it, and only vary with the error amount.
Nonetheless, in Fig. 5 we showed how the same STE leads to very different flux underestimations at a different site. For example, a systematic error of 180 µs s−1 led to flux biases of 1 % and 11 % at IT-Ro2 and IT-Ro4, respectively.
The reason is related to the distribution of the flux contributions across the frequency domain. The more the flux co-spectrum is shifted towards higher frequencies, the more it will be dampened by any given low-pass filter and the higher the resulting flux bias will be. In other words, systematic timing errors are a source of high-frequency spectral losses, not dissimilar to the ones traditionally considered in EC (Moncrieff et al., 1997; Massmann, 2000; Ibrom et al., 2007). Figure 7 depicts the low-pass filtering effects of several STEs as applied to three different hypothetical EC systems, characterized by different “initial” cutoff frequencies (caused by other sources of attenuation such as, for example, length of the sampling line) deployed in two contrasting scenarios (high vs. low measurement height). It is evidenced that at high measurement heights effects are negligible, irrespective of the “original” cutoff frequency of the system (a–c). The reason is that the STE filters act on co-spectra that are shifted to low frequencies and have therefore very low high-frequency content. At low measurement height, instead, STEs significantly increase spectral losses if the system has a high initial cutoff frequency (e–f), while if the system as a poor initial spectral response (d), STEs are irrelevant because, again, high-frequency co-spectral content is minimal to start with.
To put this new source of high-frequency losses in perspective quantitatively, we note that for EC systems based on an enclosed-path gas analyzer (LI-7200), cutoff frequencies ranging from 1.1 Hz (for less optimized) up to 7–8 Hz (for systems with optimized intake rain cup and heated sampling line) were reported in the literature (e.g., Fratini et al., 2012; Aubinet et al., 2016; Metzger et al., 2016). Similar values are usually found in systems based on open-path setups. Significant STEs can thus easily become leading sources of flux biases in modern EC systems deployed at low measurement heights and/or very limited spectral losses due to other causes, with the additional complication that they are hard to detect and quantify. In fact, once acquired and stored in files, it is generally not possible to establish whether a drift between data streams occurred. Missing lines in one data stream could be either filled in by the data acquisition software (e.g., by means of the “last observation carried forward” technique) or could be compensated by dropping one line in the paired, longer series. In both cases, one would be unable to detect the problem, which is, however, obviously not solved by these solutions, meant only to build a complete rectangular dataset. On the contrary, a mismatch of one or two lines in the length of the time series is not necessarily the sign of an occurring STE, as it could also be the result of an imperfect timing in opening/closing a data stream, or some combination of the two factors. For these reasons, it is very difficult, if not practically impossible, to detect STEs, distinguish them from other timing errors or artifacts, and, more importantly, to infer the type and amount of error that is being introduced in the covariances. The only sign of a potential timing problem is an attenuated co-spectrum, as evaluated with respect to an available reference or model. But from the co-spectra attenuation alone, it is impossible to establish the presence of a timing error and, even more, disentangle it from other sources of attenuation. The only possibility is thus to estimate an ensemble spectral correction based on co-spectra, which would correct only sources of errors without the ability to discriminate them, which is less than ideal (e.g., Ibrom et al., 2007). It is therefore advised to evaluate the performance of a data acquisition system before it is put in operation.
Evaluating synchronicity of an EC data acquisition system
If both EC instruments can receive analog inputs, a possible way to evaluate synchronicity in the data logging system is to connect a signal generator to both EC instruments (SAT and gas analyzer) and collect the data via the data logging system in the configuration that would be adopted in normal operation (Fig. 8). The result is two replicates of the known signal data, whose timestamps will in general not be synchronized, in the sense that the same nominal timestamp will be attached to two different pieces of data. The two datasets can then be compared to calculate the phase difference between the clocks and hence assess RTEs and STEs. For example, a cross correlation of the time series yields the phase difference. It's important to test the time series at different time intervals, e.g., 0.5 h for several days, in actual field conditions that undergo significant temperature variations. The two instruments are synchronized if the cross correlation yields the same result every time. If this constant phase offset (time lag) is different from zero, this measurement quantifies signal delays in a system, which can be addressed either by optimizing the data logging system or by taking this offset into account while setting up the time-lag automatic computation in post-processing. A cross correlation that changes over time, instead, is a strong indication of occurring STEs. It is very difficult to anticipate the evolution in time of the phase change as it depends on the clock's crystal cuts, quality, and temperature sensitivity. In general, we may expect a linear trend in the phase if temperatures do not vary strongly (see also later) or a trend modulated by a diurnal pattern if temperature plays a role.
A simpler (though less controlled) option, in case a signal generator is not accessible, is to use the analog outputs from at least one of the EC instruments. In this case the test involves transmitting (at least) one of the analog outputs to the analog input of the companion instrument (e.g., w sent via analog output of the SAT to an analog input channel of the gas analyzer). In this way the raw-data files contain two replicates of that variable, each collected according to the timing of the respective instrument: the timestamps of the digital version are logged according to timing of the sending instrument (SAT, in the example), while those of its analog version are logged according to the timing of the receiver (gas analyzer). The same cross-correlation analyses described above can then be performed.
In both versions of the tests, results may be affected by minor RTEs if the various A/D or D/A tasks are not accurately synchronized with the measurement and serial output tasks. However, such RTEs should not affect the ability to detect and quantify occurring STEs.
Note also that, in both versions of the test, the way raw data are stored may have a strong impact on how to interpret the results. For instance, depending on the specifics of the data acquisition system, collecting a unique file with 3 days worth of data or collecting 30 min files for 3 days can provide different results, e.g., because the act of closing a file and opening a new one can cause the data streams to be partially or completely “realigned”.
To exemplify the test, we collected about 3 days of 20 Hz wind data from a SAT (HS-100, Gill Instruments Ltd., Lymington, UK) both in native digital format (via a RS-232 port, indicated with the subscript d in the following) and in analog format via the A/D of a LI-7550 analyzer interface unit (LI-COR Biosciences Inc.) which was then collected via a second RS-232 port (indicated with subscript a), using an industrial-grade PC running Windows XP. Thus, the data logging system under testing was “a Windows PC collecting EC data via RS-232, which was setup to transmit data in streaming mode”. The two data streams were completely independent to each other, and we attached timestamps to the records based on the operating system clock as the data were made available from the serial port to the application collecting the data. We then merged the two datasets based on timestamps and split the resulting 3-day file into 30 min. Finally, we calculated time lags between pairs of homologous variables (e.g., ud vs. ua, but results were identical for all anemometric variables), which are shown in Fig. 9. The linearity of the data suggests that the system is affected by a fairly constant STE of about 50 µs s−1, as quantified by the slope of the line.
Using the same setup, we further collected 2 days of data directly stored as 30 min files and again computed time lags between homologous variables. The acquisition system was able to realign the two series at the beginning of each half hour resetting the time lag between them to roughly zero. Nevertheless, calculating time lags on overlapping 5 min periods, we found that within each half-hourly period the time lags increased by 0.05 s (32 % of the times) and of 0.1 s (68 % of the times), which again indicates an STE ranging from 30 to 60 µs s−1 which, as shown above (Fig. 5), can lead to detectable flux biases. We stress that the system used in this experiment was not optimized for data acquisition and the aim was solely that of evaluating the proposed test.
Undoubtedly, modern EC systems should log high-frequency data in a native digital format, so as to collect all possible measurement, diagnostic, and status information from each instrument and assure the creation of robust, self-documented datasets, which are essential to the long-term research goals of climate and greenhouse gas science.
Commercial data acquisition solutions exist, that one can legitimately expect to ensure a proper data synchronization, such as the SmartFlux® system by LI-COR or the SDM-based system by CSI. There also exist applications developed by research institutions that specifically address the synchronization issue. In all these cases it is, however, possible to test the synchronization in order to confirm the expected performances.
When dealing with novel gas species, however, assembling EC systems from instrumentation that is not necessarily designed to be integrated is often the only choice available to the researcher and in-house solutions become necessary. In such cases, extreme care and expertise must be used in the handling of different digital data formats and transmission modes, in a context where data synchronicity is essential. We have shown that failure to do so can result in significant biases for the resulting fluxes, which depend on the type of timing error (random or systematic) and its amplitude, as well as on the co-spectral characteristics at the site. We have also explained how such errors are virtually impossible to detect and quantify in historic time series. It is, therefore, necessary to avoid them upfront, via proper design and evaluation of the data logging system.
Deploying a simple testing setup that makes use of equipment usually available to the EC experimentalists, we demonstrated how, for example, a naïve data collection performed asynchronously on a Windows XP industrial PC leads to significant relative drifts among the time series, which is bound to generate flux underestimations. With minor ad hoc adjustments, the same testing setup can be used to evaluate any EC data logging system. While evaluation of existing systems was beyond the scope of our work and we do expect synchronization issues to be more of a risk for in-house solutions, the proposed testing setup for evaluating data synchronization applies equally to in-house and to commercial solutions and we do invite researchers and companies to test their systems.
With this in mind, we recommend the scientific community to promote collaboration and synergy among manufacturers of EC equipment, technological solutions that guarantee sufficient synchronicity do exist – such as Ethernet connectivity deploying the PTP protocol – but, in order to be utilized, they require all instrumentation to be compatible with those technologies, which is not yet the case.
A final note on the data collected until now and largely shared and used in publications. As stated above, it is impossible to detect the presence of a synchronization issue on archived dataset. However, fully digital acquisition in streaming mode started to be largely adopted only recently and this limits the potential impact of the issue on historical data. In addition, as also explained in the results, the effect of a STE acts as a spectral loss and hence it may be (at least partially) compensated for and corrected by in situ spectral corrections based on co-spectra.
The source code for performing the simulation is available in the following public repository: https://github.com/geryatejina/ec_timing_errors_simulation (Fratini, 2018).
The authors declare that they have no conflict of interest.
GF and DP conceived and designed the work. GF carried out all the simulations and data analysis and wrote most of the paper. DP participated in data interpretation, wrote parts of the paper, and provided supervision during preparation of the paper. SS performed the data acquisition test and analyzed the corresponding data of Fig. 9. KE and BR provided the know-how on DSP and data acquisition theoretical aspects and wrote the corresponding sections. GN and DV participated in the conception of the paper and extensively reviewed the paper during preparation. GB provided major contributions and extensively reviewed the paper during the review process.
The authors thank the referees for valuable input that helped improve the focus
and rigor of the paper. Dario Papale, Giacomo Nicolini, and Domenico Vitale thank the ENVRIplus project funded by
the European Union's Horizon 2020 Research and Innovation Programme under
grant agreement 654182 and the RINGO project funded under the same program
under grant agreement 730944. Simone Sabbatini thanks the COOP+ project funded by the
European Union's Horizon 2020 Research and Innovation Programme under grant
agreement no. 654131.
Edited by: Trevor Keenan
Reviewed by: two anonymous referees
Aubinet, M., Grelle, A., Ibrom, A., Rannik, Ü., Moncrieff, J., Foken, T., Kowalski, A., Martin, P., Berbigier, P., Bernhofer, C., Clement, R., Elbers, J., Granier, A., Grünwald, T., Morgenstern, K., Pilegaard, K., Rebmann, C., Snijders, W., Valentini, R., and Vesala, T.: Estimates of the Annual Net Carbon and Water Exchange of Forests: The EUROFLUX Methodology, Adv. Ecol. Res. 30, 113–175, https://doi.org/10.1016/S0065-2504(08)60018-5, 2000.
Aubinet, M., Vesala, T., and Papale, D.: Eddy Covariance: A Practical Guide to Measurement and Data Analysis, Springer, Dordrecht, the Netherlands, Heidelberg, Germany, London, UK, New York, USA, 460 pp., 2012.
Baldocchi, D., Falge, E., Gu, L., Olson, R., Hollinger, D., Running, S., Anthoni, P., Bernhofer, C., Davis, K., Evans, R., Fuentes, J., Goldstein, A., Katul, G., Law, B., Lee, X., Malhi, Y., Meyers, T., Munger, W., Oechel, W., Paw U, K. T., Pilegaard, K., Schmid, H. P., Valentini, R., Verma, S., Vesala, T., Wilson, K., and Wofsy, S.: FLUXNET: A New Tool to Study the Temporal and Spatial Variability of Ecosystem-Scale Carbon Dioxide, Water Vapor, and Energy Flux Densities, B. Am. Meteorol. Soc., 82, 2415–2434, https://doi.org/10.1175/1520-0477(2001)082<2415:FANTTS>2.3.CO;2, 2001.
Barnes, J. R.: Electronic System Design: Interference and Noise Control Techniques, Prentice-Hall Inc., Upper Saddle River, New Jersey, USA, 1987.
Cheng, Y., Sayde, C., Li, Q., Basara, J., Selker, J., Tanner, E., and Gentine, P.: Failure of Taylor's hypothesis in the atmospheric surface layer and its correction for eddy-covariance measurements, Geophys. Res. Lett., 44, 4287–4295, 2017.
Aubinet, M., Joly, L., Loustau, D., De Ligne, A., Chopin, H., Cousin, J., Chauvin, N., Decarpenterie, T., and Gross, P.: Dimensioning IRGA gas sampling systems: laboratory and field experiments, Atmos. Meas. Tech., 9, 1361–1367, https://doi.org/10.5194/amt-9-1361-2016, 2016.
Dobkin, R., Moyal, M., Kolodny A., and Ginosar, R.: Asynchronous Current Mode Serial Communication, IEEE T. VLSI Syst., 18, 1107–1117, 2010.
Eugster, W. and Plüss, P.: A fault-tolerant eddy covariance system for measuring CH4 fluxes, Agr. Forest Meteorol., 150, 841–851, https://doi.org/10.1016/j.agrformet.2009.12.008, 2010.
Foken, T., Aubinet, M., and Leuning, R.: The Eddy Covariance Method, in: Eddy Covariance: A Practical Guide to Measurement and Data Analysis, edited by: Aubinet, M., Vesala, T., and Papale, D., Springer, Dordrecht, the Netherlands, 1–20, https://doi.org/10.1007/978-94-007-2351-1, 2012.
Frank, J. M., Massman, W. J., Swiatek, E., Zimmerman, H. A., and Ewers, B. E.: All sonic anemometers need to correct for transducer and structural shadowing in their velocity measurements, J. Atmos. Ocean. Tech., 33, 149–167, 2016.
Fratini, G.: EC timing errors simulation, available at: https://github.com/geryatejina/ec_timing_errors_simulation, last access: 4 September 2018.
Fratini, G. and Mauder, M.: Towards a consistent eddy-covariance processing: an intercomparison of EddyPro and TK3, Atmos. Meas. Tech., 7, 2273–2281, https://doi.org/10.5194/amt-7-2273-2014, 2014.
Fratini, G., Ibrom, A., Arriga, N., Burba, G., and Papale, D.: Relative humidity effects on water vapour fluxes measured with closed-path eddy-covariance systems with short sampling lines, Agr. Forest Meteorol. 165, 53–63, https://doi.org/10.1016/j.agrformet.2012.05.018, 2012.
Grare, L., Lenain, L., and Melville, W.K.: The influence of wind direction on Campbell Scientific CSAT3 and Gill R3-50 sonic anemometer measurements. J. Atmos. Ocean. Tech., 33, 2477–2497, 2016.
Hewlett Packard: Fundamentals of Quartz Oscillators, Electronic Counters Series, Application Note 200-2, 1997.
Horst, T. W., Vogt, R., and Oncley, S. P.: Measurements of flow distortion within the IRGASON integrated sonic anemometer and CO2∕H2O gas analyser, Bound.-Lay. Meteorol., 160, 1–15, 2016.
Huq, S., De Roo, F., Foken, T., and Mauder, M.: Evaluation of probe-induced flow distortion of Campbell CSAT3 sonic anemometers by numerical simulation, Bound.-Lay. Meteorol., 165, 9–28, 2017.
Ibrom, A., Dellwik, E., Flyvbjerg, H., Jensen, N. O., and Pilegaard, K.: Strong low-pass filtering effects on water vapour flux measurements with closed-path eddy correlation systems, Agr. Forest Meteorol., 147, 140–156, https://doi.org/10.1016/j.agrformet.2007.07.007, 2007.
Kolle, O. and Rebmann, C.: EddySoft Documentation of a Software Package to Acquire and Process Eddy Covariance Data., Technical Reports – Max-Planck-Institut für Biogeochemie, 10, ISSN 1615-7400, 2007.
Langford, B., Acton, W., Ammann, C., Valach, A., and Nemitz, E.: Eddy-covariance data with low signal-to-noise ratio: time-lag determination, uncertainties and limit of detection, Atmos. Meas. Tech., 8, 4197–4213, https://doi.org/10.5194/amt-8-4197-2015, 2015.
Mahmood, A., Exel, R., and Sauter, T.: Delay and Jitter Characterization for Software-Based Clock Synchronization Over WLAN Using PTP, IEEE T. Ind. Inform., 10, 1198–1206, 2014.
Massman, W. J.: A simple method for estimating frequency response corrections for eddy covariance systems, Agr. Forest Meteorol., 104, 185–198, 2000.
Massman, W. J. and Ibrom, A.: Attenuation of concentration fluctuations of water vapor and other trace gases in turbulent tube flow, Atmos. Chem. Phys., 8, 6245–6259, https://doi.org/10.5194/acp-8-6245-2008, 2008.
Metzger, S., Burba, G., Burns, S. P., Blanken, P. D., Li, J., Luo, H., and Zulueta, R. C.: Optimization of an enclosed gas analyzer sampling system for measuring eddy covariance fluxes of H2O and CO2, Atmos. Meas. Tech., 9, 1341–1359, https://doi.org/10.5194/amt-9-1341-2016, 2016.
Moncrieff, J. B., Massheder, J. M., de Bruin, H., Elbers, J., Friborg, T., Heusinkveld, B., Kabat, P., Scott, S., Soegaard, H., and Verhoef, A.: A system to measure surface fluxes of momentum, sensible heat, water vapour and carbon dioxide, J. Hydrol., 188–189, 589–611. https://doi.org/10.1016/S0022-1694(96)03194-0, 1997.
Park J., Mackay S., and Wright E.: Practical Data Communications for Instrumentation and Control, Newnes, Oxford, UK, 400 pp., ISBN 9780750657976, 2003.
Pilegaard, K., Ibrom, A., Courtney, M. S., Hummelshøj, P., and Jensen, N. O.: Increasing net CO2 uptake by a Danish beech forest during the period from 1996 to 2009, Agr. Forest Meteorol. 151, 934–946, https://doi.org/10.1016/j.agrformet.2011.02.013, 2011.
Rey, A., Pegoraro, E., Tedeschi, V., De Parri, I., Jarvis, P. G., and Valentini, R.: Annual variation in soil respiration and its components in a coppice oak forest in Central Italy, Glob. Change Biol., 8, 851–866, 2002.
Sabbatini, S., Arriga, N., Bertolini, T., Castaldi, S., Chiti, T., Consalvo, C., Njakou Djomo, S., Gioli, B., Matteucci, G., and Papale, D.: Greenhouse gas balance of cropland conversion to bioenergy poplar short-rotation coppice, Biogeosciences, 13, 95–113, https://doi.org/10.5194/bg-13-95-2016, 2016.
Vickers, D. and Mahrt, L.: Quality control and flux sampling problems for tower and aircraft data, J. Atmos. Ocean. Tech., 14, 512–526, 1997.
Wilczak, J. M., Oncley, S. P., and Stage, S. A.: Sonic anemometer tilt correction algorithms, Bound.-Lay. Meteorol., 99, 127–150, 2001.
Wyngaard, J. C.: Flow-distortion effects on scalar flux measurements in the surface layer: Implications for sensor design, in: Topics in Micrometeorology. A Festschrift for Arch Dyer, edited by: Hicks, B. B., Springer, Dordrecht, the Netherlands, 1998.