Last update: 10/09/2025

Introduction

The collection of field data is an important element of the HydroCORE project. It allows to make an assessment of the state of the Paute river system and to identify patterns in both space and time. It is expected that these conditions change along the path of the Paute river due to a variety of human and natural influences. For instance, the city of Cuenca is situated in the upstream part of the basin, while two reservoirs are present in the downstream part prior to entering the Amazon region.

To characterise the Paute river system, the HydroCORE project foresees a total of eight sampling campaigns over a period of two years. Abiotic conditions are recorded every three months and supplemented with the collection of macroinvertebrates every six months. At the spatial level, the project aims to include 20 reservoir sites and 30 river sites (totaling 50 sampling sites). These also include sites that are part of a reference system in which no artificial reservoirs are present, being the Jubones river system.

The fourth sampling campaign was performed from 06 May 2024 until 29 May 2024. During this period, a total of 53 sampling sites were visited. The results of the abiotic analyses performed both on-site (observations of the field conditions, probe readings) as off-site (nutrient analyses) are reported here in a concise manner.

Map of the sampling sites

A total of 53 sampling sites were visited and distributed as follows: (1) 21 river sites, (2) 21 reservoir sites, and (3) 11 reference system sites. The majority of the river sites (N = 14) is located upstream of the two reservoirs, while the majority of the reservoir sites (N = 15) is located in the largest reservoir (called Mazar). Relatively few reservoir sites (N = 6) were visited in the smaller reservoir (called Amaluza) due to accessibility reasons. There are hardly roads leading to the reservoir due to the steep mountains and a large fraction of the water surface is overcrowded by water hyacinth (E. crassipes). The map shows where the sampling sites are situated:

Figure 1: Map of the sampling locations.

Physicochemical conditions

At every sampling site, a series of water quality parameters was recorded. In addition, water samples were collected in plastic bottles for nutrient analysis in the lab at UCuenca. Within this section, the results of these analyses are presented in a concise manner and further explored through principal component analysis and hierarchical clustering. Through this exploration, similarities and dissimilarities between sampling sites can be observed.

Statistical summary

A first step in the data exploration is looking at the characteristic statistics of the considered variables. This provides an idea of the range of the obtained values and indicates if (and where) data is missing. For this overview, a distinction is made between the parameters measured on-site and off-site.

An overall summary of the parameters that were measured on-site is given in Table 1, showing an altitude range between 915 m and 3105 m above sea level. Dissolved oxygen levels ranged from 4.40 mg/L up to 13.51 mg/L, averaging at 9.44 mg/L (SD = 2.25 mg/L). Also turbidity and chlorophyll a displayed a rather wide range (1.10 NTU up to 203 NTU and 8.17 µg/L up to 69.37 µg/L, respectively), while salinity showed to be more constant.

In addition, Table 1 also contains a summary of the parameters that were measured off-site. Nutrient concentrations below the quantification limit were replaced by the quantification limit (worst-case approach), resulting in similar values for both the minimum first quartile for most nutrients. The highest nutrient levels were 0.94 mg/L nitrate-nitrogen, 0.14 mg/L nitrite-nitrogen, 1.74 mg/L ammonium-nitrogen, 3.28 mg/L total nitrogen, 0.17 mg/L orthophosphate-phosphorus, and 0.54 mg/L total phosphorus.

Table 1: Summary of the on- and off-site variables
Min Q1 Median Q3 Max Mean SD #NA
Altitude [m a.s.l.] 915.00 1985.00 2137.00 2199.00 3105.00 2053.36 436.00 0
Distance 0.35 30.06 45.33 61.36 143.08 53.13 33.67 0
Temperature [oC] 9.52 15.74 17.26 18.89 20.87 17.01 2.40 0
pH [-] 6.10 6.76 7.17 7.40 9.02 7.17 0.56 0
Conductivity [mS/cm] 0.02 0.10 0.16 0.17 0.58 0.14 0.08 0
Salinity [ppt] 0.00 0.00 0.10 0.10 0.30 0.07 0.06 0
Oxygen saturation [%] 47.40 86.40 101.00 116.80 143.60 100.37 23.01 0
Oxygen [mg/L] 4.40 7.79 10.22 11.00 13.51 9.44 2.25 0
Total dissolved solids [g/L] 0.01 0.06 0.11 0.11 0.37 0.09 0.05 0
Oxidation reduction potential [ORPmV] 226.00 278.00 306.00 360.00 436.00 315.49 46.29 0
Turbidity [NTU] 1.10 2.90 6.00 16.40 203.00 22.25 42.71 0
Chlorophyll a [ug/L] 8.17 19.68 29.32 38.97 69.37 29.76 14.25 0
Average velocity [m/s] 0.00 0.00 0.16 0.40 1.17 0.24 0.28 2
TOC [mg/L] 1.29 2.77 3.28 3.65 9.49 3.49 1.44 1
TIC [mg/L] 2.10 6.76 8.83 10.29 28.76 8.73 3.88 1
Total carbon [mg/L] 4.03 10.03 12.29 13.85 35.82 12.22 4.49 1
Nitrate-N [mg/L] 0.50 0.50 0.50 0.50 0.94 0.54 0.09 0
Nitrite-N [mg/L] 0.00 0.00 0.01 0.02 0.14 0.02 0.03 0
Ammonium-N [mg/L] 0.00 0.01 0.30 0.61 1.74 0.35 0.39 0
Orthophosphate-P [mg/L] 0.00 0.00 0.02 0.04 0.17 0.03 0.04 0
Total-N [mg/L] 0.50 0.50 0.99 1.30 3.28 1.01 0.59 0
Total-P [mg/L] 0.01 0.05 0.07 0.13 0.54 0.10 0.10 0

Correlation analysis

The different variables of the previous sections were used as a basis for a correlation analysis (see Figure 2). From this analysis, it is clear that some variables are more correlated than others (as is expected). Salinity, conductivity, and total dissolved solids show a higher-than-average correlation as they more or less reflect the amount of dissolved chemicals in the water column. Also the different nutrients display a higher correlation, especially regarding ammonia-nitrogen and total nitrogen. Nitrate-nitrogen showed a lower correlation with the other nitrogen compounds, which might be related to the relatively low variability in the observed value due to the quantification limit of 0.5 mgN/L used during the analysis (see Table 1).

Figure 2: Correlation analysis of the various variables.

Figure 2: Correlation analysis of the various variables.

Principal component analysis

From previous sections it is clear that various water quality parameters were assessed in the selected sampling sites. These parameters can be analysed individually to identify (dis)similarities between sampling sites, though that will lead to an overly extensive list of observations. As an alternative, dimensionality reduction can be applied on the data by extracting the most informative patterns and representing them as part of a new set of variables. Principal component analysis (PCA) allows to perform such a dimensionality reduction and shows that the first component is mostly determined by the flow and the presence of pollution (nutrients and higher conductivity). The second principal component is mostly determined by altitude and turbidity.

The PCA shows that there is no clear clustering of all sampling sites (see Figure 3), which is partially in line with the observed nutrient levels (see Table 1). Sites that showed discrepancy from the majority of the sites included the Cuenca (code ‘CU’) and Mazar reservoir (code ‘MA’), displaying relatively higher nutrient levels. Also, location CU05 shows a clear difference, mostly due to the relatively high conductivity that was observed.

Figure 3: PCA of the physicochemical data.

Figure 3: PCA of the physicochemical data.

Supervised clustering

In addition to the unsupervised analysis provided by PCA, a supervised cluster analysis can be performed to identify (dis)similarities between sampling sites. These (dis)similarities can be calculated through distances, with the Euclidean distance being the most frequently used. Subsequently, clusters are created based on these distances by using a specific linkage technique. Here, Euclidean distances and a complete linkage is being used.

The resulting hierarchical tree displays the closeness of the different sampling sites and shows that spatially close sites do not always cluster together (see Figure 4). Some closely situated sites do cluster together (e.g., PA08/09, GI03/04, AM05/06), while others end up further away in the tree (e.g., CU02/03, AM04/05). Some overlap with the PCA result can also be seen, including the similarity of the sites located upstream from the city of Cuenca (i.e. MC07, TA08, TA24, TO18, and YA12).

Figure 4: Hierarchical clustering of the sample sites.

Figure 4: Hierarchical clustering of the sample sites.

Greenhouse gases

Aside from the abovementioned physicochemical conditions, additional samples were collected to determine (1) the presence of greenhouse gases in a dissolved state within the water column and (2) the emission of greenhouse gases to the atmosphere. Samples were collected in 12 mL glass vials that were pre-rinsed with Helium and subsequently analysed at ETH Zürich. Within this section, the results of these analyses are presented in a concise manner and further explored through principal component analysis and hierarchical clustering. Through this exploration, similarities and dissimilarities between sampling sites can be observed.

Statistical summary

Similar to the section on the physicochemical conditions, we start with a look at the characteristic statistics of the considered variables. For this overview, a distinction is made between dissolved greenhouse gases and the calculated emissions.

An overall summary is given in Table 2, showing that the level of carbon dioxide in the water column ranged from 20.25 µM up to 291.06 µM, averaging at 77.15 µM (SD = 59.82 µM). Methane levels ranged from 0.07 µM up to 1.49 µM and averaged at 0.22 µM (SD = 0.25 µM). Nitrous oxide levels showed to be relatively uniform and clearly lower than the carbon-based gases.

In addition, Table 2 also contains a summary of the emitted greenhouse gases. Especially interesting here is the negative flux for carbon dioxide (at -460 mg/m²/d), suggesting a local net uptake of carbon dioxide. Also the maximum emission rate for both carbon dioxide (13018 mg/m²/d) and methane (26074 mg/m²/d) are worth mentioning, though they might be caused by ebullition taking place right under the collecting chamber.

Table 2: Summary of the dissolved and emitted greenhouse gases.
Min Q1 Median Q3 Max Mean SD #NA
Nitrous oxide [uM] 0.02 0.02 0.02 0.03 0.06 0.03 0.01 0
Methane [uM] 0.07 0.08 0.10 0.25 1.49 0.22 0.25 0
Carbon dioxide [uM] 20.25 38.39 52.63 94.62 291.06 77.15 59.82 0
Nitrous oxide [mg/m²/d] -0.02 0.03 0.23 0.47 2.23 0.44 0.58 11
Carbon dioxide [mg/m²/d] -460.45 306.78 1213.12 2354.39 13017.76 1865.09 2602.49 8
Methane [mg/m²/d] -21.42 0.24 0.58 5.80 26073.89 560.15 3802.49 6

Correlation analysis

The different variables of the previous sections were used as a basis for a correlation analysis (see Figure 5). As is expected, strong correlations are observed between the dissolved and emitted quantities of methane, though the relationships between the oxygen-based gases are clearly different from previous campaigns. This time, a strong correlation is observed for dissolved nitrous oxide and carbon dioxide, as well as a correlation between the emitted quantities of both gases.

Figure 5: Correlation analysis of the greenhouse gases.

Figure 5: Correlation analysis of the greenhouse gases.

Principal component analysis

In alignment with the physicochemical conditions, dimensionality reduction can be applied. Principal component analysis (PCA) shows that the first component is mostly determined by all greenhouse gases and that the second principal component is additionally determined by methane and the emissions of the oxygen-based gases.

The PCA shows that the majority of sampling sites cluster together (see Figure 6). Sites that showed discrepancy from the majority of the sites included the Amaluza (code ‘AM’) and Mazar (code ‘MA’) reservoir. These sites are characterised by a relatively high emission of the considered greenhouse gases and might mask any further clustering of the other sites.

Figure 6: PCA of the greenhouse gas data.

Figure 6: PCA of the greenhouse gas data.

Supervised clustering

In addition to the unsupervised analysis provided by PCA, the supervised cluster analysis displays the closeness of the different sampling sites and shows that spatially close sites do not always cluster together (see Figure 7). Some closely situated sites do cluster together (e.g., GI04/06, MA21/22, RI01/02), while others end up further away in the tree (e.g., MA11/13, AM04/05, TA08/24). Some overlap with the PCA result can also be seen, including the discrepancy of AM04 and AM41 with the majority of sampling sites.

Figure 7: Hierarchical clustering of the sample sites.

Figure 7: Hierarchical clustering of the sample sites.

Summary

A total of 53 sampling sites were selected and assessed for a variety of water quality parameters and a set of greenhouse gases. Surprisingly, nutrient levels were higher than observed during the first and third sampling campaign (July-August 2023 and February 2024, respectively) and lower than observed during the second campaign (October-November 2023), though nitrate levels were often too low to be quantified. Several sampling sites seem to show different conditions, most of them located near or downstream of urbanisation. For example, the Cuenca (CU02/03) and Paute (PA01/02/03) sites are situated downstream of Cuenca, while the Giron sites are situated in and downstream of the town of Giron. Still, these sites do not show the most extreme levels of greenhouse gases, as several reservoir sites showed relatively high emission rates.

Acknowledgement

We would like to thank D.J. Vimos Lojano and K.P. Ramirez Pozo for their help in collecting the samples in the field as well as their subsequent processing in the lab. We also thank D.G. Zuñiga Villegas for providing us with the necessary transport and M. Barthel for the analyses of the greenhouse gas samples.

Reference

Van Echelpoel, W. (2025). Short report on campaign 04. HydroCORE project. Online: https://wvechelp.github.io/HydroCORE/Reports/ShortReportCampaign04.html

Read more on the HydroCORE webpage.