We're sorry but this page doesn't work properly without JavaScript enabled. Please enable it to continue.
Feedback

A Network Approach for Multiscale Catchment Classification using Traits

00:00

Formal Metadata

Title
A Network Approach for Multiscale Catchment Classification using Traits
Author
License
CC Attribution 4.0 International:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
Language
Production Year2024
Production PlaceSan Francisco, CA

Content Metadata

Subject Area
Genre
Abstract
The classification of river catchments into groups with similar biophysical characteristics is useful to understand and predict their hydrological behavior. The increasing availability of remote sensing and other large-scale geospatial datasets have enabled the use of advanced data-driven approaches to classify catchments using traits such as topography, geology, climate, land cover, land use, and human influence. Unsupervised clustering algorithms based on the Euclidean distance are commonly used for trait-based classification, but are not suitable for high dimensional data. In this study we present a new network-based method for multi-scale catchment classification, which can be applied to large datasets and used to determine the traits associated with different catchment groups. In this framework two networks are analyzed in parallel; the first where the nodes are traits, and the second where the nodes are catchments. In both cases, edges represent pairwise similarity and a network cluster detection algorithm is used for the classification. The traits network is used to investigate redundancy in the trait data and to condense this information into a small number of interpretable categories. The catchments network is used to classify the catchments into clusters, and to identify representative catchments for the different groups using the degree centrality metric. We apply this method to classify 9067 river catchments across the contiguous United States at both regional and continental scales using 274 non-categorical traits. At the continental scale, we identify 25 interpretable trait categories and 34 catchment clusters of size greater than 50. We find that catchments with similar trait categories are typically located in the same region, with different spatial patterns emerging among clusters dominated by natural and anthropogenic traits. We also find that the catchment clusters exhibit distinct hydrological behavior based on an analysis of streamflow indices. This network approach provides several advantages over traditional means of classification including better separation of clusters, the use of alternate similarity metrics that are more suitable for high dimensional data, and reducing redundancy in the trait information. The paired catchment-trait networks enables analysis of hydrological behavior using the dominant trait categories for each catchment cluster. The approach can be used at multiple spatial scales, since the network topologies adjust automatically to reflect the trait patterns at the scale of investigation. Finally, the representative catchments identified as hub nodes in the network can be used to guide transferable observational and modeling strategies. The method is broadly applicable beyond hydrology for classification of other complex systems that utilize different types of trait datasets.
Keywords
Computer animationMap
Transcript: English(auto-generated)
In this study, we classify nearly 10,000 river catchments using hundreds of physical and environmental traits. This is a high-dimensional problem where traditional and supervised classification methods based on Euclidean distance, like k-means or hierarchical clustering, degrade in performance.
We developed a new method that uses network science to analyze such high -dimensional data by building two parallel networks after a PCA-based dimensionality reduction. One advantage of this network approach is that we can use alternative metrics, like cosine similarity, which are more suitable than Euclidean distance for this kind of problems.
In the first network, the nodes are 274 traits, which are connected by edges that represent their pairwise similarity. The resulting clusters condense the information into only 25 interpretable trait categories, such as those related to agriculture or human development.
This eliminates the redundancy in the trait information. In the second network, more than 9,000 catchments are split into 34 major classes using the reduced traits categories. We found that the majority of the classes are geographically clustered, which naturally
emerges from the trait-based classification without being provided with any spatial information. By the simultaneous analysis of the two networks, we can characterize each of the catchment classes with a small number of interpretable trait categories. For this, we compute z-scores that represent how different the traits are
for a particular catchment class relative to all the catchments in the dataset. Traits that are over-expressed have positive z-scores, while those that are under-expressed have negative z-scores. In this example, catchment near urban environments are associated with an over-expression of traits corresponding to developed areas. We found that catchments within different classes display distinct hydrological behaviors, which now can be associated with their traits.
Here, for instance, the propensity for catchment to have moderate flooding is associated with the presence of developed areas. This approach can be used on multiple spatial scales since the network topologies are just to reflect the trait patterns at the investigation scale.
And finally, the representative catchments identified as hub nodes in the network can be used to guide transferable, observational, and modeling strengths.