A Network Approach for Multiscale Catchment Classification using Traits
This is a modal window.
The media could not be loaded, either because the server or network failed or because the format is not supported.
Formal Metadata
Title |
| |
Author | ||
License | CC Attribution 4.0 International: You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor. | |
Identifiers | 10.5446/66892 (DOI) | |
Publisher | ||
Release Date | ||
Language | ||
Production Year | 2024 | |
Production Place | San Francisco, CA |
Content Metadata
Subject Area | ||
Genre | ||
Abstract |
| |
Keywords |
00:00
Computer animationMap
Transcript: English(auto-generated)
00:03
In this study, we classify nearly 10,000 river catchments using hundreds of physical and environmental traits. This is a high-dimensional problem where traditional and supervised classification methods based on Euclidean distance, like k-means or hierarchical clustering, degrade in performance.
00:21
We developed a new method that uses network science to analyze such high -dimensional data by building two parallel networks after a PCA-based dimensionality reduction. One advantage of this network approach is that we can use alternative metrics, like cosine similarity, which are more suitable than Euclidean distance for this kind of problems.
00:43
In the first network, the nodes are 274 traits, which are connected by edges that represent their pairwise similarity. The resulting clusters condense the information into only 25 interpretable trait categories, such as those related to agriculture or human development.
01:03
This eliminates the redundancy in the trait information. In the second network, more than 9,000 catchments are split into 34 major classes using the reduced traits categories. We found that the majority of the classes are geographically clustered, which naturally
01:20
emerges from the trait-based classification without being provided with any spatial information. By the simultaneous analysis of the two networks, we can characterize each of the catchment classes with a small number of interpretable trait categories. For this, we compute z-scores that represent how different the traits are
01:40
for a particular catchment class relative to all the catchments in the dataset. Traits that are over-expressed have positive z-scores, while those that are under-expressed have negative z-scores. In this example, catchment near urban environments are associated with an over-expression of traits corresponding to developed areas. We found that catchments within different classes display distinct hydrological behaviors, which now can be associated with their traits.
02:09
Here, for instance, the propensity for catchment to have moderate flooding is associated with the presence of developed areas. This approach can be used on multiple spatial scales since the network topologies are just to reflect the trait patterns at the investigation scale.
02:25
And finally, the representative catchments identified as hub nodes in the network can be used to guide transferable, observational, and modeling strengths.