IIASA. Laxenburg, Austria
2023-09-28
Website: https://jakubnowosad.com/
Discovering and describing patterns is a vital part of many spatial analysis However, spatial data is gathered in many ways and stored in forms, which requires different approaches to understanding spatial patterns
Discovering and describing spatial patterns is an important part of many geographical studies, and spatial patterns are linked to natural and social processes.
Evaluation of the susceptibility of forest landscapes to agricultural expansion
Bourgoin et al., 2020, 10.1016/j.jag.2019.101958
Reinterpretation of histological images as categorized rasters and their use for disease classification (e.g., liver cancer)
Kendall et al., 2020, 10.1038/s41598-020-74691-9
Spatial patterns can be quantified using landscape metrics (O’Neill et al. 1988; Turner and Gardner 1991; Li and Reynolds 1993; He et al. 2000; Jaeger 2000; Kot i in. 2006; McGarigal 2014).
Software such as FRAGSTATS, GuidosToolbox, or landscapemetrics has proven useful in many scientific studies (> 12,000 citations).
There is a relationship between an area’s pattern composition and configuration and ecosystem characteristics, such as vegetation diversity, animal distributions, and water quality within this area (Hunsaker i Levine, 1995; Fahrig i Nuttle, 2005; Klingbeil i Willig, 2009; Holzschuh et al., 2010; Fahrig et al., 2011; Carrara et al., 2015; Arroyo-Rodŕıguez et al. 2016; Duflot et al., 2017, many others..)
I randomely selected 16 rasters with different proportions of forest (green) areas:
SHDI:
AI:
Type | Landscape-level metrics |
---|---|
Shape | PAFRAG; CONTIG AM; CONTIG RA |
Aggregation | AI; CONTAG; IJI; PLATJ; PD; DIVISION; LPI |
Connectivity | COHESION |
Diversity | SHDI; SIDI; MSIDI; SHEI; SIEI; MSIEI |
PC1:
PC2:
The result allows to distinguish between:
However, there are still some problems here…
PC1:
PC2:
Issues with the PCA approach:
Entropy:
Relative mutual information:
2D parametrization of categorical rasters’ configurations based on two weakly correlated IT metrics groups similar patterns into distinct regions of the parameters space
Land cover data:
Parametrization using two IT metrics:
These metrics still leave some questions open…
Parametrization using two IT metrics:
In recent years, the ideas of analyzing spatial patterns have been extended through an approach called pattern-based spatial analysis (Long in in. 2010; Cardille in in. 2010; Cardille in in. 2012; Jasiewicz i in. 2013; Jasiewicz i in. 2015).
The fundamental idea is to divide data into a large number of smaller areas (local landscapes).
Next, represent each area using a statistical description of the spatial pattern - a spatial signature.
Spatial signatures can be compared using a large number of existing distance or dissimilarity measures (Lin 1991; Cha 2007).
This approach enables spatial analyses such as searching, change detection, clustering, or segmentation.
Most landscape metrics are single numbers representing specific features of a local landscape.
Spatial signatures, on the other hand, are multi-element representations of landscape composition and configuration.
The basic signature is the co-occurrence matrix:
agriculture | forest | grassland | water | |
---|---|---|---|---|
agriculture | 272 | 218 | 4 | 0 |
forest | 218 | 38778 | 32 | 12 |
grassland | 4 | 32 | 16 | 0 |
water | 0 | 12 | 0 | 2 |
A spatial signature should allow simplification to the form of a normalized vector.
272 | 218 | 4 | 0 | 218 | 38778 | 32 | 12 | 4 | 32 | 16 | 0 | 0 | 12 | 0 | 2 |
136 | 218 | 19389 | 4 | 32 | 8 | 0 | 12 | 0 | 1 |
0.0069 | 0.011 | 0.9792 | 0.0002 | 0.0016 | 0.0004 | 0 | 0.0006 | 0 | 0.0001 |
Measuring the distance between two signatures in the form of normalized vectors allows determining dissimilarity between spatial structures.
0.0069 | 0.011 | 0.9792 | 0.0002 | 0.0016 | 0.0004 | 0 | 0.0006 | 0 | 0.0001 |
0.1282 | 0.0609 | 0.8105 | 0.0002 | 0.0002 | 0.0001 | 0 | 0 | 0 | 0 |
\[JSD(A, B) = H(\frac{A + B}{2}) - \frac{1}{2}[H(A) + H(B)]\]
Jensen-Shannon distance between the above rasters: 0.0684
Measuring the distance between two signatures in the form of normalized vectors allows determining dissimilarity between spatial structures.
0.0069 | 0.011 | 0.9792 | 0.0002 | 0.0016 | 0.0004 | 0 | 0 | 0 | 0 | 0 | 0.0006 | 0 | 0 | 0.0001 |
0.2033 | 0.1335 | 0.2944 | 0.1747 | 0.0562 | 0.1307 | 0.0035 | 0.0002 | 0.0004 | 0.0015 | 0.0007 | 0.0005 | 0 | 0 | 0.0005 |
\[JSD(A, B) = H(\frac{A + B}{2}) - \frac{1}{2}[H(A) + H(B)]\]
Jensen-Shannon distance between the above rasters: 0.444
Knowing the distance between spatial signatures can be used in several contexts (Nowosad, 2021, 10.1007/s10980-020-01135-0):
one-to-many
finding similar spatial structures
one-to-one
quantitative assessment of changes in spatial structures
many-to-many
clustering similar spatial structures
Finding areas with similar topography to the Suwalski Landscape Park.
The map above shows that many areas in the Amazon have undergone significant land cover changes between 1992 and 2018.
The challenge now is to determine which areas have changed the most.
Areas with the greatest change have the highest dissimilarity values.
Importantly, changes in both category and spatial configuration are measured.
Areas in Africa with similar spatial structures for two themes have been identified - land cover and landforms.
The quality of each cluster can be assessed using metrics:
Using spatial signatures to compare and evaluate digital soil maps (Rossiter et al., 2022, 10.5194/soil-8-559-2022)
Changes in spatial patterns resulting from the inclusion of small woody elements in land use maps (Golicz et al., 2021, 10.3390/land10101028)
Depending on the problem:
Challenges:
WorldClim version 2.1 climate data
for 1970-2000
CMIP6 downscaled future climate projection for 2061-2080 [model: CNRM-ESM2-1; ssp: “585”]
Minimum temperature (°C)
https://jakubnowosad.com/spquery/
How to find and compare areas with similar spatial patterns in non-categorical rasters (e.g., raster time-series)?
Search
Compare
https://jakubnowosad.com/patternogram/, https://jakubnowosad.com/ecem-2023
How to detect and describe a range of spatial similarity (spatial autocorrelation) for multiple variables?
It can be used to:
https://jakubnowosad.com/supercells/, https://jakubnowosad.com/foss4g-2022/
supercells: an extension of SLIC (Simple Linear Iterative Clustering; Achanta et al. (2012), doi:10.1109/TPAMI.2012.120) that can be applied to non-imagery geospatial rasters that carry:
Segmentation/regionalization: partitioning space into smaller segments while minimizing internal inhomogeneity and maximizing external isolation
A way to improve the output and reduce the cost of segmentation.
Great Britain. WorldClim gridded climate data was normalized to be between 0 and 1.
The goal: to regionalize Great Britain’s climates
Extended SLIC workflow uses the dynamic time warping (DTW) distance function rather than the Euclidean distance.
Extended SLIC: a more homogeneous regionalization.
Original SLIC: more isolated regions.
SLIC | Inhomogeneity | Isolation |
---|---|---|
extended | 0.30 | 0.59 |
original | 0.37 | 0.75 |
The raster of time series compressed from 24 dimensions to three principal components preserving 99% of variability.
Mastodon: fosstodon.org/@nowosad
Website: https://jakubnowosad.com