Threatened plant species distribution in Andalusia: per-species GLMs and species richness mapping
Source:vignettes/articles/andalusia.Rmd
andalusia.RmdOverview
The andalusia dataset contains presence records of
threatened and endemic plant species from Andalusia, Spain — a
biodiversity hotspot in south-western Europe renowned for its
exceptionally high rates of endemism within the Mediterranean Basin.
Records were collected at 400 m resolution in the ETRS89 / UTM zone 30N
coordinate system (EPSG:25830), with species presences spatially thinned
at the raster cell level to remove spatial redundancy. Species with
fewer than 30 records after thinning were excluded.
The dataset pairs occurrence points for multiple threatened and endemic species with randomly sampled background points, and includes 20 environmental predictors derived from Landsat imagery, climate surfaces, and a digital elevation model. This structure is well suited for per-species species distribution models (SDMs), multi-species richness mapping, and spatial conservation prioritisation.
Data Structure
The dataset is an sf data frame with 8666 rows and 23
columns.
andalusia |> dplyr::glimpse()
#> Rows: 8,666
#> Columns: 23
#> $ species <chr> "Adenocarpus_gibbsianus", "Adenocarpus_gibbsian…
#> $ presence <int> 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,…
#> $ landsat_band_1 <dbl> 61.25444, 65.91648, 82.49583, 85.40617, 74.6444…
#> $ landsat_band_2 <dbl> 77.78752, 83.29051, 99.06973, 107.76321, 97.494…
#> $ landsat_band_3 <dbl> 81.18155, 85.66795, 97.28080, 109.72297, 91.556…
#> $ landsat_band_4 <dbl> 141.2575, 144.9496, 158.4905, 130.6584, 102.595…
#> $ landsat_band_5 <dbl> 123.6921, 125.1109, 133.4070, 121.9335, 95.1965…
#> $ landsat_band_6 <dbl> 109.41866, 112.08743, 117.67980, 114.50157, 93.…
#> $ landsat_ndvi <dbl> 27.735855, 27.032965, 24.912889, 7.063870, -3.6…
#> $ rainfall_annual <dbl> 559.7617, 553.3437, 532.9059, 348.0289, 325.503…
#> $ rainfall_summer <dbl> 24.53417, 24.47948, 22.63235, 21.57755, 21.2542…
#> $ solar_radiation_summer <dbl> 7685.383, 7684.105, 7663.054, 7643.000, 7642.30…
#> $ solar_radiation_winter <dbl> 2214.927, 2189.019, 2195.299, 2192.499, 2180.77…
#> $ temperature_summer_max <dbl> 31.98804, 31.98711, 31.96779, 30.20288, 29.4001…
#> $ temperature_summer_min <dbl> 18.30772, 18.30594, 18.53732, 18.83981, 18.8364…
#> $ temperature_winter_max <dbl> 15.64293, 15.62554, 15.99173, 16.47264, 16.4793…
#> $ temperature_winter_min <dbl> 6.451786, 6.395597, 6.865064, 8.208864, 8.53619…
#> $ topography_eastness <dbl> 38.95396, 53.94389, 18.68366, 50.00000, 45.0372…
#> $ topography_elevation <dbl> 159.015488, 159.414963, 89.350151, 0.000000, 0.…
#> $ topography_northness <dbl> 44.40618, 50.61015, 45.43943, 50.00000, 57.4226…
#> $ topography_position <dbl> 9.46080685, 12.75258732, 1.77814519, 0.00000000…
#> $ topography_slope <dbl> 1.0978156, 1.1250243, 2.0473621, 0.0000000, 0.2…
#> $ geometry <POINT [m]> POINT (125777.2 4145725), POINT (125377.2…The species column identifies each record as a named
species or "background". The table below shows the number
of records per species.
andalusia |>
sf::st_drop_geometry() |>
dplyr::group_by(species) |>
dplyr::summarise(n = dplyr::n()) |>
dplyr::arrange(dplyr::desc(n))
#> # A tibble: 28 × 2
#> species n
#> <chr> <int>
#> 1 background 5598
#> 2 Maytenus_senegalensis_europaea 871
#> 3 Abies_pinsapo 500
#> 4 Linaria_nigricans 344
#> 5 Juniperus_oxycedrus_macrocarpa 165
#> 6 Narcissus_longispathus 98
#> 7 Viola_cazorlensis 98
#> 8 Cynomorium_coccineum 73
#> 9 Quercus_alpestris 73
#> 10 Thymus_albicans 64
#> # ℹ 18 more rowsThe map below shows all species presences and background points by colour.