Raster and vector data cubes in R

Zitieren

Zugehöriges Material

OpenGeoHub Foundation

Pebesma, Edzer

Formale Metadaten

Titel

Raster and vector data cubes in R

Serientitel

OpenGeoHub Summer School 2023

Anzahl der Teile

Autor

Pebesma, Edzer

Lizenz

CC-Namensnennung 3.0 Deutschland:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.

Identifikatoren

10.5446/63086 (DOI)

Herausgeber

OpenGeoHub Foundation

Erscheinungsjahr

2023

Sprache

Englisch

Produzent

Zamuner, Lea

Produktionsort

Wageningen

Inhaltliche Metadaten

Fachgebiet

Informatik

Genre

Konferenz/Talk

Abstract

A common challenge with raster datasets is not only that they come in large files (single Sentinel-2 tiles are around 1 GB), but that many of these files, potentially thousands or millions, are needed to address the area and time period of interest. In 2022, Copernicus, the program that runs all Sentinel satellites, published 160 TB of images per day. This means that a classic pattern in using R consisting of downloading data to local disc, loading the data in memory, and analysing it is not going to work. This lectures describes how large spatial and spatiotemporal datasets can be handled with R, with a focus on packages sf and stars. For practical use, we classify large datasets as too large: - to fit in working memory, - to fit on the local hard drive, or - to download to locally managed infrastructure (such as network attached storage) These three categories may (today) correspond very roughly to Gigabyte-, Terabyte- and Petabyte-sized datasets. Besides size considerations, access and processing speed also play a role, in particular for larger datasets or interactive applications. Cloud native geospatial formats are formats optimised with processing on cloud infrastructure in mind, where costs of computing and storage need to be considered and optimised.