General usage

Input dataset

As a starting point, lets assume we have the following dataset, which was generated based on a harmonic model with some random component:

NetCDF format
One file per day
Each file includes two variables “VAR1” and “VAR2”
Half of the files are from the sensor “X1” and the other half from the sensor “X2”
“X1” files have the data version “V1”, “X2” files have the data version “V5”
The CRS of the data is the Equi7Grid projection of the European continent with a sampling of 24km
The files are tiled into four adjacent tiles, each with a coverage of 600x600km

First, we need collect all file paths (around 3000) we want to put into our datacube. How to achieve this is up to you, but you can also use geopathfinder to conveniently gather files matching a certain file naming convention. Since our data is stored in one folder and we want to create a datacube of the whole dataset, we can use a simpler approach:

[2]:

import os
import glob
import pprint
import numpy as np

ds_path = r"D:\data\code\yeoda\2022_08__docs\general_usage"
filepaths = glob.glob(os.path.join(ds_path, "*"))
np.array(filepaths)

[2]:

array(['D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000101T000000____E042N012T6_EU024KM_V1_X1.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000101T000000____E042N012T6_EU024KM_V5_X2.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000101T000000____E042N018T6_EU024KM_V1_X1.nc',
       ...,
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20001230T000000____E048N012T6_EU024KM_V5_X2.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20001230T000000____E048N018T6_EU024KM_V1_X1.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20001230T000000____E048N018T6_EU024KM_V5_X2.nc'],
      dtype='<U98')

Datacube generation

As already brought up in the general package description, yeoda offers two basic datacube classes, DataCubeReader and DataCubeWriter. Both inherit from DataCube, which consists of the following essential components:

File register: A data frame managing a stack/list of files containing the following columns:
- “filepath”: Full system paths to the files.
- stack_dimension (defaults to “layer_id”): Specifies an ID to which layer a file belongs to, e.g. a layer counter or a timestamp.
- tile_dimension (defaults to “tile_id”): Tile name or ID to which tile a file belongs to.
Mosaic geometry: An instance of MosaicGeometry (or a child class) managing the spatial properties or representing the actual mosaic/grid of the files
Name of the tile (tile_dimension) and stack dimension (stack_dimension)

Initialising a DataCubeReader or DataCubeWriter object with a file register is most flexible, but can get tedious if one needs to create a complex data frame by hand. Therefore, both classes offer several class methods to quickly create a datacube instance covering most aspects when working with geospatial data. In the background, a datacube instance keeps a reference to a data format specific reader or writer class defined in veranda, which actually do all the magic in terms of data IO.

To be able to start playing around with a datacube object representing our prepared dataset, we can use the from_filepaths() method of DataCubeReader. This method tries to extract file-specific dimensions based on a certain filenaming convention (defined by the user or already pre-defined in geopathfinder) from the files and generates a file register, on-the-fly. Fortunately, the names of our files follow an existing file naming convention, namely YeodaFilename, which represents a generic file naming convention for EO data. If this would not have been the case, one can use geopathfinder’s SmartFilename class to create a new naming convention from scratch. According to this naming convention we can also define a set of dimension names we are interested in, i.e. the name of the stack/temporal and tile dimension, the variable, sensor, and data version dimension:

[3]:

from yeoda.datacube import DataCubeReader
from geopathfinder.naming_conventions.yeoda_naming import YeodaFilename

dimensions = ["time", "tile_name", "var_name", "sensor_field", "data_version"]
dc_reader = DataCubeReader.from_filepaths(filepaths, fn_class=YeodaFilename, dimensions=dimensions,
                                          stack_dimension="time", tile_dimension="tile_name")
dc_reader

[3]:

DataCubeReader -> NetCdfReader(time, MosaicGeometry):

                                               filepath   tile_name var_name  \
0     D:\data\code\yeoda\2022_08__docs\general_usage...  E042N012T6     DVAR
1     D:\data\code\yeoda\2022_08__docs\general_usage...  E042N012T6     DVAR
2     D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
3     D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
4     D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
...                                                 ...         ...      ...
2915  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
2916  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
2917  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
2918  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N018T6     DVAR
2919  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N018T6     DVAR

     data_version sensor_field       time
0              V1           X1 2000-01-01
1              V5           X2 2000-01-01
2              V1           X1 2000-01-01
3              V5           X2 2000-01-01
4              V1           X1 2000-01-01
...           ...          ...        ...
2915           V5           X2 2000-12-30
2916           V1           X1 2000-12-30
2917           V5           X2 2000-12-30
2918           V1           X1 2000-12-30
2919           V5           X2 2000-12-30

[2920 rows x 6 columns]

What we can already directly see from the datacube’s print representation is the chosen reader class from veranda (i.e. NetCDFReader) with its dependent stack dimension (“time”) and associated mosaic class (MosaicGeometry). Additionally, the file register is shown, which reveals the decoded file naming parts of each file.

Datacube properties

Our created datacube object has several properties to inspect. For instance, we can take a look at the mosaic to get an impression what spatial extent is covered by the dataset.

[4]:

plot_extent = dc_reader.mosaic.outer_extent
extent_bfr = 200e3
plot_extent = [plot_extent[0] - extent_bfr, plot_extent[1] - extent_bfr,
               plot_extent[2] + extent_bfr, plot_extent[3] + extent_bfr]
dc_reader.mosaic.plot(label_tiles=True, extent=plot_extent)

[4]:

<GeoAxesSubplot:>

../_images/notebooks_general_usage_6_1.png

For further details about the mosaic, please take a look at geospade’s documentation.

It is also possible to directly access the file register,

[5]:

dc_reader.file_register

[5]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-01
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-01-01
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01
...	...	...	...	...	...	...
2915	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30
2916	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30
2917	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-30
2918	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-30
2919	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-12-30

2920 rows × 6 columns

the number of tiles,

[6]:

dc_reader.n_tiles

[6]:

the file paths,

[7]:

np.array(dc_reader.filepaths)

[7]:

array(['D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000503T000000____E048N012T6_EU024KM_V1_X1.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000826T000000____E048N012T6_EU024KM_V1_X1.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000625T000000____E042N012T6_EU024KM_V5_X2.nc',
       ...,
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20001028T000000____E048N018T6_EU024KM_V5_X2.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000516T000000____E048N012T6_EU024KM_V5_X2.nc',
       'D:\\data\\code\\yeoda\\2022_08__docs\\general_usage\\DVAR_20000329T000000____E048N018T6_EU024KM_V5_X2.nc'],
      dtype='<U98')

the datacube dimensions,

[8]:

dc_reader.dimensions

[8]:

['tile_name', 'var_name', 'data_version', 'sensor_field', 'time']

and finally the actual (loaded) xarray data of the files on disk.

[9]:

dc_reader.data_view

Since we did not read any data so far, this class attribute is none. The same holds true for a RasterGeometry instance associated with loaded data:

[10]:

dc_reader.data_geom

To access actual coordinates along a certain dimension, a datacube instance supports indexing by means of the dimension name:

[11]:

dc_reader['time']

[11]:

0      2000-01-01
1      2000-01-01
2      2000-01-01
3      2000-01-01
4      2000-01-01
          ...
2915   2000-12-30
2916   2000-12-30
2917   2000-12-30
2918   2000-12-30
2919   2000-12-30
Name: time, Length: 2920, dtype: datetime64[ns]

In the next section you will learn how to select a subset of a datacube based on the file naming convention of the files. After performing such operations it is always a good idea to check the content of the file register. To quickly check if your selections led to an empty datacube, you can make use of another class property:

[12]:

dc_reader.is_empty

[12]:

False

File-specific selections

The most fundamental function for selecting a subset of the datacube is select_by_dimension(). It requires an expression, i.e. a function with one input argument (which will be replaced by a pd.Series containing the coordinate values along the specific dimension) linked to formula returning a boolean value if a row/coordinate should be selected or not, and the name of the dimension of interest.

Most datacube methods have a boolean key-word inplace defining if the current datacube instance should be modified or or a new one should be returned.

In the example below we are only interested in data from the sensor “X1”.

[13]:

dc_sel = dc_reader.select_by_dimension(lambda s: s == "X1", name="sensor_field")
dc_sel.file_register

[13]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-01
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01
6	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-01-01
8	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-02
...	...	...	...	...	...	...
2910	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-29
2912	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-12-30
2914	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-12-30
2916	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30
2918	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-30

1460 rows × 6 columns

More complex expressions are also possible, e.g. when selecting data/files within a specific date range.

[14]:

import datetime
start_time, end_time = datetime.datetime(2000, 4, 1), datetime.datetime(2000, 5, 1)
dc_sel = dc_reader.select_by_dimension(lambda t: (t >= start_time) & (t <= end_time))
dc_sel.file_register

[14]:

	filepath	tile_name	var_name	data_version	sensor_field	time
728	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-01
729	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-04-01
730	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-04-01
731	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-04-01
732	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-04-01
...	...	...	...	...	...	...
971	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-05-01
972	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-05-01
973	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-05-01
974	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-05-01
975	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-05-01

248 rows × 6 columns

Another way to filter certain file paths is select_files_with_pattern(), which applies a regex on the file names of the file register.

[15]:

dc_sel = dc_reader.select_files_with_pattern(".*E042N018T6.*X2.*")
dc_sel.file_register

[15]:

	filepath	tile_name	var_name	data_version	sensor_field	time
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01
11	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-02
19	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-03
27	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-04
35	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-05
...	...	...	...	...	...	...
2883	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-26
2891	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-27
2899	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-28
2907	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-29
2915	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30

365 rows × 6 columns

[16]:

dc_sel = dc_reader.select_tiles(["E042N018T6", "E048N012T6"])
dc_sel.file_register

[16]:

	filepath	tile_name	var_name	data_version	sensor_field	time
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01
5	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-01-01
10	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-02
...	...	...	...	...	...	...
2909	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-29
2914	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-12-30
2915	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30
2916	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30
2917	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-30

1460 rows × 6 columns

Note that also the mosaic was filtered in the background:

[17]:

dc_sel.mosaic.plot(label_tiles=True, extent=plot_extent)

[17]:

<GeoAxesSubplot:>

../_images/notebooks_general_usage_31_1.png

Spatial selections

In the previous section we have seen how we can subset our datacube to select the data we actually want to work with. The last example introduced a first way how to perform a spatial selection of certain tiles, also affecting the properties of the mosaic.

Yet, one is most often interested in regions being much smaller than a tile or having a different spatial setting, e.g. a certain location, a bounding box, a non-rectangular region delineated by a polygon, or even an area crossing tile boundaries. yeoda’s datacube instances allow to perform all of those operations in a fluent manner, without touching any data. The following spatial selection methods expect the actual geometry of selection as positional arguments, and a SpatialRef instance as an optional key-word if the CRS of the geometry differs from the mosaic of the datacube.

Coordinate selection

If one is interested to retrieve a single time-series from the datacube, one can execute the following command.

[18]:

from geospade.crs import SpatialRef

lat, lon = 48.21, 16.37 # centre of Vienna
sref = SpatialRef(4326)
dc_sel = dc_reader.select_xy(lat, lon, sref=sref)
dc_sel.file_register

[18]:

	filepath	tile_name	var_name	data_version	sensor_field	time
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01
5	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-01-01
12	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-02
13	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-01-02
20	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-03
...	...	...	...	...	...	...
2901	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-28
2908	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-29
2909	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-29
2916	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30
2917	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-30

730 rows × 6 columns

As we can see, the file register only contains a reference to one tile, which covers our point of interest.

Bounding box selection

If we are interested in data covering a larger area, we can extend our single-point location to a bounding box.

[19]:

bbox = [(48.22, 7.1), (49.36, 9,15)] # bounding box around Strasbourg and Karlsruhe
dc_sel = dc_reader.select_bbox(bbox, sref=sref)
dc_sel.file_register

[19]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-01
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-01-01
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01
8	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-02
...	...	...	...	...	...	...
2907	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-29
2912	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-12-30
2913	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-12-30
2914	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-12-30
2915	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30

1460 rows × 6 columns

From the file register we can identify, that the bounding box crosses the border of the two western tiles.

Pixel window selection

Another possibility to read from a rectangular region of interest is to use the select_px_window() method, which accepts the upper-left row and column index as positional arguments and the extent of the pixel window with the keywords height and width.

The mosaic will only be sliced if it contains one tile to prevent ambiguities in terms of the definition of the pixel window. Otherwise the original datacube/mosaic will be returned.

[20]:

dc_sel = dc_reader.select_tiles(["E042N018T6"])
dc_sel.select_px_window(10, 5, height=6, width=5, inplace=True)
dc_sel.mosaic.tiles[0].shape

[20]:

(6, 5)

Polygon selection

As a final and most complex spatial selection example, one can also define a polygon (shapely or ogr polygon) as a region of interest.

[21]:

from shapely.geometry import Polygon
polygon = Polygon(((4878945, 1835736),
                   (4927809, 1723051),
                   (4694460, 1730031),
                   (4776232, 1888589))) # given in native units; covers a region in southern Germany intersecting with all tiles
dc_roi = dc_reader.select_polygon(polygon)

ax = dc_reader.mosaic.plot(label_tiles=True, extent=plot_extent, alpha=0.3)
dc_roi.mosaic.plot(label_tiles=True, ax=ax, extent=plot_extent)
ax.plot(*polygon.exterior.coords.xy, color='b', alpha=0.7, label='Region of interest')

# this is how the actual data window would look like, i.e. the spatial coverage of the data read from disk
data_window_xs = dc_roi.mosaic.outer_extent[:1] * 2 + dc_roi.mosaic.outer_extent[2:3] * 2
data_window_ys = dc_roi.mosaic.outer_extent[1],  dc_roi.mosaic.outer_extent[3], dc_roi.mosaic.outer_extent[3], dc_roi.mosaic.outer_extent[1]
ax.plot(data_window_xs, data_window_ys, color='g', alpha=0.7, label='Data window')

ax.legend(ncol=2)

[21]:

<matplotlib.legend.Legend at 0x2d1664a1670>

../_images/notebooks_general_usage_39_1.png

The plot above shows our region of interest defined as a polygon intersecting with all four tiles. The mosaic of dc_sel is now decoupled from the naming scheme of the original mosaic and contains four irregularly sliced tiles correspoding to the bounding box of the intersection figure of the polygon and each tile. Later on we will learn how to actually read the data from disk, whose outer pixel extent would be in alignment with the green bounding box.

Other datacube operations

In addition to the selection methods, a yeoda datacube has also a rich set of functions to manage and modify the datacube as needed.

Renaming a dimension

If one needs to work with a pre-defined naming convention, but is not happy with the name of a certain dimension/column of the file register, one can use the rename_dimensions() function mapping old to new dimension names.

[22]:

dc_rnmd = dc_reader.rename_dimensions({'sensor_field': 'sensor'})
dc_rnmd.file_register

[22]:

	filepath	tile_name	var_name	data_version	sensor	time
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-01
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-01-01
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01
...	...	...	...	...	...	...
2915	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30
2916	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30
2917	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-30
2918	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-30
2919	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-12-30

2920 rows × 6 columns

Adding a new dimension

Sometimes it can be helpful to add one’s own dimension plus coordinate values to a datacube, e.g. file-specific properties like file size, quality flags, etc.

[23]:

dim_values = np.random.rand(len(dc_reader))
dc_ext = dc_reader.add_dimension("value", dim_values)
dc_ext.file_register

[23]:

	filepath	tile_name	var_name	data_version	sensor_field	time	value
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-01	0.923070
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-01-01	0.784735
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01	0.526505
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01	0.091429
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01	0.696233
...	...	...	...	...	...	...	...
2915	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30	0.983319
2916	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30	0.622559
2917	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-30	0.416412
2918	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-30	0.774230
2919	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-12-30	0.375876

2920 rows × 7 columns

Sorting a dimension

As soon as you read data, each file will be accessed in the order given by file register. If you want to change this, i.e. to sort the files along a specific dimension, then you can use sort_by_dimension().

[24]:

dc_srtd = dc_ext.sort_by_dimension('value', ascending=True)
dc_srtd.file_register

[24]:

	filepath	tile_name	var_name	data_version	sensor_field	time	value
1732	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-08-04	0.001752
219	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-28	0.002179
2750	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-09	0.002296
291	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-02-06	0.002862
1913	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-08-27	0.003018
...	...	...	...	...	...	...	...
2851	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-22	0.998389
805	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-04-10	0.998591
1309	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-06-12	0.998791
2758	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-10	0.998919
1349	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-06-17	0.999556

2920 rows × 7 columns

Split up datacube by a dimension

Similar to the file-based selections, one can also use yeoda’s split functions to retrieve multiple datacubes from pre-defined coordinate intervals instead of writing multiple select statements. One function to do this is split_by_dimension.

[25]:

dc_sensors = dc_reader.split_by_dimension([lambda s: s == 'X1', lambda s: s == 'X2'], name='sensor_field')
dc_sensors[1]

[25]:

DataCubeReader -> NetCdfReader(time, MosaicGeometry):

                                               filepath   tile_name var_name  \
1     D:\data\code\yeoda\2022_08__docs\general_usage...  E042N012T6     DVAR
3     D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
5     D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
7     D:\data\code\yeoda\2022_08__docs\general_usage...  E048N018T6     DVAR
9     D:\data\code\yeoda\2022_08__docs\general_usage...  E042N012T6     DVAR
...                                                 ...         ...      ...
2911  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N018T6     DVAR
2913  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N012T6     DVAR
2915  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
2917  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
2919  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N018T6     DVAR

     data_version sensor_field       time
1              V5           X2 2000-01-01
3              V5           X2 2000-01-01
5              V5           X2 2000-01-01
7              V5           X2 2000-01-01
9              V5           X2 2000-01-02
...           ...          ...        ...
2911           V5           X2 2000-12-29
2913           V5           X2 2000-12-30
2915           V5           X2 2000-12-30
2917           V5           X2 2000-12-30
2919           V5           X2 2000-12-30

[1460 rows x 6 columns]

Now we have two datacubes, one for each sensor.

Split up datacube temporally

Another handy function is split_by_temporal_freq(), which splits up the datacube into several smaller ones according to the specified temporal frequency identifier (see pandas DateOffset objects).

By default the stack dimension is assumed to be temporal. If not, you need to specify a different dimension with the key-word name.

[26]:

dc_months = dc_reader.split_by_temporal_freq('M')
dc_months[1]

[26]:

DataCubeReader -> NetCdfReader(time, MosaicGeometry):

                                              filepath   tile_name var_name  \
248  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N012T6     DVAR
249  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N012T6     DVAR
250  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
251  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
252  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
..                                                 ...         ...      ...
475  D:\data\code\yeoda\2022_08__docs\general_usage...  E042N018T6     DVAR
476  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
477  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N012T6     DVAR
478  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N018T6     DVAR
479  D:\data\code\yeoda\2022_08__docs\general_usage...  E048N018T6     DVAR

    data_version sensor_field       time
248           V1           X1 2000-02-01
249           V5           X2 2000-02-01
250           V1           X1 2000-02-01
251           V5           X2 2000-02-01
252           V1           X1 2000-02-01
..           ...          ...        ...
475           V5           X2 2000-02-29
476           V1           X1 2000-02-29
477           V5           X2 2000-02-29
478           V1           X1 2000-02-29
479           V5           X2 2000-02-29

[232 rows x 6 columns]

Copying a datacube

Sometimes it might be useful to create a (deep-)copy of a datacube object.

[27]:

dc_cloned = dc_reader.clone()
dc_cloned.select_by_dimension(lambda v: v == 'V1', name='data_version', inplace=True)
print(f"Length of original datacube vs. cloned one: {len(dc_reader)} vs. {len(dc_cloned)}")

Length of original datacube vs. cloned one: 2920 vs. 1460

Multi-datacube operations

All aforementioned operations have only concerned one datacube object so far, but it is also possible to interact with two or more datacube instances.

Datacube union

If you have two datacube objects with a different set of dimensions or entries and want to unite (outer join) them, you can call unite. The following example demonstrates a union operation of two datacubes having different entries but the same dimensions.

[28]:

dc_apr = dc_months[3]
dc_sept = dc_months[8]
dc_united = dc_apr.unite(dc_sept)
dc_united.file_register

[28]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-01
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-04-01
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-04-01
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-04-01
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-04-01
...	...	...	...	...	...	...
475	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-09-30
476	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-09-30
477	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-09-30
478	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-09-30
479	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-09-30

480 rows × 6 columns

Another example would be if two datacubes differ in dimensions.

[29]:

dc_1 = dc_reader.add_dimension('1', [1] * len(dc_reader))
dc_2 = dc_reader.add_dimension('2', [2] * len(dc_reader))
dc_united = dc_1.unite(dc_2)
dc_united.file_register

[29]:

	filepath	tile_name	var_name	data_version	sensor_field	time	1	2
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-01	1.0	NaN
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-01-01	1.0	NaN
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01	1.0	NaN
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01	1.0	NaN
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01	1.0	NaN
...	...	...	...	...	...	...	...	...
5835	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30	NaN	2.0
5836	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30	NaN	2.0
5837	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-30	NaN	2.0
5838	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-30	NaN	2.0
5839	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-12-30	NaN	2.0

5840 rows × 8 columns

Datacube intersection

Intersecting two datacubes works similarly, except that this time a inner join operation along the dimensions/columns of the file register takes place. As an optional argument, one can also define a specific dimension to operate the intersection along to perform an intersection also for the entries of the file register. The example below demonstrates an intersection to retrieve the common parts of two datacubes.

[30]:

dc_apr_may = dc_months[3].unite(dc_months[4])
dc_may_jun = dc_months[4].unite(dc_months[5])
dc_intersct = dc_apr_may.intersect(dc_may_jun, on_dimension='time')
dc_intersct.file_register

[30]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-05-01
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-05-01
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-05-01
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-05-01
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-05-01
...	...	...	...	...	...	...
243	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-05-31
244	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-05-31
245	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-05-31
246	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-05-31
247	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-05-31

248 rows × 6 columns

We can also take the two datacubes from before and intersect them to retrieve the original one.

[31]:

dc_intersct = dc_1.intersect(dc_2)
dc_intersct.file_register

[31]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-01-01
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V5	X2	2000-01-01
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V1	X1	2000-01-01
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-01-01
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-01-01
...	...	...	...	...	...	...
2915	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N018T6	DVAR	V5	X2	2000-12-30
2916	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V1	X1	2000-12-30
2917	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N012T6	DVAR	V5	X2	2000-12-30
2918	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V1	X1	2000-12-30
2919	D:\data\code\yeoda\2022_08__docs\general_usage...	E048N018T6	DVAR	V5	X2	2000-12-30

2920 rows × 6 columns

Datacube alignment

As a last multi-datacube operation align_dimension allows to reduce or duplicate the number of entries in a datacube with respect to another datacube. However, it is only possible to resolve one-to-many or many-to-one relations with this method. The following example replicates the behaviour of the previously executed intersect operation. First, we need to ensure to only have unique entries along our dimension of interest.

[32]:

dc_apr_tuni = dc_apr.select_tiles(["E042N012T6"])
_ = dc_apr_tuni.select_by_dimension(lambda s: s == 'X1', name='sensor_field', inplace=True)

Now we can try to align a large datacube to this smaller datacube along the temporal dimension by taking the many-to-one relation into account.

[33]:

dc_1_apr = dc_1.align_dimension(dc_apr_tuni, 'time')
dc_1_apr.file_register

[33]:

	filepath	tile_name	var_name	data_version	sensor_field	time	1
0	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-01	1
1	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-02	1
2	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-03	1
3	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-04	1
4	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-05	1
5	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-06	1
6	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-07	1
7	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-08	1
8	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-09	1
9	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-10	1
10	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-11	1
11	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-12	1
12	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-13	1
13	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-14	1
14	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-15	1
15	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-16	1
16	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-17	1
17	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-18	1
18	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-19	1
19	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-20	1
20	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-21	1
21	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-22	1
22	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-23	1
23	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-24	1
24	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-25	1
25	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-26	1
26	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-27	1
27	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-28	1
28	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-29	1
29	D:\data\code\yeoda\2022_08__docs\general_usage...	E042N012T6	DVAR	V1	X1	2000-04-30	1

Another example would be if one wants to duplicate entries in a file register of a datacube so that both have the same length along the specified dimension. To demonstrate this, we can create a fake dataset, containing one file representing one observation of a sensor “X3” in a whole month.

[34]:

fake_filepaths = [r"D:\data\code\yeoda\2022_08__docs\fake_data\DVAR_20000415T000000____E042N012T6_EU024KM_V1_X3.nc"]
dc_fake = DataCubeReader.from_filepaths(fake_filepaths, fn_class=YeodaFilename, dimensions=dimensions,
                                        stack_dimension="time", tile_dimension="tile_name")
dc_fake.file_register

[34]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15

By calling align_dimension on this datacube with respect to a different datacube along a common dimension, we can duplicate the file register entries so that both datacubes match in length. In this case, we choose the “var_name” dimension as a common dimension, since its already available.

[35]:

_ = dc_fake.align_dimension(dc_apr_tuni, "var_name", inplace=True)
dc_fake.file_register

[35]:

	filepath	tile_name	var_name	data_version	sensor_field	time
0	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
1	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
2	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
3	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
4	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
5	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
6	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
7	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
8	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
9	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
10	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
11	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
12	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
13	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
14	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
15	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
16	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
17	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
18	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
19	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
20	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
21	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
22	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
23	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
24	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
25	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
26	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
27	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
28	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15
29	D:\data\code\yeoda\2022_08__docs\fake_data\DVA...	E042N012T6	DVAR	V1	X3	2000-04-15

General usage

Input dataset

Datacube generation

Datacube properties

File-specific selections

Spatial selections

Coordinate selection

Bounding box selection

Pixel window selection

Polygon selection

Other datacube operations

Renaming a dimension

Adding a new dimension

Sorting a dimension

Split up datacube by a dimension

Split up datacube temporally

Copying a datacube

Multi-datacube operations

Datacube union

Datacube intersection

Datacube alignment

Reading data

Writing data

Data export

Data tiling

Data streaming

Data format conversion