Geospatial

Dask and Xarray have revolutionized the analysis of large scale geospatial data. They make it easy for researchers to intuitively query large multi-dimensional datasets, and then run those queries at large scale.

import xarray as xr
ds = xr.open_dataset(
    "s3://era5-pds/zarr/2020/01/data/eastward_wind_at_100_metres.zarr",
)
hourly_wind = (
    ds['eastward_wind_at_100_metres']
    .sel(lon=slice(180, None))
    .coarsen(lat=2, lon=2, boundary='trim')
    .mean()
)

More Examples

For more in-depth examples consider the following: