Glossary

Glossary#

A glossary of common terms used throughout Jupyter Book.

Chunk#

Smaller, more manageable pieces of a larger dataset.

Chunking#

The process of breaking down large amounts of data into smaller, more manageable pieces.

Chunk shape#

The actual shape of a chunk, specifying the number of elements in each dimension.

Chunk size#

The size of the chunk in terms of memory, which depends on the chunk shape.

Coordinate Reference System#

A framework used to precisely measure locations on the surface of Earth as coordinates.

Larger-than-memory#

A dataset whose memory footprint is too large to fit into memory all at once.

Partial Chunk#

The final chunk along a dimensions of a dataset that is not completely full of data due to the chosen chunk shape not being an integer divisor of the dataset’s dimensions.

Rechunking#

The process of changing the current chunk shape of a dataset to another chunk shape.

Stored chunks#

The chunks that are physically stored on disk.

Virtual Zarr Store#

A virtual representation of a Zarr store generated by mapping any number of real datasets in individual files (e.g., NetCDF/HDF5, GRIB2, TIFF) together into a single, sliceable dataset via an interface layer, which contains information about the original files (e.g., chunking, compression, etc.).