Glossary - HyTEST Project

A glossary of common terms used throughout Jupyter Book.

Chunk: Smaller, more manageable pieces of a larger dataset.
Chunking: The process of breaking down large amounts of data into smaller, more manageable pieces.
Chunk shape: The actual shape of a chunk, specifying the number of elements in each dimension.
Chunk size: The size of the chunk in terms of memory, which depends on the chunk shape.
Coordinate Reference System: A framework used to precisely measure locations on the surface of Earth as coordinates.
Larger-than-memory: A dataset whose memory footprint is too large to fit into memory all at once.
Partial Chunk: The final chunk along a dimensions of a dataset that is not completely full of data due to the chosen chunk shape not being an integer divisor of the dataset’s dimensions.
Rechunking: The process of changing the current chunk shape of a dataset to another chunk shape.
Stored chunks: The chunks that are physically stored on disk.
Virtual Zarr Store: A virtual representation of a Zarr store generated by mapping any number of real datasets in individual files (e.g., NetCDF/HDF5, GRIB2, TIFF) together into a single, sliceable dataset via an interface layer, which contains information about the original files (e.g., chunking, compression, etc.).