Glossary

Glossary#

A glossary of common terms used throughout Jupyter Book.

Chunk#: Smaller, more manageable pieces of a larger dataset.
Chunking#: The process of breaking down large amounts of data into smaller, more manageable pieces.
Chunk shape#: The actual shape of a chunk, specifying the number of elements in each dimension.
Chunk size#: The size of the chunk in terms of memory, which depends on the chunk shape.
Coordinate Reference System#: A framework used to precisely measure locations on the surface of Earth as coordinates.
Larger-than-memory#: A dataset whose memory footprint is too large to fit into memory all at once.
Partial Chunk#: The final chunk along a dimensions of a dataset that is not completely full of data due to the chosen chunk shape not being an integer divisor of the dataset’s dimensions.
Rechunking#: The process of changing the current chunk shape of a dataset to another chunk shape.
Stored chunks#: The chunks that are physically stored on disk.
Virtual Zarr Store#: A virtual representation of a Zarr store generated by mapping any number of real datasets in individual files (e.g., NetCDF/HDF5, GRIB2, TIFF) together into a single, sliceable dataset via an interface layer, which contains information about the original files (e.g., chunking, compression, etc.).