Skip to content

Hash Checksum of a Delivery

Jiří Kadlec edited this page Aug 30, 2024 · 2 revisions

For every checked delivery, QC tool provides a hash (checksum) of the delivery files. The hash and the list of files that the hash was computed from (hash_files) are is reported in QC job JSON and PDF reports. The hash is computed as follows:

  • Delivery from uploaded ZIP file: sha256 hash of the ZIP file (Linux command sha256sum, e.g. sha256sum clc2012_mt.gdb.zip)
  • Delivery from S3: a combined sha256 hash of all of the files that the delivery consists of (e.g. *.tif file, *.xml file, *.tif.aux.xml file), computed using the checksumdir python package / command. Calculating the checksum of an S3 delivery can be reproduced by:
    • Install checksumdir python package e.g. pip install checksumdir
    • Create an empty directory e.g download_dir
    • Download all S3 objects specified in the hash_files property of the delivery report to the download_dir
    • Run the command (checksumdir -a sha256 download_dir). This should return the same hash (checksum) of the S3 delivery as the hash specified in the report.

Clone this wiki locally