Organising a photo collection

Sorting and viewing

It would be nice to have some kind of smart annotation and viewing system which used AI to tag the photos without depending upon Google’s creepy data mining infrastructure. ownphotos aims for this.

Digikam is a whole photo library and metadata management system.

Tropy is an open source photo manager targeting researchers.

Tropy is free, open-source software that allows you to organize and describe photographs of research material. Once you have imported your photos into Tropy, you can combine photos into items (e.g., photos of the three pages of a letter into a single item), and group photos into lists. You can also describe the content of a photograph. Tropy uses customizable metadata templates with multiple fields for different properties of the content of your photo, for example, title, date, author, box, folder, collection, archive.

Metadata

exiftool and exiv2 seem to be popular media manipulation libraries. pyexiv2 is a python binding. Mediainfo is “a convenient unified display of the most relevant technical and tag data for video and audio files.”

Erase all (or most) of the explicit metadata from an image:

exiftool -all= filename.jpg

NB:

ExifTool is not guaranteed to remove metadata completely from a file when attempting to delete all metadata. For JPEG images, all APP segments (except Adobe APP14, which is not removed by default) and trailers are removed which effectively removes all metadata, but for other formats the results are less complete:

  • JPEG – APP segments (except Adobe APP14) and trailers are removed.
  • TIFF – XMP, IPTC and the ExifIFD are removed, but some EXIF may remain in IFD0.
  • PNG – Only iTXt, tEXt and zTXt chunks (including XMP) are removed.
  • PDF – The original metadata is never actually removed.
  • PS – Only some PostScript and XMP may be deleted.
  • MOV/MP4 – Only XMP is deleted.
  • RAW formats – It is not recommended to remove all metadata from RAW images because this will likely remove some proprietary information that is necessary for proper rendering of the image.

mat attempts to erase all metadata from everything it can handle.