Processing configuration for digitization project files
Introduction
This page outlines the processing configuration for transferring and ingesting digitization project files.
The processing configuration assumes that Archivematica will send a Dissemination Information Package (DIP) to the Archives Catalogue.
Configuration
Field | Description | Setting |
---|---|---|
Name | The name of the processingMCP.xml that you are editing. | Digitization |
Assign UUIDs to directories | Directories are given an entry in the fileSec and assigned a unique universal identifier (UUID). Note that the digital objects in the transfer are always assigned a UUID. | Yes |
Generate transfer structure report | A text file is generated showing a directory tree of the original transfer structure. | Yes |
Perform file format identification (Transfer) | Choose whether or not to identify the format of the files in your transfer. | Yes |
Extract packages | Packages (such as .zip files) are unzipped and extracted into a directory. | No |
Delete packages after extraction | Packages that have been extracted in the previous step can be deleted after extraction. | No |
Perform policy checks on originals | If you have created policies using MediaConch, Archivematica runs the original transfer materials against the policies to assess conformance. | No |
Examine contents | Run Bulk Extractor, a forensics tool that can recognize credit card numbers, social security numbers, and other patterns in data. For more information on reviewing Bulk Extractor logs, see the Analysis pane on the Appraisal tab. | Skip examine contents |
Create SIP(s) | Create a formal SIP out of the transfer or send it to the backlog. | Create single SIP and continue processing |
Perform file format identification (Ingest) | Choose to identify the format of files in your SIP. | Yes |
Normalize | Convert ingested digital objects to preservation and/or access formats. | Do not normalize |
Approve normalization | The dashboard allows users to review the normalization output and the normalization report. | Yes |
Generate thumbnails | This gives the option of generating thumbnails for use in the AIP and DIP. | Yes |
Perform policy checks on preservation derivatives | If you create policies using MediaConch, run the policies against the newly-created preservation derivatives to ensure conformation. | No |
Perform policy checks on access derivatives | If you create policies using MediaConch, run the policies against the newly-created access derivatives to ensure conformation. | No |
Bind PIDs | Assign persistent identifiers and send the information to a Handle.Net server. | No |
Document empty directories | By default, Archivematica removes empty directories and does not document that they existed. | No |
Reminder: add metadata if desired | Archivematica allows users to see add metadata to a SIP through the user interface. This reminder occurs at the last moment that it is possible to add metadata; once the ingest proceeds past this point, it is no longer possible to add metadata to the SIP. | Continue |
Transcribe files (OCR) | Users can elect to run Tesseract, an OCR tool that is included in Archivematica, to produce text files containing file transcripts. | No |
Perform file format identification (Submission documentation & metadata) | Choose a tool to identify the format of any submission documentation and/or metadata files that were included in your transfer. | Yes |
Select compression algorithm | AIPs created by Archivematica can be stored as compressed packages or uncompressed, depending on your storage requirements. | 7z using bzip2 |
Select compression level | If you selected a compression choice in the step above, you can determine how compressed you would like your AIP to be. Selecting a higher compression level means that the resulting AIP is smaller, but compression also takes longer. Lower compression levels mean quicker compression, but a larger AIP. | 5 - normal compression |
Store AIP | Pausing at the Store AIP microservice allows users to review the AIP contents prior to storage. If you do not want to manually review AIPs prior to storage, this can be set to bypass that review step. | Yes |
Store AIP location | Once the previous step is approved, the AIP can be automatically sent to a specified storage location by setting the preferred location. | Store AIP in standard Archivematica directory |
Upload DIP | If a DIP was created, it can be automatically sent to an access system for which there is an Archivematica integration. | Upload DIP to AtoM/Binder |
Store DIP | If a DIP was created, it can be stored without interrupting the workflow in the dashboard. Note that DIP storage is not required, and that DIPs can be created on demand by re-ingesting the AIP. | Do not store |
Store DIP location | Â | None |