Appendix B - Guiding principles for file naming and file organization
Introduction
- These guidelines outline file naming conventions and file organization conventions for digitization projects.
- Guidelines support /wiki/spaces/DR/pages/1950449665 tasks required to transfer files into the /wiki/spaces/DR/overview.
File naming – guiding principles
Use the following rules when creating file names for digital objects produced during archival digitization projects.
- Reference codes and physical storage locations are required for all analog archival material prior to digitization. Consult the Digital Archivist before digitizing archival material that does not have a reference code.
- File names mimic reference codes or other unique identifiers.
- File names include the following elements:
- Collection ID (retrieved from the /wiki/spaces/APM/pages/90538072)
- Box number
- Folder number
- Item number (where applicable)
- Automatically generated three-digit or four-digit sequential number
- File names are as short as possible.
- File names have no spaces. Use underscores to fill in spaces:
- ms-#-###_box#_folder#_item#_sequential#
- ua-#_box#_folder#_item#_sequential#
- File names for University theses should have the author's name and year with no spaces or underscores (e.g.,FirstNameLastNameYear.pdf)
- Refer to the following table for guidance creating file names for digitized archival material:
Reference Code | Filename |
MS-2-744, Box 20, Folder 30 | MS-2-744_20_30_001.tif |
MS-2-41, SF Box 45, Folder 23 | MS-2-41_SF45_23_001.tif |
UA-11, Box 1, Folder 11 | UA-11-1_11.pdf |
MS-13-1, PB Box 1, Folder 103 | MS-13-1_PB1_103_001.tif |
2011-006, OS Box 3, Folder 4 | 2011-006_OS3_4_001.jpeg |
2013-023, Reel 231 | 2013-023_Reel231.mp3 |
MS-3-46, Box 54, Folder 13, Item 3 | MS-3-46_54_13_3_001.tif |
File organization - guiding principles
Use the following rules when organizing digital objects produced through digitization projects.
- File folder names (i.e., directory structure) mimic reference codes or other unique identifiers.
- File paths organize digital files according to provenance.
- File paths enable browsing archival storage by reference code or other unique identifiers.
- File paths are as short as possible.
- Folder names have no spaces. Use underscores to fill in spaces.
- Store "access copies" in a sub-folder called "access"
- When the "access copy" is a concatenated PDF, the PDF must be stored in the top-level folder and the "access" sub-folder.
- Refer to the following table for guidance organizing files:
Top-level folder (Collection) | Sub-folder (Box) | Sub-folder (File) | Sub-folder (access copy) | Digital file names |
MS-3-35 | MS-3-35_18 | MS-3-35_18_1 | MS-3-35_18_1_001.tiff MS-3-35_18_1_002.tiff | |
access | MS-3-35_18_1.pdf | |||
MS-3-35_18_2 | MS-3-35_18_2_001.tiff MS-3-35_18_2_002.tiff | |||
access | MS-3-35_18_2_001.jpeg MS-3-35_18_2_002.jpeg | |||
MS-3-35_19 | MS-3-35_19_1 | MS-3-35_19_1_001.tiff MS-3-35_19_1_002.tiff | ||
access | MS-3-35_19_1.pdf | |||
MS-3-35_19_2 | MS-3-35_19_2_001.tiff MS-3-35_19_2_002.tiff | |||
access | MS-3-35_19_2_001.jpeg MS-3-35_19_2_002.jpeg | |||
MS-2-201 | MS-3-35_SF30 | MS-2-201_SF30_45 | MS-2-201_SF30_45_001.tiff MS-2-201_SF30_45_002.tiff | |
access | MS-2-201_SF30_45.pdf |