FamilyHistory with Permanent.org Part 3 of 4: Digitizing Your Collection

Tia Stenson

October 4, 2021
Image

Digitizing Your Collection

Welcome back! We are on step three of our journey to get our family collections organized, digitized, and accessible to our friends and family. Step number one was to organize and arrange our documents to best reflect the collection and those using it. Step number two was describing the collections to get them ready for upload to Permanent.org. Now, onto step number three: It is finally time to digitize those documents. 

Digitizing Your Collection

When it comes to digitizing your collection, you should think about whether everything is worth being digitized and available for family or the public. Some materials might be worthy to keep, but not something everyone will want to see, such as receipts and various financial documents. This is especially important if you want to make materials available quickly. You can always go back and digitize the other materials later.

The methods of digitization will depend on the quality of the material being digitized. More delicate materials will need extra care. Today there are two types of scanners for the everyday person: a flatbed scanner, and a portable scanner. I personally have both. I have an Epson Perfection V370 Photo, which has the capabilities to scan slides and negatives along with documents and images. Using a flatbed scanner is slow, however, because the material remains stationary, and will be best suited for fragile documents that might be damaged with a portable scanner where the document moves through the scanner. In addition, I have a Brother DSmobile 920dw, portable scanner. This scanner is faster and good for stable documents that can withstand moving through a scanner. 

When scanning, you want to think about the resolution and the bit depth of the scan. Resolution is the amount of detail captured in the scan and bit depth is the number of pixels captured. Using the standard resolution and bit depth numbers will set you up to have high-quality scans where you will be able to see as much detail in your documents as possible.

For documents, it is recommended to have a resolution between 300 to 400 dpi and 24-bit color (see image 1). Documents should be scanned as PDFs, the recommended file type. As scanning the documents has the potential to damage them due to the light from the scanner, it is ideal to scan your document only once, which is why you want it at the highest quality scanThen, use that digital copy, or the preservation copy, to make derivative digital copies, instead of rescanning at a lower resolution. With this I recommend using conversion software, to convert the PDF file into image file formats such as TIFF and JPG. I use a software called PDF to JPG, to make my PDF files into image files (see image 2). I do this because some systems will only allow an image file, such as JPG, and do not accept PDF files (such as Flickr, and the Internet Archive). 

Remember to scan all the pages of a document, front and back. 

Image 1: Recommended settings for digitizing documents. 

Image 2: Recommended setting for making PDF files into image files. 

Naming Your Files

When naming your files, it will be similar to how we labeled our folders. Ask yourself, what is the document, who is the document related to, and when was the document created? For example: “Birthday Card for Alfred Thum from Bud, Vera, Joel, Myra, Michael, and Keith DeSha, 1964-11-09”. This type of file name will provide you the most information when browsing your digital items and help you to find what you are looking for when using the search function. If you do not like the length of the file name you could create a type of code, such as “RT001.F001.D001”. For myself, RT001 would represent the Richard Thum Document Collection, F001 would be folder 1, and D001 would be the first document in the folder. This filename gives you context of where the document is located in your collection but does not help with browsing or searching for a specific type of document. When naming the image file of your documents, take your typical file name and add a page number if the document has more than one page. Example: “Birthday Card for Alfred Thum from Bud, Vera, Joel, Myra, Michael, and Keith DeSha, 1964-11-09_page 3 of 4”.

Organizing Your Digital Files

Now how do you organize all the files that you have digitized? Remember all the work we did to organize and arrange the physical items into series, sub-series, and folder order? You can now just replicate all that work in the digital world. 

When using Permanent.org, this would create a folder hierarchy (see images 3-5):

  1. Collection Folder
    1. Series Folder
      1. Sub-Series Folder (Optional)
        1. Folder 
          1. Digital Item

Image 3: Series Folders 

Image 4: the Folders within a series

Image 5: The individual digital files within the folder.

Now you are ready to digitize your documents and upload those digital files to Permanent.org, but we are not quite done with the physical documents just yet. The next blog in the series will help guide you in care and storage of your physical documents. 

How to Describe your Digital Materials on Permanent.org

We have gone through all the steps to creating a digital collection, we organized, we described, we digitized, but now how do we connect our digital objects to the descriptions we provided? Here’s how I make the connection on Permanent.org.

Let’s begin with our Collection level of description. Starting with the folder, I title it with the Reference Code and Title. In the Description, I will put the Creators, Language, Administrative Biographical History, Scope and Content, and Custodial History. In the tags, I would put my access points/subject terms/tags. The start date would have when the collection began, and the end date would have when the collection ended.  

Example: https://www.permanent.org/p/archive/00h3-0000/00h3-0756/574563 

For a Series or Sub-Series Folder, I would put in the title either “series” or “sub-series”, the number of that series or sub-series, plus the name of that series and sub-series. In the Description, I would put the Scope and Content, and System of Arrangement. The tags and dates would be the same as it would be for the collection level. 

Example: https://www.permanent.org/p/archive/00h3-0000/00h3-0757/574564

When it’s the folder level, I just label the folder with the folder title. 

For the Item level, the title receives the document title. The description will have the creator/author, language, and description. The tags and dates would be the same as the collection and series folders.