Metadata, Taxonomy & OCR

An important step in creating a digital archive is defining a metadata and taxonomy strategy to insure the most accurate retrieval of digital files.  The Digital Archive Group customizes this strategy for each client by working closely with the organization and the potential users of the digital archive.

If you are not familiar with metadata and taxonomy, let’s take a moment to highlight the differences:

Metadata is the key information that makes the asset searchable. Metadata goes beyond the basic file name to further describe and define the asset.  Sometimes that information is written on the document, image, etc. and other times it must be translated from actually viewing the item. Upon scanning or digitizing an asset, you want to make sure you “embed” the metadata into the newly created digital file in order to make it searchable.

Taxonomy is the formal structure that organizes your content and assets into a specialized file system, either physically or digitally.  Assigning your content taxonomy will make it easier to search for within a DAM when you are not sure of what keywords to use.

The Digital Archive Group will create the most efficient taxonomy and metadata structure to insure that will increase the search capabilities of the asset in a variety of access-based or DAM software platforms.

OCR (optical character recognition) is technology that enables you to digitize and convert different types of text-based documents into editable and searchable data. If you are creating a digital archive, which includes documents, you want to make sure that OCR is used to enable keyword search.