Importing OCR Text

 

Sanction II will import individual pages of OCR text based on the unique item ID and full file path. You must use or create a load file to import text based on the following format (each field separated by a "|" (pipe) character):

 

Category|ID|Full File Path

 

For example, the following sample OCI file would load the first five pages of the Item IDs DOC001 to DOC005:

 

DOCUMENTS|DOC001|D:\OCRTEXT\001.txt

DOCUMENTS|DOC002|D:\OCRTEXT\002.txt

DOCUMENTS|DOC003|D:\OCRTEXT\003.txt

DOCUMENTS|DOC004|D:\OCRTEXT\004.txt

DOCUMENTS|DOC005|D:\OCRTEXT\005.txt

 

Note - the text file containing the OCR'd text can be any valid file name. By default, Sanction II recognizes a load file with the file extension "oci", however any valid file extension can be used.