Re: Search Pages

Posted by BH On 2010/11/1 21:22:42
The PDF files that contain the parts catalogs are made up of pages scanned as images. You would have to reprocess each image through some sort of OCR software (else rescan each original piece of paper directly into the OCR), but I've yet to see results that don't require proofreading - even if you can train the OCR software with a test scan. Then, you'd have to somehow weed out the page headers and footers and group headings prior to running your program code.

Seems like a lot of work for just that one purpose.

In trying to resolve some gray areas when indexing SCs, STBs, and other bulletins, I often have to track a part number to a parts book. I've recently used a free, local WiFi hot-spot to download the available parts books back to 1935. To see if a number exists in a particular catalog, I simply check the Numerical Index PDF for that edition, then go to the appropriate Group.

This Post was from: https://packardinfo.com/xoops/html/modules/newbb/viewtopic.php?post_id=63299