RL Vision Knowledge Base
Support questions and answers for software by RL Vision.
Note: This is an archived discussion. Any bug, problem or suggestion mentioned here is likely to have been fixed since it was written.
Subject: Re: PDF Image Extraction Wizard
Date: Thu, 13 Mar 2008 20:08:09 +0100There are many ways to write a pdf file, and each pdf-writer software employs their own way of doing ot. I have noticed that some split the images into strips, just like you explained, don't ask me why. My program can only read the raw image data contained inside the pdf files. So I'm afraid I can't help you there.
There are two ways to store bitmap images inside a pdf: jpg compressed (lossy) and non-lossy ("bmp"). By default my program will extract into these formats since it is the least destructive way to handle the images, but you can change the output format from one of the pages within the program.
I don't want you to pay for something you can use, so if you decide that the program is not of use to you, let me know and I can give you your money back.
// Dan
Roy wrote:
> Dan,
>
> I attempted to extract three different pdf files with these results:
>
> The first pdf file was was a 20 page scanned report that converted to 152 pages, each containing a narrow strip of data. The data is all there, just fragmented.
>
> The next pdf file contained one page of text and two scanned images. The text file didn't extract and the two pages of graphics converted into 11 fragmented pages.
>
> The third pdf file was a two page scanned document that converted properly as complete pages although the first page converted as jpg and the second as bmp.
>
> I can't understand why one file converted correctly and the other two won't. The first file was from one source and the second and third files were from the same source. I am unable to print pdf files so it is necessary that they be converted to a more user friendly format.
>
> For the two files that converted from pdf to jpg as fragments, I would have to print the fragmented pages, cut them apart and paste onto sheets of paper and then scan these sheets as jpg files to get the full page. Too much work. Thanks.
>
> Roy
>
>
> ----- Original Message -----
> From: RL
> To: Roy
> Sent: Wednesday, March 12, 2008 3:20 PM
> Subject: Re: PDF Image Extraction Wizard
>
>
> It is important to understand what the program does. PDF documents can contains text, bitmap images and vector images. My program can extract the bitmap images, but nothing else. It sounds like you expect the entire document to be extracted, or am I wrong?
>
> You are welcome to send me a pdf that I can test to make sure there are no bugs or such.
>
> // Dan
>
>
> Roy wrote:
>> Dan,
>>
>> Yes, it does say registered version on the first page.
>>
>> The extraction is partial in that only a small fraction of the page is
>> extracted, perhaps 5-10%. Thanks.
>>
>> Roy
>>
>> ----- Original Message -----
>> From: RL
>> To: Roy
>> Sent: Tuesday, March 11, 2008 12:38 PM
>> Subject: Re: PDF Image Extraction Wizard
>>
>>
>>
>>> Hi,
>>>
>>> It should say "registered version" on the first page after you have
>>> restarted the program. Does it do that?
>>>
>>> Can you also please describe in what way the extraction is "partial".
>>>
>>> thanks,
>>>
>>> // Dan
>>>
>>>
>>>
>>>
>>> Roy wrote:
>>>
>>>> I purchased, installed, registered and entered the product registration
>>>>
>> code
>>
>>>> but I'm still only getting partial extractions. Help!
>>>>
>>>> Roy
>>>>
>>>>
>>>>
>>>>
>>>>
>>
>>
>>
>
There are two ways to store bitmap images inside a pdf: jpg compressed (lossy) and non-lossy ("bmp"). By default my program will extract into these formats since it is the least destructive way to handle the images, but you can change the output format from one of the pages within the program.
I don't want you to pay for something you can use, so if you decide that the program is not of use to you, let me know and I can give you your money back.
// Dan
Roy wrote:
> Dan,
>
> I attempted to extract three different pdf files with these results:
>
> The first pdf file was was a 20 page scanned report that converted to 152 pages, each containing a narrow strip of data. The data is all there, just fragmented.
>
> The next pdf file contained one page of text and two scanned images. The text file didn't extract and the two pages of graphics converted into 11 fragmented pages.
>
> The third pdf file was a two page scanned document that converted properly as complete pages although the first page converted as jpg and the second as bmp.
>
> I can't understand why one file converted correctly and the other two won't. The first file was from one source and the second and third files were from the same source. I am unable to print pdf files so it is necessary that they be converted to a more user friendly format.
>
> For the two files that converted from pdf to jpg as fragments, I would have to print the fragmented pages, cut them apart and paste onto sheets of paper and then scan these sheets as jpg files to get the full page. Too much work. Thanks.
>
> Roy
>
>
> ----- Original Message -----
> From: RL
> To: Roy
> Sent: Wednesday, March 12, 2008 3:20 PM
> Subject: Re: PDF Image Extraction Wizard
>
>
> It is important to understand what the program does. PDF documents can contains text, bitmap images and vector images. My program can extract the bitmap images, but nothing else. It sounds like you expect the entire document to be extracted, or am I wrong?
>
> You are welcome to send me a pdf that I can test to make sure there are no bugs or such.
>
> // Dan
>
>
> Roy wrote:
>> Dan,
>>
>> Yes, it does say registered version on the first page.
>>
>> The extraction is partial in that only a small fraction of the page is
>> extracted, perhaps 5-10%. Thanks.
>>
>> Roy
>>
>> ----- Original Message -----
>> From: RL
>> To: Roy
>> Sent: Tuesday, March 11, 2008 12:38 PM
>> Subject: Re: PDF Image Extraction Wizard
>>
>>
>>
>>> Hi,
>>>
>>> It should say "registered version" on the first page after you have
>>> restarted the program. Does it do that?
>>>
>>> Can you also please describe in what way the extraction is "partial".
>>>
>>> thanks,
>>>
>>> // Dan
>>>
>>>
>>>
>>>
>>> Roy wrote:
>>>
>>>> I purchased, installed, registered and entered the product registration
>>>>
>> code
>>
>>>> but I'm still only getting partial extractions. Help!
>>>>
>>>> Roy
>>>>
>>>>
>>>>
>>>>
>>>>
>>
>>
>>
>