i want to run acrobat ocr from the command line

I need the ability to run existing PDF file through the Acrobat OCR engine and get out a searchable PDF on the command line. What product(s) does Adobe have that would have this capability?

I'm envisioning something like this:

acrobat.exe -OCR c:\path\to\input.pdf c:\path\to\output.pdf

Does anything like this exist?

Tom


Tom P


12 Answers


Gilad D (try67)   

Yep, already seen that.

I called Adobe and they said that they didn't know of any products that can do OCR from the command line for certain, but they're pretty sure that Acrobat X Pro has the capability. For the size of company they are, Adobe seems to have really awful phone and on-line chat reps.

I'll call them again and see if I can't talk to someone that doesn't have his head up his ass.

Tom


Tom P   

Alright, could someone give me some guidance on what LiveCycle PDF Generator trial products I need to download to do OCR processing from the command line? Adobe has hung up on me and told me again that they don't support command line OCR processing.

Adobe, your customer representatives are completely inept.


Tom P   

This is not a good way of letting Adobe know what you think about them... It's a User-to-User forum.
Have you tried calling the number on this page?
http://www.adobe.com/products/livecyc...


Gilad D (try67)   

PS - Are you aware that you can run OCR on multiple files using an Action in Acrobat Pro?
Seems like a much less painful way than trying to do it via command-line...


Gilad D (try67)   

Yep. That's the number where I was told twice that Adobe doesn't have products that support command line processing.

I know it's a user-user forum, but I suspect Adobe looks here; and even if they don't, other users will see how bad their customer support is.


Tom P   

Yes, I'm aware I can do it through the Acrobat GUI, but the point of the question is that I don't want to do that. I want the ability to programmatically send a PDF file to <name the program> and have it spit out a new PDF that's searchable.

Tom


Tom P   

Well, I guess you'll have to look further, then.

I must say I don't know for sure if that product can do that, either. But I would expect Adobe Reps to know...
Did you see that you can download a trial version?
http://www.adobe.com/devnet/livecycle...


Gilad D (try67)   

I did notice that, but there are a number of downloads that I have to choose from, and I just don't know which ones I need. I was hoping Adobe's reps would be able to help with that.

Here's the download page: http://www.adobe.com/devnet/enterpris...


Tom P   

From the referenced post in the Acrobat forums, you find you need to use the LiveCycle PDF Generator. On the download page you found, look for PDF Generator, and then you will find the down load and the the documentation. That should get you started.


George Kaiser   

bump .. please advise if you found a good solution to this problem, bcos I'm facing something very similar. I have a large a mix of scans (picture PDFs) and regular print-to-PDF files; the former need OCR, the latter do not.

I've been thinking it might be relatively straighforward to build a tiny VisualBasic program that reads command-line arguments and passes them along to an instance of the AdobeAcrobat control (using COM/OLE features).

Thanks.


Chris Lott   

You can download the documentation form the Acrobat Developer Center, http://www.adobe.com/devnet/acrobat.html?view=home, but not all features of Acrobat can be controled by the JSO or API.


George Kaiser   


Please specify a reason: