Answers / Scan & OCR
Hello.I have a scanned document and I need to change the background to specific color as if it were printed on a paper of that color. I tried adding background but the document stays white because the white background is part of the picture and my colorful background is under it. I tried OCR and copy&paste the boxes but it didn't work out. When I try to delete, the whole text dissapears and I can't paste the copied text back. The pdf is not protected. Thank you for any advice.
Jana Preslickova 693 days ago
There is one more thing you will have to do:
Have you seen the "Content" pane on the left side? Most users will never open this, but here is your chance to dig a little deeper into your PDF file :) Here is what the Contents pane will look like for a document that you just scanned, without doing any optimizations:
When you run the optimize scanned document function, select the ClearScan output format:
Once you've done, return to the Contents pane and expand the tree until you see the actual content, including the background image:
Highlight the first image and hit the delete key. This should get rid of the background. Now you an add your own background color. However, keep in mind that this will only work for text: Anything that is an image will retain it's white background color.
Karl Heinz Kremer
PDF Acrobatics Without a Net
PDF Software Development, Training and More...
Thank you. This helped a lot. Now my only problem is that I have one picture that includes one image in heading, several words on different lines and one picture at the bottom. The pictures need to stay and a bit of white background around it is no problem, but the white rectangle around random words needs to go as well. Is there a way to cut them apart or tell Acrobat X something like "Hey, this is also a text/line and you should treat it as such."?
Jana Preslickova Commented 693 days ago | Report
I assume that the OCR algorithm did not recognize those words as text - maybe the scan quality is not good enough, or there is something different with how these words are printed. You can look for "OCR Suspects" - you will find this under Tools>Text Recognition>OCR Suspects - go through every suspect and provide more information about the text in question. This page has more information: http://help.adobe.com/en_US/acrobat/X/pro/using/WS58a04a822e3e50102bd615109794195ff-7f6f.w.html
Karl Heinz Kremer Commented 689 days ago | Report
I tried that, but those words are not even suspected. It's mostly standalone underlined word. (the document has 2 collumns in sort of a table)
Jana Preslickova Commented 687 days ago | Report
This means you are out of luck with Acrobat's OCR. You may want to look at a 3rd party OCR application. I've always had good luck with Abbyy's FineReader, which does nothing but OCR, is it's much more capable than Acrobat.
Karl Heinz Kremer Commented 686 days ago | Report
Jana Preslickova Commented 686 days ago | Report
When you OCR the document, select ClearScan as the output format. Then you should be able to add a background to the document.
Acrobat XI trial
Get the trial now
Last 30 Days
The Acrobat User Community is where you can connect with Acrobat and Adobe Document Services peers and experts. If you use Acrobat, Reader or EchoSign, you'll want to take advantage of the many free community resources like tutorials, tips, and Q&A to learn more about Acrobat and document services.