Convert an arabic pdf to excel using the free trial

using windows vista


H Taylor


6 Answers

What is your question? Are you running into problems? Do you need to know how to do it?

In Acrobat, open your PDF file, then select File>Save as other>Spreadsheet>...

If the file is correctly tagged, you will get a corresponding Excel document. If not, then you may end up with something that you cannot use, or something in between these two extremes.

Karl Heinz Kremer
PDF Acrobatics Without a Net
PDF Software Development, Training and More...
http://www.khkonsulting.com


Karl Heinz Kremer   

I need to keep the arabic characters and only gibberish comes out


H Taylor   

Converting from PDF to Word, Excel or any other format is one of the most complex things you can try to do with a PDF file. It works very well in some cases, in other cases the output has very little to do with the original file. The key for success is that the PDF file needs to be "tagged" - which means that it contains information about the information that is displayed in the file. The best way to make sure that a PDF file is tagged correctly is by using the PDFMaker in Acrobat to create the PDF file from Word or Excel (that's the Acrobat ribbon or toolbar).

Unfortunately there is not much you can do to improve the output without spending a lot of time (e.g. by manually tagging the file). Also, if you are using Adobe's ExportPDF service and don't have access to Acrobat, that is not even an option.

The only thing you can do is complain to the original author of the file and tell them that they used a bad PDF generator to create the PDF file.


Karl Heinz Kremer   

I am able to convert it to excel but the problem is the document is in arabic which I need to keep, but when it is converted I only get nonsense characters. I need to know if acrobat pro supports converting documents that are in other languages, as I have a lot of international documents to convert and they have to stay in their original language.


H Taylor   

Acrobat definitely does support other languages, but the problem is your PDF file does not contain all the information that Acrobat needs to extract the data. The PDF file may look good when displayed in Acrobat or Reader, but it does not contain the data to extract the text. This is usually caused by a missing "ToUnicode" table.

Again, it's a bad PDF file, and you need to complain to the author of the file.

There is one workaround that you have access to with Adobe Acrobat: You can save the complete document as TIFF images (File>Save as Other>Image>TIFF). Then import these images again into Acrobat (File>Create>Combine...). Now export again and select to run OCR if necessary on the settings dialog. You may end up with better results. This definitely works for western languages if the ToUnicode table is missing, but may not work correctly for Arabic, at least I don't have any first hand experience with this.

Karl Heinz Kremer
PDF Acrobatics Without a Net
PDF Software Development, Training and More...
http://www.khkonsulting.com


Karl Heinz Kremer   

I just tried what you to save as a "Tiff" them export again and it still did not work, is it suppose to be automatic or is there some place where I chose the language?


H Taylor   


Please specify a reason: