Convert/Import from PDF and Keep the Formatting April 10, 2007Posted by Carthik in applications, guides, microsoft, office, ubuntu.
I have often wanted to convert a PDF file to a MS Word (.doc) file or an openoffice.org file. Usually I just copy the text from the PDF file and paste it in the new word document. Soon, this gets pretty tiring.
Recently I found a way to convert a pdf file to other formats, including .doc and .odt which preserves the formatting of the text pretty well. It is not perfect preserved but it is way better than having no formatting at all.
The secret goes by the name KWord. KWord is a KDE application that has a pdf “import” feature which lets you import either entire pdf documents or just a few pages from a pdf document while preserving the formatting! Of course – this only works for pdf documents which are not scanned images of pages. I tried it out on files created using , MS Word and OpenOffice. The font sizes in the imported document are larger than they need to be, but at least the headings are heading, the normal text is normal text, and the bullets are bullets!
After you select the pdf file to be imported, you will see a window like the one above where you can specify the pages you want to import. I did not change the default selected options – changing them and seeing what happens is an exercise left to the reader. 🙂
Of course, if you want to install KWord on your Ubuntu system, you can run the following command from the terminal window:
$sudo apt-get install kword
Then you can launch KWord using:
or by clicking the entry for KWord in the menu on your desktop.