Importing Word Documents

What's covered?

The Importing Word Documents topics have been completely revised to .

Importing a Word document into a RoboHelp HTML project is the subject of many postings on the Adobe RoboHelp forum. The most frequent issues are covered here.

Many years ago, M J Plaster wrote a topic about how to do get a clean import using the import wizard in RoboHelp Version 2002. Many of the principles are still valid so the topic has been reproduced here. With earlier versions, I used a different method that is also described. However improvements have been made to the import wizards included in RoboHelp X5 and RoboHelp 6 and the methods to use with those versions are covered in separate topics.

The topic was completely revised to reflect findings when using the RoboHelp 6 wizard that are also relevant to earlier versions and has since been updated for RoboHelp 7.

What Goes Wrong?

Many of the problems seen with importing Word documents arise from the fact that in versions before RoboHelp X5 the styles were created inline and that overrides the style sheet in use. (Cascading Styles are explained in more detail in the HTML topic.)

Things changed with RoboHelp X5. When importing a Word document into RoboHelp X5 or RoboHelp 6, RoboHelp creates a style sheet which you can then change to your own and this does address many of the appearance problems. Some text does however still seem to end up with inline styles which override the style sheet and there are some other problem areas. Your options are described further on but regardless of version, the things that typically go wrong when importing from Word are:

Mixing Oil and Water

It is quite common for questions on the RoboHelp forum to show the poster's frustration that Word documents and HTML topics do not convert without problems. This applies to both the importing of Word documents and the creation of printed documentation from your topics. As my very good friend Rick Stone (RoboWizard) points out, you are mixing oil and water and that can be tricky!

How to Import

The method varies with the version of RoboHelp that you are using. Whichever version you are using, I suggest that what you do is create a new RoboHelp project and import the document into that rather than mess up your carefully crafted project. Also if you intend to create printed documentation later, then try it early on as often problems only manifest themselves when you create the printed output.

Versions Before X5

You can either follow M J Plaster's method for Version 2002 or follow the principles for other versions up to and including X4.

Alternatively you can use my method which I find considerably easier.

RoboHelp X5

Prior to X5, the style sheet was embedded into the imported topic and classes were created referencing the embedded style sheet. Whilst classes are still created, X5 puts the styles into a style sheet making switching to your own much easier if you follow some basics.

See Importing Using the X5 Wizard for what to do.

RoboHelp 6

The prime difference in RoboHelp 6 is that bulleted and numbered lists import correctly.

See Importing Using the RoboHelp 6 Wizard for what to do.

RoboHelp 7

Over the past few months I have had to import a number of Word documents into both RoboHelp 6 and 7. The initial attempts produced topics that were completely unacceptable and in identifying the causes, I found more about what is behind peoples' comments about the import not working. The reality was less about RoboHelp's shortcomings as already covered and more about importing poorly formatted documents. I found the documents I had been supplied with were very poor with text formatted to make it look like headings, multiple variants of list with different bullets and indents for no good reason and tabbing used instead of indents, and that's just for starters. Typically these documents took me a whole day to clean them up. So please remember, garbage in - garbage out.

By following the procedures described for RoboHelp 6 and cleaning up the documents in Word first, the changes needed in RoboHelp 7 after the import were minimal.

I never import Word documents directly into my projects just in case anything goes wrong. I create a project just for the purpose of the import and get things how I want them in that project. Then I import the cleaned up import into my live project.

Remove the meta tags that reference the source document.

 

RoboHelp adds these meta tags when importing a Word document.

<meta name=Originator content=ImportDoc>
<meta name=OriginalFile content=Working_Copy_2008_Procedures.doc>

These can cause problems when generating a printed output.

1] I was seeing tables with some borders twice the correct width and other borders correct. Removing them fixed that issue.

2] One poster found that the topic footers were appearing in his printed output. Many people would like to be able to do that but, ironically, this poster did not. Also the topic footers appeared at the bottom of the body of each page, not in the Word footer.

Remove any embedded style sheet created.

 

 

I found this in the topics created and of course it overrode what was in my stylesheet.

<style><!--

A:visited {
color: #800080;
}
A:link {
color: #0000ff;
}
-->
</style>

Check the body tag

I found the language had been defined as below. I'm UK based so I removed the language reference.

<body lang=EN-US>

Check for unwanted code.

I found this at the end of my topics. To be honest, I don't know why it is there but my hunch is that one dayI'll regret leaving it in and find it is the cause of some problem.

<implicit_p><b style="font-weight: bold;"><span style="font-size: 20.0pt; font-family: Verdana;"><br
style="clear: all;" clear=all>
</span></b>

Images may not show in Project Manager

Importing the images into a new project, I found they were in the topics but not shown in Project Manager. By looking with Windows Explorer, I found they were in a folder within the project. The folder had the name of the source document, Working_Copy_2008_Procedures, but that folder was not shown in Project Manager.

I added a new folder named Working_Copy_2008_Procedures and suddenly the images appeared!

 

Date

Changes to this page

04 Apr 2008 Section on RoboHelp 7 added.
13 Apr 2007 Topic completely revised to include RoboHelp 6 and other findings.
05 Jul 2005 "What's covered" revised to include reference to a new topic covering the X5 import wizard. Associated revisions in text. No fundamental changes.
25 Jun 2005

Paragraph added re the class=InLineNormal which is created during import.

Also minor typo and clarification changes.

26 Feb 2005 Most common import problems listed and topic amended to cover changes in RoboHelp X5.
14 Jan 2005 Link added to HTML topic.
02 Nov 2004 New topic.