XMLmind Word To XML Manual

1 Introduction

2 Installing w2x

2.1 Contents of the installation directory

3 Alternatives to using the w2x command-line utility

3.1 The w2x-app graphical application

3.2 The “Word To XML” add-on for XMLmind XML Editor

3.2.1 Installing the “Word To XML” add-on

3.3 The “Word To XML” servlet

3.3.1 Contents of the servlet software distribution

3.3.2 Installing the servlet

3.3.3 Configuring the servlet

3.3.4 Using the servlet to convert DOCX files

3.3.5 Non interactive requests

4 Getting started with w2x

4.1 How to generate useful multi-page HTML

5 Going further with w2x

5.1 Stock XED scripts

6 Customizing the output of w2x

6.1 Customizing the XHTML+CSS files generated by w2x

6.1.1 Using a XED script to modify the styles embedded in the XHTML+CSS file

6.1.2 Appending custom styles to the styles embedded in the XHTML+CSS file

6.1.3 Using an external CSS file rather than embedded CSS styles

6.1.4 Combining all the above methods

6.2 Customizing the semantic XML files generated by w2x

6.2.1 Converting custom character styles to semantic tags

6.2.2 Converting custom paragraph styles to semantic tags

6.2.3 The general case

6.3 Generating XML conforming to a custom schema

6.4 Packaging your customization as a w2x plugin

6.4.1 Anatomy of a plugin

6.4.2 Registering a plugin with w2x

7 The w2x command-line utility

7.1 Variables substituted in the parameter values passed to the –p and –pu options

7.2 Default conversion steps

7.3 Automatic conversion step parameters

8 Conversion step reference

8.1 Convert step

8.2 Delete files step

8.3 Edit step

8.4 EPUB step

8.5 Load step

8.6 Save step

8.7 Split step

8.8 Transform step

8.9 Web Help step

9 Embedding w2x in a Java™ application

9.1 Extension points

9.1.1 Custom conversion step

9.1.2 Custom image converters

9.1.2.1 Specifying an external image converter

9.1.2.2 Controlling how image files found in the input DOCX file are converted to standard formats

10 Limitations and implementation specificities

10.1 About tab stops

Index