In order to use w2x to convert a DOCX input file to an XML output file conforming to your custom schema, all you have to do is write a custom XSLT 1.0 stylesheet converting the “semantic” XHTML 1.0 Transitional generated by the Edit step to your custom schema.
Let’s call your custom XSLT 1.0 stylesheet “C:\Users\John\foo\xsl\xhtml_to_foo.xsl
”. Command-line tool w2x
must then be passed the following options:
-c
Execute a Convert step called “convert
”.
-e XED_URL_or_file
Execute an Edit step called “edit
”.
Example: -e w2x:xed/main.xed
. Pass this stock XED script (converting the styled XHTML 1.0 Transitional created by the Convert step to “semantic” XHTML) to the conversion step called “edit
”.
-t XSLT_URL_or_file
Execute a Transform step called “transform
”.
Example: -t "C:\Users\John\foo\xsl\xhtml_to_foo.xsl".
Pass your custom XSLT 1.0 stylesheet to the conversion step called “transform
”.
Stock XED script w2x:xed/main.xed
creates a number of semantic XHTML elements having a class
attribute starting with “role-
“. Examples: <div class=”role-section1”>
, <div class=”role-section2”>
, <div class=”role-figure”>
, <div class=”role-figcaption”>
, <a class=”role-footnote-ref”>
, <div class=”role-footnote”>
, <a class=”role-xref”>
, <span class=”role-index-term”>
, etc. To learn how to process these elements, the simplest is to look how this is done in a stock XSLT stylesheet such as “w2x_install_dir/xslt/topic.xslt
” or “w2x_install_dir/xslt/docbook.xslt
”.