XMLmind Ebook Compiler (ebookc for short) is a free,
open source tool which can
turn a set of HTML (or Markdown) pages into a self-contained
ebook[1]. Supported output formats are: EPUB, Web
Help, PDF[2], RTF, WML, DOCX
(MS-Word) and ODT
(OpenOffice/LibreOffice)[3].
You can of course use
ebookc to create books having a simple structure like novels,
but this tool also has all the features needed to create large, complex,
reference manuals:
Builds on topic-oriented structuring like DITA or DocBook 5.1. (Each source HTML page is expected to
deal with a single topic.)
Automatic generation of global and local table of contents.
Automatic generation of a “back-of-the-book index”.
Automatic numbering of parts, chapters, appendices, sections,
figures, tables, examples and equations.
Automatic creation of links between some user-specified book
divisions.
Automatic generation of text in cross-references.
Footnote support.
Conditional processing (also called
profiling).
Built-in support of XInclude (allows reuse of content at
different locations in the book).
Being based on HTML, ebookc relies on CSS to create nicely
formatted books and this, even for output formats like PDF and DOCX which
are not directly related to HTML and CSS.
All in all,
ebookc is an authoring and publishing tool nearly as
powerful as DITA or DocBook and their advanced conversion toolkits, but
being based on HTML and on CSS, it is much easier to learn, use and
customize. Moreover you can create with it ebooks which are more interactive
(audio, video, slide shows, multiple-choice questions, etc)
than those created using DITA or DocBook.
The
basic idea is simple. You author a set of HTML pages and then you create an
ebook specification assigning a role —part, chapter, section, appendix, etc—
to each page. Example: primer/book1.ebook:
The HTML pages comprising a book may contain anything
you want including CSS styles and links between the pages (e.g.
<a href="ch2.html#fig1">). However make sure that this
content is valid XHTML[4].
Once the ebook
specification has been created, you can compile it using XMLmind Ebook
Compiler and generate EPUB, Web
Help, PDF[5], RTF, ODT, DOCX[6],
etc. Examples:
If you look at out/book1.pdf,
you'll see that chapter and appendix titles are numbered and that these
titles are copied verbatim from the html/head/title of the
corresponding input HTML page.
It's of course possible to specify how
book components should be numbered (if at all). It's also possible to
replace the plain text titles of chapters and appendices by “rich”
titles[7] by adding ebook:head child
elements to the book divisions. Example: primer/book2.ebook:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
<bookxmlns="http://www.xmlmind.com/schema/ebook"xmlns:html="http://www.w3.org/1999/xhtml"href="titlepage.html"appendixnumber="A%1."><frontmatter><toc/></frontmatter><chapterhref="ch1.html"/><chapterhref="ch2.html"><head><title>“<html:em>Rich</html:em>” title of
second chapter</title></head></chapter><appendixhref="a1.html"/></book>
The content of a ebook:head element
specified this way is added to the html/head of the
corresponding output HTML page, except for the ebook:title
element which replaces html/head/title.
§ Assembling a book division rather than referencing
an external file
We have already seen that it's possible to add a
ebook:head child to elements like book[8],
chapter, appendix, etc. Likewise, it's also
possible to add a ebook:body child to any book division.
Example: primer/book3.ebook:
<bookxmlns="http://www.xmlmind.com/schema/ebook"xmlns:html="http://www.w3.org/1999/xhtml"appendixnumber="A%1"><head><title>Title of this sample book</title></head><body><contenthref="titlepage.html"/></body><frontmatter><toc/></frontmatter><chapterhref="ch1.html"/><chapterhref="ch2.html"><head><title>“<html:em>Rich</html:em>” title of
second chapter</title></head></chapter><appendixhref="a1.html"/></book>
In the above example, the content of the
html/body element of file titlepage.html is
“pulled” and added to the book. Several ebook:content child
elements are allowed in an ebook:body element.
When you
generate multi-page HTML (e.g. Web Help) out of an ebook specification, it
may be important to specify the names of the generated pages. It may also be
useful to group several consecutive book divisions into the same output
page.
This is specified using the pagename and
samepage attributes of any book division. Example: primer/book4.ebook:
<bookxmlns="http://www.xmlmind.com/schema/ebook"xmlns:html="http://www.w3.org/1999/xhtml"appendixnumber="A%1"><head><title>Title of this sample book</title></head><body><contenthref="titlepage.html"/></body><frontmatter><toc/><sectionhref="intro.html"pagename="the introduction"/></frontmatter><chapterhref="ch1.html"><sectionhref="s1.html"><sectionhref="s2.html"samepage="true"/></section></chapter><chapterhref="ch2.html"><head><title>“<html:em>Rich</html:em>” title of
second chapter</title></head></chapter><appendixhref="a1.html"/></book>
By default, each book division is created in its own
file and the name of this file comes the href attribute of the
book division. Web Help example:
ebookc -f webhelp book4.ebook out/book4
Without attribute pagename="the introduction", the
introduction would have been generated in file
out/book4/intro.html. With this attribute, the introduction
is generated in file
"out/book4/the introduction.html".
Without attribute samepage="true", the second section
would have been generated in its own file
out/book4/s2.html. With this attribute, the second section
is appended to file out/book4/s1.html, also containing
first section.
§ But wait a minute… HTML has not enough
elements to write books
That's right, some semantic elements like
admonitions, footnotes, etc, found in larger XML vocabularies like DITA or DocBook are missing from XHTML5. However, it's easy to
emulate these missing elements by defining semantic values for the
class attribute of standard HTML elements (typically
span and div).
XMLmind Ebook Compiler has
special support for the following semantic class names:
Semantic class
Description
<figure class="role-equation">
A “displayed equation” having a title
(figcaption).
<figure class="role-example">
An example —for example a code snippet— having a title
(figcaption).
<pre class="role-listing-c-1">
A code listing, possibly featuring line numbering and syntax
coloring (class name suffix "-c-1" means: C language,
first line number is 1).
...
<figureclass="role-equation"><figcaption>Figure containing
an equation</figcaption><div><mathdisplay="block"xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>E</mi><mo>=</mo><mrow><mi>m</mi><mo></mo><msup><mi>c</mi><mn>2</mn></msup></mrow></mrow></math></div></figure>
...
<p>Short footnote<spanclass="role-footnote">Content of
short footnote.</span>.
...
<p>Simplest index term<aclass="role-index-term">Cat</a>.
Other index term<aclass="role-index-term">Cat<spanclass="role-term">Siamese</span></a>...</p>
...
<lof/> specifies that a List of Figures is to
be generated as a front matter. <lot/> means: List of
Tables. <lox/> means: List of Examples.
<loe/> means: List of Equations.
This is caused by the fact that all
the source HTML pages referenced by book5.ebook do not specify
any CSS style.
It's a good practice to keep it this way because this
allows separation of presentation and content. However, you'll want to
create nice books, so the simplest and cleanest is to add CSS styles to the
ebook specification (and not to each input HTML page).
If you do it
like this:
1
2
3
4
5
6
7
8
9
<bookxmlns="http://www.xmlmind.com/schema/ebook"xmlns:html="http://www.w3.org/1999/xhtml"appendixnumber="A%1"><head><title>Title of this sample book</title><html:linkhref="css/styles.css"rel="stylesheet"type="text/css"/></head>
...
The above specification would not work because only the
title page would get styled.
You need to use a headcommon
element for that. The child elements of headcommon are
automatically copied the html/head of all output HTML pages.
Excerpts from primer/book6.ebook:
Element ebook:head may contain, not only
ebook:title, but also any of the HTML elements allowed in
html/head, namely style, script,
meta, link. This facility is used here to give
a specific style to the title page.
Unlike <blockquote class="role-note"> for
example, which is found in the source HTML page,
<div class="role-book-title-div"> and
<h1 class="role-book-title"> are elements
generated by XMLmind Ebook Compiler.
Knowing about these
elements is required to be able to give nice looks to the generated
book. These elements and their class names are all listed in template/skeleton.css, with suggested
CSS styles for some of these elements.
As of version 1.4, the easiest way to add CSS styles
to an ebook specification is to set attribute
includebasestylesheet of element book to
"true". This very simple setting guarantees to effortlessly
create a nicely formatted book.
More precisely , attribute
includebasestylesheet="true" instructs ebookc to
include the
ebookc_install_dir/xsl/common/resources/base.css stock
CSS stylesheet in all the output HTML pages.
In the following
example, we not only use base.css, but we also customize most
of its colors by including a custom stylesheet called
custom_colors.css:
The CSS styles specified in the ebook specification and in the
source HTML pages are also used when generating output formats like PDF,
RTF, DOCX, even if these formats are not directly related to HTML and
CSS.
However in this case, CSS 2.1 support is partial. While there are no
restrictions related to the use of CSS selectors, only the most basic CSS properties are
supported. For example, generated content (e.g. :before) and floats are not supported at all.
There are two
ways to work around this limitation:
Use simpler CSS styles when targeting output formats like PDF, RTF,
DOCX. This is done using @media screen and @media print[9] rules. This is done in primer/css/styles.css:
Without XSLT stylesheet
parameter use-note-icon=yes, admonitions in
out/book6.pdf would have no icons.
Such parameter is
not needed when generating Web Help (like EPUB, an HTML+CSS-based output
format) because admonition icons are specified in CSS stylesheet primer/css/styles.css.
An book is specified as an assembly of source HTML pages.
If you want to reuse some of these HTML pages to author other books, it is
recommended to avoid creating links (e.g.
<a href="ch2.html#fig1">) between these
pages.
Fortunately, there is a simple way to create links between book
divisions, which is using the ebook:related element. Excerpts
from primer/book7.ebook:
...
<chapterhref="ch1.html"xml:id="ch1"><relatedids="ch1 ch2 a1"relation="See also"/><sectionhref="s1.html"><sectionhref="s2.html"samepage="true"/></section></chapter><chapterhref="ch2.html"xml:id="ch2"><head><title>“<html:em>Rich</html:em>” title of
second chapter</title></head><relatedids="ch1 ch2 a1"relation="See also"/></chapter><appendixhref="a1.html"xml:id="a1"><relatedids="ch1 ch2 a1"relation="See also"/></appendix>
...
See links automatically generated in first chapter, second
chapter and first appendix by running for example:
ebookc -f webhelp book7.ebook out/book7
§ Conditionally excluding some content from the generated
book
This feature called conditional processing or
profiling has many uses, the most basic one being to include or
exclude some content depending on the chosen output format. For example,
some source HTML pages may contain interactive content (e.g. a feedback
form) and this interactive content simply cannot be rendered in PDF or
DOCX.
In order to conditionally exclude some content from the
generated book, you must first “mark” the conditional sections using
data-* attributes. Excerpts from primer/book8.ebook:
1
2
3
4
5
...
<backmatterdata-output-format="docx odt pdf rtf wml"><index/></backmatter>
...
...
<blockquoteclass="role-tip"data-output-format="epub html webhelp"><p>This document is also available in PDF ... format.</p></blockquote>
...
You may specify one or more conditional processing
data-* attribute on any element. Choose the attribute names you
want. Such conditional processing data-* attribute may contain
one or more values separated by space characters. Choose the attribute
values you want.
If you generate a single HTML page by
running:
ebookc book8.ebook out/book8_no_profile.html
the
marked sections will not be excluded because XMLmind Ebook Compiler
does not associate any special meaning to attribute
data-output-format. However if you run:
ebookc -p profile.output-format html book8.ebook out/book8.html
then
file out/book8.html will not have an index. Option
"-p profile.output-format html" reads as: unless an element has
no data-output-format attribute or has a
data-output-format attribute containing "html",
exclude this element from the generated content.
If you run:
ebookc -p profile.output-format pdf book8.ebook out/book8.pdf
then
the introduction will not contain the tip about the availability of the
document in PDF format.
An ebook
specification is compiled using a command-line tool called
ebookc.
Run ebookc_install_dir/bin/ebookc.bat on Windows and
ebookc_install_dir/bin/ebookc on the Mac and on
Linux.
XMLmind
XSL-FO Converter Evaluation Edition (download page) generates output containing
random duplicate letters. This makes this edition useless for
any purpose other than evaluating XMLmind XSL-FO Converter. Of course,
this does not happen with XMLmind XSL-FO Converter Professional
Edition!
Example, convert this manual
to PDF using a copy of RenderX XEP installed in "C:\xep\":
To avoid
specifying options -xep and -xfc each time you run
ebookc, the simplest if to create once for all an ebookc.options
file in the
ebookc user preferences directory. This directory is:
$HOME/.ebookc/ on Linux.
$HOME/Library/Application
Support/XMLmind/ebookc/ on the Mac.
%APPDATA%\XMLmind\ebookc\ on Windows.
Example:
C:\Users\john\AppData\Roaming\XMLmind\ebookc\.
In addition to HTML, an ebook page may be written in Markdown. However for this to work, the file extension of the page written in Markdown must be md, markdown, mdown, mkdn, mdwn, mkd, rmd, text or txt.
The encoding of a Markdown file is, by default, the system encoding (e.g. window-1252 on a Western PC).
If you want to explicitly specify the encoding of a Markdown file, please save your file with a UTF-8 or UTF-16 BOM (Byte Order Mark) or add an encoding directive inside a comment anywhere at the beginning of your file. Example:
<!-- -*- coding: iso-8859-1 -*- -->
Heading
=======
## Sub-heading
Paragraphs are separated
by a blank line.
The above example should work fine because ebookc understands the GNU Emacs file variable called coding.
Out of the box, the Markdown parser is configured to support the commonmark 0.28 “Markdown dialect” plus all the following extensions:
However, thanks to the flexmark-java software component used by ebookc to implement Markdown support, all this can be configured by passing some load.markdown.XXX options to ebookc.
For example, pass
-p load.markdown.profile GITHUB
-p load.markdown.less-extensions true
-p load.markdown.gfm-tasklist true
to ebookc in order to parse the Github-flavored Markdown dialect and to enable a minimal set of extensions plus the task lists syntax extension.
Supported “Markdown dialects” are COMMONMARK, COMMONMARK_0_26, COMMONMARK_0_27, COMMONMARK_0_28, FIXED_INDENT, KRAMDOWN, MARKDOWN, GITHUB_DOC, GITHUB, MULTI_MARKDOWN, PEGDOWN, PEGDOWN_STRICT. See Markdown Processor Emulation.
Parameter -p load.markdown.less-extensions true is a shorthand parameter instructing ebookc to reset its extensions to the following minimal set of extensions:
Converts plain text abbreviations (e.g. IBM) to <abbr> elements.
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.abbreviation false to ebookc.
Example:
The HTML specification is maintained by the W3C.
*[HTML]: Hyper Text Markup Language
*[W3C]: World Wide Web Consortium
is converted to:
<p>The <abbrtitle="Hyper Text Markup Language">HTML</abbr> specification
is maintained by the <abbrtitle="World Wide Web Consortium">W3C</abbr>.</p>
which is rendered as:
The HTML specification is maintained by the W3C.
Admonitions
Syntax for creating admonitions such as notes, tips, warnings, etc.
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.admonition false to ebookc.
After the "!!!" tag, the admonition type must be one of "note", "attention","caution", "danger", "fastpath", "important", "notice", "remember", "restriction", "tip","trouble", "warning".
A note example not having a title:
!!! note ""
Support is limited to bug reports.
is converted to:
<blockquoteclass="role-note"><p>Support is limited to bug reports.</p></blockquote>
which is rendered as:
Support is limited to bug reports.
A tip example having a title:
!!! tip "How do you do a hard reboot on an iPad?"
Press and hold both the **Home** and **Power** buttons
until your iPad® reboots.
You can release both buttons when you see Apple® logo.
is converted to:
<blockquoteclass="role-tip"><h4class="role-admonition-title">How do you do a hard reboot on an iPad?</h4><p>Press and hold both the <strong>Home</strong> and <strong>Power</strong>
buttons until your iPad® reboots.</p><p>You can release both buttons when you see Apple® logo.</p></blockquote>
which is rendered as:
How do you do a hard reboot on an iPad?
Press and hold both the Home and Power buttons until your iPad® reboots.
You can release both buttons when you see Apple® logo.
Attributes
Syntax for adding attributes to the generated HTML elements:
If an {...} specification is separated by space characters from some plain text (e.g. some plain text {...}) then the attributes are added to the parent element of the text.
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.attributes false to ebookc.
Example:
The *circumference { .first-term }* is the length of one circuit along the
circle, or the distance around the circle. {#circumference title="See
https://en.wikipedia.org/wiki/Circle"}
is converted to:
<pid="circumference"title="See https://en.wikipedia.org/wiki/Circle">The <emclass="first-term">circumference</em> the length of one circuit along the
circle, or the distance around the circle.</p>
which is rendered as:
The circumference is the length of one circuit along the circle, or the distance around the circle.
Pitfall
By default, heading IDs are not “rendered” in HTML (which is somewhat counterintuitive). You must pass -p load.markdown.renderer.RENDER_HEADER_ID true
to ebookc get them “rendered”.
Automatic links
Turns plain text URLs and email addresses into <a href="..."> elements.
This Markdown syntax extension is disabled by default. In order to enable it, pass parameter -p load.markdown.autolink true to ebookc.
Example:
Please send your bug reports to ebookc-support@xmlmind.com, a public,
moderated, mailing list. More information in
http://www.xmlmind.com/ebookc/support.html.
is converted to:
<p>Please send your bug reports to <ahref="mailto:ebookc-support@xmlmind.com">ebookc-support@xmlmind.com</a>, a
public, moderated, mailing list. More information in <ahref="http://www.xmlmind.com/ebookc/support.html"
>http://www.xmlmind.com/ebookc/support.html</a>.</p>
Syntax for creating definition lists, that is <dl>, <dt> and <dd> elements.
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.definition false to ebookc.
Example:
Glossary:
LED
: Light emitting diode.
ABS
: Antilock braking system.
ESC
ESP
: Electronic stability control, also known as Electronic Stability Program.
: On motorcycles, ESC/ESP is called *Traction Control*.
> Ducati was one of the first to introduce a true competition-level
> traction control system (**DTC**) on a production motorcycle.
EBA
: Emergency brake assist.
is converted to:
<p>Glossary:</p><dl><dt>LED</dt><dd><p>Light emitting diode.</p></dd><dt>ABS</dt><dd><p>Antilock braking system.</p></dd><dt>ESC</dt><dt>ESP</dt><dd><p>Electronic stability control, also known as Electronic Stability
Program.</p></dd><dd><p>On motorcycles, ESC/ESP is called <em>Traction
Control</em>.</p><blockquote><p>Ducati was one of the first to introduce a true
competition-level traction control system (<strong>DTC</strong>)
on a production motorcycle.</p></blockquote></dd><dt>EBA</dt><dd><p>Emergency brake assist.</p></dd></dl>
which is rendered as:
Glossary:
LED
Light emitting diode.
ABS
Antilock braking system.
ESC
ESP
Electronic stability control, also known as Electronic Stability Program.
On motorcycles, ESC/ESP is called Traction Control.
Ducati was one of the first to introduce a true competition-level traction control system (DTC) on a production motorcycle.
EBA
Emergency brake assist.
Remember that:
The leading ":" character of a definition must be followed by one or more space characters.
Terms must be separated from the previous definition by a blank line.
A blank line is not allowed between two consecutive terms.
A blank line is allowed before a definition.
Footnotes
Syntax for creating footnotes and footnote references.
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.footnotes false to ebookc.
Example:
The differences between the programming languages C++[^1] and Java can be
traced to their heritage.
[^1]: The C++ Programming Language by Bjarne Stroustrup.
C++[^1] was designed for systems and applications programming, extending the
procedural programming language C[^2].
[^2]: The C Programming Language by Brian Kernighan and Dennis Ritchie.
Originally published in 1978.
is converted to:
<p>The differences between the programming languages
C++<aclass="role-footnote-ref"href="#__FN1"></a> and Java can be traced to
their heritage.</p><divclass="role-footnote"id="__FN1"><p>The C++ Programming Language by Bjarne Stroustrup.</p></div><p>C++<aclass="role-footnote-ref"href="#__FN1"></a> was designed for systems
and applications programming, extending the procedural programming language
C<aclass="role-footnote-ref"href="#__FN2"></a>.</p><divclass="role-footnote"id="__FN2"><p>The C Programming Language by Brian Kernighan and Dennis
Ritchie.</p><p>Originally published in 1978.</p></div>
which is rendered as:
The differences between the programming languages C++[11] and Java can be traced to their heritage.
C++[11] was designed for systems and applications programming, extending the procedural programming language C[12].
Ins
Converts tagged text "++something new++" to <ins>something new</ins>, which is rendered as: something new
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.ins false to ebookc.
Strikethrough and subscript
Converts
tagged text "~~something deleted~~" to <del>something deleted</del>, which is rendered as: something deleted
tagged text "~a subscript~" to <sub>a subscript<sub/>, which is rendered as: a subscript
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.gfm-strikethrough false to ebookc.
Superscript
Converts tagged text "^a superscript^" to <sup>a superscript</sup>, which is rendered as: a superscript
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.superscript false to ebookc.
Media tags
Converts prefixed links to audio, embed, picture and video HTML5 elements.
!A[Text](links) - audio. Links is one or more links separated by character “|”.
<p>Audio example: <audiocontrols=""title="Falcon calling"><sourcesrc="media/falcon.mp3"type="audio/mpeg"><sourcesrc="media/falcon.wav"type="audio/wav">
Your browser does not support the audio element.
</audio>.</p>
which is rendered as:
Audio example: .
Video example:
Video example: !V[Funny big bunny](media/bbb.mp4).
is converted to:
<p>Video example: <videocontrols=""title="Funny big bunny"><sourcesrc="media/bbb.mp4"type="video/mp4">
Your browser does not support the video element.
</video>.</p>
which is rendered as:
Video example: .
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.media-tags false to ebookc.
Tables
Converts pipe "|" delimited text to <table> elements.
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.tables false to ebookc.
"'" to apostrophe ’, which is rendered as in word: "don’t"
"..." and ". . ." to ellipsis …, which are both rendered as: …
"--" to en dash –, which is rendered as: –
"---" to em dash —, which is rendered as: —
single quoted 'some text' to ‘some text’, which is rendered as: ‘some text’
double quoted "some text" to “some text”, which is rendered as: “some text”
double angle quoted <<some text>> to «some text», which is rendered as: «some text»
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.typographic false to ebookc.
If you don’t want some of the above plain text sequences to be processed, specify:
-p load.markdown.typographic.ENABLE_QUOTES false
Do not process single quotes, double quotes, double angle quotes.
-p load.markdown.typographic.ENABLE_SMARTS false
Do not process "'", "...", ". . .", "--", "---".
YAML front matter
Syntax for adding metadata to the generated HTML document, that is, for adding <head>/<title> and/or <head>/<meta> elements.
These metadata are specified by key/value pairs written using a subset of the YAML (see also http://yaml.org/) syntax.
This Markdown syntax extension is enabled by default. In order to disable it, pass parameter -p load.markdown.yaml-front-matter false to ebookc.
Example:
---
title: The C Programming Language
author:
- Brian W. Kernighan
- Dennis Ritchie
description: |
One of the best-selling programming books published
in the last fifty years, "K&R" has been called everything
from the "bible" to "a landmark in computer science" and
it has influenced generations of programmers.
date: 1988-03-22
---
is converted to:
<head><title>The C Programming Language</title><metacontent="Brian W. Kernighan"name="author" /><metacontent="Dennis Ritchie"name="author" /><metacontent="One of the best-selling programming books published
in the last fifty years, "K&R" has been called
everything from the "bible" to
"a landmark in computer science" and it has
influenced generations of programmers."name="description" /><metacontent="1988-03-22"name="date" /></head>
Other extensions
The following Markdown syntax extensions are also supported:
anchorlink
aside
emoji
enumerated-reference
gfm-issues
gfm-tasklist
gfm-users
macros
toc
wikilink
youtube-embedded
All the above extensions are disabled by default. In order to enable an extension, pass parameter -p load.markdown.EXTENSION_NAME true to ebookc. For example: -p load.markdown.emoji true
Any extension listed in this section may be parameterized by passing parameter -p load.markdown.EXTENSION_NAME.PARAMETER_NAME PARAMETER_VALUE[13] to ebookc. Examples:
With the above emoji parameters, ":smile:" is rendered as:
More generally, the Markdown parser (pseudo EXTENSION_NAME is "parser") and the HTML renderer (pseudo EXTENSION_NAME is "renderer") may also be parameterized this way. For example, automatically generate an ID for all headings not already having an ID and “render” all heading IDs in HTML[14]: -p load.markdown.renderer.GENERATE_HEADER_ID true -p load.markdown.renderer.RENDER_HEADER_ID true.
More information about extensions and their parameters in Extensions (flexmark-java is the software component used by ebookc to parse Markdown and convert it to HTML).
XMLmind Ebook Compiler automatically copies all the resources
referenced by the ebook specification and the input HTML pages to the output
directory in order to create a self-contained deliverable. Creating
self-contained deliverables is generally desirable and for some output
formats, like EPUB, this is really required.
For example, if you run
(single-page HTML output format):
ebookc doc.ebook out/doc.html
all
the resources of doc.ebook are copied to
out/doc_files/.
Other example, if you run:
ebookc -f webhelp doc.ebook out/webhelp/
all
the resources of doc.ebook are copied to
out/webhelp/_res/.
What is a resource?
By
default, XMLmind Ebook Compiler considers that any file [15]
referenced by the ebook specification or an input HTML page using a
relative URI is a resource. This is generally the case of images,
audio and video files, CSS stylesheets, scripts files referenced by the
ebook specification and the input HTML pages.
In this example, image
"cc-by-sa.png" is obviously a resource and file
"creativecommons.html" not being an input HTML page, is also
considered to be a resource:
1
2
3
4
5
<p>All the above tutorials are licensed under the
<a href="creativecommons.html"><img src="cc-by-sa.png"
alt="CC BY-SA"/>Creative Commons License</a>,
which means that everyone is welcome to distribute, modify, translate, etc,
any of them.</p>
How to specify "not a resource; do not copy
it and keep its relative URI as is"?
The automatic resource
management of ebookc may be turned off globally by setting option
proc.ignoreresources to "true".
If you
want to specify that only some of the resources of an ebook are external and
as such, should not be processed by ebookc, please use
value "external-resource"
for standard attribute rel (HTML link elements have this
attribute);
proprietary attribute data-external-resource for
elements like img which do not have attribute
rel.
Example:
1
2
3
4
5
6
<p>All the above tutorials are licensed under the
<ahref="creativecommons.html"rel="external-resource"><imgsrc="cc-by-sa.png"alt="CC BY-SA"data-external-resource=""/>Creative Commons License</a>,
which means that everyone is welcome to distribute, modify, translate, etc,
any of them.</p>
In the above example, files
"cc-by-sa.png" and "creativecommons.html" are not
processed as resources.
Option
externalresourcebase may be used to specify an absolute or
relative URI to be prepended to external resources having a relative URI.
Example:
-p proc.externalresourcebase "../../samples/".
How
to specify "this is a resource too; copy it to the output
directory"?
By default, XMLmind Ebook Compiler considers that any
file referenced by the ebook specification or an input HTML page using an
absolute URI is not a resource. Example:
1
2
3
4
5
6
<p>All the above tutorials are licensed under the
<a href="https://creativecommons.org/creativecommons.html">
<img src="https://creativecommons.org/cc-by-sa.png"
alt="CC BY-SA"/>Creative Commons License</a>,
which means that everyone is welcome to distribute, modify, translate, etc,
any of them.</p>
In the above example, files
"https://creativecommons.org/creativecommons.html" and
"https://creativecommons.org/cc-by-sa.png" are not processed as
resources.
If you want to specify that some files having absolute URIs
are in fact resources and as such, should be processed by
ebookc, please use
value "resource" for standard attribute rel (HTML link elements have this
attribute);
proprietary attribute data-resource for elements like
img which do not have attribute rel.
Example:
1
2
3
4
5
6
7
<p>All the above tutorials are licensed under the
<ahref="https://creativecommons.org/creativecommons.html"rel="resource">
<imgsrc="https://creativecommons.org/cc-by-sa.png"data-resource="" alt="CC BY-SA"/>Creative Commons License</a>,
which means that everyone is welcome to distribute, modify, translate, etc,
any of them.</p>
In the above example, files
"https://creativecommons.org/creativecommons.html" and
"https://creativecommons.org/cc-by-sa.png" are processed as
resources.
Sub-resources
In the following example, files
"styles.css", "creativecommons.html" and
"cc-by-sa.png" are automatically processed as
resources:
1
2
3
4
5
6
7
8
9
10
11
...
<head>
...
<link href="css/styles.css" rel="stylesheet" type="text/css" />
</head>
...
<p>All the above tutorials are licensed under the
<a href="creativecommons.html"><img src="cc-by-sa.png"
alt="CC BY-SA"/>Creative Commons License</a>,
which means that everyone is welcome to distribute, modify, translate, etc,
any of them.</p>
Moreover, if file
"creativecommons.html" contains XHTML —that is, can be parsed
by XMLmind Ebook Compiler— its resources are processed too as if
"creativecommons.html" were an input HTML page.
This is
also the case for resource "styles.css". The resources found in
a CSS stylesheet (e.g. file "texture.png" in
"background-image: url(images/texture.png);" or file
"core_styles.css" in "@import
url(lib/core_styles.css);") are automatically detected and processed
by XMLmind Ebook Compiler.
However, if she/he finds this clearer, the
ebook author may also explicitly specify the sub-resources of CSS
stylesheets using extra link elements in the headcommon of the
ebook specification or in the head of an input HTML page.
Example:
1
2
3
4
5
6
7
8
9
10
11
12
...
<head>
...
<link href="css/images/" rel="resource" type="inode/directory" />
<linkhref="css/styles.css"rel="stylesheet"type="text/css" /></head>
...
<p>All the above tutorials are licensed under the
<ahref="creativecommons.html"><imgsrc="cc-by-sa.png"alt="CC BY-SA"/>Creative Commons License</a>,
which means that everyone is welcome to distribute, modify, translate, etc,
any of them.</p>
Notice attribute rel="resource" which
makes even clearer the purpose of this link. Also notice
type="inode/directory"
which is needed because "css/images/" is a folder and not a
simple file.
XMLmind Ebook Compiler can conditionally exclude some contents found in the ebook specification or in the input HTML
pages. To put this feature into use, the ebook author must:
Specify one or more data-* attributes on the elements
to be conditionally excluded. Examples:
data-edition="complete", data-output-format="docx odt
pdf rtf wml".
These data-* attributes are often called
profiling attributes because they are used to define several
profiles for the same document.
It's up to the ebook author to
choose the names and allowed values for the profiling
attributes.
The ebook author may allow only a single value for a
given profiling attribute. Example: attribute data-edition
may contain only a single value, one of "complete" or
"abridged".
Or, on the contrary, the ebook author
may allow a given profiling attribute to contain several values
separated by space characters. Example: attribute
data-output-format may contain one or more of
"docx", "epub", "frameset",
"html ", "odt", "pdf",
"rtf", "webhelp", "wml".
Pass one or more profile.* parameters to the
ebookc command-line option. These profile.*
parameters must match the chosen profiling attributes. Example:
-p profile.edition abridged-p profile.output-format pdf.
Note that unless you pass a
profile.* parameter, the corresponding data-*
attribute is not given any special meaning by XMLmind Ebook Compiler.
For example, without -p profile.output-format VALUE,
attribute data-output-format is considered to be just an
ordinary attribute.
How some elements are conditionally excluded by XMLmind Ebook
Compiler is best explained by an example:
<p>See YouTube demo:</p><pdata-edition="complete"data-output-format="epub frameset html webhelp"><iframesrc="https://www.youtube.com/embed/6MgZBZ4XHzU"height="360"width="640"></iframe></p><pdata-edition="complete"data-output-format="docx odt pdf rtf wml"><imgsrc="images/YouTube_play_icon.svg"alt="..."/><ahref="https://youtu.be/6MgZBZ4XHzU"target="_blank">https://youtu.be/6MgZBZ4XHzU</a>.</p>
For an
element to be excluded, suffice for a single profiling attribute to
be “excluded”. A profiling attribute data-X is
“excluded” if none of the values it contains matches a value contained in
the profile.X parameter passed to
ebookc.
For example, with
-p profile.edition complete-p profile.output-format pdf, the embedded video
1
2
3
<pdata-edition="complete"data-output-format="epub frameset html webhelp"><iframesrc="https://www.youtube.com/embed/6MgZBZ4XHzU"height="360"width="640"></iframe></p>
is excluded
because despite the fact that data-edition="complete" is
“included”, data-output-format="epub frameset html webhelp" is
“excluded”.
Other examples, if you pass ebookc
no profile.* parameter at all, the above example will
contain both the embedded video and the YouTube link to the video.
-p profile.edition abridged, the above example will
contain neither the embedded video nor the YouTube link to the
video.
-p profile.edition complete, the above example will
contain both the embedded video and the YouTube link to the video.
-p profile.output-format epub, the above example will
contain just the embedded video.
-p profile.output-format pdf, the above example will
contain just the YouTube link to the video.
-p profile.edition abridged-p profile.output-format pdf, the above example will
contain neither the embedded video nor the YouTube link to the
video.
-p profile.edition complete
-p profile.output-format pdf, the above example will contain just
the YouTube link to the video.
-p profile.edition complete-p profile.output-format "epub pdf", the above example will
contain both the embedded video and the YouTube link to the video.
XMLmind Ebook Compiler has good support for
transclusion, that is the ability to include contents found in an
input HTML page into another input HTML page. This feature is implemented
using a standard mechanism called XInclude.
Example,
"page1.html" contains paragraph having
id="notice":
1
2
<pid="notice"class="important">Interest rates are subject
to fluctuation without notice.</p>
Because this paragraph has an
id, it's possible to include it in
"page2.html":
1
2
3
4
5
6
<p>Paragraph found in page2.html.</p><xi:includehref="page1.html"xpointer="notice"xmlns:xi="http://www.w3.org/2001/XInclude" />[16]<p>Other paragraph found in page2.html.</p>
The corresponding output
HTML page will then contain:
1
2
3
4
5
6
<p>Paragraph found in page2.html.</p><pid="notice"class="important">Interest rates are subject
to fluctuation without notice.</p><p>Other paragraph found in page2.html.</p>
Note
that transclusion works fine not only between two input HTML pages, but
also:
within the same input HTML page (see example below),
between two ebook specifications,
within the same ebook specification.
However transclusion does not work between an input HTML page and
an ebook specification.
Example 4-2. Transclusion works fine
within the same input HTML page
1
2
3
4
5
6
7
<pid="notice"class="important">Interest rates are subject
to fluctuation without notice.</p>... ELSEWHERE in page1.html ...<xi:includehref=""xpointer="notice"xmlns:xi="http://www.w3.org/2001/XInclude" />
Notice
href="" to refer to the same file.
Transclusion
is most often used between the input HTML pages and a “utility HTML page”
which is not an input HTML page but which contains useful
“snippets”.
Example, excerpts from
"snippets.html":
1
2
3
4
5
6
7
8
<ul><li><spanid="ebookc">XMLmind Ebook Compiler</span>.</li><li><spanid="xxe">XMLmind XML Editor</span>.</li><li><ahref="http://www.xmlmind.com/"id="xmlmind"target="_blank">XMLmind</a>.</li></ul>
Now, including snippets in an input HTML page:
1
2
3
4
5
6
7
8
9
<p><xi:includehref="snippets.html"xpointer="ebookc"xmlns:xi="http://www.w3.org/2001/XInclude" /> is free, open source software
developed by <xi:includehref="snippets.html"xpointer="xmlmind"xmlns:xi="http://www.w3.org/2001/XInclude" />.</p><p><xi:includehref="snippets.html"xpointer="xxe"xmlns:xi="http://www.w3.org/2001/XInclude" /> is a commercial product
developed by <xi:includehref="snippets.html"xpointer="xmlmind"xmlns:xi="http://www.w3.org/2001/XInclude" />.</p>
Make sure to have a Java™ 1.8+ runtime
installed on your machine. To check this, please open a command window and
type "java -version" followed by Enter. You should
get something looking like this:
C:\> java -version
openjdk version "23.0.1" 2024-10-15
OpenJDK Runtime Environment (build 23.0.1+11-39)
OpenJDK 64-Bit Server VM (build 23.0.1+11-39, mixed mode)
Installation
Simply
unzip ebookc-X_Y_Z.zip in any directory.
After
that, you can run command-line utility ebookc by simply executing
ebookc_install_dir/bin/ebookc.bat
(ebookc_install_dir/bin/ebookc on the Mac and on
Linux).
Contains the ebookc command-line utility. Use shell script
ebookc on Linux and on the Mac. Use ebookc.bat
on Windows.
doc/, doc/index.html
Contains the documentation of XMLmind Ebook Compiler.
docsrc/, docsrc/manual.xml
Contains the documentation of XMLmind Ebook Compiler in
ebookc format. File docsrc/manual.ebook
contains an ebook specification. You may want to use this sample ebook
specification to experiment with the ebookc command-line
utility.
LEGAL/, LEGAL.txt
Contains legal information about XMLmind Ebook Compiler and about
third-party components used in XMLmind Ebook Compiler.
lib/*.jar
All the Java™ class libraries needed to run XMLmind Ebook Compiler.
For example, lib/ebookc.jar contain the code of XMLmind
Ebook Compiler.
plus/
This directory is present only in the case of the
ebookc-X_Y_Z-plus-fop.zip distribution. It contains
most recent Apache FOP (including hyphenation and MathML
support). This
XSL-FO processor is automatically declared and thus, ready to be used to
generate PDF or PostScript.
schema/
Contains a W3C XML schema and a Schematron
which may be used to check an ebook specification for validity. File
schema/catalog.xml contains an XML
catalog which points to
these schemas.
src/, src/build.xml
Contains the Java™ source code of XMLmind Ebook Compiler.
src/build.xml is an ant build file which allows to rebuild
lib/ebookc.jar.
Your source HTML pages must contain valid[17] XHTML
(“plain HTML” cannot be parsed by ebookc) and
preferablyvalid
XHTML5, because ebookc anyway generates
XHTML5 markup.
The html root element must have 1
head and 1 body child elements. The
head child element must have 1 title child
element.
If
you plan to use XMLmind XML Editor as your ebook authoring tool, do not forget to add attribute
class="role-ebook-page" to the
html root element of your source HTML pages. XMLmind XHTML
Editor detects this attribute and this will cause the editor to replace its
stock "XHTML" menus and toolbars by extended "XHTML" menus and
toolbars.
You may use headings (h1, h2,
h3, etc) normally, without worrying about the role as a book
division (chapter, section, etc) that will be given to your input HTML
page.
By default, book attribute adjustuserheadings="true!article"
specifies that the levels of your headings are to be automatically adjusted
(except inside article elements, which are considered to be
independent, self-contained content) to make them consistent with the level
of the title of the book division.
The above input HTML is referenced as a subsection of a
chapter in the book. Therefore the title of the subsection is represented by
an h3 element. The output HTML page containing the subsection
then looks like:
1
2
3
4
5
6
7
8
<sectionclass="role-section2"><h3class="role-section2-title">Troubleshooting</h3><h4>Symptoms</h4>
...
<h5>Intermittent symptoms</h5>
...
<h4>Most common causes</h4>
...
If you want to prevent this from happening, please add attribute
adjustuserheadings="false" to your root book
element or add a class attribute to some or all of your
headings. A heading having a class attribute is understood by
XMLmind Ebook Compiler as being “not an ordinary heading which could be
modified”.
An example is a figure element which has a
class attribute containing "role-example". This kind of
figure is listed in the "List of Examples" (that is,
bookelement
lox) only if it also has a figcaption child element. Example:
1
2
3
4
5
6
7
8
9
10
11
<figureclass="role-example"><figcaption>"Hello World" program in the C language</figcaption><pre>/* Hello World */
#include <stdio.h&ght;
int main()
{
printf("Hello World\n");
return 0;
}</pre></figure>
is rendered as:
Example 6-1. "Hello World" program in the C
language
/* Hello World */
#include <stdio.h>
int main()
{
printf("Hello World\n");
return 0;
}
A program
listing can have its lines automatically numbered and/or can feature syntax
highlighting. This is done by adding
"role-listing-NUMBER-LANGUAGE-tabWIDTH" to the
class attribute of a pre element. Options NUMBER,
LANGUAGE, tabWIDTH, may be specified in any
order. Moreover some or all of these options may be omitted.
NUMBER, a strictly positive integer, specifies
the number of the first line of the program listing. This option may be
omitted if you don't want automatic line numbering.
LANGUAGE, one of (case-insensitive):
"bourne" (or "shell" or "sh"),
"c", "cmake" (or "make" or
"makefile"), "cpp", "csharp",
"css21" (or "css"), "delphi",
"ini", "java", "javascript",
"lua", "m2" (Modula 2),
"perl", "php", "python",
"ruby", "sql1999", "sql2003",
"sql92" (or "sql"), "tcl",
"upc" (Unified Parallel C),
"html", "xml", specifies the programming
language or markup language used in the program listing. This option may
be omitted if you don't want syntax highlighting.
tabWIDTH where WIDTH is a positive
integer, specifies whether tab characters should be expanded to a number
of space characters. WIDTH is the maximum number of space
characters for an expanded tab character, hence this value specifies the
location of “tab stops”. Example:
<pre class="role-listing-1-java-tab4"> means expand
tabs to up to 4 space characters in this line-numbered Java listing.
Other example: <pre class="role-listing-tab0-shell">
means: do not replace tabs in this Bourne shell listing. When
tabWIDTH is omitted, it is equivalent to having an
implicit tab8.
Example 1 (in the following C program, lines are indented using
tab characters):
1
2
3
4
5
6
7
8
<preclass="role-listing-1-c-tab4">/* Hello World */
#include <stdio.h&ght;
int main()
{
printf("Hello World\n");
return 0;
}</pre>
Note that in addition to replacing tab characters by a number of
space characters, the tabWIDTH facility also removes the
space characters which are common to the beginning of all text lines. That
is, it removes the superfluous “indentation” in the program listing, if
any.
Moreover, the tabWIDTH facility also removes
the (useless) space characters found just before a newline
character.
See example 2 below in which the indentation is
automatically removed.
Example 2 (implicit
role-listing-1-tab8; first line " /tmp/"
starts with 4 space characters):
An example is a figure element which has a
class attribute containing "role-equation". This kind of
figure is listed in the "List of Equations" (that is,
bookelement
loe) only if it also has a figcaption child element. Example:
Few
web browsers natively support MathML, so it's
recommended to add a link to the MathJax script to
your input HTML pages containing equations[18]. This typically done as
follows (this loads latest 3.x version of the MathJax mml-chtml
component):
An admonition, that is, a warning, a tip, a notice, etc, is a blockquote element which has a
class attribute containing
"role-ADMONITION", where
role-ADMONITION is one of the following class
names:
Important! Be aware of this before doing anything else.
role-fastpath
This note will speed you on your way.
role-important
This note is important.
role-notice
Indicates a potential situation which, if not avoided, might
result in an undesirable result or state.
role-remember
Don't forget to do what this note says.
role-restriction
You can't do what this note says.
role-tip
This is a fine little tip.
role-trouble
Provides information about how to remedy a trouble
situation.
role-warning
Indicates a potentially hazardous situation.
Example:
1
2
3
4
5
6
7
<blockquoteclass="role-important"><h4>How to check your oil</h4><p>You need to check your car’s oil before any long trip
to avoid major damage.</p><p>The process for how to check your oil is simple and involves
using the dip stick to see levels and test quality.</p></blockquote>
is rendered as:
How to check your oil
You need to check
your car’s oil before any long trip to avoid major damage.
The process
for how to check your oil is simple and involves using the dip stick to see
levels and test quality.
This first and simplest form for a
footnote is a span element which has a class
attribute containing "role-footnote".
Example:
1
2
3
<p>Yoko<spanclass="role-footnote">Written with kanji <i>ko</i>, meaning
"child". The syllable <i>ko</i> is not generally found at the end of
masculine names.</span> is a Japanese feminine given name.</p>
When you need a footnote to contain
paragraphs, lists or tables or when you need to reuse the same footnote at
different locations in your document, you'll have to use the second, more
general, form for a footnote.
This second form is a div element which has a class
attribute containing "role-footnote" and an id attribute.
Moreover, you'll also have to insert an a element at the location where you want
the footnote marker to be displayed. This a element, which
points to the footnote div, must have a class
attribute containing "role-footnote-ref".
Example:
1
2
3
4
5
6
<p>Yoko<aclass="role-footnote-ref"href="#ko"></a>is a Japanese
feminine given name.</p><divclass="role-footnote"id="ko">Written with kanji <i>ko</i>,
meaning "child". The syllable <i>ko</i> is not generally found
at the end of masculine names.</div>
No need to specify the text of a link when this link points to a
book division (chapter, section, etc) or to a table, figure, example, or equation
having a caption.
Example, the
following empty links point respectively to section
"Admonitions" and to table "Admonition classes"
found in this section:
The text which is automatically
generated for these empty links may be configured using attribute
xreflabels of element book.
Links
specified using attribute data-xml-id-ref
It's also
possible to create links using the a element and proprietary attribute
data-xml-id-ref rather than (or in addition to) standard
attribute href.
Attribute data-xml-id-ref
must contain the value of the xml:id
attribute of a book division found in the ebook specification. This
allows the creation of links to locations that do not exist in the input
HTML pages, but which will be created in the output HTML
pages.
Example, <a data-xml-id-ref="ch04"/> points
to the following chapter:
In input HTML page "ch4/s2.html", you
may refer to the first section of the chapter by writing
<a href="s1.html"/>. But how to refer to the chapter
itself? Notice that this chapter has no input HTML page to refer
to.
The solution to this problem is to add proprietary attribute
data-xml-id-ref to an a element. For the above
example, it's <a data-xml-id-ref="ch04"/>.
Note
that writing <a href="s1.html" data-xml-id-ref="ch04"/>
is an even better option because href="s1.html" is used as a
fallback link target in case xml:id="ch04" is not defined in
the ebook specification.
Creating index terms by hand (other
than copying an index term to paste it elsewhere) is tedious and error
prone. It's strongly recommended to use the specialized dialog box of XMLmind XML
Editor to do that.
Figure 6-1. The "Edit index term" dialog box of XMLmind
XML Editor
An index term is represented by
a a element having attribute
class="role-index-term" containing
text —the primary word or phrase in an index term— and possibly nested span elements having the following roles:
"role-term", "role-see", "role-see-also".
"Rich text" means the mix of text and phrase elements
(b, i, em, etc) allowed in
a and span elements.
Though the grammar allows
<span class="role-term"> to be nested to an arbitrary
depth, a <a class="role-index-term"> may contain only
up two nested <span class="role-term">,
corresponding respectively to the secondary word and tertiary word of an
index term. The same limit applies to
<span class="role-see"> and to
<span class="role-see-also">.
Examples:
Simplest index term containing just a phrase:
<aclass="role-index-term">Dog, man's best friend</a>
End of the above "dogs" range. The end of a range must
be found after the corresponding start of range in the same
input HTML page or in a different input HTML page:
Specifies the content of a book division (part, chapter, section, etc).
When the
parent of body is element book then
body specifies the content of the “title page” of the
book.
It's possible for a book division to have both an
href attribute and a body child element. In such
case, the content “pulled” using the href attribute is inserted
before the content specified by the body child
element.
Other attributes:XHTML5 global attributes, including any attribute having
a name starting with "data-".
adjustuserheadings
If set to true, change the level of user-specified
headings (h1, h2, h3, etc) to be
consistent with the level of automatically generated headings. If set to
false, do not change any user-specified headings.
Example:
where input HTML file "s01_01.html" starts with a
user-specified h1.
With
adjustuserheadings="false", output HTML file
"nested_section.html" contains:
1
2
3
4
5
<sectionclass="role-section2>
<h3 class="role-section2-title">Title of the section copied
from "s01_01.html"<h3><h1>User-specified heading found in "s01_01.html"</h1>
...
With adjustuserheadings="true", output HTML file
"nested_section.html" contains:
1
2
3
4
5
<sectionclass="role-section2>
<h3 class="role-section2-title">Title of the section copied
from "s01_01.html"<h3><h4>User-specified heading found in "s01_01.html"</h4>
...
Note that
adjustuserheadings="true" has no effect on headings having
a class attribute. A heading having a user-specified
class attribute is understood by XMLmind Ebook Compiler as
being “not an ordinary heading which could be
modified”.
Value
"true" of attribute adjustuserheadings may be
followed by "!" and a list of exceptions. An exception is
either the local name of an HTML element (e.g. article,
aside) or the local name of an HTML element followed by a
dot and a value[21] of the
class attribute (e.g. blockquote.role-warning,
blockquote.role-*). When this is the case, the level of
user-specified headings is not changed inside specified
elements.
Note that the default value of attribute
adjustuserheadings is "true!article" and not
simply "true" because article elements are
considered to be independent, self-contained content.
appendicestocdepth
If set to an integer larger than 0, instructs ebookc to
automatically generate a Table of Contents (TOC) having
specified depth at the beginning of the appendices division of the
book.
appendixnumber
Specifies the format of the number automatically added to the title
of an appendix. See Number format.
appendixtocdepth
If set to an integer larger than 0, instructs ebookc to
automatically generate a Table of Contents (TOC) having
specified depth at the beginning of each appendix of the book.
booklistlabels
Specifies the kind of numbered book divisions (part,
chapter, appendix, section) and
numbered figure objects (figure, table, equation, example) for which to
add labels. This option applies to book list entries
(toc, lof, lot, loe, lox).
A label is a localized message
containing the type of the book division or figure object. For example,
with chapternumber="%1", labelseparator=") ",
booklistlabels="none", a TOC entry for a
chapter looks like: "1) Introduction". With
booklistlabels="chapter" (or
booklistlabels="all"), this TOC entry looks
like: "Chapter 1) Introduction".
Note that labels are added
only to numbered book divisions or figure objects. For example,
with chapternumber="%1", booklistlabels="", a
TOC entry for a chapter will look like:
"Introduction".
chapternumber
Specifies the format of the number automatically added to the title
of a chapter. See Number format.
chaptertocdepth
If set to an integer larger than 0, instructs ebookc to
automatically generate a Table of Contents (TOC) having
specified depth at the beginning of each chapter of the book.
equationnumber
Specifies the format of the number automatically added to the
caption of an equation. See Number
format.
examplenumber
Specifies the format of the number automatically added to the
caption of an example. See Number
format.
figurenumber
Specifies the format of the number automatically added to the
caption of an figure. See Number
format.
footnotenumber
Specifies the format of the number automatically added to footnotes
(<span class="role-footnote"> or
<div class="role-footnote">) and footnote callouts
(<a class="role-footnote-ref">).
includebasestylesheet
If set to "true", include
ebookc_install_dir/xsl/common/resources/base.css in
all the output HTML pages.
Using the base.css stock CSS stylesheet is the
simplest, easiest, mean to create a nicely formatted book. More
information about this attribute in Leveraging
base.css, the stock CSS stylesheet.
When includebasestylesheet="true",
base.css is included before the other CSS
stylesheets referenced in the headcommon (if
any).
If you want to control where base.css is
included, do not set includebasestylesheet to
"true", instead add a headcommon similar to the
one in the following example:
The "ebookc-home:" prefix works
because stock XML catalogebookc_install_dir/schema/catalog.xml
contains: <rewriteURI uriStartString="ebookc-home:" rewritePrefix="../"/>.
Specifies the string which is appended to the label
automatically generated at the beginning of the title of a book division
(part, chapter, appendix,
section) or figure object (figure, table, equation,
example). Example: with labelseparator=") ", the output
HTML element generated for the following chapter is:
<chapter href="ch01.html">
is:
1
2
3
4
5
<sectionclass="role-chapter>
<h1 class="role-chapter-title"><span class="role-label">Chapter
<span class="role-number">1</span>) </span>Title of the chapter copied from "ch01.html"<h1>
partnumber
Specifies the format of the number automatically added to the title
of a part of the book. See Number
format.
parttocdepth
If set to an integer larger than 0, instructs ebookc to
automatically generate a Table of Contents (TOC) having
specified depth at the beginning of each part of the book.
preventlonelyheading
If set to true, prevent an output HTML page from
containing only a title. Example:
With preventlonelyheading="false", output HTML page
"output_directory/chapter1.html" contains just the
title of the chapter "First chapter", which may be surprising for
the reader of the book.
With
preventlonelyheading="true", output HTML page
"output_directory/chapter1.html" contains the title
of the chapter "First chapter" and also the content of input HTML
page "s01.html"[22].
section1number
Specifies the format of the number automatically added to the title
of a top level section. See Number
format.
section2number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 2 (subsection of a top
level section). See Number format.
section3number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 3. See Number format.
section4number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 4. See Number format.
section5number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 5. See Number format.
section6number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 6. See Number format.
section7number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 7. See Number format.
section8number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 8. See Number format.
section9number
Specifies the format of the number automatically added to the title
of a section having a nesting level equal to 9. See Number format.
tablenumber
Specifies the format of the number automatically added to the
caption of an table. See Number
format.
titlelabels
Specifies the kind of numbered book divisions (part,
chapter, appendix, section) and
numbered figure objects (figure, table, equation, example) for which to
add labels. This option
applies to titles or captions.
For example, with
chapternumber="%1", labelseparator=") ",
titlelabels="none", the title of a chapter looks like:
"1) Introduction". With titlelabels="chapter" (or
titlelabels="all"), this title looks like: "Chapter
1) Introduction".
tocdepth
Specifies the depth of the main Table of Contents (TOC)
(see toc element).
xml:lang
Specifies the main language of the book. This language is used to
automatically generate some titles (e.g. "Table of Contents",
"List of Figures") and also to sort index entries.
Unlike lang, which is a
XHTML5 global attribute, xml:lang is not copied to
the output HTML element corresponding to the book
element.
However, explicitly setting attribute
xml:lang on the book element is a convenient
way to ensure that all the output HTML pages have a lang
attribute.
xreflabels
Specifies the kind of numbered book divisions (part,
chapter, appendix, section) and
numbered figure objects (figure, table, equation, example) for which to
add labels. This option
applies to automatically generated link text.
For example, with
chapternumber="%1", labelseparator=") ",
xreflabels="none", the text automatically generated for
empty link to chapter <a href="intro.html"/> looks
like: "1) Introduction". With xreflabels="chapter"
(or xreflabels="all"), this text looks like: "Chapter
1) Introduction".
With
xreflabels="chapter-number", this text looks like:
"Chapter 1", that is, no chapter title, just the label without
any label separator. Note that this "-number" suffix is
supported only by xreflabels.
Instructs
XMLmind Ebook Compiler to copy to the output HTML page all the elements
found in the html:body of the input HTML page pointed to
by the href attribute.
Specifies the content of the html:head element of an output HTML
page.
By default, this html:head element is simply a copy
of the html:head element found in the content “pulled” using
the href attribute of a book division. But when a
head child element of a book division is specified,
Its title child element is used to specify the html:title of the output HTML
page.
All its other child elements and also all its XHTML5 global attributes are copied to the
html:head of the output HTML page.
Other attributes:XHTML5 global attributes, including any attribute having
a name starting with "data-".
override
When set to true, the child elements and XHTML5 global
attributes found in the head element completely
replace the child elements and XHTML5 global attributes found in
the html:head element of an input HTML page.
When set to
false, the child elements and XHTML5 global attributes
found in the head element are merged with the child
elements and XHTML5 global attributes found in the
html:head element of an input HTML page.
Instructs XMLmind Ebook Compiler to automatically generate a List
of Figures (LOF).
A plain figure listed
in the LOF is a html:figure having a html:figcaption and no class attribute or a class
attribute not containing "role-equation" or
"role-example".
Instructs XMLmind Ebook Compiler to generate a list of links.
The targets of
these links are the book divisions (part, chapter, section, etc) having an
xml:id attribute referenced in the ids attribute
of the related element.
The default title of this list of
links is "Related information". A different title (e.g. "See
also") may be specified in attribute
relation.
Specifies the IDs of the related book divisions (part, chapter,
section, etc). Redundant IDs found in this list are ignored.
relation
Specifies the title of the automatically generated list of links. By
default, it's "Related information" translated to the language of
the parent element of the automatically generated list of links.
Specifies the “rich” title of a book division (part, chapter, section, etc).
Content
model
Element title can contain text and the same XHTML5
child elements as an html:p element (that is,phrasing
content: html:b, html:img, etc) in any order and in any
number.
Specifies the location of an input HTML file. This file must contain
valid XHTML5
(more information in Section 6.1.
Valid XHTML5). The specified
URL may not have a fragment identifier (e.g. something like
href="ch09.html#conclusion" is not supported).
pagename
Specifies the base name without any extension of an output HTML
file. By default, this name is the same as the name of the corresponding
input HTML file. Example:
By
default, without attribute pagename, the page generated for
the above chapter would be
output_directory/intro.html.
After setting
pagename to "introduction", the page generated
for the above chapter is
output_directory/introduction.html.
samepage
Specifies that the book division (e.g. a section) is to be generated
in the same output HTML file as its parent book division (e.g. a
chapter). By default, all book divisions are generated by
ebookc in their own HTML files. Example:
Attribute samepage="true" instructs
ebookc to generate the content of the chapter and the
content of the first section in the same HTML file. The second section
having an implied samepage="false" is created in its own
HTML file.
It allows the inclusion of ebook elements using XInclude. In the preceding example,
related element "rel1" is defined in first
chapter. In the following example, a copy of related
element "rel1" is included in the second chapter:
The html element of the output page containing
the chapter will have id="going_further". All the
elements “pulled” from "ch3.html" will have their
IDs prefixed with "going_further__".
The section element containing the section will
have id="requirements". All the elements “pulled”
from "ch3/s1.html" will have their IDs prefixed
with "requirements__".
Referencing the value of an xml:id attribute in
proprietary attribute data-xml-id-ref may be used to create links to locations that do not exist
in the input HTML pages, but which will be created in the output
HTML pages. Example:
In input HTML page "ch4/s2.html", you
may refer to the first section of the chapter by writing
<a href="s1.html"/>. But how to refer to the
chapter itself? Notice that this chapter has no input HTML page to
refer to.
The solution to this problem is to add proprietary
attribute data-xml-id-ref to an a element. For the above example,
it's <a data-xml-id-ref="ch04"/>.
Note that
writing
<a href="s1.html" data-xml-id-ref="ch04"/> is an
even better option because href="s1.html" is used as a
fallback link target in case xml:id="ch04" is not
defined in the ebook specification.
These attributes (e.g. class, dir,
lang, onclick, style) are copied
to the output HTML element corresponding to the book division. Example:
the output HTML element corresponding to the following appendix:
The Processor is the main component of XMLmind Ebook Compiler. It
processes an ebook specification referencing a number of valid XHTML5
pages. It generates processed valid XHTML5 pages and generally also, a
subdirectory (called "_res/" by default) containing all the
resources referenced by the processed pages.
Whatever the file layout
of the input HTML pages and their resources, all the files and
directories are always created in a single output directory, which makes
this output directory self-contained.
In addition to the processed
pages, the Processor automatically creates an HTML page (called
"_toc_frame.html" by default) containing a table of
contents and the manifest of all the resources found in the resource
directory (in the form of
<link href="XXX" rel="resource" type="YYY"/>
elements).
The Processor also automatically creates an HTML page
(called "_frameset.html" by default) containing a frameset. The only purpose of this
frameset is to be able to quickly navigate the output of
the Processor when testing and debugging.
Generating a single HTML page out of an ebook
specification does not involve any further processing steps. The
Processor is simply instructed to generate a single page and files
"_toc_frame.html" and "_frameset.html" are
discarded.
Generating an EPUB file requires transforming
"_toc_frame.html" by the means of the
xsl/epub/epub.xsl stylesheet and then archiving[23] the contents of
the output directory.
Generating a Web Help requires transforming
"_toc_frame.html" by the means of the
xsl/webhelp/webhelp.xsl stylesheet and then processing the
contents of the output directory using XMLmind
Web Help Compiler.
Generating PDF, DOCX, ODT, etc, requires first
generating an intermediate format called XSL-FO. This is
done by the means of the xsl/fo/fo.xsl stylesheet. After
that, it's up to an XSL-FO processor — Apache
FOP, RenderX
XEP or Antenna House Formatter for the PostScript and
PDF formats, XMLmind XSL-FO Converter for the RTF, WML, DOCX
and ODT formats— to create the output file.
The CSS styles specified in the ebook specification
and in the source HTML pages are also used when generating output
formats based on XSL-FO. However for this to work, these CSS styles need
to be translated to directly usable XSL-FO properties (see apply-css-styles) and stored in
processing-instructions (<?css-styles?>) prior to be
transformed by the xsl/fo/fo.xsl stylesheet. This
preparatory step is implemented by the "CSS to XSL-FO properties"
component depicted in the above figure.
Converts
specified ebook input file and saves the result of the conversion to
specified output file or directory.
An ebook input file may be specified using its
URL or its filename.
Output formats webhelp,
html and frameset require
output_file_or_directory to be a directory. Other output formats
require output_file_or_directory to be a file.
The output
directory is created if it does not already exist.
Some options
have both a short name and a long name. Example: -p is
equivalent to -param.
-pparam_nameparam_value
-paramparam_nameparam_value
Specifies a conversion parameter, generally an XSLT stylesheet
parameter.
"profile." parameters
A
param_name starting with "profile." specifies a
profiling attribute. Example:
-p profile.data-output-format html or more simply
-p profile.output-format html (the "data-"
attribute name prefix being implied). See Section 4.3. Conditional processing.
"load.page_loader_name."
parameters
A param_name starting with
"load.page_loader_name." specifies an option which
is passed to the alternate page loader called page_loader_name.
For example, -p load.markdown.autolink true turns on the
autolink extension in the Markdown loader. See Supported Markdown extensions.
"proc."
parameters
A param_name starting with
"proc." specifies a low-level option
which is passed to the first pass of ebookc. This first
pass, called the Processor, compiles the input ebook specification to multi-page
XHTML5 with a frameset and a “TOC frame”[24], see Chapter 8. How it works. Example:
-p proc.resourcedirname resources.
Setting these
low-level options “by hand” is almost never needed, it's best not to
fiddle with these.
List of values separated by whitespace. Allowed values
are: 'part', 'chapter',
'section', 'figure',
'table', 'heading', 'all'
(equivalent to 'chapter section figure
table'; 'part' and 'heading'not included in this list).
Default
value: '' (do not add copiable links).
Adds a copiable
link to the heading or caption child elements of specified
“formal elements”.
If the “formal element” is numbered (e.g. has a
"Chapter 1." automatically generated label), then the
automatically generated label is converted to a link. This
link points to the formal element (e.g. a link to the
section having a chapter role).
Otherwise (e.g. a table which is not
numbered), a link containing the section symbol, "§",
is added to the heading or caption. This link points to the
formal element (e.g. a link to the table).
This automatically generated link to a formal element
is intended to be copied using the "Copy Link" entry
found in the contextual menu of all web browsers in order to be
shared with others. For example, send this link by
email.
For this facility to work, the formal element must have
an id attribute, whether specified by the
author or automatically generated by
ebookc.
It does not make sense to use this parameter when
generating EPUB or any XSL-FO based output format (PDF, RTF,
etc). Use it only when generating HTML or Web Help.
debug
true |
false
Default:
false.
Print low-level debugging info.
externalresourcebase
Absolute or relative URI ending with
'/'.
Default: '' (no
base).
Specifies an absolute or relative URI to be prepended to
external resources having a relative URI.
framesetfilename
File basename without any extension.
Default:
"_frameset".
Specifies the name of the frameset file
generated by first pass.
htmlcharset
A valid charset.
Default:
"UTF-8".
Specifies which charset to use for the generated HTML
files.
htmlextension
File extension (without a leading period).
Default:
"html".
Specifies which file extension to use for the generated
HTML files.
ignoreresources
true |
false
Default:
false.
If set to true, do not process resources.
That is, treat all resources as if they were external
resources.
indexfilename
File basename without any extension.
Default:
none.
Specifies that the index is to be generated in a separate
HTML file. This option specifies the name of this separate
file.
Setting this option generally also requires setting
suppressindex to true.
Ignored
unless the ebook as specified by the user actually contains an
<index/> descendant.
pagenavigation
none | header |
footer | both
Default:
none.
Specifies whether page navigation headers and/or footers
are to be added to the output HTML pages.
The page
navigation headers and footers are styled using CSS stylesheet
pageNavigation.css found in
ebookc_install_dir/xsl/common/resources/.
reservedfilenames
One or more file basenames (without any extension)
separated by newline characters.
Default: none.
Do not generate HTML files having any of the specified
names.
resourcedirname
File basename without any extension.
Default:
"_res".
Specifies the name of the directory where all the
resources (e.g. image files, CSS files) referenced in the output
HTML pages are stored.
resourcedirnamefor
URL or file path.
Same as resourcedirname except that the name
of the resource directory is computed out of the option value.
For example, sets the name of the resource directory to
"my doc_files" when passed
"file:/tmp/my%20doc.epub" or
"C:\temp\my doc.epub".
singlepage
true |
false
Default:
false.
Generate a single HTML page.
suppressindex
true |
false
Default:
false.
Suppress <index/> from the ebook
specification before generating the output HTML
pages.
Setting suppressindex to
true is generally needed when
indexfilename is also specified.
suppresstoc
true | false
Default:
false.
Suppress <toc/> from the ebook specification before
generating the output HTML pages.
tocframefilename
File basename without any extension.
Default:
"_toc_frame".
Specifies the name of the “TOC frame” file generated by
first pass.
validate
true |
false
Default: true
when invoked by the ebookc command-line utility,
false otherwise.
Validate the ebook specification against the W3C
XML schema found in
ebookc_install_dir/schema/ebook.xsd.
-tXSLT_stylesheet_URL_or_file
-xsltXSLT_stylesheet_URL_or_file
Use the specified custom XSLT stylesheet rather than the stock
one.
No matter the
order of command-line options, option -foconverter has
priority over options -fop, -xep,
-ahf, -xfc, which is turn have priority over
bundled XSL-FO converters (such as the Apache
FOP contained in ebookc-N_N_N-plus-fop.zip
distributions).
-fopconfconfiguration_file
Specifies the location of an Apache FOP configuration file. A relative file path is relative to the current working
directory. Ignored unless option
-fop is also specified.
-xepexecutable_file
Specifies the location of the xep shell script
(xep.bat on Windows).
XMLmind
XSL-FO Converter Evaluation Edition (download page) generates output containing
random duplicate letters. This makes this edition useless for
any purpose other than evaluating XMLmind XSL-FO Converter. Of course,
this does not happen with XMLmind XSL-FO Converter Professional
Edition!
-foconverterprocessor_nametarget_formatcommand
Register specified XSL-FO
converter with ebookc, a lower-level alternative to using
-xep, -fop, -ahf or
-xfc. Example:
No matter the
order of command-line options, option -foconverter has
priority over options -fop, -xep,
-ahf, -xfc, which is turn have priority over
bundled XSL-FO converters (such as the Apache
FOP contained in ebookc-N_N_N-plus-fop.zip
distributions).
It is also possible to specify command-line options in the
ebookc.options options file. The content of
this plain text file, encoded in the native encoding of the platform (e.g.
Windows-1252 on a Western Windows PC), is automatically loaded
by ebookc each time this command is executed. The content of
this file, command-line options separated by whitespace, is prepended
to the options specified in the command-line.
Relative filenames found in this file are relative to the current
working directory, and not to the ebookc.options options
file. Therefore it is recommended to always specify absolute
filenames.
No comments (e.g. lines starting with '#') are allowed
in ebookc.options. Options must be separated by
whitespace.
In the above example, FOP is declared after XEP. This implies
that it is FOP and not XEP, which will be used by ebookc to generate PDF
and PostScript®.
An XSL-FO processor tend to consume a lot
of memory. If the ebook compilation fails with an out-of-memory error,
you need to edit the xep (xep.bat), fop
(fop.bat), fo2xxx
(fo2xxx.bat) scripts in order to increase the
maximum amount of memory that the Java™ runtime may
allocate. This is done by using the -Xmx option of the
Java™ command-line. Example:
"java ... -Xmx512m ...".
Starting from Java™ 1.6.0_23,
converting XML documents to PDF using RenderX XEP randomly fails with
false XSL-FO errors (e.g. attribute "space-before" may not be
empty). This problem seems specific to the 64-bit runtime.
The
workarounds for the above bug ("renderx #22766") are:
Use a 32-bit Java™ runtime.
OR Use a 64-bit Java™ runtime older than
1.6.0_23.
OR Specify option -valid in the xep
command-line. Note that this workaround is automatically used when
you specify which RenderX XEP executable to use by the means of the
-xep command-line
option.
10.1. Parameters of the XSLT stylesheets used to convert an ebook
specification to EPUB
Parameter
Value
Default Value
Description
cover-image
URI. If the URI is relative, it is relative to the current
working directory of the user.
None.
Specifies an image file which is to be used as the cover page of
the EPUB file. This image must be a PNG or JPEG image. Its size must
not exceed 1000x1000 pixels.
In theory, EPUB 3 also accepts SVG
1.1 cover images.
epub-identifier
String
Dynamically generated UUID URN.
A globally unique identifier for the generated EPUB document
(typically the permanent URL of the EPUB document).
epub2-compatible
'no' | 'yes'
'yes'
By default, the EPUB 3 files generated by ebookc
are made compatible with EPUB 2 readers. Specify 'no'
if you don't need this compatibility.
omit-toc-root
'no' | 'yes'
'yes'
Specify 'yes' if you want the title of the book to
be the root of the EPUB TOC.
10.2. Parameters of the XSLT stylesheets used to convert an ebook
specification to Web Help
Parameters starting with
"wh-" are pseudo-parameters. They may or may not be
passed to the XSLT stylesheets, but the important thing to remember is that
they are also interpreted by ebookc itself. Consequently you
cannot specify them in an XSLT stylesheet which customizes the stock
ones.
Parameter
Value
Default Value
Description
omit-toc-root
'no' | 'yes'
'yes'
Specify 'yes' if you want the title of the book to
be the root of the Web Help TOC.
wh---CSS_VAR_NAME
String. A valid CSS property value.
No default.
This kind of parameter may be used to override any of the
default values of the CSS variables specified in any of the
NNtheme.css template files (all
found in
ebookc_install_dir/whc_template/_wh/).
For example, the main NNtheme.css template file:
body {
...
--navigation-width: 33%;
...
}
The wh---navigation-width CSS variable is used as
follows in NNcommon.css, another CSS template
file:
Therefore parameter wh---navigation-width may be used
to give the navigation side of the generated Web Help a different
initial width. Example:
-p wh---navigation-width "25%".
Specifies whether the Web Help TOC should be initially
collapsed.
wh-favicon
Filename or absolute URI of an image file. A relative filename
is relative to the current working directory.
None.
Specifies the favicon
which is to be added to each page of the Web Help.
This file is
copied to output_directory/_wh/user/.
wh-index-numbers
'no' | 'yes'
'no'
Specifies whether words looking like numbers are to be
indexed.
Examples of such number-like words: 3.14,
3,14, 3times4equals12, +1,
-1.0, 3px, 1,2cm,
100%, 1.0E+6,
1,000.00$.
wh-inherit-font-and-colors
'no' | 'yes'
'yes'
When wh-inherit-font-and-colors is set to
'no', the navigation pane of the generated Web Help
uses fonts and colors of its own, which will generally differ from
those used for the content of the Web Help.
Setting
wh-inherit-font-and-colors to 'yes' lets
you use for the navigation pane the same fonts and colors as those
used for the content of the Web Help.
Specifying
an "https:" URL is recommended when the generated Web
Help is stored on an HTTPS server.
wh-local-jquery
'no' | 'yes'
'no'
Specifies whether all jQuery files should be copied to
_wh/jquery/, where _wh/ is the directory
containing the other Web Help files.
By default, the jQuery files
are accessed from the Web (typically from a
CDN).
Note that this parameter is applied after
jQuery has been possibly customized using parameter
wh-jquery. For example,
"-p wh-jquery https://code.jquery.com/jquery-3.7.1.min.js"
copies a file downloaded from https://code.jquery.com/ to
_wh/jquery/.
wh-layout
The name of a layout.
'classic'
Selects a layout for the generated Web Help.
For now, only 3
layouts are supported: 'classic', 'simple'
and 'corporate'.
wh-responsive-ui
'no' | 'yes'
'yes'
Specifies whether the generated Web Help should be
“responsive”, that is, whether it should adapt its layout to the
size of the screen.
wh-ui-language
"browser" or "document" or a language
code conforming RFC 3066. Examples: de,
fr-CA.
'browser'
Specifies which language should be used for the messages (tab
labels, button tool tips, etc) of the generated Web
Help.
Default value "browser" means that this
language is the one used by the Web browser for its own messages.
This language may often be specified in the user preferences of the
Web browser.
Value "document" means that the
language of the document should be used.
A language code such
as en, en-US, es, es-AR, etc,
may be used to explicitly specify which language should be
used.
wh-use-stemming
'no' | 'yes'
'yes'
Specifies whether stemming
should be used to implement the search facility.
By default,
stemming is used whenever possible, that is,
when the main language of the XHTML pages to be compiled can
be determined;
when this main language is one of: Danish, Dutch, English,
Finnish, French, German, Hungarian, Italian, Norwegian,
Portuguese, Russian, Spanish, Swedish, Romanian, Turkish.
The main language of the document is specified by the
@xml:lang attribute found on the root element of the
ebook specification being compiled; otherwise, it is assumed to be
"en".
wh-user-css
Filename or absolute URI of a CSS file. A relative filename is
relative to the current working directory.
None.
Specifies the user's CSS stylesheet which is to be added to an
XHTML page decorated by the compiler.
This file is copied to
output_directory/_wh/user/.
Filename or absolute URI of an XHTML file. A relative filename
is relative to the current working directory.
None.
Specifies the user's header which is to be added to each page of
the Web Help.
The content of the body element of
user-header is inserted as is in the
<div id="wh-header"> found in each page of the
Web Help.
If a user's header references resources (e.g. image
files), then these resources must either be referenced using
absolute URLs or these resources must be found in a user's resource
directory and parameter user-resources must be
specified.
Example:
The user's resource directory is called
header_footer_files/ and contains
header_footer_files/logo200x100.png.
ebookc is passed parameters
-p user-resources PATH_TO/header_footer_files
and -p user-header PATH_TO/header.html.
Filename or absolute "file:" URI of a
directory. URI schemes other than "file" (e.g.
"http") are not supported for this parameter. A
relative filename is relative to the current working directory.
None.
Specifies a user's resource directory which is to be recursively
copied to output_directory/_wh/user/.
This
directory typically contains image files referenced by the user's
header, footer or CSS stylesheet.
Such system parameters are not
intended to be specified by the end-user. Such system parameters are
documented here only because the end-user may see them referenced in some
dialog boxes, in some configuration files or in the source code of the XSLT
stylesheets.
Parameter
Value
Default Value
Description
whc-index-basename
URL basename
'__tmp__index.whc_ndx'
Basename of the Index XML input file of XMLmind Web Help
Compiler.
whc-toc-basename
URL basename
'__tmp__toc.whc_toc'
Basename of the TOC XML input file of XMLmind Web Help
Compiler.
10.3. Parameters of the XSLT stylesheets used to convert an ebook
specification to XSL-FO
Parameter
Value
Default Value
Description
apply-css-styles
'no' | 'yes'
'yes'
Specifies whether CSS styles specified in XHTML
style attributes, style and
link elements also apply to the XSL-FO file.
Depending on the context, the following CSS properties are
converted to their equivalent XSL-FO attributes. The corresponding
shorthand CSS properties are supported too. Any other CSS
property is ignored. Generated content (:before,
:after) is ignored too.
Note that styles specified this way supersede all the
other ways to specify the presentation in the output file, that
is, parameters like base-font-size or the
presentation attributes (xsl:attribute-set) specified
in the XSLT stylesheets that generate the XSL-FO file.
base-font-size
Length in pt
'10pt'
The size of the font used for most body elements (paragraphs,
tables, lists, etc). All the other font sizes are computed
relatively to this font size.
base-line-height
A valid line height
'10'
The line height used for most body elements (paragraphs, tables,
lists, etc). All the line heights are computed relatively to this
line height.
external-href-after
String
']'
Appended after the external URL referenced by an a
element. Ignored unless show-external-links='yes'.
external-href-before
String
' ['
Separates the text of an a element from the
external URL it points to. Ignored unless show-external-links='yes'.
Specifies whether text (e.g. in paragraphs) should be justified
(that is, flush left and right) or just left aligned (that is, flush
left and ragged right).
index-column-count
Positive integer.
'2'
The number of columns of index pages.
index-column-gap
Length.
'2em'
The distance which separates columns in index pages.
A convenient way to specify the size of the printed page.I
t
is also possible to specify a custom paper type by ignoring the
paper-type parameter and directly specifying the page-width and page-height parameters.
pdf-outline
'no' | 'yes'
'no'
Specifies whether PDF bookmarks should be generated.
Supported by the 'XEP', 'FOP' and
'AHF' XSL-FO processors. Not relevant, and thus ignored
by 'XFC'.
show-external-links
'no' | 'yes'
'no'
Specifies whether the external URL referenced by an
a element should be displayed right after the text
contained by this element.
Example:
show-external-links='yes' causes <a
href="http://www.oasis-open.org/">Oasis</a> to be
rendered as follows:
Oasis [http://www.oasis-open.org/].
show-map-links
'no' | 'yes'
'yes'
Specifies whether a numbered list should be generated for a
XHTML map element, with one list item per
area element.
A list item contains the link
specified by the area element. No list items are
generated for “dead areas” (area elements specifying no
link at all).
show-xref-page
'no' | 'yes'
'no'
Specifies whether the page number corresponding to the internal
link target referenced by an a element should be
displayed right after the text contained by this element.
Example: show-xref-page='yes' causes <a
href="#introduction">Introduction</a> to be rendered
as follows: Introduction [3].
two-sided
'no' | 'yes'
'no'
Specifies whether the document should be printed double
sided.
ul-li-bullets
One or more bullet characters separated by spaces
'• –'
Specify which bullet character to use for an
ul/li element. Additional characters are
used for nested li elements.
For example, if
ul-li-bullets="* - +", "*" will be used
for ul/li elements, "-" will
be used for ul/li elements contained in a
ul/li element and "+" will be
used for ul/li elements nested in two
ul/li elements.
use-note-icon
'no' | 'yes'
'no'
Specifies whether an icon should be added to
blockquote elements having a class
attribute containing role-note,
role-attention, role-caution,
role-danger, role-fastpath,
role-important, role-notice,
role-remember, role-restriction,
role-tip, role-trouble,
role-warning.
use-note-label
'no' | 'yes'
'no'
Specifies whether a title should be added to
blockquote elements having a class
attribute containing role-note,
role-attention, role-caution,
role-danger, role-fastpath,
role-important, role-notice,
role-remember, role-restriction,
role-tip, role-trouble,
role-warning.
watermark
Allowed values are one or more of 'blank',
'title', 'toc', 'booklist',
'frontmatter', 'body',
'backmatter', 'index', 'all'
separated by whitespace.
'all'
Specifies which pages in the output document are to be given a
watermark.
By default, all pages are given a watermark. If for
example, parameter watermark is set to
'frontmatter body backmatter', then only the pages
which are part of the front matter, body and back matter of the
output document are given a watermark. The title page, TOC pages,
etc, are not given a watermark.
URI. If the URI is relative, it is relative to the current
working directory of the user.
No default.
Specifies an image file which is to be used as a watermark in
all the pages comprising the output document. See also parameter watermark.
xfc-render-as-table
A string containing zero or more roles or element names
separated by whitespace.
Supported roles and element names are:
admonition, aside,
blockquote, footer, header,
nav.
'admonition aside blockquote'
Specifies whether XMLmind XSL-FO Converter should render the
fo:blocks representing specified elements as
fo:tables.
This parameter enables a workaround for a
limitation of XMLmind XSL-FO Converter: a fo:block
having a border and/or background color and containing several other
blocks, lists or tables is very poorly rendered in RTF, WML, DOCX
and ODT.
Inserting a
<?pagebreak?> processing-instruction in the XHTML5 source
between paragraphs, notes, tables, lists, etc, may be used to force a page
break when generating any of the output formats which uses XSL-FO as an
intermediate format (PDF, RTF, DOCX, etc).
Such system parameters are not
intended to be specified by the end-user. Such system parameters are
documented here only because the end-user may see them referenced in some
dialog boxes, in some configuration files or in the source code of the XSLT
stylesheets.
Parameter
Value
Default Value
Description
foProcessor
'FOP' | 'XEP' | 'XFC'
|'AHF'
Automatically set by the application hosting XMLmind Ebook
Compiler
The name of the XSL-FO processor used to convert the XSL-FO file
generated by the XSLT stylesheets to the target output format.
img-src-path
URI ending with '/'
''
If this parameter is not empty and if the value of the
src attribute is a relative URI, then this parameter is
prepended to the value of the src attribute.
This
low-level alternative to resolve-img-src='yes' also
allows the generation of an XSL-FO file where all the references to
graphic files are absolute URIs.
Automatically set by the application hosting XMLmind Ebook
Compiler
The file extension of the target output file.
resolve-a-href
'no' | 'yes'
'no'
In the XSL-FO file, convert relative URIs contained in the
href attribute of a elements to absolute
URIs. This is done by resolving the relative URI against the base of
the a element.
resolve-img-src
'no' | 'yes'
'no'
In the XSL-FO file, convert relative URIs contained in the
src attribute of img elements to absolute
URIs. This is done by resolving the relative URI against the base of
the img element.
screen-resolution
Number
96.0
Screen resolution in DPI. Used to convert px to
pt.
xsl-resources-directory
URL
'resources/' resolved against the directory which
contains the XSLT stylesheets.
These XSLT stylesheets generate files which reference resources
such as note icons. This parameter specifies the directory
containing such resources.
The width of a column is specified as an integer which is
larger than or equal to 1. This value is the proportional width of the
column. For example, if the left column has a width equal to 2 and the right
column has a width equal to 4, this simply means that the right column is
twice (4/2 = 2) as wide as the left column.
When header-left, header-center,
header-right are all specified as the empty string, no header
is generated. When footer-left, footer-center,
footer-right are all specified as the empty string, no footer
is generated.
The content of a column is basically a mix of text and
variables. Example: "Page {{page-number}} of
{{page-count}}".
Supported variables are:
{{document-title}}
The title of the document.
{{document-date}}
The publication date of the document.
The value of this variable
comes from the meta element having any of the following
name attributes: "dc.date",
"dcterms.issued", "dcterms.modified",
"dcterms.created", if found in the head
element of the ebook specification. If the ebook specification does not
contain such meta elements then the current date is
used.
The value of the content attribute of the
meta element is expected be something like
YYYY-MM-DD, because it is parsed and then formatted according to
the xml:lang of the ebook specification. For example, if
content="2017-02-23", with xml:lang="en", it
gives: "February 02, 2017" and with xml:lang="fr", it
gives: "02 février 2017".
{{chapter-title}}
The title of the current part, chapter, appendices or appendix.
{{section1-title}}
The title of the current part, chapter, appendices or appendix
or section 1.
{{division-title}}
The title of the current document divisions. All the document
divisions are guaranteed to have a corresponding
{{division-title}}. Even automatically generated divisions
such as <toc/> or <index/> have a
corresponding {{division-title}}.
{{page-number}}
Current page number within the current document part (front matter,
body matter or back matter) .
{{page-count}}
Total number of pages of the current document part (front matter,
body matter or back matter).
{{break}}
A line break.
{{image(URI)}}
An image having specified URI. A relative URI is resolved against
the current working directory. Example:
"{{image(artwork/logo.svg)}}".
{{page-sequence}}
Not for production use. Inserts in the header/footer the name of the
current page sequence . Lets the author learn which name to use in
a conditional header or footer. See below.
The default value of header-center is
'{{document-title}}'. This means that each page of the
generated PDF, RTF, etc, file will have the document title centered on its
top. But what if you want the pages containing the Table of Contents have a
"Contents" header? Is there a way to specify: use "Contents" for the pages
containing the Table of Contents and use the title of the document for any
other page?
This is done by specifying the following conditional value
for parameter
header-center: 'toc:: Contents;;
{{document-title}}'.
A conditional value may contain one or
more cases separated by ";;". Each case is tested against the
page being generated. The first case which matches the page being generated
is the one which is selected.
conditional_value --> case [ ";;" case ]*
case --> [ condition "::" ]* value
condition --> [ test_page_sequence ]?
& [ S test_page_layout ]?
& [ S test_page_side ]?
Let's suppose you also want
the the pages containing the Index have a "Index" header. Specifying
'toc:: Contents;; {{document-title}};; index:: Index' won't
work as expected because the second case (having no condition at all)
matches any page, including the Index pages. You need to specify:
'toc:: Contents;; index:: Index;;
{{document-title}}'.
Let's remember that variable {{division-title}} is
substituted with the title of the current document division, including
automatically generated document divisions such as toc and
index.
Therefore our conditional value is better
expressed as: 'toc:: index:: {{division-title}};;
{{document-title}}'. Notice how a case may have several conditions.
Suffice for any of these conditions to match the page being generated for
the case to be selected.
Even better, specify 'toc||index::
{{division-title}};; {{document-title}}'. String "||"
may be used to separate alternative values to be tested against the page
being generated.
It's not
difficult to guess that the name of the page sequence corresponding to the
Table of Contents is toc and that the name of the page sequence
corresponding to the Index is index. However the simplest way
to learn what is the name of the page sequence being generated is to
reference variable {{page-sequence}} in the
specification of a header or a footer.
Now let's suppose
that we want to suppress the document title on the first page of a part,
chapter or appendix. This is specified as follows: 'first body:: ;;
toc||index:: {{division-title}};; {{document-title}}'.
For now,
we have only described a condition about the page sequence being generated:
TOC, Index, etc. In fact, a condition may test up to 3 facets of the page
being generated:
The page sequence to which belongs the page being generated.
Whether the page being generated is part of a one-sided or a
two-sided document.
Whether the page being generated is the first page of its sequence,
has an odd page number or has an even page number.
When the document has one side, the only possible
page side is odd. The other values, first and
even, are not supported. For example, something like
'one-side body even:: {{chapter-title}};;' cannot generate any
text.
The order of the tests is not significant. For
example, 'first part||chapter||appendix' is equivalent to
'part||chapter||appendix first'.
Therefore 'first
body:: ;; toc||index:: {{division-title}};; {{document-title}}' reads
as follows:
Use the empty string for the first page of a part, appendices,
chapter or appendix.
Use the document division title for the pages containing the Table
of Contents. This title is "Table of Contents", but localized according
to the main language of the book.
Use the document division title title for the pages containing the
Index. This title is "Index", but localized according to the main
language of the book.
For any other page, use the title of the book.
Everything explained in this section
applies not only to the contents of a column of a header or footer, but also
to the proportional width of a column of a header or footer. Example:
-pfooter-right-width"first||odd:: 4;;
even:: 1".
Appendices
Appendix A. Embedding
com.xmlmind.ebook.convert.Converter
Quick and easy embedding: embed
com.xmlmind.ebookc.convert.Converter, the Java™ class which is
used to implement the ebookc command-line utility.
Converter is the
object which is at the core of the ebookc command-line utility.
Its run method accepts the
same string arguments as the ebookc command-line
utility.
The full source code of the Embed1 sample is
found in Embed1.java.
Create the Converter.
1
2
3
4
5
6
7
8
9
StyleSheetCache cache = new StyleSheetCache();
Console console = new Console() {
publicvoid showMessage(String message, MessageType messageType) {
System.err.println(message);
}
};
Converter converter = new Converter(cache, console);
StyleSheetCache is a simple cache
for the ebookc XSLT 2.0 stylesheets. It is a
thread-safe object which is intended to be shared by several
Converters.
Unlike StyleSheetCache,
Converter is not thread-safe. Each thread must own its
Converter. However, the run method of a
Converter may be invoked several times.
Console is
a very simple interface. Implementing this interface allows to do
whatever you want with the messages reported by a
Converter.
Configure the Converter.
1
2
3
if (!converter.registerFOP(path("/opt/fop/fop"))) {
return1;
}
The run method returns 0 if
the conversion is successful and an integer greater than 0 otherwise.
When the conversion fails, errors messages are displayed on the
Console.
§ Environment required for running this kind of
embedding
Aside ".jar" files like
ebookc.jar, xmlresolver.jar, saxon12.jar,
etc, which are all listed in
ebookc_install_dir/doc/manual/embed/build.xml (see
below), this kind of embedding also needs to access:
The W3C XML schemas, schematrons and XML catalogs normally found in
ebookc_install_dir/schema/.
The XSL stylesheets normally found in
ebookc_install_dir/xsl/.
Therefore the requirements for running this kind of embedding
are:
This
rewriteURI entry is needed to find the
ebook.xsd schema and the location of the directory
containing the XSL stylesheets. Make sure that this entry exists in your
XML catalogs and that it points to the actual location of the directory
containing both the schema/ and xsl/
subdirectories.
Compiling and executing the Embed1
sample
Compile the Embed1 sample by running ant in
ebookc_install_dir/doc/manual/html/embed/.
Execute
the Embed1 sample by running "ant embed1" in
ebookc_install_dir/doc/manual/html/embed/. This will
convert ebookc_install_dir/docsrc/manual/manual.ebook to
ebookc_install_dir/doc/manual/html/embed/manual.pdf,
using Apache
FOP.
Note that Embed1.java contains “hardwired
filenames”, like "/opt/fop/fop". This means that, without
modifications, this sample cannot be run from elsewhere than
ebookc_install_dir/doc/manual/html/embed/ and that
you'll almost certainly need to modify the source code in order to specify
the actual location of the fop (fop.bat)
script.
Index
A
adjustuserheadings, attribute of element book, 1, 2
[1] Here “ebook” shall be understood
in the widest possible sense.
[2] Requires an
XSL-FO processor like Apache FOP, RenderX
XEP, Antenna House Formatter to be installed and registered
with XMLmind Ebook Compiler (for example, using option
-foconverter). We'll assume in this manual that you have
downloaded and installed the distribution of XMLmind Ebook Compiler which
includes Apache FOP.
[4] Preferably valid XHTML5,
because ebookc anyway generates XHTML5 markup. “Plain HTML”
cannot be parsed by ebookc.
[5] Requires an
XSL-FO processor like Apache FOP, RenderX
XEP, Antenna House Formatter to be installed and registered
with XMLmind Ebook Compiler (for example, using option
-foconverter). We'll assume in this manual that you have
downloaded and installed the distribution of XMLmind Ebook Compiler which
includes Apache FOP.
[10] A standard, intermediate page-layout format which
is then used by XSL-FO processors like Apache
FOP or XMLmind XSL-FO Converter to generate PDF, RTF, DOCX,
etc.
[11] The C++ Programming Language by Bjarne Stroustrup.
[12] The C Programming Language by Brian Kernighan and Dennis Ritchie.
Originally published in 1978.
[13] The only types supported for PARAMETER_VALUE are: string, boolean (true or false), integer and any enumerated type.
[14] By default, heading IDs are not “rendered” in HTML, which is somewhat counterintuitive.
[16] Creating xi:include elements
by hand is tedious and error prone. It's strongly recommended to use an
XInclude-enabled editor like XMLmind XML Editor to do that. With XMLmind XML Editor, creating an
xi:include element is as easy as copying a reference
to an element (Ctrl+Shift-C) from one page and then pasting it
(Ctrl-V) into another page.
[17] Note that the validity of the source HTML pages is
currently not checked by ebookc. It's only the validity of the
ebook specification which is checked against W3C XML Schema
ebookc_install_dir/schema/ebook.xsd.
[19] Written with kanji
ko, meaning "child". The syllable ko is not generally found at
the end of masculine names.
[20] Written with
kanji ko, meaning "child". The syllable ko is not generally
found at the end of masculine names.
[21] The prefix of a value of the
class attribute may be specified by ending the prefix with
a '*'. Example: "role-*" matches any class
value starting with "role-".