DITA XML and XSL-FO tutorials

Check my latest blog post!

The following tutorials will help the technical writer set up and use a free DITA XML authoring and publishing chain.

DITA XML is a structured authoring language that allows you to create documents without worrying about their final appearance on different media. XSL-FO is a language for reorganizing and filtering XML content and applying a style sheet layout to it.

A set of DITA XML files contains all the content related, for example, to a product. Various XSL-FO style sheets are used to publish this content in PDF, HTML, or other formats, applying complex transformations. The summary of each section of the final document may, for example, appear in the HTML version but not in the PDF version.

Similarly, if a product is to be supplied as a white label to different customers, a completely different layout can be applied to its documentation simply by specifying a different set of style sheets when generating the deliverable. In practice, this is not possible with traditional FrameMaker solutions.

XSL-FO: Filter content according to “except” and “or” conditions

Suppose you want to filter the child nodes of the DITA XML tag <example> and display all its content except the title (located between the <title> tags).

You can use the following syntax:

<xsl:template match="*[contains(@class,' topic/example ')]">
  <fo:block>
    <xsl:apply-templates select="*[not(name()='title')]" />
  </fo:block>
</xsl:template>

This command selects all child nodes of the <example> node, excluding the <title> node. However, the <example> node accepts text entered directly, without being encapsulated in tags. Therefore, this command will not display this content.

Suppose the source code of one of your DITA XML files is as follows:

<example>
  <title>
    XSL-FO
  </title>
  Here is my XPATH path example:
  <codeblock>
    ancestor-or-self
  </codeblock>
  Unencapsulated text located after a child node.
</example>

The PDF file will display the example structured as follows:

ancestor-or-self

The title of the example is not displayed, which is the desired result, but content not encapsulated in tags is not displayed, which is an undesirable side effect. To select this content, text nodes must be selected using the text() syntax. You may then be tempted to use the following syntax:

<xsl:template match="*[contains(@class,' topic/example ')]">
  <fo:block>
    <xsl:apply-templates select="text()" />
    <xsl:apply-templates select="*[not(name()='title')]" />
  </fo:block>
</xsl:template>

However, all text elements not encapsulated in child tags of the <example> tag will be placed at the beginning of the example, before the encapsulated elements, even if they are placed afterward in the DITA XML source file.

The PDF file will display the example structured as follows:

Here’s my XPATH path example: Unencapsulated text after a child node.
ancestor-or-self

You must then use the pipe syntax (Boolean or condition) to modify the XPATH path as follows:

<xsl:apply-templates select="text()|*[not(name()='title')]" />

The final result will be:

<xsl:template match="*[contains(@class,' topic/example ')]">
  <fo:block>
    <xsl:apply-templates select="text()|*[not(name()='title')]" />
  </fo:block>
</xsl:template>

The PDF file will display the example structured as follows:

Here is my XPATH path example:
ancestor-or-self
Unencapsulated text after a child node.

XSL-FO: Automatically insert a title for examples

By default, DITA-OT does not automatically insert the text Example: before the title of an example contained between DITA XML tags <example>. However, XSL-FO syntax offers this possibility.

Suppose the source code of one of your DITA XML files is as follows:

<example>
  <title>
    XSL-FO
  </title>
  Here is my XPATH path example:
  <codeblock>
    ancestor-or-self
  </codeblock>
</example>

You want the generated PDF file to display the example structured as follows:

Example: XSL-FO

Here’s my XPATH path example:
ancestor-or-self

and if the example does not contain a title, it should be structured as follows:

Example:

Here’s my XPATH path example:
ancestor-or-self

By default, however, this content will be structured in the PDF by DITA-OT as:

XSL-FO

Here is my XPATH path example:
ancestor-or-self

It’s possible to enter text between <example> tags, but XSL-FO offers a more elegant and structured method.

Automatically insert a text variable before the title of examples

Replace in the plugins/org.dita.pdf2/xsl/fo/commons.xsl stylesheet (under DITA-OT 1.7.) the following template:

<xsl:template match="*[contains(@class,' topic/example')]/*
[contains(@class,' topic/title ')]>
  <fo:block xsl:use-attribute-sets="example.title>
    <xsl:call-template name="commonattributes"/>
    <xsl:apply-templates/>
  </fo:block>
</xsl:template>

with the following code:

<xsl:template match="*[contains(@class,' topic/example ')]>
  <fo:block xsl:use-attribute-sets="example.title>
    <xsl:call-template name="insertVariable>
    <xsl:with-param name="theVariableID"
    select="'my-example-text'"/>
    </xsl:call-template>
    <xsl:apply-templates select="title"/>
  </fo:block>
  <fo:block>
  <xsl:apply-templates
  select="*[not(contains(@class, ' topic/title'))]
    |text()|processing-instruction()"/>
  </fo:block>
</xsl:template>

Define in the files containing the language variables, such as plugins/org.dita.pdf2/cfg/common/vars/en.xml, the text variables to be inserted automatically, for example:
```
<variable id="my-example-text>Example:</variable>
```

To achieve consistent behavior, you should disable this treatment for examples of specific topics types (task, in particular).

Generate a PDF with DITA Open Toolkit under GNU/Linux

This DITA XML tutorial is designed to guide you through setting up and using the DITA-OT publishing chain in a GNU/Linux environment (Ubuntu or Debian).

Prerequisites

Ubuntu or Debian on a physical or virtual machine with administrator access
Internet connection

Download and unzip the archive DITA-OT:

$ export REPO="https://github.com/dita-ot/dita-ot"
$ wget $REPO/releases/download/2.1/dita-ot-2.1.0.tar.gz
$ tar -xzvf dita-ot-2.1.0.tar.gz

Generate your first PDF:

cd dita-ot-2.1.0
$ dita -f pdf -i samples/taskbook.ditamap

Congratulations, you’ve compiled your first DITA XML project! The PDF file generated is out/taskbook.pdf. You can now compile other projects by skipping steps 1 and 2.

Generate a PDF with DITA Open Toolkit (Windows)

This DITA XML tutorial is designed to guide you through setting up and using the DITA-OT publishing chain in a Windows environment (tested on Windows XP).

Prerequisites

Internet connection

Download Java, then run the installation program.
Download DITA Open Toolkit 1.5.4 to your desktop, then unzip DITA-OT1.5.4_full_easy_install_bin.zip.
Select Run from the Start menu, paste the following command, then press Enter:
Terminal window
```
cmd
```
A terminal appears.

Paste the following command into the terminal:

set full=DITA-OT1.5.4_full_easy_install_bin
cd Desktop\%full%\DITA-OT1.5.4

Paste the following command:
Terminal window
```
startcmd.bat
```
A new terminal appears.
Paste the following command into the new terminal:
Terminal window
```
$ java -jar lib/dost.jar /i:samples/taskbook.ditamap\
/outdir:. /transtype:pdf2
```
This command generates a PDF file from a sample DITA XML project.

Congratulations, you’ve compiled your first DITA XML project! You’ll find the target file taskbook.pdf in the Desktop\%full%\DITA-OT1.5.4 directory. You can now compile other projects by skipping steps 1 and 2.

Manage multilingual DITA XML documentation projects

DITA XML is a great format for managing documentation projects. For multilingual projects, however, the technical writer must create a ditamap file, which contains the table-of-contents structure of the documents, by version. This creates the risk of errors and inconsistencies. Fortunately, an appropriate methodology and an automation script for the DITA-OT publishing chain remedy this problem.

Methodology for multilingual DITA XML documentation projects

The ditamap file should not include a navtitle section, which contains a spelled-out title instead of extracting the title from the corresponding DITA XML section, and is therefore language-specific.
From the outset of your DITA XML project, place DITA XML content files in a subdirectory specific to the language in which they are initially written.

For example:
- product
  - en_US
    - images
    - tasks
    - topics
  and not:
- product
  - images
  - tasks
  - topics
Replace all occurrences of the language-specific directory name in the ditamap file with a single temporary string.

For example, use the string @language-code@:
```
<topicref href="@language-code@/topics/managing-rights.dita"/>
```
and not:
```
<topicref href="en_US/topics/managing-rights.dita"/>
```
To generate the target files, you can now:

a. Modify the default.locale parameter in the demo/fo/build.xml file. b. Replace the language variable in the ditamap file with the name of the language directory. c. Change the language parameter xml:lang in the ditamap file and in DITA XML content files. d. For PDF target files, modify page dimensions (A4 or US letter, for example) according to language. e. Generate target files. f. Restore initial values in source files.

Fortunately, a simple Bash script (GNU/Linux) can automate all this.

Prerequisites

You have installed DITA-OT.
Your DITA XML project includes only one ditamap file.
Your DITA XML content files have the extension .dita.
The directory names of the language versions correspond to the language codes supported by Dita Open Toolkit (fr_FR or en_US, for example).
Your DITA XML content files are located in subdirectories of the language version directories (for example, in fr_FR/tasks/ and fr_FR/topics/).

Supported values for PDF page size are fr_FR (A4) and en_US (US letter). This script can of course be easily adapted or inspire a new script.

To use this script:

Download the multilingual DITA XML generation script into the directory containing the project ditamap file.
In a terminal, navigate to this directory and enter:
Terminal window
```
$ chmod +x dita2target.sh
```
In the terminal, enter:
Terminal window
```
$ mkdir out
```
to create the directory containing the target files.
Enter:
Terminal window
```
$ ./dita2target.sh <ditamap file> \
<language directory name> <target format>
```
to generate target files.

The target format argument accepts values managed by DITA-OT.

Example
Terminal window
```
./dita2target.sh firewall.ditamap en_US pdf2
```
The PDF file firewall.pdf is then generated in the out directory (hard-coded in the script).

Create different documents from the same DITA sources

DITA XML offers a conditional text mechanism. This mechanism promotes the reuse of source content and avoids redundant information. This tutorial will help the technical writer use this mechanism in just a few minutes.

Prerequisites

You have installed DITA-OT in the DITA-OT1.5.4 directory under GNU/Linux or Windows.

Paste the following code into a file and save it as conditional-text.dita in the DITA-OT1.5.4 directory:

<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE topic PUBLIC "-//OASIS//DTD DITA 1.2 Topic//EN"
"/usr/share/dita-ot/dtd/technicalContent/dtd/topic.dtd">
<topic id="example-topic" xml:lang="fr-fr">
  <title>Using Conditional Text</title>
  <body>
    <hazardstatement>
      <messagepanel audience="electricians">
        <typeofhazard>
          Danger for electricians
        </typeofhazard>
        <consequence>
          Risk of electrocution
        </consequence>
        <howtoavoid>
          Do not touch electrical wires.
        </howtoavoid>
      </messagepanel>
      <messagepanel audience="plumbers">
        <typeofhazard>
          Danger for plumbers
        </typeofhazard>
        <consequence>
          Risk of drowning
        </consequence>
        <howtoavoid>
          Do not dive into the pool.
        </howtoavoid>
      </messagepanel>
    </hazardstatement>
    <p>
     Any content placed between tags that does not include an
     <i>audience</i> value excluded in a file
     <i>.ditaval</i> is published in documents
     intended for plumbers and electricians.
  </p>
  </body>
</topic>

This code contains DITA XML tags containing different audience values: we’ll exclude the contents of one of these two tags when generating the target file using the audience key.

Paste the following code into a file and save it as conditional-text.ditamap in the DITA-OT1.5.4 directory:

<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE bookmap PUBLIC "-//OASIS//DTD DITA BookMap//EN"
"/usr/share/dita-ot/dtd/bookmap/dtd/bookmap.dtd">
<bookmap id="conditional-text">
  <booktitle>
    <mainbooktitle>
      Example of conditional text
    </mainbooktitle>
  </booktitle>
  <chapter href="conditional-text.dita"/>
</bookmap>

Paste the following code into a file and save it as electricians.ditaval in the DITA-OT1.5.4 directory:

<?xml version="1.0" encoding="UTF-8"?>
<val>
  <prop att="audience" val="electricians" action="include"/>
  <prop att="audience" val="plumbers" action="exclude"/>
</val>

Paste the following code into a file and save it as plumbers.ditaval in the DITA-OT1.5.4 directory:

<?xml version="1.0" encoding="UTF-8"?>
<val>
  <prop att="audience" val="electricians" action="exclude"/>
  <prop att="audience" val="plumbers" action="include"/>
</val>

Open a terminal and enter the following command in the DITA-OT1.5.4 directory:
Terminal window
```
$ java -jar lib/dost.jar /i:texte-conditionnel.ditamap /filter:electricians.ditaval /outdir:.
/filter:electricians.ditaval /outdir:. /transtype:pdf2
```
Open the file conditional-text.pdf; it contains information for:
- plumbers and electricians,
- electricians only.
Open a terminal and enter the following command in the DITA-OT1.5.4 directory:
Terminal window
```
java -jar lib/dost.jar /i:texte-conditionnel.ditamap /filter:plumbers.ditaval /outdir:.
/filter:plumbers.ditaval /outdir:. /transtype:pdf2
```
Open the file conditional-text.pdf; it contains information for:

plumbers and electricians, plumbers only.

DITA Open Toolkit: Display cross-references in PDFs

You have placed correctly formatted related-links tags in your DITA XML content files, or better still, a reltable in your ditamap table of contents structure (the reltable allows you to decontextualize your content and therefore reuse it better). You launch your PDF generation command and, unpleasantly, no Additional resources section appears in the target file! You then try to generate an HTML version of your content and there, your See also section is indeed present. Wouldn’t DITA-OT support cross-referencing in PDFs?

Fortunately, no. By default, cross-references are not generated in PDFs by DITA-OT. To display them, assign the value no to the disableRelatedLinks variable in the demo/fo/build_template.xml file. If you use ant, you’ll also need to pass the args.fo.include.rellinks=all parameter as follows:

ant -Dargs.input=samples/sequence.ditamap -Doutput.dir=out/ \
-Dtranstype=pdf2 -Dargs.fo.include.rellinks=all

Display an Index in PDF

Various challenges might arise when using DITA-OT, the DITA XML free publishing engine. You’ve meticulously inserted your index entries into your DITA XML content files. You generate a PDF output and the index does not appear. A compiler error message tells you that, unfortunately, FOP does not currently support index generation.

Faced with this situation, you have several solutions:

Wait until FOP supports indexes; without an availability date, this choice will be difficult to get approved.
Abandon DITA XML; it would be a shame to give up the remarkable productivity gains offered by this format.
Forgo displaying the index in the PDF; indexes are difficult to maintain and offer a questionable surplus of usability in a document that will only be consulted marginally in printed form.
Abandon DITA-OT and turn to a proprietary solution; non-open-source software, such as XMetal, often uses RenderX’s XEP publishing engine, which supports indexes perfectly.

The index problem is not an obstacle to the adoption of DITA XML. If your final medium is a printed document, solutions exist. If it’s an electronic format, the absence of an index is more than compensated for by the full-text search function.

Use the nXML IDE for DITA XML

The nXML mode offers real-time validation of XML DocBook, XHTML, or other documents. No need to know the XML schema by heart: your text editor will autocomplete XML tags according to context. It does not, however, support DITA XML by default. This tutorial will show you how to use this Emacs mode for DITA XML.

Prerequisites

Emacs
The directory structure of your DITA XML documentation project must be as follows:
- language directory
  - concepts
  - faq
  - reference
  - tasks
  - topics
where <language directory> has the value en_US, or fr_FR, etc.
Command-line instructions are designed for GNU/Linux; they must be adapted for use in other environments.

Make a backup of your entire DITA XML documentation project.

Open a terminal and paste the following command sequence:

$ export THAI="http://www.thaiopensource.com/download"
$ export RED="http://www.redaction-technique.org/media"
$ cd && \
wget $THAI/nxml-mode-20041004.tar.gz && \
tar xzvf nxml-mode-20041004.tar.gz && \
wget $RED/nxml-mode-environmment.txt && \
cp .emacs .emacs.bak && \
cat .emacs | sed '$a\' > .emacs.tmp && \
mv .emacs.tmp .emacs && \
cat nxml-mode-environmment.txt >> .emacs && \
rm nxml-mode-environmment.txt

This sequence of commands:

Downloads and unzips the nXML mode.
Creates a backup copy of the .emacs file (.emacs.bak).
Writes nXML mode environment variables to the .emacs file.

Download the RelaxNG schema archive for DITA XML into the root directory of your DITA XML documentation project.
Go to the root directory of your DITA XML documentation project, then paste the following command:
Terminal window
```
$ tar xzvf rnc.tar.gz
```
This command creates an rnc directory at the same level as the <language directory>.
Download the archive of schemas.xml files into the root directory of your DITA XML documentation project, then paste the command sequence below, replacing <language directory> with the appropriate value, such as en_US, or fr_FR. Repeat this step for all your language directories.
Terminal window
```
$ export DIR="schemas.redaction-technique.org"
$ tar xzvf $DIR.tar.gz && \
cd <language directory> && \
cp ../$DIR/concepts/schemas.xml concepts/ && \
cp ../$DIR/faq/schemas.xml faq/ && \
cp ../$DIR/reference/schemas.xml reference/ && \
cp ../$DIR/tasks/schemas.xml tasks/ && \
cp ../$DIR/tasks/schemas.xml tasks/ && \
cp ../$DIR/topics/schemas.xml topics/ && \
rm -rf ../$DIR/
```
Your language directories should now contain the appropriate schemas.xml files:
- en_FR
  - concepts
    - schemas.xml
  - faq
    - schemas.xml
  - reference
    - schemas.xml
  - tasks
    - schemas.xml
  - topics
    - schemas.xml
Open a DITA XML content file (.dita) with Emacs. The DITA XML syntax is shown in color. Places where the schema is not respected are underlined in red.
To insert a new tag, enter <, then press Ctrl+Enter. The list of possible tags appears.
Select a tag, then press Enter. Press Ctrl+Enter to display the list of permitted attributes.
To insert a closing tag after text, enter </, then press Ctrl+Enter.

Speed up your typing with Predictive mode for Emacs

This Predictive mode tutorial for Emacs is designed to help you set up and use Emacs’ Predictive word autocompletion and editing mode in a GNU/Linux environment (in this case, Debian).

Install make and texinfo:
Terminal window
```
$ sudo aptitude install make texinfo
```
Download Predictive.
Unzip the Predictive archive:
Terminal window
```
$ tar xzvf predictive-0.23.13.tar.gz
```
Go to the predictive directory:
Terminal window
```
$ cd predictive
```
Compile predictive:
Terminal window
```
$ make
```
Install predictive:
Terminal window
```
$ sudo make install
```

Insert the following code in the .emacs file:

;; predictive install location
     (add-to-list 'load-path "~/.emacs.d/predictive/")
     ;; dictionary locations
     (add-to-list 'load-path "~/.emacs.d/predictive/latex/")
     (add-to-list 'load-path "~/.emacs.d/predictive/texinfo/")
     (add-to-list 'load-path "~/.emacs.d/predictive/html/")
     ;; load predictive package
     (require 'predictive)

Start Emacs, then press Alt+X and enter:
```
predictive-mode
```