You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pdfbox.apache.org by ms...@apache.org on 2016/05/16 17:16:27 UTC

[4/4] pdfbox-docs git commit: Site checkin for project Apache PDFBox Website

Site checkin for project Apache PDFBox Website


Project: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/repo
Commit: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/commit/799b1d78
Tree: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/tree/799b1d78
Diff: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/diff/799b1d78

Branch: refs/heads/asf-site
Commit: 799b1d78d8ba26568c2efc5d762327d647a0369d
Parents: fa128c8
Author: Maruan Sahyoun <sa...@fileaffairs.de>
Authored: Mon May 16 19:16:19 2016 +0200
Committer: Maruan Sahyoun <sa...@fileaffairs.de>
Committed: Mon May 16 19:16:19 2016 +0200

----------------------------------------------------------------------
 content/1.8/architecture.html                   |  170 +--
 content/1.8/commandline.html                    | 1007 +++++++++---------
 content/1.8/cookbook/documentcreation.html      |   16 +-
 content/1.8/cookbook/encryption.html            |    9 +-
 content/1.8/cookbook/fill-form-field.html       |   38 +-
 content/1.8/cookbook/pdfacreation.html          |   32 +-
 content/1.8/cookbook/pdfavalidation.html        |   92 +-
 content/1.8/cookbook/rendering.html             |   11 +-
 content/1.8/cookbook/textextraction.html        |  121 ++-
 .../1.8/cookbook/workingwithattachments.html    |   21 +-
 content/1.8/cookbook/workingwithfonts.html      |  158 +--
 content/1.8/cookbook/workingwithmetadata.html   |   29 +-
 content/1.8/dependencies.html                   |   47 +-
 content/1.8/faq.html                            |  108 +-
 content/2.0/commandline.html                    |  904 ++++++++--------
 content/2.0/cookbook/encryption.html            |   17 +-
 content/2.0/dependencies.html                   |   30 +-
 content/2.0/getting-started.html                |   22 +-
 content/2.0/migration.html                      |  184 ++--
 content/building.html                           |   73 +-
 content/codingconventions.html                  |  157 ++-
 content/css/site.css                            |    2 +-
 content/errors/403.html                         |   10 +-
 content/errors/404.html                         |   10 +-
 content/ideas.html                              |   40 +-
 content/index.html                              |   47 +-
 content/mailinglists.html                       |   89 +-
 .../2015/11/22/pdfbox-2.0.0-RC2-released.html   |    4 +-
 .../2016/01/15/pdfbox-2.0.0-RC3-released.html   |    4 +-
 .../news/2016/01/18/pdfbox-1.8.11-released.html |    4 +-
 .../news/2016/03/18/pdfbox-2.0.0-released.html  |    2 +-
 .../news/2016/04/26/new-pdfbox-releases.html    |    2 +-
 content/references.html                         |  260 ++---
 content/siteupdate.html                         |   81 +-
 content/support.html                            |   42 +-
 content/team.html                               |  218 ++--
 36 files changed, 2180 insertions(+), 1881 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/architecture.html
----------------------------------------------------------------------
diff --git a/content/1.8/architecture.html b/content/1.8/architecture.html
index a1d6be5..f4aad24 100644
--- a/content/1.8/architecture.html
+++ b/content/1.8/architecture.html
@@ -137,8 +137,8 @@ is organized as PDFBox was architected around the concepts layed out in the
 ISO-32000 (PDF) Specification</p>
 
 <ul>
-<li><a href="http://www.iso.org/iso/catalogue_detail.htm?csnumber=51502">ISO Site</a></li>
-<li><a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/PDF32000_2008.pdf">Adobe Version</a></li>
+  <li><a href="http://www.iso.org/iso/catalogue_detail.htm?csnumber=51502">ISO Site</a></li>
+  <li><a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/PDF32000_2008.pdf">Adobe Version</a></li>
 </ul>
 
 <h2 id="quick-introduction-to-the-pdf-format">Quick Introduction to the PDF format</h2>
@@ -151,7 +151,7 @@ make up the basic objects upon which higher level objects and structures are bui
 </p>
 
 <p>The organization of these objects, how to they are read and how to write them is defined in the file structure of the 
-PDF [see ISO-32000 7.5]. In addition a file can be encrpyted to protect the document&#39;s content [see ISO-32000 7.5].</p>
+PDF [see ISO-32000 7.5]. In addition a file can be encrpyted to protect the document\u2019s content [see ISO-32000 7.5].</p>
 
 <p class="alert alert-info">PDFBox handles the reading in the *org.apache.pdfbox.pdfparser* package. 
 Writing of PDF files is handled in the *org.apache.pdfbox.pdfwriter* package.
@@ -162,11 +162,11 @@ as pages, bookmarks, annotations [see ISO-32000 7.7].</p>
 
 <p class="alert alert-info">PDFBox makes these higher level objects available through the 
 *org.apache.pdfbox.pdfmodel* package (The PD Model).
-</p> 
+</p>
 
 <p>In addition there is a COS representation available for the PD model if there is a need to 
 inspect the underlying structure or to handle special cases where the higher level PD model
-doesn&#39;t provide the functionality needed.</p>
+doesn\u2019t provide the functionality needed.</p>
 
 <p class="alert alert-info">It's always the COS model which is represented in the PDF file.</p>
 
@@ -174,85 +174,94 @@ doesn&#39;t provide the functionality needed.</p>
 
 <p>As outlined above the basic PDF objects are represented in PDFBox in the org.apache.pdfbox.cos package.</p>
 
-<table><thead>
-<tr>
-<th>PDF Type</th>
-<th>Description</th>
-<th>Example</th>
-<th>PDFBox class</th>
-<th>ISO 32000</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>Boolean</td>
-<td>Standard True/False values</td>
-<td>true</td>
-<td>org.apache.pdfbox.cos.COSBoolean</td>
-<td>7.3.2</td>
-</tr>
-<tr>
-<td>Number</td>
-<td>Integer and floating point numbers</td>
-<td>1 2.3</td>
-<td>org.apache.pdfbox.cos.COSInteger<br/>org.apache.pdfbox.cos.COSFloat</td>
-<td>7.3.3</td>
-</tr>
-<tr>
-<td>String</td>
-<td>A sequence of characters</td>
-<td>(This is a string)</td>
-<td>org.apache.pdfbox.cos.COSString</td>
-<td>7.3.4</td>
-</tr>
-<tr>
-<td>Name</td>
-<td>A predefined value in a PDF document, typically used as a key in a dictionary</td>
-<td>/Type</td>
-<td>org.apache.pdfbox.cos.COSName</td>
-<td>7.3.5</td>
-</tr>
-<tr>
-<td>Array</td>
-<td>Arrays are one-dimensional lists of objects accessed by a numeric index. Within an array each basic object is permitted as an entry.</td>
-<td>[549 3.14 false (Ralph) /SomeName]</td>
-<td>org.apache.pdfbox.cos.COSArray</td>
-<td>7.3.6</td>
-</tr>
-<tr>
-<td>Dictionary</td>
-<td>A map of name value pairs</td>
-<td>&lt;&lt;<br/>/Type /XObject<br/>/Name (Name)</br>/Size 1</br>&gt;&gt;</td>
-<td>org.apache.pdfbox.cos.COSDictionary</td>
-<td>7.3.7</td>
-</tr>
-<tr>
-<td>Stream</td>
-<td>A stream of data, typically compressed. This is used for page contents, images and embedded font streams.</td>
-<td>12 0 obj &lt;&lt; /Type /XObject &gt;&gt; stream 030004040404040404 endstream</td>
-<td>org.apache.pdfbox.cos.COSStream</td>
-<td>7.3.8</td>
-</tr>
-<tr>
-<td>Object</td>
-<td>A wrapper to any of the other objects, this can be used to reference an object multiple times. An object is referenced by using two numbers, an object number and a generation number. Initially the generation number will be zero unless the object got replaced later in the stream.</td>
-<td>12 0 obj &lt;&lt; /Type /XObject &gt;&gt; endobj</td>
-<td>org.apache.pdfbox.cos.COSObject</td>
-<td></td>
-</tr>
-</tbody></table>
+<table>
+  <thead>
+    <tr>
+      <th>PDF Type</th>
+      <th>Description</th>
+      <th>Example</th>
+      <th>PDFBox class</th>
+      <th>ISO 32000</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>Boolean</td>
+      <td>Standard True/False values</td>
+      <td>true</td>
+      <td>org.apache.pdfbox.cos.COSBoolean</td>
+      <td>7.3.2</td>
+    </tr>
+    <tr>
+      <td>Number</td>
+      <td>Integer and floating point numbers</td>
+      <td>1 2.3</td>
+      <td>org.apache.pdfbox.cos.COSInteger<br />org.apache.pdfbox.cos.COSFloat</td>
+      <td>7.3.3</td>
+    </tr>
+    <tr>
+      <td>String</td>
+      <td>A sequence of characters</td>
+      <td>(This is a string)</td>
+      <td>org.apache.pdfbox.cos.COSString</td>
+      <td>7.3.4</td>
+    </tr>
+    <tr>
+      <td>Name</td>
+      <td>A predefined value in a PDF document, typically used as a key in a dictionary</td>
+      <td>/Type</td>
+      <td>org.apache.pdfbox.cos.COSName</td>
+      <td>7.3.5</td>
+    </tr>
+    <tr>
+      <td>Array</td>
+      <td>Arrays are one-dimensional lists of objects accessed by a numeric index. Within an array each basic object is permitted as an entry.</td>
+      <td>[549 3.14 false (Ralph) /SomeName]</td>
+      <td>org.apache.pdfbox.cos.COSArray</td>
+      <td>7.3.6</td>
+    </tr>
+    <tr>
+      <td>Dictionary</td>
+      <td>A map of name value pairs</td>
+      <td>�<br />/Type /XObject<br />/Name (Name)&lt;/br&gt;/Size 1&lt;/br&gt;�</td>
+      <td>org.apache.pdfbox.cos.COSDictionary</td>
+      <td>7.3.7</td>
+    </tr>
+    <tr>
+      <td>Stream</td>
+      <td>A stream of data, typically compressed. This is used for page contents, images and embedded font streams.</td>
+      <td>12 0 obj ��/Type /XObject�� stream 030004040404040404 endstream</td>
+      <td>org.apache.pdfbox.cos.COSStream</td>
+      <td>7.3.8</td>
+    </tr>
+    <tr>
+      <td>Object</td>
+      <td>A wrapper to any of the other objects, this can be used to reference an object multiple times. An object is referenced by using two numbers, an object number and a generation number. Initially the generation number will be zero unless the object got replaced later in the stream.</td>
+      <td>12 0 obj ��/Type /XObject�� endobj</td>
+      <td>org.apache.pdfbox.cos.COSObject</td>
+      <td>�</td>
+    </tr>
+  </tbody>
+</table>
 
 <p>A page in a pdf document is represented with a COSDictionary. The entries that are available for a page can be seen in the PDF Reference and an example of a page looks like this:</p>
-<div class="highlight"><pre><code class="language-text" data-lang="text">&lt;&lt;
+
+<div class="highlighter-rouge"><pre class="highlight"><code>&lt;&lt;
     /Type /Page
     /MediaBox [0 0 612 915]
     /Contents 56 0 R
 &gt;&gt;
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>The information within the dictionary can be accessed using the COS model</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">COSDictionary</span> <span class="n">page</span> <span class="o">=</span> <span class="o">...;</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">COSDictionary</span> <span class="n">page</span> <span class="o">=</span> <span class="o">...;</span>
 <span class="n">COSArray</span> <span class="n">mediaBox</span> <span class="o">=</span> <span class="o">(</span><span class="n">COSArray</span><span class="o">)</span><span class="n">page</span><span class="o">.</span><span class="na">getDictionaryObject</span><span class="o">(</span> <span class="s">"MediaBox"</span> <span class="o">);</span>
 <span class="n">System</span><span class="o">.</span><span class="na">out</span><span class="o">.</span><span class="na">println</span><span class="o">(</span> <span class="s">"Width:"</span> <span class="o">+</span> <span class="n">mediaBox</span><span class="o">.</span><span class="na">get</span><span class="o">(</span> <span class="mi">3</span> <span class="o">)</span> <span class="o">);</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>As can be seen from that little example the COS model provides a low level API to access 
 information within the PDF. In order to use the COS model successfully a good knowledge of
 the PDF specification is needed.</p>
@@ -265,17 +274,20 @@ parameters and no helper methods are available. The PD Model was created to help
 alleviate this problem. Each type of object(page, font, image) has a set of defined
 attributes that can be available in the dictionary. 
 A PD Model class is available for each of these so that strongly typed methods are
-available to access the attributes. </p>
+available to access the attributes.</p>
 
 <p>The same code from above to get the page width can be rewritten to use PD Model classes.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">PDPage</span> <span class="n">page</span> <span class="o">=</span> <span class="o">...;</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">PDPage</span> <span class="n">page</span> <span class="o">=</span> <span class="o">...;</span>
 <span class="n">PDRectangle</span> <span class="n">mediaBox</span> <span class="o">=</span> <span class="n">page</span><span class="o">.</span><span class="na">getMediaBox</span><span class="o">();</span>
 <span class="n">System</span><span class="o">.</span><span class="na">out</span><span class="o">.</span><span class="na">println</span><span class="o">(</span> <span class="s">"Width:"</span> <span class="o">+</span> <span class="n">mediaBox</span><span class="o">.</span><span class="na">getWidth</span><span class="o">()</span> <span class="o">);</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>PD Model objects sit on top of COS model. Typically, the classes in the PD Model will only
 store a COS object and all setter/getter methods will modify data that is stored in the
 COS object. For example, when you call PDPage.getLastModified() the method will do a
-lookup in the COSDictionary with the key &quot;LastModified&quot;, if it is found the value is then
+lookup in the COSDictionary with the key \u201cLastModified\u201d, if it is found the value is then
 converter to a java.util.Calendar. When PDPage.setLastModified( Calendar ) is called then
 the Calendar is converted to a string in the COSDictionary.</p>
 

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/commandline.html
----------------------------------------------------------------------
diff --git a/content/1.8/commandline.html b/content/1.8/commandline.html
index 474b699..c19f7db 100644
--- a/content/1.8/commandline.html
+++ b/content/1.8/commandline.html
@@ -137,7 +137,7 @@
 <p>See the <a href="/1.8/dependencies.html">Dependencies</a> page for instructions on how to set your classpath in order to run 
 PDFBox tools as Java applications.</p>
 
-<p><strong>Table of Contents</strong><br>
+<p><strong>Table of Contents</strong><br />
 <a href="#decrypt">Decrypt</a>
 <a href="#encrypt">Encrypt</a>
 <a href="#extractimages">ExtractImages</a>
@@ -158,292 +158,307 @@ PDFBox tools as Java applications.</p>
 
 <p>NOTE: You must have the owner password to decrypt the document!</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar Decrypt [OPTIONS] &lt;inputfile&gt; [outputfile]</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td>Password to the PDF or certificate in keystore.</td>
-</tr>
-<tr>
-<td>-keyStore</td>
-<td>Path to keystore that holds certificate to decrypt the document. This is only required if the document is encrypted with a certificate, otherwise only the password is required.</td>
-</tr>
-<tr>
-<td>-alias</td>
-<td>The alias to the certificate in the keystore.</td>
-</tr>
-<tr>
-<td>inputfile</td>
-<td>The PDF file to decrypt.</td>
-</tr>
-<tr>
-<td>outputfile</td>
-<td>The file to save the decrypted document to. If left blank then it will be the same as the input file.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar Decrypt [OPTIONS] &lt;inputfile&gt; [outputfile]</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>Password to the PDF or certificate in keystore.</td>
+    </tr>
+    <tr>
+      <td>-keyStore</td>
+      <td>Path to keystore that holds certificate to decrypt the document. This is only required if the document is encrypted with a certificate, otherwise only the password is required.</td>
+    </tr>
+    <tr>
+      <td>-alias</td>
+      <td>The alias to the certificate in the keystore.</td>
+    </tr>
+    <tr>
+      <td>inputfile</td>
+      <td>The PDF file to decrypt.</td>
+    </tr>
+    <tr>
+      <td>outputfile</td>
+      <td>The file to save the decrypted document to. If left blank then it will be the same as the input file.</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="encrypt">Encrypt</h2>
 
 <p>This application will encrypt a PDF document.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar Encrypt [OPTIONS] &lt;password&gt; &lt;inputfile&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-O</td>
-<td></td>
-<td>The owner password to the PDF, ignored if -certFile is specified.</td>
-</tr>
-<tr>
-<td>-U</td>
-<td></td>
-<td>The user password to the PDF, ignored if -certFile is specified.</td>
-</tr>
-<tr>
-<td>-certFile</td>
-<td></td>
-<td>Path to X.509 cert file.</td>
-</tr>
-<tr>
-<td>-canAssemble</td>
-<td>true</td>
-<td>Set the assemble permission.</td>
-</tr>
-<tr>
-<td>-canExtractContent</td>
-<td>true</td>
-<td>Set the extraction permission.</td>
-</tr>
-<tr>
-<td>-canExtractForAccessibility</td>
-<td>true</td>
-<td>Set the extraction permission.</td>
-</tr>
-<tr>
-<td>-canFillInForm</td>
-<td>true</td>
-<td>Set the fill in form permission.</td>
-</tr>
-<tr>
-<td>-canModify</td>
-<td>true</td>
-<td>Set the modify permission.</td>
-</tr>
-<tr>
-<td>-canModifyAnnotations</td>
-<td>true</td>
-<td>Set the modify annots permission.</td>
-</tr>
-<tr>
-<td>-canPrint</td>
-<td>true</td>
-<td>Set the print permission.</td>
-</tr>
-<tr>
-<td>-canPrintDegraded</td>
-<td>true</td>
-<td>Set the print degraded permission.</td>
-</tr>
-<tr>
-<td>-keyLength</td>
-<td>40 or 128</td>
-<td>The number of bits for the encryption key. For 128 bits <a href="http://www.oracle.com/technetwork/java/javase/downloads/jce-7-download-432124.html">Java Cryptography Extension (JCE) Unlimited Strength Jurisdiction Policy Files</a> must be installed.</td>
-</tr>
-<tr>
-<td>inputfile</td>
-<td></td>
-<td>The PDF file to encrypt.</td>
-</tr>
-<tr>
-<td>outputfile</td>
-<td></td>
-<td>The file to save the encrypted document to. If left blank then it will be the same as the input file.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar Encrypt [OPTIONS] &lt;password&gt; &lt;inputfile&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-O</td>
+      <td>�</td>
+      <td>The owner password to the PDF, ignored if -certFile is specified.</td>
+    </tr>
+    <tr>
+      <td>-U</td>
+      <td>�</td>
+      <td>The user password to the PDF, ignored if -certFile is specified.</td>
+    </tr>
+    <tr>
+      <td>-certFile</td>
+      <td>�</td>
+      <td>Path to X.509 cert file.</td>
+    </tr>
+    <tr>
+      <td>-canAssemble</td>
+      <td>true</td>
+      <td>Set the assemble permission.</td>
+    </tr>
+    <tr>
+      <td>-canExtractContent</td>
+      <td>true</td>
+      <td>Set the extraction permission.</td>
+    </tr>
+    <tr>
+      <td>-canExtractForAccessibility</td>
+      <td>true</td>
+      <td>Set the extraction permission.</td>
+    </tr>
+    <tr>
+      <td>-canFillInForm</td>
+      <td>true</td>
+      <td>Set the fill in form permission.</td>
+    </tr>
+    <tr>
+      <td>-canModify</td>
+      <td>true</td>
+      <td>Set the modify permission.</td>
+    </tr>
+    <tr>
+      <td>-canModifyAnnotations</td>
+      <td>true</td>
+      <td>Set the modify annots permission.</td>
+    </tr>
+    <tr>
+      <td>-canPrint</td>
+      <td>true</td>
+      <td>Set the print permission.</td>
+    </tr>
+    <tr>
+      <td>-canPrintDegraded</td>
+      <td>true</td>
+      <td>Set the print degraded permission.</td>
+    </tr>
+    <tr>
+      <td>-keyLength</td>
+      <td>40 or 128</td>
+      <td>The number of bits for the encryption key. For 128 bits <a href="http://www.oracle.com/technetwork/java/javase/downloads/jce-7-download-432124.html">Java Cryptography Extension (JCE) Unlimited Strength Jurisdiction Policy Files</a> must be installed.</td>
+    </tr>
+    <tr>
+      <td>inputfile</td>
+      <td>�</td>
+      <td>The PDF file to encrypt.</td>
+    </tr>
+    <tr>
+      <td>outputfile</td>
+      <td>�</td>
+      <td>The file to save the encrypted document to. If left blank then it will be the same as the input file.</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="extractimages">ExtractImages</h2>
 
 <p>This application will extract all images from the given PDF document.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar ExtractImages [OPTIONS] &lt;inputfile&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td></td>
-<td>The password to the PDF document.</td>
-</tr>
-<tr>
-<td>-prefix</td>
-<td>PDF name</td>
-<td>Image prefix to use.</td>
-</tr>
-<tr>
-<td>-directJPEG</td>
-<td>false</td>
-<td>Forces the direct extraction of JPEG images regardless of colorspace.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar ExtractImages [OPTIONS] &lt;inputfile&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>�</td>
+      <td>The password to the PDF document.</td>
+    </tr>
+    <tr>
+      <td>-prefix</td>
+      <td>PDF name</td>
+      <td>Image prefix to use.</td>
+    </tr>
+    <tr>
+      <td>-directJPEG</td>
+      <td>false</td>
+      <td>Forces the direct extraction of JPEG images regardless of colorspace.</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="extracttext">ExtractText</h2>
 
 <p>This application will extract all text from the given PDF document.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar ExtractText [OPTIONS] &lt;inputfile&gt; [Text file]</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td></td>
-<td>The password to the PDF document.</td>
-</tr>
-<tr>
-<td>-encoding</td>
-<td>default encoding</td>
-<td>The encoding type of the text file, e.g. ISO-8859-1, UTF-8, UTF-16BE.</td>
-</tr>
-<tr>
-<td>-console</td>
-<td>false</td>
-<td>Send text to console instead of file.</td>
-</tr>
-<tr>
-<td>-html</td>
-<td>false</td>
-<td>Output in HTML format instead of raw text.</td>
-</tr>
-<tr>
-<td>-sort</td>
-<td>false</td>
-<td>Sort the text before writing.</td>
-</tr>
-<tr>
-<td>-ignoreBeads</td>
-<td>false</td>
-<td>Disables the separation by beads.</td>
-</tr>
-<tr>
-<td>-force</td>
-<td>false</td>
-<td>Enables pdfbox to ignore corrupt objects.</td>
-</tr>
-<tr>
-<td>-debug</td>
-<td>false</td>
-<td>Enables debug output about the time consumption of every stage.</td>
-</tr>
-<tr>
-<td>-startPage</td>
-<td>1</td>
-<td>The first page to extract, one based.</td>
-</tr>
-<tr>
-<td>-endPage</td>
-<td>Integer.MAX_INT</td>
-<td>The last page to extract, one based.</td>
-</tr>
-<tr>
-<td>-nonSeq</td>
-<td>false</td>
-<td>Use the new non sequential parser.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar ExtractText [OPTIONS] &lt;inputfile&gt; [Text file]</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>�</td>
+      <td>The password to the PDF document.</td>
+    </tr>
+    <tr>
+      <td>-encoding</td>
+      <td>default encoding</td>
+      <td>The encoding type of the text file, e.g. ISO-8859-1, UTF-8, UTF-16BE.</td>
+    </tr>
+    <tr>
+      <td>-console</td>
+      <td>false</td>
+      <td>Send text to console instead of file.</td>
+    </tr>
+    <tr>
+      <td>-html</td>
+      <td>false</td>
+      <td>Output in HTML format instead of raw text.</td>
+    </tr>
+    <tr>
+      <td>-sort</td>
+      <td>false</td>
+      <td>Sort the text before writing.</td>
+    </tr>
+    <tr>
+      <td>-ignoreBeads</td>
+      <td>false</td>
+      <td>Disables the separation by beads.</td>
+    </tr>
+    <tr>
+      <td>-force</td>
+      <td>false</td>
+      <td>Enables pdfbox to ignore corrupt objects.</td>
+    </tr>
+    <tr>
+      <td>-debug</td>
+      <td>false</td>
+      <td>Enables debug output about the time consumption of every stage.</td>
+    </tr>
+    <tr>
+      <td>-startPage</td>
+      <td>1</td>
+      <td>The first page to extract, one based.</td>
+    </tr>
+    <tr>
+      <td>-endPage</td>
+      <td>Integer.MAX_INT</td>
+      <td>The last page to extract, one based.</td>
+    </tr>
+    <tr>
+      <td>-nonSeq</td>
+      <td>false</td>
+      <td>Use the new non sequential parser.</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="overlaypdf">OverlayPDF</h2>
 
 <p>This application will overlay one document with the content of another document</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar OverlayPDF &lt;input.pdf&gt; [OPTIONS] &lt;output.pdf&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>inputfile</td>
-<td></td>
-<td>The PDF file to be overlayed.</td>
-</tr>
-<tr>
-<td>defaultOverlay.pdf</td>
-<td></td>
-<td>Default overlay file.</td>
-</tr>
-<tr>
-<td>-odd oddPageOverlay.pdf</td>
-<td></td>
-<td>Overlay file used for odd pages.</td>
-</tr>
-<tr>
-<td>-even evenPageOverlay.pdf</td>
-<td></td>
-<td>Overlay file used for even pages.</td>
-</tr>
-<tr>
-<td>-first firstPageOverlay.pdf</td>
-<td></td>
-<td>Overlay file used for the first page.</td>
-</tr>
-<tr>
-<td>-last lastPageOverlay.pdf</td>
-<td></td>
-<td>Overlay file used for the last pages.</td>
-</tr>
-<tr>
-<td>-page pageNumber specificPageOverlay.pdf</td>
-<td></td>
-<td>overlay file used for the given page number, may occur more than once.</td>
-</tr>
-<tr>
-<td>-position</td>
-<td>background</td>
-<td>Where to put the overlay, foreground or background.</td>
-</tr>
-<tr>
-<td>-nonSeq</td>
-<td>false</td>
-<td>Use the new non sequential parser.</td>
-</tr>
-<tr>
-<td>outputfile</td>
-<td></td>
-<td>The resulting pdf file.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar OverlayPDF &lt;input.pdf&gt; [OPTIONS] &lt;output.pdf&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>inputfile</td>
+      <td>�</td>
+      <td>The PDF file to be overlayed.</td>
+    </tr>
+    <tr>
+      <td>defaultOverlay.pdf</td>
+      <td>�</td>
+      <td>Default overlay file.</td>
+    </tr>
+    <tr>
+      <td>-odd oddPageOverlay.pdf</td>
+      <td>�</td>
+      <td>Overlay file used for odd pages.</td>
+    </tr>
+    <tr>
+      <td>-even evenPageOverlay.pdf</td>
+      <td>�</td>
+      <td>Overlay file used for even pages.</td>
+    </tr>
+    <tr>
+      <td>-first firstPageOverlay.pdf</td>
+      <td>�</td>
+      <td>Overlay file used for the first page.</td>
+    </tr>
+    <tr>
+      <td>-last lastPageOverlay.pdf</td>
+      <td>�</td>
+      <td>Overlay file used for the last pages.</td>
+    </tr>
+    <tr>
+      <td>-page pageNumber specificPageOverlay.pdf</td>
+      <td>�</td>
+      <td>overlay file used for the given page number, may occur more than once.</td>
+    </tr>
+    <tr>
+      <td>-position</td>
+      <td>background</td>
+      <td>Where to put the overlay, foreground or background.</td>
+    </tr>
+    <tr>
+      <td>-nonSeq</td>
+      <td>false</td>
+      <td>Use the new non sequential parser.</td>
+    </tr>
+    <tr>
+      <td>outputfile</td>
+      <td>�</td>
+      <td>The resulting pdf file.</td>
+    </tr>
+  </tbody>
+</table>
 
 <p>Examples:</p>
 
 <ul>
-<li>OverlayPDF input.pdf overlay.pdf -nonSeq output.pdf</li>
-<li>OverlayPDF input.pdf defaultOverlay.pdf -page 10 overlayForPage10.pdf -position foreground -nonSeq output.pdf</li>
-<li>OverlayPDF input.pdf -odd oddOverlay.pdf -even evenOverlay.pdf -nonSeq output.pdf</li>
+  <li>OverlayPDF input.pdf overlay.pdf -nonSeq output.pdf</li>
+  <li>OverlayPDF input.pdf defaultOverlay.pdf -page 10 overlayForPage10.pdf -position foreground -nonSeq output.pdf</li>
+  <li>OverlayPDF input.pdf -odd oddOverlay.pdf -even evenOverlay.pdf -nonSeq output.pdf</li>
 </ul>
 
 <h2 id="printpdf">PrintPDF</h2>
@@ -452,271 +467,293 @@ PDFBox tools as Java applications.</p>
 
 <p class="alert alert-info">You must have the correct permissions to print the document!</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar PrintPDF [OPTIONS] &lt;inputfile&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td>The password to decrypt the PDF.</td>
-</tr>
-<tr>
-<td>-silentPrint</td>
-<td>Print the PDF without prompting for a printer.</td>
-</tr>
-<tr>
-<td>inputfile</td>
-<td>The PDF file to print.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar PrintPDF [OPTIONS] &lt;inputfile&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>The password to decrypt the PDF.</td>
+    </tr>
+    <tr>
+      <td>-silentPrint</td>
+      <td>Print the PDF without prompting for a printer.</td>
+    </tr>
+    <tr>
+      <td>inputfile</td>
+      <td>The PDF file to print.</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="pdfdebugger">PDFDebugger</h2>
 
 <p>This application will take an existing PDF document and allows to analyze and inspect the internal structure</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar PDFDebugger [inputfile]</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td></td>
-<td>The password to the PDF document.</td>
-</tr>
-<tr>
-<td>-nonSeq</td>
-<td>false</td>
-<td>Use the new non sequential parser.</td>
-</tr>
-<tr>
-<td>inputfile</td>
-<td></td>
-<td>the name of an optional PDF file to open.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar PDFDebugger [inputfile]</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>�</td>
+      <td>The password to the PDF document.</td>
+    </tr>
+    <tr>
+      <td>-nonSeq</td>
+      <td>false</td>
+      <td>Use the new non sequential parser.</td>
+    </tr>
+    <tr>
+      <td>inputfile</td>
+      <td>�</td>
+      <td>the name of an optional PDF file to open.</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="pdfreader">PDFReader</h2>
 
 <p>An application to read PDF documents. This will provide Acrobat Reader like functionality.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar PDFReader [PDF file]</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td></td>
-<td>The password to the PDF document.</td>
-</tr>
-<tr>
-<td>-nonSeq</td>
-<td>false</td>
-<td>Use the new non sequential parser.</td>
-</tr>
-<tr>
-<td>PDF file</td>
-<td></td>
-<td>the name of an optional PDF file to open</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar PDFReader [PDF file]</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>�</td>
+      <td>The password to the PDF document.</td>
+    </tr>
+    <tr>
+      <td>-nonSeq</td>
+      <td>false</td>
+      <td>Use the new non sequential parser.</td>
+    </tr>
+    <tr>
+      <td>PDF file</td>
+      <td>�</td>
+      <td>the name of an optional PDF file to open</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="pdfmerger">PDFMerger</h2>
 
 <p>This application will take a list of pdf documents and merge them, saving the result in a new document.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar PDFMerger &lt;Source PDF files (2 ..n)&gt; &lt;Target PDF file&gt;</code></p>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar PDFMerger &lt;Source PDF files (2 ..n)&gt; &lt;Target PDF file&gt;</code></p>
 
 <h2 id="pdfsplit">PDFSplit</h2>
 
 <p>This application will take an existing PDF document and split it into a number of other documents</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar PDFSplit [OPTIONS] &lt;PDF file&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td></td>
-<td>The password to the PDF document.</td>
-</tr>
-<tr>
-<td>-split</td>
-<td></td>
-<td>Number of pages of every splitted part of the pdf.</td>
-</tr>
-<tr>
-<td>-startPage</td>
-<td></td>
-<td>The page to start at.</td>
-</tr>
-<tr>
-<td>-endPage</td>
-<td></td>
-<td>The page to stop at.</td>
-</tr>
-<tr>
-<td>-nonSeq</td>
-<td>false</td>
-<td>Use the new non sequential parser.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar PDFSplit [OPTIONS] &lt;PDF file&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>�</td>
+      <td>The password to the PDF document.</td>
+    </tr>
+    <tr>
+      <td>-split</td>
+      <td>�</td>
+      <td>Number of pages of every splitted part of the pdf.</td>
+    </tr>
+    <tr>
+      <td>-startPage</td>
+      <td>�</td>
+      <td>The page to start at.</td>
+    </tr>
+    <tr>
+      <td>-endPage</td>
+      <td>�</td>
+      <td>The page to stop at.</td>
+    </tr>
+    <tr>
+      <td>-nonSeq</td>
+      <td>false</td>
+      <td>Use the new non sequential parser.</td>
+    </tr>
+  </tbody>
+</table>
 
 <p>Examples:</p>
 
 <ul>
-<li>PDFSplit -split 2 sample_with_13_pages.pdf will split the pdf in pieces of 2 pages each except the last which will contain 1 page only.</li>
-<li>PDFSplit -startPage 5 sample_with_13_pages.pdf will provide a pdf containing all pages of the source pdf starting at page 5</li>
-<li>PDFSplit -startPage 5 -endPage 10 sample_with_13_pages.pdf will provide a pdf containing all pages from 5 to 10 of the source pdf</li>
-<li>PDFSplit -split 2 -startPage 5 -endPage 10 sample_with_13_pages.pdf will provide 3 pdfs containing all pages from 5 to 10 of the source pdf 2 pages each</li>
+  <li>PDFSplit -split 2 sample_with_13_pages.pdf will split the pdf in pieces of 2 pages each except the last which will contain 1 page only.</li>
+  <li>PDFSplit -startPage 5 sample_with_13_pages.pdf will provide a pdf containing all pages of the source pdf starting at page 5</li>
+  <li>PDFSplit -startPage 5 -endPage 10 sample_with_13_pages.pdf will provide a pdf containing all pages from 5 to 10 of the source pdf</li>
+  <li>PDFSplit -split 2 -startPage 5 -endPage 10 sample_with_13_pages.pdf will provide 3 pdfs containing all pages from 5 to 10 of the source pdf 2 pages each</li>
 </ul>
 
 <h2 id="pdftoimage">PDFToImage</h2>
 
 <p>This application will create an image for every page in the PDF document.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar PDFToImage [OPTIONS] &lt;PDF file&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td></td>
-<td>The password to the PDF document.</td>
-</tr>
-<tr>
-<td>-imageType</td>
-<td>jpg</td>
-<td>The image type to write to. Currently only jpg or png.</td>
-</tr>
-<tr>
-<td>-outputPrefix</td>
-<td>Name of PDF document</td>
-<td>The prefix to the image file.</td>
-</tr>
-<tr>
-<td>-startPage</td>
-<td>1</td>
-<td>The first page to convert, one based.</td>
-</tr>
-<tr>
-<td>-endPage</td>
-<td>Integer.MAX_INT</td>
-<td>The last page to convert, one based.</td>
-</tr>
-<tr>
-<td>-nonSeq</td>
-<td>false</td>
-<td>Use the new non sequential parser.</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar PDFToImage [OPTIONS] &lt;PDF file&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>�</td>
+      <td>The password to the PDF document.</td>
+    </tr>
+    <tr>
+      <td>-imageType</td>
+      <td>jpg</td>
+      <td>The image type to write to. Currently only jpg or png.</td>
+    </tr>
+    <tr>
+      <td>-outputPrefix</td>
+      <td>Name of PDF document</td>
+      <td>The prefix to the image file.</td>
+    </tr>
+    <tr>
+      <td>-startPage</td>
+      <td>1</td>
+      <td>The first page to convert, one based.</td>
+    </tr>
+    <tr>
+      <td>-endPage</td>
+      <td>Integer.MAX_INT</td>
+      <td>The last page to convert, one based.</td>
+    </tr>
+    <tr>
+      <td>-nonSeq</td>
+      <td>false</td>
+      <td>Use the new non sequential parser.</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="texttopdf">TextToPDF</h2>
 
 <p>This application will create a PDF document from a text file.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar TextToPDF [OPTIONS] &lt;outputfile&gt; &lt;textfile&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-standardFont</td>
-<td>Helvetica</td>
-<td>The font to use for the text. Either this or -ttf should be specified but not both.</td>
-</tr>
-<tr>
-<td>-ttf</td>
-<td></td>
-<td>The TTF font to use for the text. Either this or -standardFont should be specified but not both.</td>
-</tr>
-<tr>
-<td>-fontSize</td>
-<td>10</td>
-<td>The size of the font to use.</td>
-</tr>
-</tbody></table>
-
-<p>The following font names can be used for the parameter <code>standardFont</code>:</p>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar TextToPDF [OPTIONS] &lt;outputfile&gt; &lt;textfile&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-standardFont</td>
+      <td>Helvetica</td>
+      <td>The font to use for the text. Either this or -ttf should be specified but not both.</td>
+    </tr>
+    <tr>
+      <td>-ttf</td>
+      <td>�</td>
+      <td>The TTF font to use for the text. Either this or -standardFont should be specified but not both.</td>
+    </tr>
+    <tr>
+      <td>-fontSize</td>
+      <td>10</td>
+      <td>The size of the font to use.</td>
+    </tr>
+  </tbody>
+</table>
+
+<p>The following font names can be used for the parameter <code class="highlighter-rouge">standardFont</code>:</p>
 
 <ul>
-<li>Courier</li>
-<li>Courier-Bold</li>
-<li>Courier-Oblique</li>
-<li>Courier-BoldOblique</li>
-<li>Helvetica</li>
-<li>Helvetica-Bold</li>
-<li>Helvetica-Oblique</li>
-<li>Helvetica-BoldOblique</li>
-<li>Symbol</li>
-<li>Times-Bold</li>
-<li>Times-Roman</li>
-<li>Times-Italic</li>
-<li>Times-BoldItalic</li>
-<li>ZapfDingbats</li>
+  <li>Courier</li>
+  <li>Courier-Bold</li>
+  <li>Courier-Oblique</li>
+  <li>Courier-BoldOblique</li>
+  <li>Helvetica</li>
+  <li>Helvetica-Bold</li>
+  <li>Helvetica-Oblique</li>
+  <li>Helvetica-BoldOblique</li>
+  <li>Symbol</li>
+  <li>Times-Bold</li>
+  <li>Times-Roman</li>
+  <li>Times-Italic</li>
+  <li>Times-BoldItalic</li>
+  <li>ZapfDingbats</li>
 </ul>
 
 <h2 id="writedecodeddoc">WriteDecodedDoc</h2>
 
 <p>An application to decompress PDF documents.</p>
 
-<p>usage: <code>java -jar pdfbox-app-x.y.z.jar WriteDecodedDoc &lt;input-file&gt; &lt;output-file&gt;</code></p>
-
-<table><thead>
-<tr>
-<th>Command Line Parameter</th>
-<th>Default</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>-password</td>
-<td></td>
-<td>The password to the PDF document.</td>
-</tr>
-<tr>
-<td>-nonSeq</td>
-<td>false</td>
-<td>Use the new non sequential parser.</td>
-</tr>
-<tr>
-<td><input-file></td>
-<td></td>
-<td>The PDF file to decompress</td>
-</tr>
-<tr>
-<td><output-file></td>
-<td></td>
-<td>The destination PDF file</td>
-</tr>
-</tbody></table>
+<p>usage: <code class="highlighter-rouge">java -jar pdfbox-app-x.y.z.jar WriteDecodedDoc &lt;input-file&gt; &lt;output-file&gt;</code></p>
+
+<table>
+  <thead>
+    <tr>
+      <th>Command Line Parameter</th>
+      <th>Default</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>-password</td>
+      <td>�</td>
+      <td>The password to the PDF document.</td>
+    </tr>
+    <tr>
+      <td>-nonSeq</td>
+      <td>false</td>
+      <td>Use the new non sequential parser.</td>
+    </tr>
+    <tr>
+      <td><input-file></input-file></td>
+      <td>�</td>
+      <td>The PDF file to decompress</td>
+    </tr>
+    <tr>
+      <td><output-file></output-file></td>
+      <td>�</td>
+      <td>The destination PDF file</td>
+    </tr>
+  </tbody>
+</table>
+
 
             </div>
         </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/documentcreation.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/documentcreation.html b/content/1.8/cookbook/documentcreation.html
index 793efbf..0e2a902 100644
--- a/content/1.8/cookbook/documentcreation.html
+++ b/content/1.8/cookbook/documentcreation.html
@@ -135,7 +135,8 @@
 <h2 id="create-a-blank-pdf">Create a blank PDF</h2>
 
 <p>This small sample shows how to create a new PDF document using PDFBox.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="c1">// Create a new empty document</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="c1">// Create a new empty document</span>
 <span class="n">PDDocument</span> <span class="n">document</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDDocument</span><span class="o">();</span>
 
 <span class="c1">// Create a new blank page and add it to the document</span>
@@ -148,11 +149,14 @@
 <span class="c1">// finally make sure that the document is properly</span>
 <span class="c1">// closed.</span>
 <span class="n">document</span><span class="o">.</span><span class="na">close</span><span class="o">();</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <h2 id="hello-world-using-a-pdf-base-font">Hello World using a PDF base font</h2>
 
-<p>This small sample shows how to create a new document and print the text &quot;Hello World&quot; using one of the PDF base fonts.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="c1">// Create a document and add a page to it</span>
+<p>This small sample shows how to create a new document and print the text \u201cHello World\u201d using one of the PDF base fonts.</p>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="c1">// Create a document and add a page to it</span>
 <span class="n">PDDocument</span> <span class="n">document</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDDocument</span><span class="o">();</span>
 <span class="n">PDPage</span> <span class="n">page</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDPage</span><span class="o">();</span>
 <span class="n">document</span><span class="o">.</span><span class="na">addPage</span><span class="o">(</span> <span class="n">page</span> <span class="o">);</span>
@@ -176,7 +180,9 @@
 <span class="c1">// Save the results and ensure that the document is properly closed:</span>
 <span class="n">document</span><span class="o">.</span><span class="na">save</span><span class="o">(</span> <span class="s">"Hello World.pdf"</span><span class="o">);</span>
 <span class="n">document</span><span class="o">.</span><span class="na">close</span><span class="o">();</span>
-</code></pre></div>
+</code></pre>
+</div>
+
             </div>
         </div>
     </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/encryption.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/encryption.html b/content/1.8/cookbook/encryption.html
index 1965231..e81b680 100644
--- a/content/1.8/cookbook/encryption.html
+++ b/content/1.8/cookbook/encryption.html
@@ -132,12 +132,13 @@
             <div class="col-xs-12 col-sm-9">
                 <h1 id="encrypting-a-file">Encrypting a file</h1>
 
-<p>PDF encryption requires two passwords: the &quot;user password&quot; to open and view the file with restricted permissions, the &quot;owner password&quot; to access the file with all permission.</p>
+<p>PDF encryption requires two passwords: the \u201cuser password\u201d to open and view the file with restricted permissions, the \u201cowner password\u201d to access the file with all permission.</p>
 
 <h2 id="load-and-save-encrypted">Load and save encrypted</h2>
 
 <p>This small sample shows how to encrypt a file so that it can be viewed, but not printed..</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">PDDocument</span> <span class="n">doc</span> <span class="o">=</span> <span class="n">PDDocument</span><span class="o">.</span><span class="na">load</span><span class="o">(</span><span class="s">"filename.pdf"</span><span class="o">);</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">PDDocument</span> <span class="n">doc</span> <span class="o">=</span> <span class="n">PDDocument</span><span class="o">.</span><span class="na">load</span><span class="o">(</span><span class="s">"filename.pdf"</span><span class="o">);</span>
 
 <span class="c1">// Define the length of the encryption key.</span>
 <span class="c1">// Possible values are 40 or 128 (256 will be available in PDFBox 2.0).</span>
@@ -157,7 +158,9 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="na">save</span><span class="o">(</span><span class="s">"filename-encrypted.pdf"</span><span class="o">);</span>
 <span class="n">doc</span><span class="o">.</span><span class="na">close</span><span class="o">();</span>
-</code></pre></div>
+</code></pre>
+</div>
+
             </div>
         </div>
     </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/fill-form-field.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/fill-form-field.html b/content/1.8/cookbook/fill-form-field.html
index 807d389..aade0df 100644
--- a/content/1.8/cookbook/fill-form-field.html
+++ b/content/1.8/cookbook/fill-form-field.html
@@ -132,45 +132,61 @@
             <div class="col-xs-12 col-sm-9">
                 <h1 id="fill-a-form-field">Fill a Form Field</h1>
 
-<p>Form fields within a PDF are defined as part of the AcroForm entry within the PDF&#39;s document catalog.
+<p>Form fields within a PDF are defined as part of the AcroForm entry within the PDF\u2019s document catalog.
 Form there individual fields can be accessed. Fields might be organized in a tree structure so it might
 be neccessary to walk through the tree to get an individual field.</p>
 
 <p>Load the PDF document.</p>
-<div class="highlight"><pre><code class="language-" data-lang="">:::java
+
+<div class="highlighter-rouge"><pre class="highlight"><code>:::java
 // load the document
 PDDocument pdfDocument = PDDocument.loadNonSeq(new File(... ), null);
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>Get the docoument catalog and the AcroForm which might be contained within.</p>
-<div class="highlight"><pre><code class="language-" data-lang="">:::java
+
+<div class="highlighter-rouge"><pre class="highlight"><code>:::java
 // get the document catalog
 PDDocumentCatalog docCatalog = pdfDocument.getDocumentCatalog();
 PDAcroForm acroForm = docCatalog.getAcroForm();
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>Retrieve an individual field and set its value.</p>
-<div class="highlight"><pre><code class="language-" data-lang="">:::java
+
+<div class="highlighter-rouge"><pre class="highlight"><code>:::java
 // as there might not be an AcroForm entry a null check is necessary
 if (acroForm != null)
 {
     PDField field = (PDField) acroForm.getField( "fieldName" );
     field.setValue("new field value");
 }
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>If a field is nested within the form tree a fully qualified name might be provided
 to access the field.</p>
-<div class="highlight"><pre><code class="language-" data-lang="">:::java
+
+<div class="highlighter-rouge"><pre class="highlight"><code>:::java
 // as there might not be an AcroForm entry a null check is neccessary
 if (acroForm != null)
 {
     PDField field = (PDField) acroForm.getField( "fieldsParentName.fieldName" );
     field.setValue("new field value");
 }
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>Save and close the filled out form.</p>
-<div class="highlight"><pre><code class="language-" data-lang="">:::java
+
+<div class="highlighter-rouge"><pre class="highlight"><code>:::java
 doc.save(filledForm);
 doc.close();
-</code></pre></div>
+</code></pre>
+</div>
+
+
             </div>
         </div>
     </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/pdfacreation.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/pdfacreation.html b/content/1.8/cookbook/pdfacreation.html
index db9a06b..a5aecf0 100644
--- a/content/1.8/cookbook/pdfacreation.html
+++ b/content/1.8/cookbook/pdfacreation.html
@@ -130,7 +130,7 @@
                 </ul>
             </div>
             <div class="col-xs-12 col-sm-9">
-                <h1 id="pdf-a-creation">PDF/A Creation</h1>
+                <h1 id="pdfa-creation">PDF/A Creation</h1>
 
 <p>The Apache PDFBox API can be used to create a PDF/A File. PDF/A is a PDF file with some constraints to ensure its 
 long time conservation. These constraints are described in ISO 19005.</p>
@@ -142,27 +142,33 @@ document. The current example creates a valid PDF/A-1b document.</p>
 
 <p>The PDF/A specification enforces that the fonts used in the document are present in the PDF File. You
 have to load them. As an example:</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">InputStream</span> <span class="n">fontStream</span> <span class="o">=</span> <span class="n">CreatePDFA</span><span class="o">.</span><span class="na">class</span><span class="o">.</span><span class="na">getResourceAsStream</span><span class="o">(</span><span class="s">"/org/apache/pdfbox/resources/ttf/ArialMT.ttf"</span><span class="o">);</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">InputStream</span> <span class="n">fontStream</span> <span class="o">=</span> <span class="n">CreatePDFA</span><span class="o">.</span><span class="na">class</span><span class="o">.</span><span class="na">getResourceAsStream</span><span class="o">(</span><span class="s">"/org/apache/pdfbox/resources/ttf/ArialMT.ttf"</span><span class="o">);</span>
 <span class="n">PDFont</span> <span class="n">font</span> <span class="o">=</span> <span class="n">PDTrueTypeFont</span><span class="o">.</span><span class="na">loadTTF</span><span class="o">(</span><span class="n">doc</span><span class="o">,</span> <span class="n">fontStream</span><span class="o">);</span>
-</code></pre></div>
-<h2 id="including-xmp-metadata-block">Including XMP metadata block</h2>
+</code></pre>
+</div>
+<p>## Including XMP metadata block</p>
 
 <p>It is imposed to have xmp metadata defined in the PDF. At least, the PDFA Schema (giving details on the version
 of PDF/A specification reached by the document) must be present. These lines create the xmp metadata for a
 PDF/A-1b document:</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">XMPMetadata</span> <span class="n">xmp</span> <span class="o">=</span> <span class="k">new</span> <span class="n">XMPMetadata</span><span class="o">();</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">XMPMetadata</span> <span class="n">xmp</span> <span class="o">=</span> <span class="k">new</span> <span class="n">XMPMetadata</span><span class="o">();</span>
 <span class="n">XMPSchemaPDFAId</span> <span class="n">pdfaid</span> <span class="o">=</span> <span class="k">new</span> <span class="n">XMPSchemaPDFAId</span><span class="o">(</span><span class="n">xmp</span><span class="o">);</span>
 <span class="n">xmp</span><span class="o">.</span><span class="na">addSchema</span><span class="o">(</span><span class="n">pdfaid</span><span class="o">);</span>
 <span class="n">pdfaid</span><span class="o">.</span><span class="na">setConformance</span><span class="o">(</span><span class="s">"B"</span><span class="o">);</span>
 <span class="n">pdfaid</span><span class="o">.</span><span class="na">setPart</span><span class="o">(</span><span class="mi">1</span><span class="o">);</span>
 <span class="n">pdfaid</span><span class="o">.</span><span class="na">setAbout</span><span class="o">(</span><span class="s">""</span><span class="o">);</span>
 <span class="n">metadata</span><span class="o">.</span><span class="na">importXMPMetadata</span><span class="o">(</span><span class="n">xmp</span><span class="o">);</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <h2 id="including-color-profile">Including color profile</h2>
 
 <p>It is mandatory to include the color profile used by the document. Different profiles can be used. This 
 example takes one present in pdfbox:</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="c1">// create output intent</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="c1">// create output intent</span>
 <span class="n">InputStream</span> <span class="n">colorProfile</span> <span class="o">=</span> <span class="n">CreatePDFA</span><span class="o">.</span><span class="na">class</span><span class="o">.</span><span class="na">getResourceAsStream</span><span class="o">(</span><span class="s">"/org/apache/pdfbox/resources/pdfa/sRGB Color Space Profile.icm"</span><span class="o">);</span>
 <span class="n">PDOutputIntent</span> <span class="n">oi</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDOutputIntent</span><span class="o">(</span><span class="n">doc</span><span class="o">,</span> <span class="n">colorProfile</span><span class="o">);</span> 
 <span class="n">oi</span><span class="o">.</span><span class="na">setInfo</span><span class="o">(</span><span class="s">"sRGB IEC61966-2.1"</span><span class="o">);</span> 
@@ -170,12 +176,18 @@ example takes one present in pdfbox:</p>
 <span class="n">oi</span><span class="o">.</span><span class="na">setOutputConditionIdentifier</span><span class="o">(</span><span class="s">"sRGB IEC61966-2.1"</span><span class="o">);</span> 
 <span class="n">oi</span><span class="o">.</span><span class="na">setRegistryName</span><span class="o">(</span><span class="s">"http://www.color.org"</span><span class="o">);</span> 
 <span class="n">cat</span><span class="o">.</span><span class="na">addOutputIntent</span><span class="o">(</span><span class="n">oi</span><span class="o">);</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <h2 id="complete-example">Complete example</h2>
 
 <p>The complete example can be found in pdfbox-example. The source file is</p>
-<div class="highlight"><pre><code class="language-" data-lang="">src/main/java/org/apache/pdfbox/examples/pdfa/CreatePDFA.java
-</code></pre></div>
+
+<div class="highlighter-rouge"><pre class="highlight"><code>src/main/java/org/apache/pdfbox/examples/pdfa/CreatePDFA.java
+</code></pre>
+</div>
+
+
             </div>
         </div>
     </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/pdfavalidation.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/pdfavalidation.html b/content/1.8/cookbook/pdfavalidation.html
index 43026e6..d27adbe 100644
--- a/content/1.8/cookbook/pdfavalidation.html
+++ b/content/1.8/cookbook/pdfavalidation.html
@@ -130,13 +130,14 @@
                 </ul>
             </div>
             <div class="col-xs-12 col-sm-9">
-                <h1 id="pdf-a-validation">PDF/A Validation</h1>
+                <h1 id="pdfa-validation">PDF/A Validation</h1>
 
 <p>The Apache Preflight library is a Java tool that implements a parser compliant with the ISO-19005 specification (aka PDF/A-1).
 Check Compliance with PDF/A-1b</p>
 
 <p>This small sample shows how to check the compliance of a file with the PDF/A-1b specification.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">ValidationResult</span> <span class="n">result</span> <span class="o">=</span> <span class="kc">null</span><span class="o">;</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">ValidationResult</span> <span class="n">result</span> <span class="o">=</span> <span class="kc">null</span><span class="o">;</span>
 
 <span class="n">PreflightParser</span> <span class="n">parser</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PreflightParser</span><span class="o">(</span><span class="n">args</span><span class="o">[</span><span class="mi">0</span><span class="o">]);</span>
 <span class="k">try</span>
@@ -183,57 +184,62 @@ Check Compliance with PDF/A-1b</p>
         <span class="n">System</span><span class="o">.</span><span class="na">out</span><span class="o">.</span><span class="na">println</span><span class="o">(</span><span class="n">error</span><span class="o">.</span><span class="na">getErrorCode</span><span class="o">()</span> <span class="o">+</span> <span class="s">" : "</span> <span class="o">+</span> <span class="n">error</span><span class="o">.</span><span class="na">getDetails</span><span class="o">());</span>
     <span class="o">}</span>
 <span class="o">}</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <h2 id="categories-of-validation-error">Categories of Validation Error</h2>
 
 <p>If a validation fails, the ValidationResult object contains all causes of the failure.
 In order to help in the failure understanding, all error codes have the following form X[.Y[.Z]] where :</p>
 
 <ul>
-<li>&#39;X&#39; is the category (ex : Font validation error...)</li>
-<li>&#39;Y&#39; represent a subsection of the category (ex : &quot;Font with Glyph error&quot;)</li>
-<li>&#39;Z&#39; represent the cause of the error (ex : &quot;Font with a missing Glyph&quot;)</li>
+  <li>\u2018X\u2019 is the category (ex : Font validation error\u2026)</li>
+  <li>\u2018Y\u2019 represent a subsection of the category (ex : \u201cFont with Glyph error\u201d)</li>
+  <li>\u2018Z\u2019 represent the cause of the error (ex : \u201cFont with a missing Glyph\u201d)</li>
 </ul>
 
-<p>Category (&#39;Y&#39;) and cause (&#39;Z&#39;) may be missing according to the difficulty to identify the error detail.</p>
+<p>Category (\u2018Y\u2019) and cause (\u2018Z\u2019) may be missing according to the difficulty to identify the error detail.</p>
 
-<p>Here after, you can find all Categories (for detailed cause, see constants in the <code>PreflightConstants</code> interface) :</p>
+<p>Here after, you can find all Categories (for detailed cause, see constants in the <code class="highlighter-rouge">PreflightConstants</code> interface) :</p>
 
-<table><thead>
-<tr>
-<th>Category</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>1[.y[.z]]</td>
-<td>Syntax Error</td>
-</tr>
-<tr>
-<td>2[.y[.z]]</td>
-<td>Graphic Error</td>
-</tr>
-<tr>
-<td>3[.y[.z]]</td>
-<td>Font Error</td>
-</tr>
-<tr>
-<td>4[.y[.z]]</td>
-<td>Transparency Error</td>
-</tr>
-<tr>
-<td>5[.y[.z]]</td>
-<td>Annotation Error</td>
-</tr>
-<tr>
-<td>6[.y[.z]]</td>
-<td>Action Error</td>
-</tr>
-<tr>
-<td>7[.y[.z]]</td>
-<td>Metadata Error</td>
-</tr>
-</tbody></table>
+<table>
+  <thead>
+    <tr>
+      <th>Category</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>1[.y[.z]]</td>
+      <td>Syntax Error</td>
+    </tr>
+    <tr>
+      <td>2[.y[.z]]</td>
+      <td>Graphic Error</td>
+    </tr>
+    <tr>
+      <td>3[.y[.z]]</td>
+      <td>Font Error</td>
+    </tr>
+    <tr>
+      <td>4[.y[.z]]</td>
+      <td>Transparency Error</td>
+    </tr>
+    <tr>
+      <td>5[.y[.z]]</td>
+      <td>Annotation Error</td>
+    </tr>
+    <tr>
+      <td>6[.y[.z]]</td>
+      <td>Action Error</td>
+    </tr>
+    <tr>
+      <td>7[.y[.z]]</td>
+      <td>Metadata Error</td>
+    </tr>
+  </tbody>
+</table>
 
             </div>
         </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/rendering.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/rendering.html b/content/1.8/cookbook/rendering.html
index e48b261..6c7cbae 100644
--- a/content/1.8/cookbook/rendering.html
+++ b/content/1.8/cookbook/rendering.html
@@ -135,7 +135,8 @@
 <h2 id="convert-a-document-to-images">Convert a document to images</h2>
 
 <p>This small sample shows how to render (convert to images) a PDF document using PDFBox.</p>
-<div class="highlight"><pre><code class="language-" data-lang="">:::java
+
+<div class="highlighter-rouge"><pre class="highlight"><code>:::java
     String filename = "YOURFILENAMEHERE.pdf";
 
     // open the document
@@ -147,7 +148,7 @@
     {
         // RGB image with 300 dpi
         BufferedImage bim = pages.get(p).convertToImage(BufferedImage.TYPE_INT_RGB, 300);
-
+        
         // save as PNG with default metadata
         b = ImageIO.write(bim, "png", new File("rgbpage" + (p+1) + ".png"));
         if (!b)
@@ -157,7 +158,7 @@
 
         // B/W image with 300 dpi
         bim = pages.get(p).convertToImage(BufferedImage.TYPE_BYTE_BINARY, 300);
-
+        
         // save as TIF with dpi in the metadata
         // PDFBox will choose the best compression for you - here: CCITT G4
         // you need to add jai_imageio.jar to your classpath for this to work
@@ -169,7 +170,9 @@
     }
 
     doc.close();
-</code></pre></div>
+</code></pre>
+</div>
+
             </div>
         </div>
     </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/textextraction.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/textextraction.html b/content/1.8/cookbook/textextraction.html
index 6446167..c2ab7e5 100644
--- a/content/1.8/cookbook/textextraction.html
+++ b/content/1.8/cookbook/textextraction.html
@@ -134,9 +134,9 @@
 
 <h2 id="extracting-text">Extracting Text</h2>
 
-<p>See class:org.apache.pdfbox.util.PDFTextStripper<br>
-See class:org.apache.pdfbox.searchengine.lucene.LucenePDFDocument<br>
-See command line app:ExtractText  </p>
+<p>See class:org.apache.pdfbox.util.PDFTextStripper<br />
+See class:org.apache.pdfbox.searchengine.lucene.LucenePDFDocument<br />
+See command line app:ExtractText</p>
 
 <p>One of the main features of PDFBox is its ability to quickly and accurately extract text 
 from a variety of PDF documents. This functionality is encapsulated in the 
@@ -148,8 +148,11 @@ org.apache.pdfbox.ExtractText.</p>
 <p>Lucene is an open source text search library from the Apache Jakarta Project. In order for
 Lucene to be able to index a PDF document it must first be converted to text. PDFBox provides 
 a simple approach for adding PDF documents into a Lucene index.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">Document</span> <span class="n">luceneDocument</span> <span class="o">=</span> <span class="n">LucenePDFDocument</span><span class="o">.</span><span class="na">getDocument</span><span class="o">(</span> <span class="o">...</span> <span class="o">);</span>
-</code></pre></div>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">Document</span> <span class="n">luceneDocument</span> <span class="o">=</span> <span class="n">LucenePDFDocument</span><span class="o">.</span><span class="na">getDocument</span><span class="o">(</span> <span class="o">...</span> <span class="o">);</span>
+</code></pre>
+</div>
+
 <p>Now that you hava a Lucene Document object, you can add it to the Lucene index just like 
 you would if it had been created from a text or HTML file. The LucenePDFDocument automatically 
 extracts a variety of metadata fields from the PDF to be added to the index, the javadoc 
@@ -170,11 +173,14 @@ these requirements.</p>
 process. The simplest is to specify the range of pages that you want to be extracted. 
 For example, to only extract text from the second and third pages of the PDF document 
 you could do this:</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">PDFTextStripper</span> <span class="n">stripper</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDFTextStripper</span><span class="o">();</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">PDFTextStripper</span> <span class="n">stripper</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDFTextStripper</span><span class="o">();</span>
 <span class="n">stripper</span><span class="o">.</span><span class="na">setStartPage</span><span class="o">(</span> <span class="mi">2</span> <span class="o">);</span>
 <span class="n">stripper</span><span class="o">.</span><span class="na">setEndPage</span><span class="o">(</span> <span class="mi">3</span> <span class="o">);</span>
 <span class="n">stripper</span><span class="o">.</span><span class="na">writeText</span><span class="o">(</span> <span class="o">...</span> <span class="o">);</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>NOTE: The startPage and endPage properties of PDFTextStripper are 1 based and inclusive.</p>
 
 <p>If you wanted to start on page 2 and extract to the end of the document then you would just
@@ -184,62 +190,65 @@ set the startPage property. By default all pages in the pdf document are extract
 If you are not familiar with how to use bookmarks in PDFBox then you should review the 
 Bookmarks page. Similar to the startPage/endPage properties, PDFTextStripper also has 
 startBookmark/endBookmark properties. There are some caveats to be aware of when using this
-feature of the PDFTextStripper. Not all bookmarks point to a page in the current PDF document. </p>
+feature of the PDFTextStripper. Not all bookmarks point to a page in the current PDF document.</p>
 
 <p>The possible states of a bookmark are:</p>
 
 <ul>
-<li>null - The property was not set, this is the default.</li>
-<li>Points to page in the PDF - The property was set and points to a valid page in the PDF</li>
-<li>Bookmark does not point to anything - The property was set but the bookmark does not point to any page</li>
-<li>Bookmark points to external action - The property was set, but it points to a page in a different PDF or performs an action when activated</li>
+  <li>null - The property was not set, this is the default.</li>
+  <li>Points to page in the PDF - The property was set and points to a valid page in the PDF</li>
+  <li>Bookmark does not point to anything - The property was set but the bookmark does not point to any page</li>
+  <li>Bookmark points to external action - The property was set, but it points to a page in a different PDF or performs an action when activated</li>
 </ul>
 
 <p>The table below will describe how PDFBox behaves in the various scenarios:</p>
 
-<table><thead>
-<tr>
-<th>Start Bookmark</th>
-<th>End Bookmark</th>
-<th>Result</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>null</td>
-<td>null</td>
-<td>This is the default, the properties have no effect on the text extraction.</td>
-</tr>
-<tr>
-<td>Points to a page in the PDF</td>
-<td>null</td>
-<td>Text extraction will begin on the page that this bookmark points to and go until the end of the document.</td>
-</tr>
-<tr>
-<td>null</td>
-<td>Points to a page in the PDF</td>
-<td>Text extraction will begin on the first page and stop at the end of the page that this bookmark points to.</td>
-</tr>
-<tr>
-<td>Bookmark does not point to anything</td>
-<td>null</td>
-<td>Because the PDFTextStripper cannot determine a start page based on the bookmark, it will start on the first page and go until the end of the document.</td>
-</tr>
-<tr>
-<td>null</td>
-<td>Bookmark does not point to anything</td>
-<td>Because the PDFTextStripper cannot determine a end page based on the bookmark, it will start on the first page and go until the end of the document.</td>
-</tr>
-<tr>
-<td>Bookmark does not point to anything</td>
-<td>Bookmark does not point to anything</td>
-<td>This is a special case! If the startBookmark and endBookmark are exactly the same then no text will be extracted. If they are different then it is not possible for the PDFTextStripper to determine that pages so it will include the entire document.</td>
-</tr>
-<tr>
-<td>Bookmark points to external action</td>
-<td>Bookmark points to external action</td>
-<td>If either the startBookmark or the endBookmark refer to an external page or execute an action then an OutlineNotLocalException will be thrown to indicate to the user that the bookmark is not valid.</td>
-</tr>
-</tbody></table>
+<table>
+  <thead>
+    <tr>
+      <th>Start Bookmark</th>
+      <th>End Bookmark</th>
+      <th>Result</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>null</td>
+      <td>null</td>
+      <td>This is the default, the properties have no effect on the text extraction.</td>
+    </tr>
+    <tr>
+      <td>Points to a page in the PDF</td>
+      <td>null</td>
+      <td>Text extraction will begin on the page that this bookmark points to and go until the end of the document.</td>
+    </tr>
+    <tr>
+      <td>null</td>
+      <td>Points to a page in the PDF</td>
+      <td>Text extraction will begin on the first page and stop at the end of the page that this bookmark points to.</td>
+    </tr>
+    <tr>
+      <td>Bookmark does not point to anything</td>
+      <td>null</td>
+      <td>Because the PDFTextStripper cannot determine a start page based on the bookmark, it will start on the first page and go until the end of the document.</td>
+    </tr>
+    <tr>
+      <td>null</td>
+      <td>Bookmark does not point to anything</td>
+      <td>Because the PDFTextStripper cannot determine a end page based on the bookmark, it will start on the first page and go until the end of the document.</td>
+    </tr>
+    <tr>
+      <td>Bookmark does not point to anything</td>
+      <td>Bookmark does not point to anything</td>
+      <td>This is a special case! If the startBookmark and endBookmark are exactly the same then no text will be extracted. If they are different then it is not possible for the PDFTextStripper to determine that pages so it will include the entire document.</td>
+    </tr>
+    <tr>
+      <td>Bookmark points to external action</td>
+      <td>Bookmark points to external action</td>
+      <td>If either the startBookmark or the endBookmark refer to an external page or execute an action then an OutlineNotLocalException will be thrown to indicate to the user that the bookmark is not valid.</td>
+    </tr>
+  </tbody>
+</table>
 
 <p>NOTE: PDFTextStripper will check both the startPage/endPage and the startBookmark/endBookmark to determine if text should be extracted from the current page.</p>
 
@@ -247,7 +256,7 @@ feature of the PDFTextStripper. Not all bookmarks point to a page in the current
 
 <p>Some PDF files need to map between glyph names and Unicode values during text extraction. 
 PDFBox comes with an Adobe Glyph List, but you may encounter files with glyph names that 
-are not in that map. To use your own glyphlist file, supply the file name to the <code>glyphlist_ext</code> JVM property.</p>
+are not in that map. To use your own glyphlist file, supply the file name to the <code class="highlighter-rouge">glyphlist_ext</code> JVM property.</p>
 
 <h3 id="right-to-left-text">Right to Left Text</h3>
 

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/workingwithattachments.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/workingwithattachments.html b/content/1.8/cookbook/workingwithattachments.html
index 6d6f3fd..82982d4 100644
--- a/content/1.8/cookbook/workingwithattachments.html
+++ b/content/1.8/cookbook/workingwithattachments.html
@@ -134,28 +134,29 @@
 
 <h2 id="the-pdf-file-specification">The PDF File Specification</h2>
 
-<p>See package:org.apache.pdfbox.pdmodel.common.filespecification<br>
-See example:EmbeddedFiles  </p>
+<p>See package:org.apache.pdfbox.pdmodel.common.filespecification<br />
+See example:EmbeddedFiles</p>
 
 <p>A PDF can contain references to external files via the file system or a URL to a remote 
 location. It is also possible to embed a binary file into a PDF document.</p>
 
-<p>There are two classes that can be used when referencing a file. <code>PDSimpleFileSpecification</code>
-is a simple string reference to a file(e.g. &quot;./movies/BigMovie.avi&quot;). The simple file 
-specification does not allow for any parameters to be set. </p>
+<p>There are two classes that can be used when referencing a file. <code class="highlighter-rouge">PDSimpleFileSpecification</code>
+is a simple string reference to a file(e.g. \u201c./movies/BigMovie.avi\u201d). The simple file 
+specification does not allow for any parameters to be set.</p>
 
-<p>The <code>PDComplexFileSpecification</code> is more feature rich and allows for advanced settings on 
+<p>The <code class="highlighter-rouge">PDComplexFileSpecification</code> is more feature rich and allows for advanced settings on 
 the file reference.</p>
 
 <p>It is also possible to embed a file directly into a PDF. Instead of setting the file 
-attribute of the <code>PDComplexFileSpecification</code>, the <code>EmbeddedFile</code> attribute can be used instead.</p>
+attribute of the <code class="highlighter-rouge">PDComplexFileSpecification</code>, the <code class="highlighter-rouge">EmbeddedFile</code> attribute can be used instead.</p>
 
 <h2 id="adding-a-file-attachment">Adding a File Attachment</h2>
 
 <p>PDF documents can contain file attachments that are accessed from the Document-&gt;File Attachments 
 menu. PDFBox allows attachments to be added to and extracted from PDF documents. 
 Attachments are part of the named tree that is attached to the document catalog.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="n">PDEmbeddedFilesNameTreeNode</span> <span class="n">efTree</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDEmbeddedFilesNameTreeNode</span><span class="o">();</span>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="n">PDEmbeddedFilesNameTreeNode</span> <span class="n">efTree</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDEmbeddedFilesNameTreeNode</span><span class="o">();</span>
 
 <span class="c1">//first create the file specification, which holds the embedded file</span>
 <span class="n">PDComplexFileSpecification</span> <span class="n">fs</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDComplexFileSpecification</span><span class="o">();</span>
@@ -176,7 +177,9 @@ Attachments are part of the named tree that is attached to the document catalog.
 <span class="n">PDDocumentNameDictionary</span> <span class="n">names</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDDocumentNameDictionary</span><span class="o">(</span> <span class="n">doc</span><span class="o">.</span><span class="na">getDocumentCatalog</span><span class="o">()</span> <span class="o">);</span>
 <span class="n">names</span><span class="o">.</span><span class="na">setEmbeddedFiles</span><span class="o">(</span> <span class="n">efTree</span> <span class="o">);</span>
 <span class="n">doc</span><span class="o">.</span><span class="na">getDocumentCatalog</span><span class="o">().</span><span class="na">setNames</span><span class="o">(</span> <span class="n">names</span> <span class="o">);</span>
-</code></pre></div>
+</code></pre>
+</div>
+
             </div>
         </div>
     </div>

http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/799b1d78/content/1.8/cookbook/workingwithfonts.html
----------------------------------------------------------------------
diff --git a/content/1.8/cookbook/workingwithfonts.html b/content/1.8/cookbook/workingwithfonts.html
index 5508494..eb55790 100644
--- a/content/1.8/cookbook/workingwithfonts.html
+++ b/content/1.8/cookbook/workingwithfonts.html
@@ -136,74 +136,78 @@
 
 <p>The PDF specification states that a standard set of 14 fonts will always be available when consuming PDF documents. In PDFBox these are defined as constants in the PDType1Font class.</p>
 
-<table><thead>
-<tr>
-<th>Standard Font</th>
-<th>Description</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>PDType1Font.TIMES_ROMAN</td>
-<td>Times regular</td>
-</tr>
-<tr>
-<td>PDType1Font.TIMES_BOLD</td>
-<td>Times bold</td>
-</tr>
-<tr>
-<td>PDType1Font.TIMES_ITALIC</td>
-<td>Times italic</td>
-</tr>
-<tr>
-<td>PDType1Font.TIMES_BOLD_ITALIC</td>
-<td>Times bold italic</td>
-</tr>
-<tr>
-<td>PDType1Font.HELVETICA</td>
-<td>Helvetica regular</td>
-</tr>
-<tr>
-<td>PDType1Font.HELVETICA_BOLD</td>
-<td>Helvetica bold</td>
-</tr>
-<tr>
-<td>PDType1Font.HELVETICA_OBLIQUE</td>
-<td>Helvetica italic</td>
-</tr>
-<tr>
-<td>PDType1Font.HELVETICA_BOLD_OBLIQUE</td>
-<td>Helvetica bold italic</td>
-</tr>
-<tr>
-<td>PDType1Font.COURIER</td>
-<td>Courier</td>
-</tr>
-<tr>
-<td>PDType1Font.COURIER_BOLD</td>
-<td>Courier bold</td>
-</tr>
-<tr>
-<td>PDType1Font.COURIER_OBLIQUE</td>
-<td>Courier italic</td>
-</tr>
-<tr>
-<td>PDType1Font.COURIER_BOLD_OBLIQUE</td>
-<td>Courier bold italic</td>
-</tr>
-<tr>
-<td>PDType1Font.SYMBOL</td>
-<td>Symbol Set</td>
-</tr>
-<tr>
-<td>PDType1Font.ZAPF_DINGBATS</td>
-<td>Dingbat Typeface</td>
-</tr>
-</tbody></table>
+<table>
+  <thead>
+    <tr>
+      <th>Standard Font</th>
+      <th>Description</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>PDType1Font.TIMES_ROMAN</td>
+      <td>Times regular</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.TIMES_BOLD</td>
+      <td>Times bold</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.TIMES_ITALIC</td>
+      <td>Times italic</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.TIMES_BOLD_ITALIC</td>
+      <td>Times bold italic</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.HELVETICA</td>
+      <td>Helvetica regular</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.HELVETICA_BOLD</td>
+      <td>Helvetica bold</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.HELVETICA_OBLIQUE</td>
+      <td>Helvetica italic</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.HELVETICA_BOLD_OBLIQUE</td>
+      <td>Helvetica bold italic</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.COURIER</td>
+      <td>Courier</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.COURIER_BOLD</td>
+      <td>Courier bold</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.COURIER_OBLIQUE</td>
+      <td>Courier italic</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.COURIER_BOLD_OBLIQUE</td>
+      <td>Courier bold italic</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.SYMBOL</td>
+      <td>Symbol Set</td>
+    </tr>
+    <tr>
+      <td>PDType1Font.ZAPF_DINGBATS</td>
+      <td>Dingbat Typeface</td>
+    </tr>
+  </tbody>
+</table>
 
 <h2 id="hello-world-using-a-pdf-base-font">Hello World using a PDF base font</h2>
 
-<p>This small sample shows how to create a new document and print the text &quot;Hello World&quot; using one of the PDF base fonts.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="c1">// Create a document and add a page to it</span>
+<p>This small sample shows how to create a new document and print the text \u201cHello World\u201d using one of the PDF base fonts.</p>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="c1">// Create a document and add a page to it</span>
 <span class="n">PDDocument</span> <span class="n">document</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDDocument</span><span class="o">();</span>
 <span class="n">PDPage</span> <span class="n">page</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDPage</span><span class="o">();</span>
 <span class="n">document</span><span class="o">.</span><span class="na">addPage</span><span class="o">(</span> <span class="n">page</span> <span class="o">);</span>
@@ -227,11 +231,14 @@
 <span class="c1">// Save the results and ensure that the document is properly closed:</span>
 <span class="n">document</span><span class="o">.</span><span class="na">save</span><span class="o">(</span> <span class="s">"Hello World.pdf"</span><span class="o">);</span>
 <span class="n">document</span><span class="o">.</span><span class="na">close</span><span class="o">();</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <h2 id="hello-world-using-a-truetype-font">Hello World using a TrueType font</h2>
 
-<p>This small sample shows how to create a new document and print the text &quot;Hello World&quot; using a TrueType font.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="c1">// Create a document and add a page to it</span>
+<p>This small sample shows how to create a new document and print the text \u201cHello World\u201d using a TrueType font.</p>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="c1">// Create a document and add a page to it</span>
 <span class="n">PDDocument</span> <span class="n">document</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDDocument</span><span class="o">();</span>
 <span class="n">PDPage</span> <span class="n">page</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDPage</span><span class="o">();</span>
 <span class="n">document</span><span class="o">.</span><span class="na">addPage</span><span class="o">(</span> <span class="n">page</span> <span class="o">);</span>
@@ -255,19 +262,22 @@
 <span class="c1">// Save the results and ensure that the document is properly closed:</span>
 <span class="n">document</span><span class="o">.</span><span class="na">save</span><span class="o">(</span> <span class="s">"Hello World.pdf"</span><span class="o">);</span>
 <span class="n">document</span><span class="o">.</span><span class="na">close</span><span class="o">();</span>
-</code></pre></div>
+</code></pre>
+</div>
+
 <p>While it is recommended to embed all fonts for greatest portability not all PDF producer 
 applications will do this. When displaying a PDF it is necessary to find an external font to use. 
 PDFBox will look for a mapping file to use when substituting fonts.</p>
 
 <p>PDFBox will load Resources/PDFBox_External_Fonts.properties off of the classpath to map font
 names to TTF font files. The UNKNOWN_FONT property in that file will tell PDFBox which font to 
-use when no mapping exists. </p>
+use when no mapping exists.</p>
 
 <h2 id="hello-world-using-a-postscript-type1-font">Hello World using a Postscript Type1 font</h2>
 
-<p>This small sample shows how to create a new document and print the text &quot;Hello World&quot; using a Postscript Type1 font.</p>
-<div class="highlight"><pre><code class="language-java" data-lang="java"><span class="c1">// Create a document and add a page to it</span>
+<p>This small sample shows how to create a new document and print the text \u201cHello World\u201d using a Postscript Type1 font.</p>
+
+<div class="highlighter-rouge"><pre class="highlight"><code><span class="c1">// Create a document and add a page to it</span>
 <span class="n">PDDocument</span> <span class="n">document</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDDocument</span><span class="o">();</span>
 <span class="n">PDPage</span> <span class="n">page</span> <span class="o">=</span> <span class="k">new</span> <span class="n">PDPage</span><span class="o">();</span>
 <span class="n">document</span><span class="o">.</span><span class="na">addPage</span><span class="o">(</span> <span class="n">page</span> <span class="o">);</span>
@@ -291,7 +301,9 @@ use when no mapping exists. </p>
 <span class="c1">// Save the results and ensure that the document is properly closed:</span>
 <span class="n">document</span><span class="o">.</span><span class="na">save</span><span class="o">(</span> <span class="s">"Hello World.pdf"</span><span class="o">);</span>
 <span class="n">document</span><span class="o">.</span><span class="na">close</span><span class="o">();</span>
-</code></pre></div>
+</code></pre>
+</div>
+
             </div>
         </div>
     </div>