You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ta...@apache.org on 2021/12/23 20:02:42 UTC

svn commit: r1896328 [28/28] - in /tika/site: ./ publish/ publish/0.10/ publish/0.5/ publish/0.6/ publish/0.7/ publish/0.8/ publish/0.9/ publish/1.0/ publish/1.1/ publish/1.10/ publish/1.11/ publish/1.12/ publish/1.13/ publish/1.14/ publish/1.15/ publi...

Modified: tika/site/publish/2.2.0/examples.html
URL: http://svn.apache.org/viewvc/tika/site/publish/2.2.0/examples.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/2.2.0/examples.html (original)
+++ tika/site/publish/2.2.0/examples.html Thu Dec 23 20:02:39 2021
@@ -116,23 +116,23 @@
 <p>The <a href="./api/org/apache/tika/Tika.html">Tika facade</a>, provides a number of very quick and easy ways to have your content parsed by Tika, and return the resulting plain text</p><style type="text/css">
    @import url('attached-includes/css/shCoreDefault.css');
 </style>
-<div id="highlighter_582480" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number54 index0 alt1"><code class="java keyword">public</code> <code class="java plain">String parseToStringExample() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number55 index1 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Tika tika = </code><code class="java keyword">new</code> <code class="java plain">Tika();</code></div><div class="line number56 index2 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ParsingExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><c
 ode class="java plain">)) {</code></div><div class="line number57 index3 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">tika.parseToString(stream);</code></div><div class="line number58 index4 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number59 index5 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
+<div id="highlighter_920462" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number54 index0 alt1"><code class="java keyword">public</code> <code class="java plain">String parseToStringExample() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number55 index1 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Tika tika = </code><code class="java keyword">new</code> <code class="java plain">Tika();</code></div><div class="line number56 index2 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ParsingExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><c
 ode class="java plain">)) {</code></div><div class="line number57 index3 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">tika.parseToString(stream);</code></div><div class="line number58 index4 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number59 index5 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h4><a name="Parsing_using_the_Auto-Detect_Parser">Parsing using the Auto-Detect Parser</a></h4>
-<p>For more control, you can call the <a href="./api/org/apache/tika/parser/Parser.html">Tika Parsers</a> directly. Most likely, you'll want to start out using the <a href="./api/org/apache/tika/parser/AutoDetectParser.html">Auto-Detect Parser</a>, which automatically figures out what kind of content you have, then calls the appropriate parser for you.</p><div id="highlighter_230185" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number85 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String parseExample() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number86 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java pla
 in">AutoDetectParser();</code></div><div class="line number87 index2 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">BodyContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">BodyContentHandler();</code></div><div class="line number88 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number89 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ParsingExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number90 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nb
 sp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number91 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number92 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number93 index8 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>For more control, you can call the <a href="./api/org/apache/tika/parser/Parser.html">Tika Parsers</a> directly. Most likely, you'll want to start out using the <a href="./api/org/apache/tika/parser/AutoDetectParser.html">Auto-Detect Parser</a>, which automatically figures out what kind of content you have, then calls the appropriate parser for you.</p><div id="highlighter_265622" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number85 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String parseExample() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number86 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java pla
 in">AutoDetectParser();</code></div><div class="line number87 index2 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">BodyContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">BodyContentHandler();</code></div><div class="line number88 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number89 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ParsingExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number90 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nb
 sp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number91 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number92 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number93 index8 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Picking_different_output_formats">Picking different output formats</a></h3>
 <p>With Tika, you can get the textual content of your files returned in a number of different formats. These can be plain text, html, xhtml, xhtml of one part of the file etc. This is controlled based on the <a class="externalLink" href="http://docs.oracle.com/javase/7/docs/api/org/xml/sax/ContentHandler.html">ContentHandler</a> you supply to the Parser.</p>
 <div class="section">
 <h4><a name="Parsing_to_Plain_Text">Parsing to Plain Text</a></h4>
-<p>By using the <a href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a>, you can request that Tika return only the content of the document's body as a plain-text string.</p><div id="highlighter_376158" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number47 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String parseToPlainText() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number48 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">BodyContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">BodyContentHandler();</code></div><div class="line number49 index2 alt2">&nbsp;</div><div class="line number50 index3 alt1"><code class="java space
 s">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number51 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number52 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number53 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</c
 ode></div><div class="line number54 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number55 index8 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number56 index9 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
+<p>By using the <a href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a>, you can request that Tika return only the content of the document's body as a plain-text string.</p><div id="highlighter_228932" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number47 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String parseToPlainText() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number48 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">BodyContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">BodyContentHandler();</code></div><div class="line number49 index2 alt2">&nbsp;</div><div class="line number50 index3 alt1"><code class="java space
 s">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number51 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number52 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number53 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</c
 ode></div><div class="line number54 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number55 index8 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number56 index9 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h4><a name="Parsing_to_XHTML">Parsing to XHTML</a></h4>
-<p>By using the <a href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a>, you can get the XHTML content of the whole document as a string.</p><div id="highlighter_842198" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number61 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String parseToHTML() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number62 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">ToXMLContentHandler();</code></div><div class="line number63 index2 alt2">&nbsp;</div><div class="line number64 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><cod
 e class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number65 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number66 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number67 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number68 in
 dex7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number69 index8 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number70 index9 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div>
-<p>If you just want the body of the xhtml document, without the header, you can chain together a <a href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a> and a <a href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a> as shown:</p><div id="highlighter_108565" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number76 index0 alt1"><code class="java keyword">public</code> <code class="java plain">String parseBodyToHTML() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number77 index1 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">BodyContentHandler(</code></div><div class="line number78 index2 alt
 1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">new</code> <code class="java plain">ToXMLContentHandler());</code></div><div class="line number79 index3 alt2">&nbsp;</div><div class="line number80 index4 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number81 index5 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number82 index6 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code 
 class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number83 index7 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number84 index8 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number85 index9 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number86 index10 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
+<p>By using the <a href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a>, you can get the XHTML content of the whole document as a string.</p><div id="highlighter_593567" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number61 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String parseToHTML() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number62 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">ToXMLContentHandler();</code></div><div class="line number63 index2 alt2">&nbsp;</div><div class="line number64 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><cod
 e class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number65 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number66 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number67 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number68 in
 dex7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number69 index8 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number70 index9 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div>
+<p>If you just want the body of the xhtml document, without the header, you can chain together a <a href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a> and a <a href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a> as shown:</p><div id="highlighter_450488" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number76 index0 alt1"><code class="java keyword">public</code> <code class="java plain">String parseBodyToHTML() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number77 index1 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">BodyContentHandler(</code></div><div class="line number78 index2 alt
 1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">new</code> <code class="java plain">ToXMLContentHandler());</code></div><div class="line number79 index3 alt2">&nbsp;</div><div class="line number80 index4 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number81 index5 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number82 index6 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code 
 class="java plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><code class="java plain">)) {</code></div><div class="line number83 index7 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number84 index8 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number85 index9 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number86 index10 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h4><a name="Fetching_just_certain_bits_of_the_XHTML">Fetching just certain bits of the XHTML</a></h4>
-<p>It possible to execute XPath queries on the parse results, to fetch only certain bits of the XHTML. </p><div id="highlighter_703973" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number92 index0 alt1"><code class="java keyword">public</code> <code class="java plain">String parseOnePartToHTML() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number93 index1 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java comments">// Only get things under html -> body -> div (class=header)</code></div><div class="line number94 index2 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">XPathParser xhtmlParser = </code><code class="java keyword">new</code> <code class="java plain">XPathParser(</code><code class="java string">
 "xhtml"</code><code class="java plain">, XHTMLContentHandler.XHTML);</code></div><div class="line number95 index3 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Matcher divContentMatcher = xhtmlParser.parse(</code><code class="java string">"/xhtml:html/xhtml:body/xhtml:div/descendant::node()"</code><code class="java plain">);</code></div><div class="line number96 index4 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">MatchingContentHandler(</code></div><div class="line number97 index5 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">new</code> <code class="java plain">ToXMLContentHandler(), divContentMatcher);</code></div><div class="line number98 index6 alt1">&nbsp;</div><div class="line number99 index7 alt2"><code class=
 "java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number100 index8 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number101 index9 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test2.doc"</code><code class="java plain">)) {</code></div><div class="line number102 index10 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handle
 r, metadata);</code></div><div class="line number103 index11 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number104 index12 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number105 index13 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>It possible to execute XPath queries on the parse results, to fetch only certain bits of the XHTML. </p><div id="highlighter_881380" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number92 index0 alt1"><code class="java keyword">public</code> <code class="java plain">String parseOnePartToHTML() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number93 index1 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java comments">// Only get things under html -> body -> div (class=header)</code></div><div class="line number94 index2 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">XPathParser xhtmlParser = </code><code class="java keyword">new</code> <code class="java plain">XPathParser(</code><code class="java string">
 "xhtml"</code><code class="java plain">, XHTMLContentHandler.XHTML);</code></div><div class="line number95 index3 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Matcher divContentMatcher = xhtmlParser.parse(</code><code class="java string">"/xhtml:html/xhtml:body/xhtml:div/descendant::node()"</code><code class="java plain">);</code></div><div class="line number96 index4 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler handler = </code><code class="java keyword">new</code> <code class="java plain">MatchingContentHandler(</code></div><div class="line number97 index5 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">new</code> <code class="java plain">ToXMLContentHandler(), divContentMatcher);</code></div><div class="line number98 index6 alt1">&nbsp;</div><div class="line number99 index7 alt2"><code class=
 "java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number100 index8 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number101 index9 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test2.doc"</code><code class="java plain">)) {</code></div><div class="line number102 index10 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handle
 r, metadata);</code></div><div class="line number103 index11 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">handler.toString();</code></div><div class="line number104 index12 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number105 index13 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Custom_Content_Handlers">Custom Content Handlers</a></h3>
 <p>The textual output of parsing a file with Tika is returned via the SAX <a class="externalLink" href="http://docs.oracle.com/javase/7/docs/api/org/xml/sax/ContentHandler.html">ContentHandler</a> you pass to the parse method. It is possible to customise your parsing by supplying your own ContentHandler which does special things.</p>
@@ -141,16 +141,16 @@
 <p>By using the <a href="./api/org/apache/tika/sax/PhoneExtractingContentHandler.html">PhoneExtractingContentHandler</a>, you can have any phone numbers found in the textual content of the document extracted and placed into the Metadata object for you.</p></div>
 <div class="section">
 <h4><a name="Streaming_the_plain_text_in_chunks">Streaming the plain text in chunks</a></h4>
-<p>Sometimes, you want to chunk the resulting text up, perhaps to output as you go minimising memory use, perhaps to output to HDFS files, or any other reason! With a small custom content handler, you can do that.</p><div id="highlighter_248633" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number113 index0 alt2"><code class="java keyword">public</code> <code class="java plain">List&lt;String> parseToPlainTextChunks() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number114 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">final</code> <code class="java plain">List&lt;String> chunks = </code><code class="java keyword">new</code> <code class="java plain">ArrayList&lt;>();</code></div><div class="line number115 index2 alt2"><code c
 lass="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">chunks.add(</code><code class="java string">""</code><code class="java plain">);</code></div><div class="line number116 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandlerDecorator handler = </code><code class="java keyword">new</code> <code class="java plain">ContentHandlerDecorator() {</code></div><div class="line number117 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java color1">@Override</code></div><div class="line number118 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">public</code> <code class="java keyword">void</code> <code class="java plain">characters(</code><code class="java keyword">char</code><code class="java plain">[] ch, </code><code class="java keyword">int</code> <code class="java plain">start, </c
 ode><code class="java keyword">int</code> <code class="java plain">length) {</code></div><div class="line number119 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">String lastChunk = chunks.get(chunks.size() - </code><code class="java value">1</code><code class="java plain">);</code></div><div class="line number120 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">String thisStr = </code><code class="java keyword">new</code> <code class="java plain">String(ch, start, length);</code></div><div class="line number121 index8 alt2">&nbsp;</div><div class="line number122 index9 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">if</code> <code class="java plain">(lastChunk.length() + length > MAXIMUM_TEXT_CHUNK_SIZE) {
 </code></div><div class="line number123 index10 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">chunks.add(thisStr);</code></div><div class="line number124 index11 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">} </code><code class="java keyword">else</code> <code class="java plain">{</code></div><div class="line number125 index12 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">chunks.set(chunks.size() - </code><code class="java value">1</code><code class="java plain">, lastChunk + thisStr);</code></div><div class="line number126 index13 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</c
 ode></div><div class="line number127 index14 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number128 index15 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">};</code></div><div class="line number129 index16 alt2">&nbsp;</div><div class="line number130 index17 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number131 index18 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number132 index19 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class
 ="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test2.doc"</code><code class="java plain">)) {</code></div><div class="line number133 index20 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number134 index21 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">chunks;</code></div><div class="line number135 index22 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number136 index23 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>Sometimes, you want to chunk the resulting text up, perhaps to output as you go minimising memory use, perhaps to output to HDFS files, or any other reason! With a small custom content handler, you can do that.</p><div id="highlighter_582932" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number113 index0 alt2"><code class="java keyword">public</code> <code class="java plain">List&lt;String> parseToPlainTextChunks() </code><code class="java keyword">throws</code> <code class="java plain">IOException, SAXException, TikaException {</code></div><div class="line number114 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">final</code> <code class="java plain">List&lt;String> chunks = </code><code class="java keyword">new</code> <code class="java plain">ArrayList&lt;>();</code></div><div class="line number115 index2 alt2"><code c
 lass="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">chunks.add(</code><code class="java string">""</code><code class="java plain">);</code></div><div class="line number116 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandlerDecorator handler = </code><code class="java keyword">new</code> <code class="java plain">ContentHandlerDecorator() {</code></div><div class="line number117 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java color1">@Override</code></div><div class="line number118 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">public</code> <code class="java keyword">void</code> <code class="java plain">characters(</code><code class="java keyword">char</code><code class="java plain">[] ch, </code><code class="java keyword">int</code> <code class="java plain">start, </c
 ode><code class="java keyword">int</code> <code class="java plain">length) {</code></div><div class="line number119 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">String lastChunk = chunks.get(chunks.size() - </code><code class="java value">1</code><code class="java plain">);</code></div><div class="line number120 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">String thisStr = </code><code class="java keyword">new</code> <code class="java plain">String(ch, start, length);</code></div><div class="line number121 index8 alt2">&nbsp;</div><div class="line number122 index9 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">if</code> <code class="java plain">(lastChunk.length() + length > MAXIMUM_TEXT_CHUNK_SIZE) {
 </code></div><div class="line number123 index10 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">chunks.add(thisStr);</code></div><div class="line number124 index11 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">} </code><code class="java keyword">else</code> <code class="java plain">{</code></div><div class="line number125 index12 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">chunks.set(chunks.size() - </code><code class="java value">1</code><code class="java plain">, lastChunk + thisStr);</code></div><div class="line number126 index13 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</c
 ode></div><div class="line number127 index14 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number128 index15 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">};</code></div><div class="line number129 index16 alt2">&nbsp;</div><div class="line number130 index17 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser parser = </code><code class="java keyword">new</code> <code class="java plain">AutoDetectParser();</code></div><div class="line number131 index18 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata metadata = </code><code class="java keyword">new</code> <code class="java plain">Metadata();</code></div><div class="line number132 index19 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class
 ="java plain">(InputStream stream = ContentHandlerExample.</code><code class="java keyword">class</code><code class="java plain">.getResourceAsStream(</code><code class="java string">"test2.doc"</code><code class="java plain">)) {</code></div><div class="line number133 index20 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, handler, metadata);</code></div><div class="line number134 index21 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">chunks;</code></div><div class="line number135 index22 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number136 index23 alt1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Translation">Translation</a></h3>
 <p>Tika provides a pluggable Translation system, which allow you to send the results of parsing off to an external system or program to have the text translated into another language.</p>
 <div class="section">
 <h4><a name="Translation_using_the_Microsoft_Translation_API">Translation using the Microsoft Translation API</a></h4>
-<p>In order to use the Microsoft Translation API, you need to sign up for a Microsoft account, get an API key, then pass the key to Tika before translating.</p><div id="highlighter_814709" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number23 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String microsoftTranslateToFrench(String text) {</code></div><div class="line number24 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">MicrosoftTranslator translator = </code><code class="java keyword">new</code> <code class="java plain">MicrosoftTranslator();</code></div><div class="line number25 index2 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java comments">// Change the id and secret! See <a href="http://msdn.microsoft.com/en-us/library/hh454950.aspx.">http://msdn.microso
 ft.com/en-us/library/hh454950.aspx.</a></code></div><div class="line number26 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">translator.setId(</code><code class="java string">"dummy-id"</code><code class="java plain">);</code></div><div class="line number27 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">translator.setSecret(</code><code class="java string">"dummy-secret"</code><code class="java plain">);</code></div><div class="line number28 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">{</code></div><div class="line number29 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">translator.translate(text, </code><code class="java string">"fr"</code><code class="java plain">);</code></div><div class=
 "line number30 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">} </code><code class="java keyword">catch</code> <code class="java plain">(Exception e) {</code></div><div class="line number31 index8 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java string">"Error while translating."</code><code class="java plain">;</code></div><div class="line number32 index9 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number33 index10 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>In order to use the Microsoft Translation API, you need to sign up for a Microsoft account, get an API key, then pass the key to Tika before translating.</p><div id="highlighter_481351" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number23 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String microsoftTranslateToFrench(String text) {</code></div><div class="line number24 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">MicrosoftTranslator translator = </code><code class="java keyword">new</code> <code class="java plain">MicrosoftTranslator();</code></div><div class="line number25 index2 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java comments">// Change the id and secret! See <a href="http://msdn.microsoft.com/en-us/library/hh454950.aspx.">http://msdn.microso
 ft.com/en-us/library/hh454950.aspx.</a></code></div><div class="line number26 index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">translator.setId(</code><code class="java string">"dummy-id"</code><code class="java plain">);</code></div><div class="line number27 index4 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">translator.setSecret(</code><code class="java string">"dummy-secret"</code><code class="java plain">);</code></div><div class="line number28 index5 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> <code class="java plain">{</code></div><div class="line number29 index6 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">translator.translate(text, </code><code class="java string">"fr"</code><code class="java plain">);</code></div><div class=
 "line number30 index7 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">} </code><code class="java keyword">catch</code> <code class="java plain">(Exception e) {</code></div><div class="line number31 index8 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java string">"Error while translating."</code><code class="java plain">;</code></div><div class="line number32 index9 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">}</code></div><div class="line number33 index10 alt2"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Language_Identification">Language Identification</a></h3>
-<p>Tika provides support for identifying the language of text, through the <a href="./api/org/apache/tika/language/LanguageIdentifier.html">LanguageIdentifier</a> class.</p><div id="highlighter_29966" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number23 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String identifyLanguage(String text) {</code></div><div class="line number24 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">LanguageIdentifier identifier = </code><code class="java keyword">new</code> <code class="java plain">LanguageIdentifier(text);</code></div><div class="line number25 index2 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">identifier.getLanguage();</code></div><div class="line number26 index3 alt1
 "><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
+<p>Tika provides support for identifying the language of text, through the <a href="./api/org/apache/tika/language/LanguageIdentifier.html">LanguageIdentifier</a> class.</p><div id="highlighter_944512" class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div class="container"><div class="line number23 index0 alt2"><code class="java keyword">public</code> <code class="java plain">String identifyLanguage(String text) {</code></div><div class="line number24 index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">LanguageIdentifier identifier = </code><code class="java keyword">new</code> <code class="java plain">LanguageIdentifier(text);</code></div><div class="line number25 index2 alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">return</code> <code class="java plain">identifier.getLanguage();</code></div><div class="line number26 index3 alt
 1"><code class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h3><a name="Additional_Examples">Additional Examples</a></h3>
 <p>A number of other examples are also available, including all of the examples from the <a class="externalLink" href="http://manning.com/mattmann/">Tika In Action book</a>. These can all be found in the <a class="externalLink" href="https://svn.apache.org/repos/asf/tika/trunk/tika-example">Tika Example module</a> in SVN.</p></div></div>
@@ -200,7 +200,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="../1.28/index.html">Apache Tika 1.28</a>
@@ -237,10 +236,6 @@
     <li class="none">
                     <a href="../1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -253,7 +248,6 @@
                   
                         
                   
-                  
               
             <li class="expanded">
                     <a href="../2.2.0/index.html">Apache Tika 2.2.0</a>
@@ -290,10 +284,6 @@
     <li class="none">
                     <a href="../2.2.0/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../2.2.0/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               

Modified: tika/site/publish/2.2.0/formats.html
URL: http://svn.apache.org/viewvc/tika/site/publish/2.2.0/formats.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/2.2.0/formats.html (original)
+++ tika/site/publish/2.2.0/formats.html Thu Dec 23 20:02:39 2021
@@ -666,7 +666,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="../1.28/index.html">Apache Tika 1.28</a>
@@ -703,10 +702,6 @@
     <li class="none">
                     <a href="../1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -719,7 +714,6 @@
                   
                   
                   
-                  
               
             <li class="expanded">
                     <a href="../2.2.0/index.html">Apache Tika 2.2.0</a>
@@ -756,10 +750,6 @@
     <li class="none">
                     <a href="../2.2.0/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../2.2.0/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               

Modified: tika/site/publish/2.2.0/gettingstarted.html
URL: http://svn.apache.org/viewvc/tika/site/publish/2.2.0/gettingstarted.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/2.2.0/gettingstarted.html (original)
+++ tika/site/publish/2.2.0/gettingstarted.html Thu Dec 23 20:02:39 2021
@@ -341,7 +341,6 @@ curl http://.../document.doc \
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="../1.28/index.html">Apache Tika 1.28</a>
@@ -378,10 +377,6 @@ curl http://.../document.doc \
     <li class="none">
                     <a href="../1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -394,7 +389,6 @@ curl http://.../document.doc \
                   
                   
                   
-                  
               
             <li class="expanded">
                     <a href="../2.2.0/index.html">Apache Tika 2.2.0</a>
@@ -431,10 +425,6 @@ curl http://.../document.doc \
     <li class="none">
                     <a href="../2.2.0/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../2.2.0/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               

Modified: tika/site/publish/2.2.0/index.html
URL: http://svn.apache.org/viewvc/tika/site/publish/2.2.0/index.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/2.2.0/index.html (original)
+++ tika/site/publish/2.2.0/index.html Thu Dec 23 20:02:39 2021
@@ -166,7 +166,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="../1.28/index.html">Apache Tika 1.28</a>
@@ -203,10 +202,6 @@
     <li class="none">
                     <a href="../1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -219,7 +214,6 @@
                   
                   
                   
-                  
               
             <li class="expanded">
               <strong>Apache Tika 2.2.0</strong>
@@ -256,10 +250,6 @@
     <li class="none">
                     <a href="../2.2.0/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../2.2.0/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               

Modified: tika/site/publish/2.2.0/parser.html
URL: http://svn.apache.org/viewvc/tika/site/publish/2.2.0/parser.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/2.2.0/parser.html (original)
+++ tika/site/publish/2.2.0/parser.html Thu Dec 23 20:02:39 2021
@@ -223,7 +223,6 @@ try {
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="../1.28/index.html">Apache Tika 1.28</a>
@@ -260,10 +259,6 @@ try {
     <li class="none">
                     <a href="../1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -276,7 +271,6 @@ try {
                   
                   
                   
-                  
               
             <li class="expanded">
                     <a href="../2.2.0/index.html">Apache Tika 2.2.0</a>
@@ -313,10 +307,6 @@ try {
     <li class="none">
                     <a href="../2.2.0/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../2.2.0/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               

Modified: tika/site/publish/2.2.0/parser_guide.html
URL: http://svn.apache.org/viewvc/tika/site/publish/2.2.0/parser_guide.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/2.2.0/parser_guide.html (original)
+++ tika/site/publish/2.2.0/parser_guide.html Thu Dec 23 20:02:39 2021
@@ -221,7 +221,6 @@ public class HelloParser extends Abstrac
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="../1.28/index.html">Apache Tika 1.28</a>
@@ -258,10 +257,6 @@ public class HelloParser extends Abstrac
     <li class="none">
                     <a href="../1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -274,7 +269,6 @@ public class HelloParser extends Abstrac
                   
                   
                   
-                  
               
             <li class="expanded">
                     <a href="../2.2.0/index.html">Apache Tika 2.2.0</a>
@@ -311,10 +305,6 @@ public class HelloParser extends Abstrac
     <li class="none">
                     <a href="../2.2.0/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="../2.2.0/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               

Modified: tika/site/publish/contribute.html
URL: http://svn.apache.org/viewvc/tika/site/publish/contribute.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/contribute.html (original)
+++ tika/site/publish/contribute.html Thu Dec 23 20:02:39 2021
@@ -99,7 +99,7 @@
 <p>If you're new to reporting problems, you might find the <a class="externalLink" href="http://www.chiark.greenend.org.uk/~sgtatham/bugs.html">How to Report Bugs Effectively</a> essay (amongst many others) useful for learning more about what makes an effective and helpful bug report.</p></div>
 <div class="section">
 <h2><a name="New_Parsers_Detectors_and_Mime_Types"></a>New Parsers, Detectors and Mime Types</h2>
-<p>The <a href="./2.2.0/parser_guide.html">Parser Quick Start Guide</a> provides instructions on adding new mime types and new parsers to Tika.</p>
+<p>The <a href="./1.28/parser_guide.html">Parser Quick Start Guide</a> provides instructions on adding new mime types and new parsers to Tika.</p>
 <p>If your new Parser or Detector depends on libraries which we cannot include in Tika for license reasons, you are encouraged to list it on the <a class="externalLink" href="https://cwiki.apache.org/confluence/display/TIKA/3rd+party+parser+plugins">3rd Party Parser Plugins</a> page on the Tika wiki.</p></div>
 <div class="section">
 <h2><a name="Submitting_Enhancements_and_Fixes"></a>Submitting Enhancements and Fixes</h2>
@@ -169,7 +169,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -206,10 +205,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -218,7 +213,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/download.html
URL: http://svn.apache.org/viewvc/tika/site/publish/download.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/download.html (original)
+++ tika/site/publish/download.html Thu Dec 23 20:02:39 2021
@@ -86,15 +86,15 @@
       <div id="content">
         <!-- Licensed to the Apache Software Foundation (ASF) under one or more --><!-- contributor license agreements.  See the NOTICE file distributed with --><!-- this work for additional information regarding copyright ownership. --><!-- The ASF licenses this file to You under the Apache License, Version 2.0 --><!-- (the "License"); you may not use this file except in compliance with --><!-- the License.  You may obtain a copy of the License at --><!--  --><!-- https://www.apache.org/licenses/LICENSE-2.0 --><!--  --><!-- Unless required by applicable law or agreed to in writing, software --><!-- distributed under the License is distributed on an "AS IS" BASIS, --><!-- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. --><!-- See the License for the specific language governing permissions and --><!-- limitations under the License. --><div class="section">
 <h2><a name="Download_Apache_Tika"></a>Download Apache Tika</h2>
-<p>Apache Tika 2.2.0 is now available. See the <a class="externalLink" href="https://www.apache.org/dist/tika/2.2.0/CHANGES-2.2.0.txt">CHANGES-2.2.0.txt</a> file for more information on the list of breaking changes and updates.</p>
+<p>Apache Tika 1.28 is now available. See the <a class="externalLink" href="https://www.apache.org/dist/tika/2.2.0/CHANGES-2.2.0.txt">CHANGES-2.2.0.txt</a> file for more information on the list of breaking changes and updates.</p>
 <ul>
-<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/2.2.0/tika-2.2.0-src.zip"> Mirrors for tika-2.2.0-src.zip</a> (source archive, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-2.2.0-src.zip.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-2.2.0-src.zip.sha512">SHA512</a>)</li>
-<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/2.2.0/tika-app-2.2.0.jar">Mirrors for tika-app-2.2.0.jar</a> (runnable jar, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-app-2.2.0.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-app-2.2.0.jar.sha512">SHA512</a>)</li>
-<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/2.2.0/tika-server-standard-2.2.0.jar">Mirrors for tika-server-standard-2.2.0.jar</a> (runnable jar, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-server-standard-2.2.0.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-server-standard-2.2.0.jar.sha512">SHA512</a>)</li>
-<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/2.2.0/tika-server-standard-2.2.0-bin.zip">Mirrors for tika-server-standard-2.2.0.bin.zip</a> (tika-server as a service), <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-server-standard-2.2.0-bin.zip.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-server-standard-2.2.0-bin.zip.sha512">SHA512</a>)</li>
-<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/2.2.0/tika-eval-app-2.2.0.jar">Mirrors for tika-eval-app-2.2.0.jar</a> (runnable jar, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-eval-app-2.2.0.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-eval-app-2.2.0.jar.sha512">SHA512</a>)</li>
-<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/2.2.0/tika-parser-scientific-package-2.2.0.jar">Mirrors for tika-parser-scientific-package-2.2.0.jar</a> (scientific parsers, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-parser-scientific-package-2.2.0.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-parser-scientific-package-2.2.0.jar.sha512">SHA512</a>)</li>
-<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/2.2.0/tika-parser-sqlite3-package-2.2.0.jar">Mirrors for tika-parser-sqlite3-package-2.2.0.jar</a> (sqlite3 parser, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-parser-sqlite3-package-2.2.0.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/2.2.0/tika-parser-sqlite3-package-2.2.0.jar.sha512">SHA512</a>)
+<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-1.28-src.zip"> Mirrors for tika-1.28-src.zip</a> (source archive, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-1.28-src.zip.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-1.28-src.zip.sha512">SHA512</a>)</li>
+<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-app-1.28.jar">Mirrors for tika-app-1.28.jar</a> (runnable jar, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-app-1.28.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-app-1.28.jar.sha512">SHA512</a>)</li>
+<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-server-standard-1.28.jar">Mirrors for tika-server-standard-1.28.jar</a> (runnable jar, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-server-standard-1.28.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-server-standard-1.28.jar.sha512">SHA512</a>)</li>
+<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-server-standard-1.28-bin.zip">Mirrors for tika-server-standard-1.28.bin.zip</a> (tika-server as a service), <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-server-standard-1.28-bin.zip.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-server-standard-1.28-bin.zip.sha512">SHA512</a>)</li>
+<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-eval-app-1.28.jar">Mirrors for tika-eval-app-1.28.jar</a> (runnable jar, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-eval-app-1.28.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-eval-app-1.28.jar.sha512">SHA512</a>)</li>
+<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-parser-scientific-package-1.28.jar">Mirrors for tika-parser-scientific-package-1.28.jar</a> (scientific parsers, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-parser-scientific-package-1.28.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-parser-scientific-package-1.28.jar.sha512">SHA512</a>)</li>
+<li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-parser-sqlite3-package-1.28.jar">Mirrors for tika-parser-sqlite3-package-1.28.jar</a> (sqlite3 parser, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-parser-sqlite3-package-1.28.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-parser-sqlite3-package-1.28.jar.sha512">SHA512</a>)
 <p>The latest 1.x release (1.28) is available via the links below. See the <a class="externalLink" href="https://www.apache.org/dist/tika/1.28/CHANGES-1.28.txt">CHANGES.txt</a> file for more information on the list of updates.</p></li>
 <li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-1.28-src.zip">Mirrors for tika-1.28-src.zip</a> (source archive, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-1.28-src.zip.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-1.28-src.zip.sha512">SHA512</a>)</li>
 <li><a class="externalLink" href="https://www.apache.org/dyn/closer.lua/tika/1.28/tika-app-1.28.jar">Mirrors for tika-app-1.28.jar</a> (runnable jar, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-app-1.28.jar.asc">PGP signature</a>, <a class="externalLink" href="https://downloads.apache.org/tika/1.28/tika-app-1.28.jar.sha512">SHA512</a>)</li>
@@ -172,7 +172,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -209,10 +208,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -221,7 +216,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/index.html
URL: http://svn.apache.org/viewvc/tika/site/publish/index.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/index.html (original)
+++ tika/site/publish/index.html Thu Dec 23 20:02:39 2021
@@ -86,8 +86,8 @@
       <div id="content">
         <!-- Licensed to the Apache Software Foundation (ASF) under one or more --><!-- contributor license agreements.  See the NOTICE file distributed with --><!-- this work for additional information regarding copyright ownership. --><!-- The ASF licenses this file to You under the Apache License, Version 2.0 --><!-- (the "License"); you may not use this file except in compliance with --><!-- the License.  You may obtain a copy of the License at --><!--  --><!-- https://www.apache.org/licenses/LICENSE-2.0 --><!--  --><!-- Unless required by applicable law or agreed to in writing, software --><!-- distributed under the License is distributed on an "AS IS" BASIS, --><!-- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. --><!-- See the License for the specific language governing permissions and --><!-- limitations under the License. --><div class="section">
 <h2><a name="Apache_Tika_-_a_content_analysis_toolkit"></a>Apache Tika - a content analysis toolkit</h2>
-<p>The Apache Tika&#x2122; toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more. You can find the latest release on the <a href="./download.html">download page</a>. Please see the <a href="./2.2.0/gettingstarted.html">Getting Started</a> page for more information on how to start using Tika.</p>
-<p>The <a href="./2.2.0/parser.html">Parser</a> and <a href="./2.2.0/detection.html">Detector</a> pages describe the main interfaces of Tika and how they work.</p>
+<p>The Apache Tika&#x2122; toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more. You can find the latest release on the <a href="./download.html">download page</a>. Please see the <a href="./1.28/gettingstarted.html">Getting Started</a> page for more information on how to start using Tika.</p>
+<p>The <a href="./1.28/parser.html">Parser</a> and <a href="./1.28/detection.html">Detector</a> pages describe the main interfaces of Tika and how they work.</p>
 <p>If you're interested in contributing to Tika, please see the <a href="./contribute.html">Contributing</a> page or send an email to the <a href="./mail-lists.html">Tika development list</a>.</p>
 <p>Tika is a project of the <a class="externalLink" href="https://www.apache.org/">Apache Software Foundation</a>, and was formerly a subproject of <a class="externalLink" href="https://lucene.apache.org/">Apache Lucene</a>. </p></div>
 <div class="section">
@@ -306,7 +306,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -343,10 +342,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -355,7 +350,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/issue-tracking.html
URL: http://svn.apache.org/viewvc/tika/site/publish/issue-tracking.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/issue-tracking.html (original)
+++ tika/site/publish/issue-tracking.html Thu Dec 23 20:02:39 2021
@@ -138,7 +138,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -175,10 +174,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -187,7 +182,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/license.html
URL: http://svn.apache.org/viewvc/tika/site/publish/license.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/license.html (original)
+++ tika/site/publish/license.html Thu Dec 23 20:02:39 2021
@@ -341,7 +341,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -378,10 +377,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -390,7 +385,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/mail-lists.html
URL: http://svn.apache.org/viewvc/tika/site/publish/mail-lists.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/mail-lists.html (original)
+++ tika/site/publish/mail-lists.html Thu Dec 23 20:02:39 2021
@@ -145,7 +145,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -182,10 +181,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -194,7 +189,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/project-info.html
URL: http://svn.apache.org/viewvc/tika/site/publish/project-info.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/project-info.html (original)
+++ tika/site/publish/project-info.html Thu Dec 23 20:02:39 2021
@@ -156,7 +156,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -193,10 +192,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -205,7 +200,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/security.html
URL: http://svn.apache.org/viewvc/tika/site/publish/security.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/security.html (original)
+++ tika/site/publish/security.html Thu Dec 23 20:02:39 2021
@@ -413,7 +413,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -450,10 +449,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -462,7 +457,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/source-repository.html
URL: http://svn.apache.org/viewvc/tika/site/publish/source-repository.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/source-repository.html (original)
+++ tika/site/publish/source-repository.html Thu Dec 23 20:02:39 2021
@@ -151,7 +151,6 @@
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -188,10 +187,6 @@
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -200,7 +195,6 @@
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/publish/team-list.html
URL: http://svn.apache.org/viewvc/tika/site/publish/team-list.html?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/publish/team-list.html (original)
+++ tika/site/publish/team-list.html Thu Dec 23 20:02:39 2021
@@ -363,7 +363,6 @@ window.onLoad = init();
                   
                   
                   
-                  
               
         <li class="expanded">
                     <a href="1.28/index.html">Apache Tika 1.28</a>
@@ -400,10 +399,6 @@ window.onLoad = init();
     <li class="none">
                     <a href="1.28/api/">API Documentation</a>
           </li>
-                  
-    <li class="none">
-                    <a href="1.28/miredot/">REST API Documentation (Miredot)</a>
-          </li>
               </ul>
         </li>
               
@@ -412,7 +407,6 @@ window.onLoad = init();
                   
                   
                   
-                  
                   
                   
                   

Modified: tika/site/src/site/apt/1.28/formats.apt
URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.28/formats.apt?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/src/site/apt/1.28/formats.apt (original)
+++ tika/site/src/site/apt/1.28/formats.apt Thu Dec 23 20:02:39 2021
@@ -20,7 +20,7 @@
 Supported Document Formats
 
    This page lists all the document formats supported by the parsers in
-   Apache Tika 1.20. Follow the links to the various parser class javadocs
+   Apache Tika 1.28. Follow the links to the various parser class javadocs
    for more detailed information about each document format and how it is 
    parsed by Tika.
 

Modified: tika/site/src/site/site.xml
URL: http://svn.apache.org/viewvc/tika/site/src/site/site.xml?rev=1896328&r1=1896327&r2=1896328&view=diff
==============================================================================
--- tika/site/src/site/site.xml (original)
+++ tika/site/src/site/site.xml Thu Dec 23 20:02:39 2021
@@ -50,7 +50,6 @@
         <item name="Configuring Tika" href="1.28/configuring.html"/>
         <item name="Usage Examples" href="1.28/examples.html"/>
         <item name="API Documentation" href="1.28/api/"/>
-        <item name="REST API Documentation (Miredot)" href="1.28/miredot/"/>
       </item>
       <item name="Apache Tika 2.2.0" href="2.2.0/index.html" collapse="true">
         <item name="Getting Started" href="2.2.0/gettingstarted.html"/>
@@ -61,7 +60,6 @@
         <item name="Configuring Tika" href="2.2.0/configuring.html"/>
         <item name="Usage Examples" href="2.2.0/examples.html"/>
         <item name="API Documentation" href="2.2.0/api/"/>
-        <item name="REST API Documentation (Miredot)" href="2.2.0/miredot/"/>
       </item>
       <item name="Apache Tika 2.1.0" href="2.1.0/index.html" collapse="true">
         <item name="Getting Started" href="2.1.0/gettingstarted.html"/>