You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ju...@apache.org on 2008/06/06 20:34:43 UTC

svn commit: r664072 [2/3] - /incubator/tika/site/

Modified: incubator/tika/site/findbugs.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/findbugs.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/findbugs.html (original)
+++ incubator/tika/site/findbugs.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>
@@ -170,7 +170,7 @@
     </div>
     <div id="bodyColumn">
       <div id="contentBox">
-        <div class="section"><h2>FindBugs Bug Detector Report</h2><p>The following document contains the results of <a href="http://findbugs.sourceforge.net">FindBugs Report</a></p><p>FindBugs Version is <i>1.1.1</i></p><p>Threshold is <i>Normal</i></p><p>Effort is <i>Default</i></p></div><h2>Summary</h2><table class="bodyTable"><tr class="a"><th>Classes</th><th>Bugs</th><th>Errors</th><th>Missing Classes</th></tr><tr class="b"><td>456</td><td>18</td><td>17</td><td>30</td></tr></table></p></div><h2>Files</h2><table class="bodyTable"><tr class="a"><th>Class</th><th>Bugs</th></tr><tr class="b"><td><a href="#org.apache.tika.config.TikaConfig">org.apache.tika.config.TikaConfig</a></td><td>1</td></tr><tr class="a"><td><a href="#org.apache.tika.metadata.Metadata">org.apache.tika.metadata.Metadata</a></td><td>1</td></tr><tr class="b"><td><a href="#org.apache.tika.mime.MimeType$RootXML">org.apache.tika.mime.MimeType$RootXML</a></td><td>1</td></tr><tr class="a"><td><a href="#org.apac
 he.tika.parser.microsoft.ExcelEventParser$TikaHSSFListener">org.apache.tika.parser.microsoft.ExcelEventParser$TikaHSSFListener</a></td><td>1</td></tr><tr class="b"><td><a href="#org.apache.tika.parser.microsoft.OfficeParser">org.apache.tika.parser.microsoft.OfficeParser</a></td><td>1</td></tr><tr class="a"><td><a href="#org.apache.tika.parser.microsoft.PowerPointExtractor">org.apache.tika.parser.microsoft.PowerPointExtractor</a></td><td>6</td></tr><tr class="b"><td><a href="#org.apache.tika.parser.microsoft.Slide">org.apache.tika.parser.microsoft.Slide</a></td><td>2</td></tr><tr class="a"><td><a href="#org.apache.tika.parser.microsoft.TextBox">org.apache.tika.parser.microsoft.TextBox</a></td><td>1</td></tr><tr class="b"><td><a href="#org.apache.tika.parser.microsoft.WordParser">org.apache.tika.parser.microsoft.WordParser</a></td><td>2</td></tr><tr class="a"><td><a href="#org.apache.tika.parser.opendocument.OpenOfficeParser">org.apache.tika.parser.opendocument.OpenOfficeParse
 r</a></td><td>1</td></tr><tr class="b"><td><a href="#org.apache.tika.utils.StringUtil">org.apache.tika.utils.StringUtil</a></td><td>1</td></tr></table><a name="org.apache.tika.config.TikaConfig"></a><div class="section"><h3>org.apache.tika.config.TikaConfig</h3><table class="bodyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="b"><td>Write to static field org.apache.tika.config.TikaConfig.mimeTypes from instance method org.apache.tika.config.TikaConfig.TikaConfig(org.jdom.Element)</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#ST_WRITE_TO_STATIC_FROM_INSTANCE_METHOD">ST_WRITE_TO_STATIC_FROM_INSTANCE_METHOD</a></td><td><a href=xref/org/apache/tika/config/TikaConfig.html#73>73</a></td></tr></table></div><a name="org.apache.tika.metadata.Metadata"></a><div class="section"><h3>org.apache.tika.metadata.Metadata</h3><table class="bodyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th>
 <th>Line</th></tr><tr class="b"><td>org.apache.tika.metadata.Metadata defines equals and uses Object.hashCode()</td><td>BAD_PRACTICE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#HE_EQUALS_USE_HASHCODE">HE_EQUALS_USE_HASHCODE</a></td><td><a href=xref/org/apache/tika/metadata/Metadata.html#173>173-201</a></td></tr></table></div><a name="org.apache.tika.mime.MimeType$RootXML"></a><div class="section"><h3>org.apache.tika.mime.MimeType$RootXML</h3><table class="bodyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="b"><td>Should org.apache.tika.mime.MimeType$RootXML be a _static_ inner class?</td><td>PERFORMANCE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SIC_INNER_SHOULD_BE_STATIC">SIC_INNER_SHOULD_BE_STATIC</a></td><td><a href=xref/org/apache/tika/mime/MimeType$RootXML.html#-1>Not available</a></td></tr></table></div><a name="org.apache.tika.parser.microsoft.ExcelEventParser$TikaHSSFL
 istener"></a><div class="section"><h3>org.apache.tika.parser.microsoft.ExcelEventParser$TikaHSSFListener</h3><table class="bodyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="b"><td>Class org.apache.tika.parser.microsoft.ExcelEventParser$TikaHSSFListener defines non-transient non-serializable instance field appendable</td><td>BAD_PRACTICE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SE_BAD_FIELD">SE_BAD_FIELD</a></td><td><a href=xref/org/apache/tika/parser/microsoft/ExcelEventParser$TikaHSSFListener.html#-1>Not available</a></td></tr></table></div><a name="org.apache.tika.parser.microsoft.OfficeParser"></a><div class="section"><h3>org.apache.tika.parser.microsoft.OfficeParser</h3><table class="bodyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="b"><td>Method org.apache.tika.parser.microsoft.OfficeParser.getMetadata(org.apache.poi.poifs.filesystem.POIFSFileSy
 stem,String,org.apache.tika.metadata.Metadata) catches Exception, but Exception is not thrown in the try block and RuntimeException is not explicitly caught</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#REC_CATCH_EXCEPTION">REC_CATCH_EXCEPTION</a></td><td><a href=xref/org/apache/tika/parser/microsoft/OfficeParser.html#88>88</a></td></tr></table></div><a name="org.apache.tika.parser.microsoft.PowerPointExtractor"></a><div class="section"><h3>org.apache.tika.parser.microsoft.PowerPointExtractor</h3><table class="bodyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="b"><td>Dead store to outStream in method org.apache.tika.parser.microsoft.PowerPointExtractor.extractSlides(long,byte[],long)</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DLS_DEAD_LOCAL_STORE">DLS_DEAD_LOCAL_STORE</a></td><td><a href=xref/org/apache/tika/parser/microsoft/PowerPointExtractor.html#
 410>410</a></td></tr><tr class="a"><td>Dead store to outStream in method org.apache.tika.parser.microsoft.PowerPointExtractor.extractTextBoxes(java.util.Hashtable,int,byte[],long)</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DLS_DEAD_LOCAL_STORE">DLS_DEAD_LOCAL_STORE</a></td><td><a href=xref/org/apache/tika/parser/microsoft/PowerPointExtractor.html#169>169</a></td></tr><tr class="b"><td>Method org.apache.tika.parser.microsoft.PowerPointExtractor.extractTextBoxes(java.util.Hashtable,int,byte[],long) invokes inefficient Long(long) constructor; use Long.valueOf(long) instead</td><td>PERFORMANCE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DM_NUMBER_CTOR">DM_NUMBER_CTOR</a></td><td><a href=xref/org/apache/tika/parser/microsoft/PowerPointExtractor.html#206>206</a></td></tr><tr class="a"><td>Method org.apache.tika.parser.microsoft.PowerPointExtractor.extractTextBoxes(java.util.Hashtable,int,byte[],long) invokes ineffi
 cient Long(long) constructor; use Long.valueOf(long) instead</td><td>PERFORMANCE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DM_NUMBER_CTOR">DM_NUMBER_CTOR</a></td><td><a href=xref/org/apache/tika/parser/microsoft/PowerPointExtractor.html#208>208</a></td></tr><tr class="b"><td>Method org.apache.tika.parser.microsoft.PowerPointExtractor.extractTextBoxes(java.util.Hashtable,int,byte[],long) invokes inefficient Long(long) constructor; use Long.valueOf(long) instead</td><td>PERFORMANCE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DM_NUMBER_CTOR">DM_NUMBER_CTOR</a></td><td><a href=xref/org/apache/tika/parser/microsoft/PowerPointExtractor.html#214>214</a></td></tr><tr class="a"><td>Useless control flow in org.apache.tika.parser.microsoft.PowerPointExtractor.extract(java.io.InputStream)</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#UCF_USELESS_CONTROL_FLOW">UCF_USELESS_CONTROL_FLOW</a></td><td>
 <a href=xref/org/apache/tika/parser/microsoft/PowerPointExtractor.html#94>94</a></td></tr></table></div><a name="org.apache.tika.parser.microsoft.Slide"></a><div class="section"><h3>org.apache.tika.parser.microsoft.Slide</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>org.apache.tika.parser.microsoft.Slide.contents is transient but org.apache.tika.parser.microsoft.Slide isn't Serializable</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SE_TRANSIENT_FIELD_OF_NONSERIALIZABLE_CLASS">SE_TRANSIENT_FIELD_OF_NONSERIALIZABLE_CLASS</a></td><td><a href=xref/org/apache/tika/parser/microsoft/Slide.html#-1>Not available</a></td></tr><tr class="b"><td>org.apache.tika.parser.microsoft.Slide.slideNumber is transient but org.apache.tika.parser.microsoft.Slide isn't Serializable</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SE_TRANSIENT_FIELD_OF
 _NONSERIALIZABLE_CLASS">SE_TRANSIENT_FIELD_OF_NONSERIALIZABLE_CLASS</a></td><td><a href=xref/org/apache/tika/parser/microsoft/Slide.html#-1>Not available</a></td></tr></table></div><a name="org.apache.tika.parser.microsoft.TextBox"></a><div class="section"><h3>org.apache.tika.parser.microsoft.TextBox</h3><table class="bodyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="b"><td>org.apache.tika.parser.microsoft.TextBox.currentID is transient but org.apache.tika.parser.microsoft.TextBox isn't Serializable</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SE_TRANSIENT_FIELD_OF_NONSERIALIZABLE_CLASS">SE_TRANSIENT_FIELD_OF_NONSERIALIZABLE_CLASS</a></td><td><a href=xref/org/apache/tika/parser/microsoft/TextBox.html#-1>Not available</a></td></tr></table></div><a name="org.apache.tika.parser.microsoft.WordParser"></a><div class="section"><h3>org.apache.tika.parser.microsoft.WordParser</h3><table class="bo
 dyTable"><tr class="a"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="b"><td>org.apache.tika.parser.microsoft.WordParser.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem,Appendable) ignores result of org.apache.poi.poifs.filesystem.DocumentInputStream.read(byte[])</td><td>BAD_PRACTICE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#RR_NOT_CHECKED">RR_NOT_CHECKED</a></td><td><a href=xref/org/apache/tika/parser/microsoft/WordParser.html#58>58</a></td></tr><tr class="a"><td>org.apache.tika.parser.microsoft.WordParser.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem,Appendable) ignores result of org.apache.poi.poifs.filesystem.DocumentInputStream.read(byte[])</td><td>BAD_PRACTICE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#RR_NOT_CHECKED">RR_NOT_CHECKED</a></td><td><a href=xref/org/apache/tika/parser/microsoft/WordParser.html#99>99</a></td></tr></table></div><a name="org.apache.tika.
 parser.opendocument.OpenOfficeParser"></a><div class="section"><h3>org.apache.tika.parser.opendocument.OpenOfficeParser</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>Dead store to xmlMeta in method org.apache.tika.parser.opendocument.OpenOfficeParser.parse(java.io.InputStream)</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DLS_DEAD_LOCAL_STORE">DLS_DEAD_LOCAL_STORE</a></td><td><a href=xref/org/apache/tika/parser/opendocument/OpenOfficeParser.html#57>57</a></td></tr></table></div><a name="org.apache.tika.utils.StringUtil"></a><div class="section"><h3>org.apache.tika.utils.StringUtil</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>org.apache.tika.utils.StringUtil.resolveEncodingAlias(String) invokes inefficient new String(String) constructor; just use the argument</td><td>PERFORMANCE</td><
 td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DM_STRING_CTOR">DM_STRING_CTOR</a></td><td><a href=xref/org/apache/tika/utils/StringUtil.html#199>199</a></td></tr></table></div></div>
+        <div class="section"><h2>FindBugs Bug Detector Report</h2><p>The following document contains the results of <a href="http://findbugs.sourceforge.net">FindBugs Report</a></p><p>FindBugs Version is <i>1.1.1</i></p><p>Threshold is <i>Normal</i></p><p>Effort is <i>Default</i></p></div><h2>Summary</h2><table class="bodyTable"><tr class="a"><th>Classes</th><th>Bugs</th><th>Errors</th><th>Missing Classes</th></tr><tr class="b"><td>554</td><td>8</td><td>22</td><td>38</td></tr></table></p></div><h2>Files</h2><table class="bodyTable"><tr class="a"><th>Class</th><th>Bugs</th></tr><tr class="b"><td><a href="#org.apache.tika.config.TikaConfig">org.apache.tika.config.TikaConfig</a></td><td>1</td></tr><tr class="a"><td><a href="#org.apache.tika.gui.TikaGUI">org.apache.tika.gui.TikaGUI</a></td><td>1</td></tr><tr class="b"><td><a href="#org.apache.tika.metadata.Metadata">org.apache.tika.metadata.Metadata</a></td><td>1</td></tr><tr class="a"><td><a href="#org.apache.tika.mime.MimeType
 $RootXML">org.apache.tika.mime.MimeType$RootXML</a></td><td>1</td></tr><tr class="b"><td><a href="#org.apache.tika.parser.ParsingReader">org.apache.tika.parser.ParsingReader</a></td><td>1</td></tr><tr class="a"><td><a href="#org.apache.tika.parser.microsoft.ExcelExtractor$PointComparator">org.apache.tika.parser.microsoft.ExcelExtractor$PointComparator</a></td><td>1</td></tr><tr class="b"><td><a href="#org.apache.tika.sax.TeeContentHandler">org.apache.tika.sax.TeeContentHandler</a></td><td>1</td></tr><tr class="a"><td><a href="#org.apache.tika.utils.StringUtil">org.apache.tika.utils.StringUtil</a></td><td>1</td></tr></table><a name="org.apache.tika.config.TikaConfig"></a><div class="section"><h3>org.apache.tika.config.TikaConfig</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>Write to static field org.apache.tika.config.TikaConfig.mimeTypes from instance method org.apache.tika.config.TikaConfig.TikaCo
 nfig(org.w3c.dom.Element)</td><td>STYLE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#ST_WRITE_TO_STATIC_FROM_INSTANCE_METHOD">ST_WRITE_TO_STATIC_FROM_INSTANCE_METHOD</a></td><td><a href=xref/org/apache/tika/config/TikaConfig.html#79>79</a></td></tr></table></div><a name="org.apache.tika.gui.TikaGUI"></a><div class="section"><h3>org.apache.tika.gui.TikaGUI</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>Class org.apache.tika.gui.TikaGUI defines non-transient non-serializable instance field parser</td><td>BAD_PRACTICE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SE_BAD_FIELD">SE_BAD_FIELD</a></td><td><a href=xref/org/apache/tika/gui/TikaGUI.html#-1>Not available</a></td></tr></table></div><a name="org.apache.tika.metadata.Metadata"></a><div class="section"><h3>org.apache.tika.metadata.Metadata</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>C
 ategory</th><th>Details</th><th>Line</th></tr><tr class="a"><td>org.apache.tika.metadata.Metadata defines equals and uses Object.hashCode()</td><td>BAD_PRACTICE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#HE_EQUALS_USE_HASHCODE">HE_EQUALS_USE_HASHCODE</a></td><td><a href=xref/org/apache/tika/metadata/Metadata.html#173>173-201</a></td></tr></table></div><a name="org.apache.tika.mime.MimeType$RootXML"></a><div class="section"><h3>org.apache.tika.mime.MimeType$RootXML</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>Should org.apache.tika.mime.MimeType$RootXML be a _static_ inner class?</td><td>PERFORMANCE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SIC_INNER_SHOULD_BE_STATIC">SIC_INNER_SHOULD_BE_STATIC</a></td><td><a href=xref/org/apache/tika/mime/MimeType$RootXML.html#-1>Not available</a></td></tr></table></div><a name="org.apache.tika.parser.ParsingR
 eader"></a><div class="section"><h3>org.apache.tika.parser.ParsingReader</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>org.apache.tika.parser.ParsingReader.ParsingReader(Parser,java.io.InputStream,org.apache.tika.metadata.Metadata) invokes java.lang.Thread.start()</td><td>MT_CORRECTNESS</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SC_START_IN_CTOR">SC_START_IN_CTOR</a></td><td><a href=xref/org/apache/tika/parser/ParsingReader.html#144>144</a></td></tr></table></div><a name="org.apache.tika.parser.microsoft.ExcelExtractor$PointComparator"></a><div class="section"><h3>org.apache.tika.parser.microsoft.ExcelExtractor$PointComparator</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>org.apache.tika.parser.microsoft.ExcelExtractor$PointComparator implements Comparator but not Serializable</td><td>BAD_PRACTI
 CE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#SE_COMPARATOR_SHOULD_BE_SERIALIZABLE">SE_COMPARATOR_SHOULD_BE_SERIALIZABLE</a></td><td><a href=xref/org/apache/tika/parser/microsoft/ExcelExtractor$PointComparator.html#-1>Not available</a></td></tr></table></div><a name="org.apache.tika.sax.TeeContentHandler"></a><div class="section"><h3>org.apache.tika.sax.TeeContentHandler</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>org.apache.tika.sax.TeeContentHandler.TeeContentHandler(org.xml.sax.ContentHandler[]) may expose internal representation by storing an externally mutable object into org.apache.tika.sax.TeeContentHandler.handlers</td><td>MALICIOUS_CODE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#EI_EXPOSE_REP2">EI_EXPOSE_REP2</a></td><td><a href=xref/org/apache/tika/sax/TeeContentHandler.html#34>34</a></td></tr></table></div><a name="org.apache.tika.u
 tils.StringUtil"></a><div class="section"><h3>org.apache.tika.utils.StringUtil</h3><table class="bodyTable"><tr class="b"><th>Bug</th><th>Category</th><th>Details</th><th>Line</th></tr><tr class="a"><td>org.apache.tika.utils.StringUtil.resolveEncodingAlias(String) invokes inefficient new String(String) constructor; just use the argument</td><td>PERFORMANCE</td><td><a href="http://findbugs.sourceforge.net/bugDescriptions.html#DM_STRING_CTOR">DM_STRING_CTOR</a></td><td><a href=xref/org/apache/tika/utils/StringUtil.html#199>199</a></td></tr></table></div></div>
       </div>
     </div>
     <div class="clear">

Modified: incubator/tika/site/index.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/index.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/index.html (original)
+++ incubator/tika/site/index.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/integration.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/integration.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/integration.html (original)
+++ incubator/tika/site/integration.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/issue-tracking.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/issue-tracking.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/issue-tracking.html (original)
+++ incubator/tika/site/issue-tracking.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/license.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/license.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/license.html (original)
+++ incubator/tika/site/license.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/mail-lists.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/mail-lists.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/mail-lists.html (original)
+++ incubator/tika/site/mail-lists.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/project-info.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/project-info.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/project-info.html (original)
+++ incubator/tika/site/project-info.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/project-reports.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/project-reports.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/project-reports.html (original)
+++ incubator/tika/site/project-reports.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/project-summary.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/project-summary.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/project-summary.html (original)
+++ incubator/tika/site/project-summary.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>

Modified: incubator/tika/site/rat-report.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/rat-report.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/rat-report.html (original)
+++ incubator/tika/site/rat-report.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>
@@ -170,336 +170,420 @@
     </div>
     <div id="bodyColumn">
       <div id="contentBox">
-        <div class="section"><h2>RAT (Release Audit Tool) results</h2><p>The following document contains the results of <a href="http://code.google.com/p/arat/">RAT (Release Audit Tool)</a>.</p><p><div class="source"><pre>
-*****************************************************
-Summary
--------
-Notes: 4
-Binaries: 7
-Archives: 1
-Standards: 96
-
-Apache Licensed: 88
-Generated Documents: 0
-
-JavaDocs are generated and so license header is optional
-Generated files do not required license headers
-
-8 Unknown Licenses
-
-*******************************
-
-Archives (+ indicates readable, $ unreadable): 
-
- + src/test/resources/test-documents/test-documents.zip
- 
-*****************************************************
-  Files with AL headers will be marked L
-  Binary files (which do not require AL headers) will be marked B
-  Compressed archives will be marked A
-  Notices, licenses etc will be marked N
- !????? CHANGES.txt
-  AL    HEADER.txt
-  N     KEYS
-  N     LICENSE.txt
-  N     NOTICE.txt
-  AL    pom.xml
-  N     README.txt
-  AL    src/main/assembly/bin.xml
-  AL    src/main/assembly/src.xml
-  AL    src/main/java/org/apache/tika/config/TikaConfig.java
-  AL    src/main/java/org/apache/tika/exception/CauseIOException.java
-  AL    src/main/java/org/apache/tika/exception/TikaException.java
-  AL    src/main/java/org/apache/tika/metadata/CreativeCommons.java
-  AL    src/main/java/org/apache/tika/metadata/DublinCore.java
-  AL    src/main/java/org/apache/tika/metadata/HttpHeaders.java
-  AL    src/main/java/org/apache/tika/metadata/Metadata.java
-  AL    src/main/java/org/apache/tika/metadata/MSOffice.java
-  AL    src/main/java/org/apache/tika/metadata/package.html
-  AL    src/main/java/org/apache/tika/metadata/SpellCheckedMetadata.java
-  AL    src/main/java/org/apache/tika/metadata/TikaMetadataKeys.java
-  AL    src/main/java/org/apache/tika/metadata/TikaMimeKeys.java
-  AL    src/main/java/org/apache/tika/mime/Clause.java
-  AL    src/main/java/org/apache/tika/mime/HexCoDec.java
-  AL    src/main/java/org/apache/tika/mime/Magic.java
-  AL    src/main/java/org/apache/tika/mime/MagicClause.java
-  AL    src/main/java/org/apache/tika/mime/MagicMatch.java
-  AL    src/main/java/org/apache/tika/mime/MimeType.java
-  AL    src/main/java/org/apache/tika/mime/MimeTypeException.java
-  AL    src/main/java/org/apache/tika/mime/MimeTypes.java
-  AL    src/main/java/org/apache/tika/mime/MimeTypesFactory.java
-  AL    src/main/java/org/apache/tika/mime/MimeTypesReader.java
-  AL    src/main/java/org/apache/tika/mime/Operator.java
-  AL    src/main/java/org/apache/tika/mime/Patterns.java
-  AL    src/main/java/org/apache/tika/parser/AutoDetectParser.java
-  AL    src/main/java/org/apache/tika/parser/EmptyParser.java
-  AL    src/main/java/org/apache/tika/parser/ErrorParser.java
-  AL    src/main/java/org/apache/tika/parser/html/HtmlParser.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/ExcelEventParser.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/ExcelParser.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/FilteredStringWriter.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/OfficeParser.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/PowerPointExtractor.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/PowerPointParser.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/PPTConstants.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/Slide.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/TextBox.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/Word6CHPBinTable.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/Word6Extractor.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/WordParser.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/WordTextBuffer.java
-  AL    src/main/java/org/apache/tika/parser/microsoft/WordTextPiece.java
-  AL    src/main/java/org/apache/tika/parser/opendocument/OpenOfficeEntityResolver.java
-  AL    src/main/java/org/apache/tika/parser/opendocument/OpenOfficeParser.java
-  AL    src/main/java/org/apache/tika/parser/Parser.java
-  AL    src/main/java/org/apache/tika/parser/ParserDecorator.java
-  AL    src/main/java/org/apache/tika/parser/ParserPostProcessor.java
-  AL    src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java
-  AL    src/main/java/org/apache/tika/parser/pdf/PDFParser.java
-  AL    src/main/java/org/apache/tika/parser/rtf/RTFParser.java
-  AL    src/main/java/org/apache/tika/parser/txt/TXTParser.java
-  AL    src/main/java/org/apache/tika/parser/xml/XMLParser.java
-  AL    src/main/java/org/apache/tika/sax/AppendableAdaptor.java
-  AL    src/main/java/org/apache/tika/sax/ContentHandlerDecorator.java
-  AL    src/main/java/org/apache/tika/sax/TeeContentHandler.java
-  AL    src/main/java/org/apache/tika/sax/WriteOutContentHandler.java
-  AL    src/main/java/org/apache/tika/sax/XHTMLContentHandler.java
-  AL    src/main/java/org/apache/tika/utils/ParseUtils.java
-  AL    src/main/java/org/apache/tika/utils/RegexUtils.java
-  AL    src/main/java/org/apache/tika/utils/RereadableInputStream.java
-  AL    src/main/java/org/apache/tika/utils/StringUtil.java
-  AL    src/main/java/org/apache/tika/utils/Utils.java
-  AL    src/main/resources/mime/tika-mimetypes.xml
-  AL    src/main/resources/tika-config.xml
-  AL    src/site/apt/index.apt
-  B     src/site/resources/tika.png
-  B     src/site/resources/tika.xcf
-  AL    src/site/site.xml
-  AL    src/test/java/org/apache/tika/exception/CauseIOExceptionTest.java
-  AL    src/test/java/org/apache/tika/metadata/TestMetadata.java
-  AL    src/test/java/org/apache/tika/metadata/TestSpellCheckedMetadata.java
-  AL    src/test/java/org/apache/tika/mime/MimeTypesTest.java
-  AL    src/test/java/org/apache/tika/mime/MimeTypeTest.java
-  AL    src/test/java/org/apache/tika/mime/PatternsTest.java
-  AL    src/test/java/org/apache/tika/mime/TestMimeTypes.java
-  AL    src/test/java/org/apache/tika/parser/AutoDetectParserTest.java
-  AL    src/test/java/org/apache/tika/parser/html/HtmlParserTest.java
-  AL    src/test/java/org/apache/tika/parser/microsoft/ExcelParserTest.java
-  AL    src/test/java/org/apache/tika/parser/microsoft/PowerPointParserTest.java
-  AL    src/test/java/org/apache/tika/parser/microsoft/WordParserTest.java
-  AL    src/test/java/org/apache/tika/parser/txt/TXTParserTest.java
-  AL    src/test/java/org/apache/tika/sax/AppendableAdaptorTest.java
-  AL    src/test/java/org/apache/tika/TestParsers.java
-  AL    src/test/java/org/apache/tika/TestRereadableInputStream.java
-  AL    src/test/java/org/apache/tika/utils/RegexUtilsTest.java
-  AL    src/test/resources/log4j/log4j.properties
-  A     src/test/resources/test-documents/test-documents.zip
-  B     src/test/resources/test-documents/testEXCEL.xls
- !????? src/test/resources/test-documents/testHTML.html
- !????? src/test/resources/test-documents/testHTML_utf8.html
-  B     src/test/resources/test-documents/testOpenOffice2.odt
-  B     src/test/resources/test-documents/testPDF.pdf
-  B     src/test/resources/test-documents/testPPT.ppt
- !????? src/test/resources/test-documents/testRTF.rtf
- !????? src/test/resources/test-documents/testTXT.txt
-  B     src/test/resources/test-documents/testWORD.doc
- !????? src/test/resources/test-documents/testXML.xml
- !????? tika.log
- !????? velocity.log
- 
- *****************************************************
- Printing headers for files without AL header...
- 
- 
- =======================================================================
- ==CHANGES.txt
- =======================================================================
- Tika Change Log
-
-Release 0.1-incubating - 12/27/2007
-
-1. TIKA-5 - Port Metadata Framework from Nutch (mattmann)
-
-2. TIKA-11 - Consolidate test classes into a src/test/java directory tree (mattmann)
-
-3. TIKA-15 - Utils.print does not print a Content having no value (jukka)
-
-4. TIKA-19 - org.apache.tika.TestParsers fails (bdelacretaz)
-
-5. TIKA-16 - Issues with data files used for testing by TestParsers (bdelacretaz)
-
-6. TIKA-14 - MimeTypeUtils.getMimeType() returns the default mime type for 
-             .odt (Open Office) file (bdelacretaz)
-
-7. TIKA-12 - Add URL capability to MimeTypesUtils (jukka)
-
-8. TIKA-13 - Fix obsolete package names in config.xml (siren)
-
-9. TIKA-10 - Remove MimeInfoException catch clauses and import from TestParsers (siren)
-
-10. TIKA-8 - Replaced the jmimeinfo dependency with a trivial mime type detector (jukka)
-
-11. TIKA-7 - Added the Lius Lite code. Added missing dependencies to POM (jukka)
-
-12. TIKA-18 - &quot;Office&quot; interface should be renamed &quot;MSOffice&quot; (mattmann)
-
-13. TIKA-23 - Decouple Parser from ParserConfig (jukka)
-
-14. TIKA-6 - Port Nutch (or better) MimeType detection system into Tika (J. Charron &amp; mattmann)
-
-15. TIKA-25 - Removed hardcoded reference to C:\oo.xml in OpenOfficeParser (K. Bennett &amp; jukka)
-
-16. TIKA-17 - Need to support URL's for input resources. (K. Bennett &amp; mattmann)
-
-17. TIKA-22 - Remove @author tags from the java source (mattmann)
-
-18. TIKA-21 - Simplified configuration code (jukka)
-
-19. TIKA-17 - Rename all &quot;Lius&quot; classes to be &quot;Tika&quot; classes (jukka)
-
-20. TIKA-30 - Added utility constructors to TikaConfig (K. Bennett &amp; jukka)
-
-21. TIKA-28 - Rename config.xml to tika-config.xml or similar (mattmann)
-
-22. TIKA-26 - Use Map&lt;String, Content&gt; instead of List&lt;Content&gt; (jukka)
-
-23. TIKA-31 - protected Parser.parse(InputStream stream,
-
- =======================================================================
- ==src/test/resources/test-documents/testHTML.html
- =======================================================================
- &lt;html&gt;
-	&lt;head&gt;
-		&lt;title&gt;Title : Test Indexation Html&lt;/title&gt;	
-	&lt;/head&gt;
-	&lt;body&gt;
-		&lt;h1&gt;Test Indexation Html&lt;/h1&gt;
-		&lt;p&gt;Indexation du fichier&lt;/p&gt;
-	&lt;/body&gt;	
-&lt;/html&gt;
-
- =======================================================================
- ==src/test/resources/test-documents/testHTML_utf8.html
- =======================================================================
- &lt;html&gt;
-	&lt;head&gt;
-		&lt;title&gt;Title : Tilte with UTF-8 chars ???§??&lt;/title&gt;	
-	&lt;/head&gt;
-	&lt;body&gt;
-		&lt;h1&gt;Content with UTF-8 chars&lt;/h1&gt;
-		&lt;p&gt;???§??&lt;/p&gt;
-	&lt;/body&gt;	
-&lt;/html&gt;
-
- =======================================================================
- ==src/test/resources/test-documents/testRTF.rtf
- =======================================================================
- {\rtf1\ansi\ansicpg1252\uc1\deff0\stshfdbch0\stshfloch0\stshfhich0\stshfbi0\deflang1036\deflangfe1036{\fonttbl{\f0\froman\fcharset0\fprq2{\*\panose 02020603050405020304}Times New Roman;}{\f37\froman\fcharset238\fprq2 Times New Roman CE;}
-{\f38\froman\fcharset204\fprq2 Times New Roman Cyr;}{\f40\froman\fcharset161\fprq2 Times New Roman Greek;}{\f41\froman\fcharset162\fprq2 Times New Roman Tur;}{\f42\froman\fcharset177\fprq2 Times New Roman (Hebrew);}
-{\f43\froman\fcharset178\fprq2 Times New Roman (Arabic);}{\f44\froman\fcharset186\fprq2 Times New Roman Baltic;}{\f45\froman\fcharset163\fprq2 Times New Roman (Vietnamese);}}{\colortbl;\red0\green0\blue0;\red0\green0\blue255;\red0\green255\blue255;
-\red0\green255\blue0;\red255\green0\blue255;\red255\green0\blue0;\red255\green255\blue0;\red255\green255\blue255;\red0\green0\blue128;\red0\green128\blue128;\red0\green128\blue0;\red128\green0\blue128;\red128\green0\blue0;\red128\green128\blue0;
-\red128\green128\blue128;\red192\green192\blue192;}{\stylesheet{\ql \li0\ri0\widctlpar\aspalpha\aspnum\faauto\adjustright\rin0\lin0\itap0 \fs24\lang1036\langfe1036\cgrid\langnp1036\langfenp1036 \snext0 Normal;}{\*\cs10 \additive \ssemihidden 
-Default Paragraph Font;}{\*\ts11\tsrowd\trftsWidthB3\trpaddl108\trpaddr108\trpaddfl3\trpaddft3\trpaddfb3\trpaddfr3\trcbpat1\trcfpat1\tscellwidthfts0\tsvertalt\tsbrdrt\tsbrdrl\tsbrdrb\tsbrdrr\tsbrdrdgl\tsbrdrdgr\tsbrdrh\tsbrdrv 
-\ql \li0\ri0\widctlpar\aspalpha\aspnum\faauto\adjustright\rin0\lin0\itap0 \fs20\lang1024\langfe1024\cgrid\langnp1024\langfenp1024 \snext11 \ssemihidden Normal Table;}}{\*\latentstyles\lsdstimax156\lsdlockeddef0}{\*\rsidtbl \rsid2954171\rsid10375891}
-{\*\generator Microsoft Word 11.0.6568;}{\info{\title Test d\'92indexation Word}{\author Bibliotheque}{\operator Bibliotheque}{\creatim\yr2006\mo5\dy18\hr12\min19}{\revtim\yr2006\mo5\dy18\hr12\min19}{\version2}{\edmins0}{\nofpages1}{\nofwords3}
-{\nofchars21}{\*\company Universite Laval}{\nofcharsws23}{\vern24579}}\paperw11906\paperh16838\margl1417\margr1417\margt1417\margb1417 
-\deftab708\widowctrl\ftnbj\aenddoc\hyphhotz425\noxlattoyen\expshrtn\noultrlspc\dntblnsbdb\nospaceforul\formshade\horzdoc\dgmargin\dghspace180\dgvspace180\dghorigin1417\dgvorigin1417\dghshow1\dgvshow1
-\jexpand\viewkind1\viewscale100\pgbrdrhead\pgbrdrfoot\splytwnine\ftnlytwnine\htmautsp\nolnhtadjtbl\useltbaln\alntblind\lytcalctblwd\lyttblrtgr\lnbrkrule\nobrkwrptbl\snaptogridincell\allowfieldendsel\wrppunct\asianbrkrule\nojkernpunct\rsidroot2954171 \fet0
-\sectd \linex0\headery708\footery708\colsx708\endnhere\sectlinegrid360\sectdefaultcl\sftnbj {\*\pnseclvl1\pnucrm\pnstart1\pnindent720\pnhang {\pntxta .}}{\*\pnseclvl2\pnucltr\pnstart1\pnindent720\pnhang {\pntxta .}}{\*\pnseclvl3
-\pndec\pnstart1\pnindent720\pnhang {\pntxta .}}{\*\pnseclvl4\pnlcltr\pnstart1\pnindent720\pnhang {\pntxta )}}{\*\pnseclvl5\pndec\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}{\*\pnseclvl6\pnlcltr\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}
-{\*\pnseclvl7\pnlcrm\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}{\*\pnseclvl8\pnlcltr\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}{\*\pnseclvl9\pnlcrm\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}\pard\plain 
-\ql \li0\ri0\widctlpar\aspalpha\aspnum\faauto\adjustright\rin0\lin0\itap0 \fs24\lang1036\langfe1036\cgrid\langnp1036\langfenp1036 {\insrsid2954171 Test d\rquote indexation Word
-\par 
-\par }}
-
- =======================================================================
- ==src/test/resources/test-documents/testTXT.txt
- =======================================================================
- Test d'indexation de Txt
-http://www.apache.org
-
- =======================================================================
- ==src/test/resources/test-documents/testXML.xml
- =======================================================================
- &lt;?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?&gt;
-&lt;oaidc:dc xmlns:dc=&quot;http://purl.org/dc/elements/1.1/&quot; xmlns:oaidc=&quot;http://www.openarchives.org/OAI/2.0/oai_dc/&quot;&gt;
-
-	&lt;dc:title&gt;Archim?®de et Lius&lt;/dc:title&gt;
-
-	&lt;dc:creator&gt;Rida Benjelloun&lt;/dc:creator&gt;
-
-	&lt;dc:subject&gt;Java&lt;/dc:subject&gt;
-
-	&lt;dc:subject&gt;XML&lt;/dc:subject&gt;
-
-	&lt;dc:subject&gt;XSLT&lt;/dc:subject&gt;
-
-	&lt;dc:subject&gt;JDOM&lt;/dc:subject&gt;
- 
-	&lt;dc:subject&gt;Indexation&lt;/dc:subject&gt;
-
-	&lt;dc:description&gt;Framework d'indexation des documents XML, HTML, PDF etc.. &lt;/dc:description&gt;
-
-	&lt;dc:identifier&gt;http://www.apache.org&lt;/dc:identifier&gt;
-
-	&lt;dc:date&gt;2000-12&lt;/dc:date&gt;
-
-	&lt;dc:type&gt;test&lt;/dc:type&gt;
-
-	&lt;dc:format&gt;application/msword&lt;/dc:format&gt;
-
-	&lt;dc:language&gt;Fr&lt;/dc:language&gt;
-
-	&lt;dc:rights&gt;Non restreint&lt;/dc:rights&gt;	
-
-&lt;/oaidc:dc&gt;
-
- =======================================================================
- ==tika.log
- =======================================================================
- 
- =======================================================================
- ==velocity.log
- =======================================================================
- Sun Jan 06 19:01:24 PST 2008  [debug] AvalonLogSystem initialized using logfile 'velocity.log'
-Sun Jan 06 19:01:24 PST 2008   [info] ************************************************************** 
-Sun Jan 06 19:01:24 PST 2008   [info] Starting Jakarta Velocity v1.4
-Sun Jan 06 19:01:24 PST 2008   [info] RuntimeInstance initializing.
-Sun Jan 06 19:01:24 PST 2008   [info] Default Properties File: org/apache/velocity/runtime/defaults/velocity.properties
-Sun Jan 06 19:01:24 PST 2008   [info] Trying to use logger class org.apache.velocity.runtime.log.AvalonLogSystem
-Sun Jan 06 19:01:24 PST 2008   [info] Using logger class org.apache.velocity.runtime.log.AvalonLogSystem
-Sun Jan 06 19:01:24 PST 2008   [info] Default ResourceManager initializing. (class org.apache.velocity.runtime.resource.ResourceManagerImpl)
-Sun Jan 06 19:01:24 PST 2008   [info] Resource Loader Instantiated: org.apache.velocity.runtime.resource.loader.FileResourceLoader
-Sun Jan 06 19:01:24 PST 2008   [info] FileResourceLoader : initialization starting.
-Sun Jan 06 19:01:24 PST 2008   [info] FileResourceLoader : adding path '/Users/mattmann/.maven/cache/maven-xdoc-plugin-1.8/plugin-resources/templates'
-Sun Jan 06 19:01:24 PST 2008   [info] FileResourceLoader : initialization complete.
-Sun Jan 06 19:01:24 PST 2008   [info] ResourceCache : initialized. (class org.apache.velocity.runtime.resource.ResourceCacheImpl)
-Sun Jan 06 19:01:24 PST 2008   [info] Default ResourceManager initialization complete.
-Sun Jan 06 19:01:24 PST 2008   [info] Loaded System Directive: org.apache.velocity.runtime.directive.Literal
-Sun Jan 06 19:01:24 PST 2008   [info] Loaded System Directive: org.apache.velocity.runtime.directive.Macro
-Sun Jan 06 19:01:24 PST 2008   [info] Loaded System Directive: org.apache.velocity.runtime.directive.Parse
-Sun Jan 06 19:01:24 PST 2008   [info] Loaded System Directive: org.apache.velocity.runtime.directive.Include
-Sun Jan 06 19:01:24 PST 2008   [info] Loaded System Directive: org.apache.velocity.runtime.directive.Foreach
-Sun Jan 06 19:01:24 PST 2008   [info] Created: 20 parsers.
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : initialization starting.
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : adding VMs from VM library template : VM_global_library.vm
-Sun Jan 06 19:01:24 PST 2008  [error] ResourceManager : unable to find resource 'VM_global_library.vm' in any resource loader.
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : error using  VM library template VM_global_library.vm : org.apache.velocity.exception.ResourceNotFoundException: Unable to find resource 'VM_global_library.vm'
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro :  VM library template macro registration complete.
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : allowInline = true : VMs can be defined inline in templates
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : allowInlineToOverride = false : VMs defined inline may NOT replace previous VM definitions
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : allowInlineLocal = false : VMs defined inline will be  global in scope if allowed.
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : messages on  : VM system will output logging messages
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : autoload off  : VM system will not automatically reload global library macros
-Sun Jan 06 19:01:24 PST 2008   [info] Velocimacro : initialization complete.
-Sun Jan 06 19:01:24 PST 2008   [info] Velocity successfully started.
-Sun Jan 06 19:01:24 PST 2008   [info] ResourceManager : found cvs-usage.xml with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader
-Sun Jan 06 19:01:24 PST 2008  [error] RHS of #set statement is null. Context will not be modified. cvs-usage.xml [line 28, column 5]
-Sun Jan 06 19:01:24 PST 2008   [info] ResourceManager : found index.xml with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader
-Sun Jan 06 19:01:24 PST 2008   [info] ResourceManager : found maven-reports.xml with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader
-Sun Jan 06 19:01:24 PST 2008   [info] ResourceManager : found dependencies.xml with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader
-Sun Jan 06 19:01:24 PST 2008   [info] ResourceManager : found issue-tracking.xml with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader
-Sun Jan 06 19:01:24 PST 2008  [error] Method getText threw exception for reference $escape in template issue-tracking.xml at  [29,22]
+        <div class="section"><h2>RAT (Release Audit Tool) results</h2><p>The following document contains the results of <a href="http://code.google.com/p/arat/">RAT (Release Audit Tool)</a>.</p><p><div class="source"><pre>
+*****************************************************
+Summary
+-------
+Notes: 4
+Binaries: 13
+Archives: 1
+Standards: 118
+
+Apache Licensed: 108
+Generated Documents: 0
+
+JavaDocs are generated and so license header is optional
+Generated files do not required license headers
+
+10 Unknown Licenses
+
+*******************************
+
+Archives (+ indicates readable, $ unreadable): 
+
+ + src/test/resources/test-documents/test-documents.zip
+ 
+*****************************************************
+  Files with AL headers will be marked L
+  Binary files (which do not require AL headers) will be marked B
+  Compressed archives will be marked A
+  Notices, licenses etc will be marked N
+ !????? .checkstyle
+ !????? .externalToolBuilders/Maven_Ant_Builder.launch
+ !????? CHANGES.txt
+  AL    HEADER.txt
+  N     KEYS
+  N     LICENSE.txt
+ !????? maven-eclipse.xml
+  N     NOTICE.txt
+  AL    pom.xml
+  N     README.txt
+  AL    src/main/assembly/standalone.xml
+  AL    src/main/java/org/apache/tika/cli/TikaCLI.java
+  AL    src/main/java/org/apache/tika/config/TikaConfig.java
+  AL    src/main/java/org/apache/tika/exception/TikaException.java
+  AL    src/main/java/org/apache/tika/gui/ParsingTransferHandler.java
+  AL    src/main/java/org/apache/tika/gui/TikaGUI.java
+  AL    src/main/java/org/apache/tika/metadata/CreativeCommons.java
+  AL    src/main/java/org/apache/tika/metadata/DublinCore.java
+  AL    src/main/java/org/apache/tika/metadata/HttpHeaders.java
+  AL    src/main/java/org/apache/tika/metadata/Metadata.java
+  AL    src/main/java/org/apache/tika/metadata/MSOffice.java
+  AL    src/main/java/org/apache/tika/metadata/package.html
+  AL    src/main/java/org/apache/tika/metadata/SpellCheckedMetadata.java
+  AL    src/main/java/org/apache/tika/metadata/TikaMetadataKeys.java
+  AL    src/main/java/org/apache/tika/metadata/TikaMimeKeys.java
+  AL    src/main/java/org/apache/tika/mime/Clause.java
+  AL    src/main/java/org/apache/tika/mime/HexCoDec.java
+  AL    src/main/java/org/apache/tika/mime/Magic.java
+  AL    src/main/java/org/apache/tika/mime/MagicClause.java
+  AL    src/main/java/org/apache/tika/mime/MagicMatch.java
+  AL    src/main/java/org/apache/tika/mime/MediaType.java
+  AL    src/main/java/org/apache/tika/mime/MediaTypeRegistry.java
+  AL    src/main/java/org/apache/tika/mime/MimeType.java
+  AL    src/main/java/org/apache/tika/mime/MimeTypeException.java
+  AL    src/main/java/org/apache/tika/mime/MimeTypes.java
+  AL    src/main/java/org/apache/tika/mime/MimeTypesFactory.java
+  AL    src/main/java/org/apache/tika/mime/MimeTypesReader.java
+  AL    src/main/java/org/apache/tika/mime/Operator.java
+  AL    src/main/java/org/apache/tika/mime/Patterns.java
+  AL    src/main/java/org/apache/tika/parser/AbstractParser.java
+  AL    src/main/java/org/apache/tika/parser/AutoDetectParser.java
+  AL    src/main/java/org/apache/tika/parser/CompositeParser.java
+  AL    src/main/java/org/apache/tika/parser/EmptyParser.java
+  AL    src/main/java/org/apache/tika/parser/ErrorParser.java
+  AL    src/main/java/org/apache/tika/parser/html/HtmlParser.java
+ !????? src/main/java/org/apache/tika/parser/image/ImageParser.java
+  AL    src/main/java/org/apache/tika/parser/microsoft/Cell.java
+  AL    src/main/java/org/apache/tika/parser/microsoft/CellDecorator.java
+  AL    src/main/java/org/apache/tika/parser/microsoft/ExcelExtractor.java
+  AL    src/main/java/org/apache/tika/parser/microsoft/LinkedCell.java
+  AL    src/main/java/org/apache/tika/parser/microsoft/NumberCell.java
+  AL    src/main/java/org/apache/tika/parser/microsoft/OfficeParser.java
+  AL    src/main/java/org/apache/tika/parser/microsoft/TextCell.java
+  AL    src/main/java/org/apache/tika/parser/opendocument/OpenOfficeContentParser.java
+  AL    src/main/java/org/apache/tika/parser/opendocument/OpenOfficeMetaParser.java
+  AL    src/main/java/org/apache/tika/parser/opendocument/OpenOfficeParser.java
+  AL    src/main/java/org/apache/tika/parser/Parser.java
+  AL    src/main/java/org/apache/tika/parser/ParserDecorator.java
+  AL    src/main/java/org/apache/tika/parser/ParserPostProcessor.java
+  AL    src/main/java/org/apache/tika/parser/ParsingReader.java
+  AL    src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java
+  AL    src/main/java/org/apache/tika/parser/pdf/PDFParser.java
+  AL    src/main/java/org/apache/tika/parser/rtf/RTFParser.java
+  AL    src/main/java/org/apache/tika/parser/txt/TXTParser.java
+  AL    src/main/java/org/apache/tika/parser/xml/DcXMLParser.java
+  AL    src/main/java/org/apache/tika/parser/xml/MetadataHandler.java
+  AL    src/main/java/org/apache/tika/parser/xml/XMLParser.java
+  AL    src/main/java/org/apache/tika/sax/BodyContentHandler.java
+  AL    src/main/java/org/apache/tika/sax/ContentHandlerDecorator.java
+  AL    src/main/java/org/apache/tika/sax/TeeContentHandler.java
+  AL    src/main/java/org/apache/tika/sax/TextContentHandler.java
+  AL    src/main/java/org/apache/tika/sax/WriteOutContentHandler.java
+  AL    src/main/java/org/apache/tika/sax/XHTMLContentHandler.java
+  AL    src/main/java/org/apache/tika/sax/xpath/AttributeMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/ChildMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/CompositeMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/ElementMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/Matcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/MatchingContentHandler.java
+  AL    src/main/java/org/apache/tika/sax/xpath/NamedAttributeMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/NamedElementMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/NodeMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/SubtreeMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/TextMatcher.java
+  AL    src/main/java/org/apache/tika/sax/xpath/XPathParser.java
+  AL    src/main/java/org/apache/tika/utils/ParseUtils.java
+  AL    src/main/java/org/apache/tika/utils/RegexUtils.java
+  AL    src/main/java/org/apache/tika/utils/RereadableInputStream.java
+  AL    src/main/java/org/apache/tika/utils/StringUtil.java
+  AL    src/main/java/org/apache/tika/utils/Utils.java
+  AL    src/main/resources/mime/tika-mimetypes.xml
+  AL    src/main/resources/tika-config.xml
+  AL    src/site/apt/download.apt
+  AL    src/site/apt/index.apt
+  B     src/site/resources/tika.png
+  B     src/site/resources/tika.xcf
+  AL    src/site/site.xml
+  AL    src/test/java/org/apache/tika/metadata/TestMetadata.java
+  AL    src/test/java/org/apache/tika/metadata/TestSpellCheckedMetadata.java
+  AL    src/test/java/org/apache/tika/mime/MediaTypeTest.java
+  AL    src/test/java/org/apache/tika/mime/MimeTypesTest.java
+  AL    src/test/java/org/apache/tika/mime/MimeTypeTest.java
+  AL    src/test/java/org/apache/tika/mime/PatternsTest.java
+  AL    src/test/java/org/apache/tika/mime/TestMimeTypes.java
+  AL    src/test/java/org/apache/tika/parser/AutoDetectParserTest.java
+  AL    src/test/java/org/apache/tika/parser/html/HtmlParserTest.java
+  AL    src/test/java/org/apache/tika/parser/image/ImageParserTest.java
+  AL    src/test/java/org/apache/tika/parser/microsoft/ExcelParserTest.java
+  AL    src/test/java/org/apache/tika/parser/microsoft/PowerPointParserTest.java
+  AL    src/test/java/org/apache/tika/parser/microsoft/WordParserTest.java
+  AL    src/test/java/org/apache/tika/parser/opendocument/OpenOfficeParserTest.java
+  AL    src/test/java/org/apache/tika/parser/ParsingReaderTest.java
+  AL    src/test/java/org/apache/tika/parser/txt/TXTParserTest.java
+  AL    src/test/java/org/apache/tika/parser/xml/DcXMLParserTest.java
+  AL    src/test/java/org/apache/tika/sax/xpath/XPathParserTest.java
+  AL    src/test/java/org/apache/tika/TestParsers.java
+  AL    src/test/java/org/apache/tika/TestRereadableInputStream.java
+  AL    src/test/java/org/apache/tika/utils/RegexUtilsTest.java
+  AL    src/test/resources/log4j.properties
+  A     src/test/resources/test-documents/test-documents.zip
+  B     src/test/resources/test-documents/testBMP.bmp
+  B     src/test/resources/test-documents/testEXCEL-formats.xls
+  B     src/test/resources/test-documents/testEXCEL.xls
+  B     src/test/resources/test-documents/testGIF.gif
+ !????? src/test/resources/test-documents/testHTML.html
+ !????? src/test/resources/test-documents/testHTML_utf8.html
+  B     src/test/resources/test-documents/testJPEG.jpg
+  B     src/test/resources/test-documents/testOpenOffice2.odt
+  B     src/test/resources/test-documents/testPDF.pdf
+  B     src/test/resources/test-documents/testPNG.png
+  B     src/test/resources/test-documents/testPPT.ppt
+ !????? src/test/resources/test-documents/testRTF.rtf
+  B     src/test/resources/test-documents/testTIFF.tif
+ !????? src/test/resources/test-documents/testTXT.txt
+  B     src/test/resources/test-documents/testWORD.doc
+ !????? src/test/resources/test-documents/testXML.xml
+ 
+ *****************************************************
+ Printing headers for files without AL header...
+ 
+ 
+ =======================================================================
+ ==.checkstyle
+ =======================================================================
+ &lt;?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?&gt;
+&lt;fileset-config file-format-version=&quot;1.2.0&quot; simple-config=&quot;true&quot;&gt;
+    &lt;fileset name=&quot;all&quot; enabled=&quot;true&quot; check-config-name=&quot;Sun Checks&quot; local=&quot;false&quot;&gt;
+        &lt;file-match-pattern match-pattern=&quot;.&quot; include-pattern=&quot;true&quot;/&gt;
+    &lt;/fileset&gt;
+&lt;/fileset-config&gt;
+
+ =======================================================================
+ ==.externalToolBuilders/Maven_Ant_Builder.launch
+ =======================================================================
+ &lt;launchConfiguration type=&quot;org.eclipse.ant.AntBuilderLaunchConfigurationType&quot;&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.debug.ui.ATTR_LAUNCH_IN_BACKGROUND&quot; value=&quot;false&quot;/&gt;
+  &lt;stringAttribute key=&quot;org.eclipse.ui.externaltools.ATTR_RUN_BUILD_KINDS&quot; value=&quot;full,incremental,auto,clean&quot;/&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.ui.externaltools.ATTR_TRIGGERS_CONFIGURED&quot; value=&quot;true&quot;/&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.debug.core.appendEnvironmentVariables&quot; value=&quot;true&quot;/&gt;
+  &lt;stringAttribute key=&quot;org.eclipse.jdt.launching.PROJECT_ATTR&quot; value=&quot;tika&quot;/&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.jdt.launching.DEFAULT_CLASSPATH&quot; value=&quot;true&quot;/&gt;
+  &lt;stringAttribute key=&quot;org.eclipse.ui.externaltools.ATTR_LOCATION&quot; value=&quot;${build_project}/maven-eclipse.xml&quot;/&gt;
+  &lt;stringAttribute key=&quot;org.eclipse.ui.externaltools.ATTR_WORKING_DIRECTORY&quot; value=&quot;${build_project}&quot;/&gt;
+  &lt;stringAttribute key=&quot;org.eclipse.debug.core.ATTR_REFRESH_SCOPE&quot; value=&quot;${project}&quot;/&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.debug.core.capture_output&quot; value=&quot;false&quot;/&gt;
+  &lt;stringAttribute key=&quot;org.eclipse.ui.externaltools.ATTR_BUILD_SCOPE&quot; value=&quot;${working_set:&amp;lt;?xml version=&amp;apos;1.0&amp;apos;?&amp;gt;&amp;lt;launchConfigurationWorkingSet editPageId=&amp;apos;org.eclipse.ui.resourceWorkingSetPage&amp;apos; factoryID=&amp;apos;org.eclipse.ui.internal.WorkingSetFactory&amp;apos; label=&amp;apos;workingSet&amp;apos; name=&amp;apos;workingSet&amp;apos;&amp;gt;&amp;lt;item factoryID=&amp;apos;org.eclipse.ui.internal.model.ResourceFactory&amp;apos; path=&amp;apos;tika&amp;apos; type=&amp;apos;4&amp;apos;/&amp;gt;&amp;lt;/launchConfigurationWorkingSet&amp;gt;}&quot;/&gt;
+  &lt;stringAttribute key=&quot;process_factory_id&quot; value=&quot;org.eclipse.ant.ui.remoteAntProcessFactory&quot;/&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.ant.ui.DEFAULT_VM_INSTALL&quot; value=&quot;false&quot;/&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.debug.ui.ATTR_CONSOLE_OUTPUT_ON&quot; value=&quot;false&quot;/&gt;
+  &lt;booleanAttribute key=&quot;org.eclipse.ant.ui.ATTR_TARGETS_UPDATED&quot; value=&quot;true&quot;/&gt;
+  &lt;stringAttribute key=&quot;org.eclipse.jdt.launching.CLASSPATH_PROVIDER&quot; value=&quot;org.eclipse.ant.ui.AntClasspathProvider&quot;/&gt;
+  &lt;listAttribute key=&quot;org.eclipse.debug.core.MAPPED_RESOURCE_TYPES&quot;&gt;
+    &lt;listEntry value=&quot;1&quot;/&gt;
+  &lt;/listAttribute&gt;
+  &lt;listAttribute key=&quot;org.eclipse.debug.core.MAPPED_RESOURCE_PATHS&quot;&gt;
+    &lt;listEntry value=&quot;/tika/maven-eclipse.xml&quot;/&gt;
+  &lt;/listAttribute&gt;
+&lt;/launchConfiguration&gt;
+
+ =======================================================================
+ ==CHANGES.txt
+ =======================================================================
+ Tika Change Log
+
+Unreleased changes (0.2-incubating)
+
+1.  TIKA-109 - WordParser fails on some Word files (Dave Meikle)
+
+2.  TIKA-105 - Excel parser implementation based on POI's Event API
+               (Niall Pemberton)
+
+3.  TIKA-116 - Streaming parser for OpenDocument files (Jukka Zitting)
+
+4.  TIKA-117 - Drop JDOM and Jaxen dependencies (Jukka Zitting)
+
+5.  TIKA-115 - Tika package with all the dependencies (Jukka Zitting)
+
+6.  TIKA-97  - Tika GUI (Jukka Zitting)
+
+7.  TIKA-96  - Tika CLI (Jukka Zitting)
+
+8.  TIKA-112 - Use Commons IO 1.4 (Jukka Zitting)
+
+9.  TIKA-126 - Add Parser.parse(InputStream, Metadata) for metadata extraction
+              (Jukka Zitting)
+
+10. TIKA-127 - Add support for Visio files (Jukka Zitting)
+
+11. TIKA-129 - node() support for the streaming XPath utility (Jukka Zitting)
+
+12. TIKA-130 - self-or-descendant axis does not match self in streaming XPath
+               (Jukka Zitting)
+
+13. TIKA-131 - Lazy XHTML prefix generation (Jukka Zitting)
+
+14. TIKA-128 - HTML parser should produce XHTML SAX events (Jukka Zitting)
+
+15. TIKA-133 - TeeContentHandler constructor should use varargs (Jukka Zitting)
+
+16. TIKA-132 - Refactor Excel extractor to parse per sheet and add
+               hyperlink support (Niall Pemberton)
+
+17. TIKA-134 - mvn package does not produce packages for bin/src
+               (Karl Heinz Marbaise)
+
+18. TIKA-138 - Ignore HTML style and script content (Jukka Zitting)
+
+19. TIKA-113 - Metadata (such as title) should not be part of content
+               (Jukka Zitting)
+
+20. TIKA-139 - Add a composite parser (Jukka Zitting)
+
+
+ =======================================================================
+ ==maven-eclipse.xml
+ =======================================================================
+ &lt;project default=&quot;copy-resources&quot;&gt;
+  &lt;target name=&quot;init&quot;/&gt;
+  &lt;target name=&quot;copy-resources&quot; depends=&quot;init&quot;&gt;
+    &lt;copy todir=&quot;target/classes/META-INF&quot; filtering=&quot;false&quot;&gt;
+      &lt;fileset dir=&quot;.&quot; includes=&quot;README.txt|NOTICE.txt|LICENSE.txt&quot;/&gt;
+    &lt;/copy&gt;
+    &lt;copy todir=&quot;target/classes/org/apache/tika&quot; filtering=&quot;false&quot;&gt;
+      &lt;fileset dir=&quot;src/main/resources&quot;/&gt;
+    &lt;/copy&gt;
+  &lt;/target&gt;
+&lt;/project&gt;
+
+ =======================================================================
+ ==src/main/java/org/apache/tika/parser/image/ImageParser.java
+ =======================================================================
+ package org.apache.tika.parser.image;
+
+import java.io.IOException;
+import java.io.InputStream;
+import java.util.Iterator;
+
+import javax.imageio.ImageIO;
+import javax.imageio.ImageReader;
+
+import org.apache.commons.io.input.CloseShieldInputStream;
+import org.apache.tika.exception.TikaException;
+import org.apache.tika.metadata.Metadata;
+import org.apache.tika.parser.Parser;
+import org.apache.tika.sax.XHTMLContentHandler;
+import org.xml.sax.ContentHandler;
+import org.xml.sax.SAXException;
+
+public class ImageParser implements Parser {
+
+    public void parse(InputStream stream, Metadata metadata)
+            throws IOException, TikaException {
+        String type = metadata.get(Metadata.CONTENT_TYPE);
+        if (type != null) {
+            Iterator&lt;ImageReader&gt; iterator =
+                ImageIO.getImageReadersByMIMEType(type);
+            if (iterator.hasNext()) {
+                ImageReader reader = iterator.next();
+                reader.setInput(ImageIO.createImageInputStream(
+                        new CloseShieldInputStream(stream)));
+                metadata.set(&quot;height&quot;, Integer.toString(reader.getHeight(0)));
+                metadata.set(&quot;width&quot;, Integer.toString(reader.getWidth(0)));
+                reader.dispose();
+            }
+        }
+    }
+
+    public void parse(
+            InputStream stream, ContentHandler handler, Metadata metadata)
+            throws IOException, SAXException, TikaException {
+        parse(stream, metadata);
+        XHTMLContentHandler xhtml = new XHTMLContentHandler(handler, metadata);
+        xhtml.startDocument();
+        xhtml.endDocument();
+    }
+
+}
+
+ =======================================================================
+ ==src/test/resources/test-documents/testHTML.html
+ =======================================================================
+ &lt;html&gt;
+	&lt;head&gt;
+		&lt;title&gt;Title : Test Indexation Html&lt;/title&gt;	
+	&lt;/head&gt;
+	&lt;body&gt;
+		&lt;h1&gt;Test Indexation Html&lt;/h1&gt;
+		&lt;p&gt;&lt;a href=&quot;http://www.apache.org/&quot;&gt;Indexation&lt;/a&gt; du fichier&lt;/p&gt;
+	&lt;/body&gt;	
+&lt;/html&gt;
+
+ =======================================================================
+ ==src/test/resources/test-documents/testHTML_utf8.html
+ =======================================================================
+ &lt;html&gt;
+	&lt;head&gt;
+		&lt;title&gt;Title : Tilte with UTF-8 chars öäå&lt;/title&gt;	
+	&lt;/head&gt;
+	&lt;body&gt;
+		&lt;h1&gt;Content with UTF-8 chars&lt;/h1&gt;
+		&lt;p&gt;åäö&lt;/p&gt;
+	&lt;/body&gt;	
+&lt;/html&gt;
+
+ =======================================================================
+ ==src/test/resources/test-documents/testRTF.rtf
+ =======================================================================
+ {\rtf1\ansi\ansicpg1252\uc1\deff0\stshfdbch0\stshfloch0\stshfhich0\stshfbi0\deflang1036\deflangfe1036{\fonttbl{\f0\froman\fcharset0\fprq2{\*\panose 02020603050405020304}Times New Roman;}{\f37\froman\fcharset238\fprq2 Times New Roman CE;}
+{\f38\froman\fcharset204\fprq2 Times New Roman Cyr;}{\f40\froman\fcharset161\fprq2 Times New Roman Greek;}{\f41\froman\fcharset162\fprq2 Times New Roman Tur;}{\f42\froman\fcharset177\fprq2 Times New Roman (Hebrew);}
+{\f43\froman\fcharset178\fprq2 Times New Roman (Arabic);}{\f44\froman\fcharset186\fprq2 Times New Roman Baltic;}{\f45\froman\fcharset163\fprq2 Times New Roman (Vietnamese);}}{\colortbl;\red0\green0\blue0;\red0\green0\blue255;\red0\green255\blue255;
+\red0\green255\blue0;\red255\green0\blue255;\red255\green0\blue0;\red255\green255\blue0;\red255\green255\blue255;\red0\green0\blue128;\red0\green128\blue128;\red0\green128\blue0;\red128\green0\blue128;\red128\green0\blue0;\red128\green128\blue0;
+\red128\green128\blue128;\red192\green192\blue192;}{\stylesheet{\ql \li0\ri0\widctlpar\aspalpha\aspnum\faauto\adjustright\rin0\lin0\itap0 \fs24\lang1036\langfe1036\cgrid\langnp1036\langfenp1036 \snext0 Normal;}{\*\cs10 \additive \ssemihidden 
+Default Paragraph Font;}{\*\ts11\tsrowd\trftsWidthB3\trpaddl108\trpaddr108\trpaddfl3\trpaddft3\trpaddfb3\trpaddfr3\trcbpat1\trcfpat1\tscellwidthfts0\tsvertalt\tsbrdrt\tsbrdrl\tsbrdrb\tsbrdrr\tsbrdrdgl\tsbrdrdgr\tsbrdrh\tsbrdrv 
+\ql \li0\ri0\widctlpar\aspalpha\aspnum\faauto\adjustright\rin0\lin0\itap0 \fs20\lang1024\langfe1024\cgrid\langnp1024\langfenp1024 \snext11 \ssemihidden Normal Table;}}{\*\latentstyles\lsdstimax156\lsdlockeddef0}{\*\rsidtbl \rsid2954171\rsid10375891}
+{\*\generator Microsoft Word 11.0.6568;}{\info{\title Test d\'92indexation Word}{\author Bibliotheque}{\operator Bibliotheque}{\creatim\yr2006\mo5\dy18\hr12\min19}{\revtim\yr2006\mo5\dy18\hr12\min19}{\version2}{\edmins0}{\nofpages1}{\nofwords3}
+{\nofchars21}{\*\company Universite Laval}{\nofcharsws23}{\vern24579}}\paperw11906\paperh16838\margl1417\margr1417\margt1417\margb1417 
+\deftab708\widowctrl\ftnbj\aenddoc\hyphhotz425\noxlattoyen\expshrtn\noultrlspc\dntblnsbdb\nospaceforul\formshade\horzdoc\dgmargin\dghspace180\dgvspace180\dghorigin1417\dgvorigin1417\dghshow1\dgvshow1
+\jexpand\viewkind1\viewscale100\pgbrdrhead\pgbrdrfoot\splytwnine\ftnlytwnine\htmautsp\nolnhtadjtbl\useltbaln\alntblind\lytcalctblwd\lyttblrtgr\lnbrkrule\nobrkwrptbl\snaptogridincell\allowfieldendsel\wrppunct\asianbrkrule\nojkernpunct\rsidroot2954171 \fet0
+\sectd \linex0\headery708\footery708\colsx708\endnhere\sectlinegrid360\sectdefaultcl\sftnbj {\*\pnseclvl1\pnucrm\pnstart1\pnindent720\pnhang {\pntxta .}}{\*\pnseclvl2\pnucltr\pnstart1\pnindent720\pnhang {\pntxta .}}{\*\pnseclvl3
+\pndec\pnstart1\pnindent720\pnhang {\pntxta .}}{\*\pnseclvl4\pnlcltr\pnstart1\pnindent720\pnhang {\pntxta )}}{\*\pnseclvl5\pndec\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}{\*\pnseclvl6\pnlcltr\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}
+{\*\pnseclvl7\pnlcrm\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}{\*\pnseclvl8\pnlcltr\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}{\*\pnseclvl9\pnlcrm\pnstart1\pnindent720\pnhang {\pntxtb (}{\pntxta )}}\pard\plain 
+\ql \li0\ri0\widctlpar\aspalpha\aspnum\faauto\adjustright\rin0\lin0\itap0 \fs24\lang1036\langfe1036\cgrid\langnp1036\langfenp1036 {\insrsid2954171 Test d\rquote indexation Word
+\par 
+\par }}
+
+ =======================================================================
+ ==src/test/resources/test-documents/testTXT.txt
+ =======================================================================
+ Test d'indexation de Txt
+http://www.apache.org
+
+ =======================================================================
+ ==src/test/resources/test-documents/testXML.xml
+ =======================================================================
+ &lt;?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?&gt;
+&lt;oaidc:dc xmlns:dc=&quot;http://purl.org/dc/elements/1.1/&quot; xmlns:oaidc=&quot;http://www.openarchives.org/OAI/2.0/oai_dc/&quot;&gt;
+
+	&lt;dc:title&gt;Tika test document&lt;/dc:title&gt;
+
+	&lt;dc:creator&gt;Rida Benjelloun&lt;/dc:creator&gt;
+
+	&lt;dc:subject&gt;Java&lt;/dc:subject&gt;
+
+	&lt;dc:subject&gt;XML&lt;/dc:subject&gt;
+
+	&lt;dc:subject&gt;XSLT&lt;/dc:subject&gt;
+
+	&lt;dc:subject&gt;JDOM&lt;/dc:subject&gt;
+ 
+	&lt;dc:subject&gt;Indexation&lt;/dc:subject&gt;
+
+	&lt;dc:description&gt;Framework d'indexation des documents XML, HTML, PDF etc.. &lt;/dc:description&gt;
+
+	&lt;dc:identifier&gt;http://www.apache.org&lt;/dc:identifier&gt;
+
+	&lt;dc:date&gt;2000-12&lt;/dc:date&gt;
+
+	&lt;dc:type&gt;test&lt;/dc:type&gt;
+
+	&lt;dc:format&gt;application/msword&lt;/dc:format&gt;
+
+	&lt;dc:language&gt;Fr&lt;/dc:language&gt;
+
+	&lt;dc:rights&gt;Archimède et Lius à Châteauneuf testing chars en été&lt;/dc:rights&gt;	
+
+&lt;/oaidc:dc&gt;
 </pre></div></p>
       </div>
     </div>

Modified: incubator/tika/site/source-repository.html
URL: http://svn.apache.org/viewvc/incubator/tika/site/source-repository.html?rev=664072&r1=664071&r2=664072&view=diff
==============================================================================
--- incubator/tika/site/source-repository.html (original)
+++ incubator/tika/site/source-repository.html Fri Jun  6 11:34:43 2008
@@ -78,7 +78,7 @@
         </li>
               
     <li class="none">
-              <a href="http://www.apache.org/dyn/closer.cgi/incubator/tika">Download</a>
+              <a href="download.html">Download</a>
         </li>
           </ul>
           <h5>Project Documentation</h5>