You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ju...@apache.org on 2010/01/30 22:24:59 UTC

svn commit: r904893 - in /lucene/tika/site/src/site: apt/0.5/index.apt site.xml

Author: jukka
Date: Sat Jan 30 21:24:59 2010
New Revision: 904893

URL: http://svn.apache.org/viewvc?rev=904893&view=rev
Log:
site: Add an index page for the 0.5 documentation

Added:
    lucene/tika/site/src/site/apt/0.5/index.apt
Modified:
    lucene/tika/site/src/site/site.xml

Added: lucene/tika/site/src/site/apt/0.5/index.apt
URL: http://svn.apache.org/viewvc/lucene/tika/site/src/site/apt/0.5/index.apt?rev=904893&view=auto
==============================================================================
--- lucene/tika/site/src/site/apt/0.5/index.apt (added)
+++ lucene/tika/site/src/site/apt/0.5/index.apt Sat Jan 30 21:24:59 2010
@@ -0,0 +1,100 @@
+                       ---------------
+                       Apache Tika 0.5
+                       ---------------
+
+~~ Licensed to the Apache Software Foundation (ASF) under one or more
+~~ contributor license agreements.  See the NOTICE file distributed with
+~~ this work for additional information regarding copyright ownership.
+~~ The ASF licenses this file to You under the Apache License, Version 2.0
+~~ (the "License"); you may not use this file except in compliance with
+~~ the License.  You may obtain a copy of the License at
+~~
+~~     http://www.apache.org/licenses/LICENSE-2.0
+~~
+~~ Unless required by applicable law or agreed to in writing, software
+~~ distributed under the License is distributed on an "AS IS" BASIS,
+~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+~~ See the License for the specific language governing permissions and
+~~ limitations under the License.
+
+Apache Tika 0.5
+
+   The most notable changes in Tika 0.5 over the previous release are:
+
+      * Improved RDF/OWL mime detection using both MIME magic as well as
+        pattern matching.
+        ({{{https://issues.apache.org/jira/browse/TIKA-309}TIKA-309}})
+
+      * An org.apache.tika.Tika facade class has been added to simplify
+        common text extraction and type detection use cases.
+        ({{{https://issues.apache.org/jira/browse/TIKA-269}TIKA-269}})
+
+      * A new parse context argument was added to the Parser.parse() method.
+        This context map can be used to pass things like a delegate parser
+        or other settings to the parsing process. The previous parse() method
+        signature has been deprecated and will be removed in Tika 1.0.
+        ({{{https://issues.apache.org/jira/browse/TIKA-275}TIKA-275}})
+
+      * A simple ngram-based language detection mechanism has been added
+        along with predefined language profiles for 18 languages.
+        ({{{https://issues.apache.org/jira/browse/TIKA-209}TIKA-209}})
+
+      * The media type registry in Tika was synchronized with the MIME type
+        configuration in the Apache HTTP Server. Tika now knows about 1274
+        different media types and can detect 672 of those using 927 file
+        extension and 280 magic byte patterns.
+        ({{{https://issues.apache.org/jira/browse/TIKA-285}TIKA-285}})
+
+      * Tika now uses the Apache PDFBox version 0.8.0-incubating for parsing
+        PDF documents. This version is notably better than the 0.7.3 release
+        used earlier.
+        ({{{https://issues.apache.org/jira/browse/TIKA-158}TIKA-158}})
+
+   The following people have contributed to Tika 0.5 by submitting or
+   commenting on the issues resolved in this release:
+
+      * Alex Baranov
+
+      * Bart Hanssens
+
+      * Benson Margulies
+
+      * Chris A. Mattmann
+
+      * Daan de Wit
+
+      * Erik Hetzner
+
+      * Frank Hellwig
+
+      * Jeff Cadow
+
+      * Joachim Zittmayr
+
+      * Jukka Zitting 
+
+      * Julien Nioche
+
+      * Ken Krugler
+
+      * Maxim Valyanskiy
+
+      * MRIT64
+
+      * Paul Borgermans
+
+      * Piotr B.
+
+      * Robert Newson
+
+      * Sascha Szott
+
+      * Ted Dunning
+
+      * Thilo Goetz
+
+      * Uwe Schindler
+
+      * Yuan-Fang Li
+
+   See {{http://tinyurl.com/yl9prwp}} for more details on these contributions.

Modified: lucene/tika/site/src/site/site.xml
URL: http://svn.apache.org/viewvc/lucene/tika/site/src/site/site.xml?rev=904893&r1=904892&r2=904893&view=diff
==============================================================================
--- lucene/tika/site/src/site/site.xml (original)
+++ lucene/tika/site/src/site/site.xml Sat Jan 30 21:24:59 2010
@@ -39,11 +39,13 @@
       <item name="Introduction" href="index.html"/>
       <item name="Download" href="download.html"/>
     </menu>
-    <menu name="Tika 0.5">
-      <item name="Getting Started" href="0.5/gettingstarted.html"/>
-      <item name="Documentation" href="0.5/documentation.html"/>
-      <item name="Supported Formats" href="0.5/formats.html"/>
-      <item name="API Documentation" href="0.5/api/"/>
+    <menu name="Documentation">
+      <item name="Tika 0.5" href="0.5/index.html">
+        <item name="Getting Started" href="0.5/gettingstarted.html"/>
+        <item name="Documentation" href="0.5/documentation.html"/>
+        <item name="Supported Formats" href="0.5/formats.html"/>
+        <item name="API Documentation" href="0.5/api/"/>
+      </item>
     </menu>
     <menu ref="reports"/>
   </body>