You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ju...@apache.org on 2009/11/07 05:55:44 UTC

svn commit: r833626 - /lucene/tika/trunk/CHANGES.txt

Author: jukka
Date: Sat Nov  7 04:55:43 2009
New Revision: 833626

URL: http://svn.apache.org/viewvc?rev=833626&view=rev
Log:
Add change log entries for TIKA-209 and TIKA-275

Modified:
    lucene/tika/trunk/CHANGES.txt

Modified: lucene/tika/trunk/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/tika/trunk/CHANGES.txt?rev=833626&r1=833625&r2=833626&view=diff
==============================================================================
--- lucene/tika/trunk/CHANGES.txt (original)
+++ lucene/tika/trunk/CHANGES.txt Sat Nov  7 04:55:43 2009
@@ -6,6 +6,16 @@
 
 The most notable changes in Tika 0.5 over the previous release are:
 
+ * A new parse context argument was added to the Parser.parse() method.
+   This context map can be used to pass things like a delegate parser or
+   other settings to the parsing process. The previous parse() method
+   signature has been deprecated and will be removed in Tika 1.0. (TIKA-275)
+
+ * A simple ngram-based language detection mechanism has been added along
+   with predefined language profiles for 18 languages. The AutoDetectParser
+   will automatically set the Metadata.LANGUAGE property if a reasonably
+   certain language profile match is detected. (TIKA-209)
+
  * The media type registry in Tika was synchronized with the MIME type
    configuration in the Apache HTTP Server. Tika now knows about 1274
    different media types and can detect 672 of those using 927 file