You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ju...@apache.org on 2009/01/16 01:39:07 UTC

svn commit: r734874 - /lucene/tika/trunk/CHANGES.txt

Author: jukka
Date: Thu Jan 15 16:39:07 2009
New Revision: 734874

URL: http://svn.apache.org/viewvc?rev=734874&view=rev
Log:
CHANGES.txt: Added a higher level summary of some of the more notable changes in the upcoming 0.3 release.

Modified:
    lucene/tika/trunk/CHANGES.txt

Modified: lucene/tika/trunk/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/tika/trunk/CHANGES.txt?rev=734874&r1=734873&r2=734874&view=diff
==============================================================================
--- lucene/tika/trunk/CHANGES.txt (original)
+++ lucene/tika/trunk/CHANGES.txt Thu Jan 15 16:39:07 2009
@@ -1,8 +1,27 @@
-Tika Change Log
+Apache Tika Change Log
+======================
 
 Unreleased changes (0.3)
+------------------------
+
+The most notable changes in Tika 0.3 over the previous release are:
+
+  * Tika now disables the loading of all external entities in XML files
+    that it parses as input documents. This improves security and avoids
+    problems with potentially broken references. (TIKA-185)
+
+  * Tika now replaces all invalid XML characters in the extracted text
+    content with spaces. This prevents problems when output from Tika
+    is processed with XML tools. (TIKA-180)
+
+  * The Tika CLI now correctly flushes its buffers when invoked with the
+    --text argument. This prevents the end of the text output from being
+    lost. (TIKA-179)
+
+See http://tinyurl.com/tika-0-3-changes for a list of all changes in Tika 0.3.
 
 Release 0.2 - 12/04/2008
+------------------------
 
 1.  TIKA-109 - WordParser fails on some Word files (Dave Meikle)
 
@@ -111,6 +130,7 @@
                missing white space after XHTML block tags (Uwe Schindler & Jukka Zitting)
 
 Release 0.1-incubating - 12/27/2007
+-----------------------------------
 
 1. TIKA-5 - Port Metadata Framework from Nutch (mattmann)