You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ju...@apache.org on 2009/01/16 01:39:07 UTC
svn commit: r734874 - /lucene/tika/trunk/CHANGES.txt
Author: jukka
Date: Thu Jan 15 16:39:07 2009
New Revision: 734874
URL: http://svn.apache.org/viewvc?rev=734874&view=rev
Log:
CHANGES.txt: Added a higher level summary of some of the more notable changes in the upcoming 0.3 release.
Modified:
lucene/tika/trunk/CHANGES.txt
Modified: lucene/tika/trunk/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/tika/trunk/CHANGES.txt?rev=734874&r1=734873&r2=734874&view=diff
==============================================================================
--- lucene/tika/trunk/CHANGES.txt (original)
+++ lucene/tika/trunk/CHANGES.txt Thu Jan 15 16:39:07 2009
@@ -1,8 +1,27 @@
-Tika Change Log
+Apache Tika Change Log
+======================
Unreleased changes (0.3)
+------------------------
+
+The most notable changes in Tika 0.3 over the previous release are:
+
+ * Tika now disables the loading of all external entities in XML files
+ that it parses as input documents. This improves security and avoids
+ problems with potentially broken references. (TIKA-185)
+
+ * Tika now replaces all invalid XML characters in the extracted text
+ content with spaces. This prevents problems when output from Tika
+ is processed with XML tools. (TIKA-180)
+
+ * The Tika CLI now correctly flushes its buffers when invoked with the
+ --text argument. This prevents the end of the text output from being
+ lost. (TIKA-179)
+
+See http://tinyurl.com/tika-0-3-changes for a list of all changes in Tika 0.3.
Release 0.2 - 12/04/2008
+------------------------
1. TIKA-109 - WordParser fails on some Word files (Dave Meikle)
@@ -111,6 +130,7 @@
missing white space after XHTML block tags (Uwe Schindler & Jukka Zitting)
Release 0.1-incubating - 12/27/2007
+-----------------------------------
1. TIKA-5 - Port Metadata Framework from Nutch (mattmann)