You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ta...@apache.org on 2019/12/02 19:33:48 UTC
[tika] 01/06: Update CHANGES.txt for 1.23 release.
This is an automated email from the ASF dual-hosted git repository.
tallison pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/tika.git
commit 363c9c23b402451b4463461e38fb4114eda03e08
Author: tallison <ta...@apache.org>
AuthorDate: Tue Nov 26 14:46:25 2019 -0500
Update CHANGES.txt for 1.23 release.
# Conflicts:
# CHANGES.txt
---
CHANGES.txt | 12 ++++++++++--
1 file changed, 10 insertions(+), 2 deletions(-)
diff --git a/CHANGES.txt b/CHANGES.txt
index 6b532f0..3f0146d 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -5,7 +5,11 @@ Release 2.0.0 - ???
Other changes
-Release 1.23
+Release 1.23 - 11/26/2019
+
+ * NOTE: The PDFParser now relies on OCRDPI to render page images when
+ users configure OCR on rendered page images. This will have the effect
+ of increasing rendered image size (TIKA-2624).
* NOTE: tika-server no longer returns 415 for file types for which there
is no parser.
@@ -29,6 +33,10 @@ Release 1.23
Release 1.22 - 07/29/2019
+ * NOTE: tika-server no longer hard-codes the HtmlParser to handle
+ XML files (TIKA-2910). Users must now configure that behavior
+ via a tika-config.xml file.
+
* NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints
between 0xF000 and 0XF0000 will cause an exception.
@@ -36,7 +44,7 @@ Release 1.22 - 07/29/2019
JinSup Kim (ddoleye) (TIKA-2909).
* Fix order of closing streams to avoid "Failed to close temporary resource"
- exception (TIKA-2908).
+ exception in TesseractOCRParser (TIKA-2908).
* Improve AutoDetectReader performance by caching encoding
detector (TIKA-1568).