You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ta...@apache.org on 2019/05/29 15:23:10 UTC
[tika] branch branch_1x updated (2499e60 -> 8396952)
This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch branch_1x
in repository https://gitbox.apache.org/repos/asf/tika.git.
from 2499e60 TIKA-2876 -- unpacker resource should fill parsecontext and process pdf/tesseract config info
add 827abf4 TIKA-2878 -- update vulnerable jackson version
add 2be18e7 swap http -> https
add 8396952 TIKA-2883 -- improve recognition of leaving the RTF header
No new revisions were added by this update.
Summary of changes:
pom.xml | 4 ++--
tika-parent/pom.xml | 2 +-
.../org/apache/tika/parser/rtf/TextExtractor.java | 15 +++++++++++++--
.../org/apache/tika/parser/rtf/RTFParserTest.java | 21 +++++++++++++++++++++
.../tika/parser/ner/opennlp/ModelGetter.groovy | 2 +-
.../resources/test-documents/testRTFTIKA_1713.rtf | 1 +
.../resources/test-documents/testRTFTIKA_2150.rtf | 6 ++++++
.../resources/test-documents/testRTFTIKA_2500.rtf | 10 ++++++++++
.../resources/test-documents/testRTFTIKA_2883.rtf | Bin 0 -> 1526 bytes
9 files changed, 55 insertions(+), 6 deletions(-)
create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_1713.rtf
create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_2150.rtf
create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_2500.rtf
create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_2883.rtf