You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ta...@apache.org on 2019/05/29 15:23:10 UTC

[tika] branch branch_1x updated (2499e60 -> 8396952)

This is an automated email from the ASF dual-hosted git repository.

tallison pushed a change to branch branch_1x
in repository https://gitbox.apache.org/repos/asf/tika.git.


    from 2499e60  TIKA-2876 -- unpacker resource should fill parsecontext and process pdf/tesseract config info
     add 827abf4  TIKA-2878 -- update vulnerable jackson version
     add 2be18e7  swap http -> https
     add 8396952  TIKA-2883 -- improve recognition of leaving the RTF header

No new revisions were added by this update.

Summary of changes:
 pom.xml                                            |   4 ++--
 tika-parent/pom.xml                                |   2 +-
 .../org/apache/tika/parser/rtf/TextExtractor.java  |  15 +++++++++++++--
 .../org/apache/tika/parser/rtf/RTFParserTest.java  |  21 +++++++++++++++++++++
 .../tika/parser/ner/opennlp/ModelGetter.groovy     |   2 +-
 .../resources/test-documents/testRTFTIKA_1713.rtf  |   1 +
 .../resources/test-documents/testRTFTIKA_2150.rtf  |   6 ++++++
 .../resources/test-documents/testRTFTIKA_2500.rtf  |  10 ++++++++++
 .../resources/test-documents/testRTFTIKA_2883.rtf  | Bin 0 -> 1526 bytes
 9 files changed, 55 insertions(+), 6 deletions(-)
 create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_1713.rtf
 create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_2150.rtf
 create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_2500.rtf
 create mode 100644 tika-parsers/src/test/resources/test-documents/testRTFTIKA_2883.rtf