You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ta...@apache.org on 2024/03/28 11:12:33 UTC

(tika) branch TIKA-4207 updated (8cdaff4b3 -> a8e25cd1e)

This is an automated email from the ASF dual-hosted git repository.

tallison pushed a change to branch TIKA-4207
in repository https://gitbox.apache.org/repos/asf/tika.git


    from 8cdaff4b3 TIKA-4207 -- further refactorings to simplify class structure and bring back the default ParsingEmbeddedDocumentExtractor
     add fd23e6c27 Bump io.netty:netty-bom from 4.1.107.Final to 4.1.108.Final
     add d600259c5 Merge pull request #1677 from apache/dependabot/maven/io.netty-netty-bom-4.1.108.Final
     add 8e27e31a6 Bump com.google.cloud:google-cloud-storage from 2.36.0 to 2.36.1
     add a954511bd Merge pull request #1676 from apache/dependabot/maven/com.google.cloud-google-cloud-storage-2.36.1
     add a01e3edb4 Bump aws.version from 1.12.684 to 1.12.685
     add daad9b2b1 Merge pull request #1675 from apache/dependabot/maven/aws.version-1.12.685
     add 33ac40ccf TIKA-4166: update azure-storage-blob
     add f3f8404dd Bump commons-logging:commons-logging from 1.3.0 to 1.3.1
     add 449f8d192 Merge pull request #1683 from apache/dependabot/maven/commons-logging-commons-logging-1.3.1
     add fce53f9df Bump aws.version from 1.12.685 to 1.12.686
     add 27f1d87e5 Merge pull request #1682 from apache/dependabot/maven/aws.version-1.12.686
     add ba51ff3b6 Bump de.thetaphi:forbiddenapis from 3.6 to 3.7
     add 39b5c8a7b Merge pull request #1681 from apache/dependabot/maven/de.thetaphi-forbiddenapis-3.7
     add c51ab337d Bump org.ow2.asm:asm from 9.6 to 9.7
     add 40bf35574 Merge pull request #1680 from apache/dependabot/maven/org.ow2.asm-asm-9.7
     add b9ab4813e TIKA-4171 -- fix regression when field names are missing in the XFAExtractor (#1679)
     add a559906db TIKA-4219 -- improve epub handling of encrypted non-text-containing items (#1684)
     add 36e3ba8cd TIKA-4225 -- add detection for amf (#1688)
     add 3ffbc04f7 TIKA-4224 -- add detection for 3mf (#1689)
     add c5693624c TIKA-4222 -- add openscad glob (#1690)
     add b6bfe78d9 Bump aws.version from 1.12.686 to 1.12.687
     add 035c18461 Merge pull request #1692 from apache/dependabot/maven/aws.version-1.12.687
     add 9d45b69da TIKA-4223 -- add detection of stl (#1691)
     add e88be05ad TIKA-4219 -- clean up...do not include font names in main package
     add afc05ee4b Bump com.fasterxml.woodstox:woodstox-core from 6.6.1 to 6.6.2
     add e5511a043 Merge pull request #1693 from apache/dependabot/maven/com.fasterxml.woodstox-woodstox-core-6.6.2
     add 25badd98b Bump aws.version from 1.12.687 to 1.12.688
     add 07f1f4f24 Merge pull request #1694 from apache/dependabot/maven/aws.version-1.12.688
     add 1fb5b2622 Bump aws.version from 1.12.688 to 1.12.689
     add 4f5dff9a1 Merge pull request #1696 from apache/dependabot/maven/aws.version-1.12.689
     add f8c6750c9 Bump com.github.luben:zstd-jni from 1.5.5-11 to 1.5.6-1
     add b1f8e430f Merge pull request #1697 from apache/dependabot/maven/com.github.luben-zstd-jni-1.5.6-1
     new a8e25cd1e Merge remote-tracking branch 'origin/main' into TIKA-4207

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .../org/apache/tika/mime/tika-mimetypes.xml        |  34 +++-
 .../java/org/apache/tika/TikaDetectionTest.java    |   2 +-
 tika-parent/pom.xml                                |  18 +-
 .../detect/microsoft/ooxml/OPCPackageDetector.java |  47 +++--
 .../apache/tika/parser/epub/EncryptionParser.java  |  88 ----------
 .../org/apache/tika/parser/epub/EpubParser.java    | 193 ++++++++++++++++-----
 .../org/apache/tika/parser/pdf/XFAExtractor.java   |   3 +
 .../org/apache/tika/parser/pdf/PDFParserTest.java  |   2 +-
 .../tika/detect/TestContainerAwareDetector.java    |   5 +
 .../java/org/apache/tika/mime/TestMimeTypes.java   |   6 +
 .../src/test/resources/test-documents/test3mf.3mf  | Bin 0 -> 28243 bytes
 .../resources/test-documents/testSTL-ascii.stl     |  16 ++
 .../resources/test-documents/testSTL-binary.stl    | Bin 0 -> 160 bytes
 13 files changed, 255 insertions(+), 159 deletions(-)
 delete mode 100644 tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/src/main/java/org/apache/tika/parser/epub/EncryptionParser.java
 create mode 100644 tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/resources/test-documents/test3mf.3mf
 create mode 100644 tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/resources/test-documents/testSTL-ascii.stl
 create mode 100644 tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/resources/test-documents/testSTL-binary.stl


(tika) 01/01: Merge remote-tracking branch 'origin/main' into TIKA-4207

Posted by ta...@apache.org.
This is an automated email from the ASF dual-hosted git repository.

tallison pushed a commit to branch TIKA-4207
in repository https://gitbox.apache.org/repos/asf/tika.git

commit a8e25cd1ed82aecccb12ecbe1fe5d74690d311e9
Merge: 8cdaff4b3 b1f8e430f
Author: tallison <ta...@apache.org>
AuthorDate: Thu Mar 28 07:12:12 2024 -0400

    Merge remote-tracking branch 'origin/main' into TIKA-4207

 .../org/apache/tika/mime/tika-mimetypes.xml        |  34 +++-
 .../java/org/apache/tika/TikaDetectionTest.java    |   2 +-
 tika-parent/pom.xml                                |  18 +-
 .../detect/microsoft/ooxml/OPCPackageDetector.java |  47 +++--
 .../apache/tika/parser/epub/EncryptionParser.java  |  88 ----------
 .../org/apache/tika/parser/epub/EpubParser.java    | 193 ++++++++++++++++-----
 .../org/apache/tika/parser/pdf/XFAExtractor.java   |   3 +
 .../org/apache/tika/parser/pdf/PDFParserTest.java  |   2 +-
 .../tika/detect/TestContainerAwareDetector.java    |   5 +
 .../java/org/apache/tika/mime/TestMimeTypes.java   |   6 +
 .../src/test/resources/test-documents/test3mf.3mf  | Bin 0 -> 28243 bytes
 .../resources/test-documents/testSTL-ascii.stl     |  16 ++
 .../resources/test-documents/testSTL-binary.stl    | Bin 0 -> 160 bytes
 13 files changed, 255 insertions(+), 159 deletions(-)