You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [tika] robinderat commented on pull request #246: TIKA-2696 Add support for OSD output, contributed by @4U6U57 - posted by GitBox <gi...@apache.org> on 2021/11/01 15:14:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-2696) Support output of Tesseract OSD output for psm mode 0 - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/01 15:15:00 UTC, 3 replies.
- [GitHub] [tika] tballison commented on pull request #246: TIKA-2696 Add support for OSD output, contributed by @4U6U57 - posted by GitBox <gi...@apache.org> on 2021/11/03 11:41:04 UTC, 2 replies.
- [jira] [Created] (TIKA-3587) Couldn't find setter: setStatus for object class org.apache.tika.server.core.TikaServerConfig - posted by "hillar aarelaid (Jira)" <ji...@apache.org> on 2021/11/03 13:51:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3164) Upgrade to POI 5.0.0 when available - posted by "PJ Fanning (Jira)" <ji...@apache.org> on 2021/11/03 15:23:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3588) Fix thread starvation in PipesClient after numerous restarts - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/03 18:29:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3587) Couldn't find setter: setStatus for object class org.apache.tika.server.core.TikaServerConfig - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/08 14:19:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3587) Couldn't find setter: setStatus for object class org.apache.tika.server.core.TikaServerConfig - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/08 14:19:00 UTC, 0 replies.
- Virtual hands-on tika-eval workshop tomorrow - posted by Tim Allison <ta...@apache.org> on 2021/11/08 14:30:31 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3588) Fix thread starvation in PipesClient after numerous restarts - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/08 18:10:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3589) Two trivial bugs in tika-app's commandline interpretation for tika-batch - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/08 19:01:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3589) Two trivial bugs in tika-app's commandline interpretation for tika-batch - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/08 19:09:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3588) Fix thread starvation in PipesClient after numerous restarts - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/08 19:30:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3589) Two trivial bugs in tika-app's commandline interpretation for tika-batch - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/08 20:35:00 UTC, 0 replies.
- Proposed topics for next Tika meetups? - posted by Tim Allison <ta...@apache.org> on 2021/11/09 19:00:51 UTC, 7 replies.
- [jira] [Commented] (TIKA-3561) Tika throwing java.lang.OutOfMemoryError - posted by "Abha (Jira)" <ji...@apache.org> on 2021/11/10 20:01:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3590) OSX DMG files wrong MIME type detection (wrong MediaType and Supertype) - posted by "Tetiana Tvardovska (Jira)" <ji...@apache.org> on 2021/11/11 16:45:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3590) OSX DMG files wrong MIME type detection (wrong MediaType and Supertype) - posted by "Tetiana Tvardovska (Jira)" <ji...@apache.org> on 2021/11/11 19:11:00 UTC, 0 replies.
- [GitHub] [tika] danielin917 opened a new pull request #455: [TIKA-860] Allow for AutoDetectParser to specify SecureContentHandler parameters. - posted by GitBox <gi...@apache.org> on 2021/11/12 21:51:33 UTC, 0 replies.
- [jira] [Commented] (TIKA-860) Make ZIP bomb detection configureable - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/12 21:52:00 UTC, 8 replies.
- [jira] [Created] (TIKA-3591) The Import-Package of commons.io is wrong in MANIFEST.MF - posted by "Per Kristian Söreide (Jira)" <ji...@apache.org> on 2021/11/16 10:42:00 UTC, 0 replies.
- JDK 18 Early-Access builds 23 are available - posted by da...@oracle.com on 2021/11/16 11:07:25 UTC, 0 replies.
- [jira] [Commented] (TIKA-3591) The Import-Package of commons.io is wrong in MANIFEST.MF - posted by "Per Kristian Söreide (Jira)" <ji...@apache.org> on 2021/11/16 11:50:00 UTC, 13 replies.
- [jira] [Created] (TIKA-3592) Need to add bcutil-jdk15on to embedded dependencies in tika-bundle-standard - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 15:26:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3592) Need to add bcutil-jdk15on to embedded dependencies in tika-bundle-standard - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 15:27:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3592) Need to add bcutil-jdk15on to embedded dependencies in tika-bundle-standard - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 15:27:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3591) The Import-Package of commons.io is wrong in MANIFEST.MF - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 15:36:00 UTC, 3 replies.
- [GitHub] [tika] tballison commented on pull request #455: [TIKA-860] Allow for AutoDetectParser to specify SecureContentHandler parameters. - posted by GitBox <gi...@apache.org> on 2021/11/16 15:42:05 UTC, 1 replies.
- [jira] [Created] (TIKA-3593) Remove XMLParser from service loading in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 15:46:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3593) Remove XMLParser from service loading in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 15:48:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3590) OSX DMG files wrong MIME type detection (wrong MediaType and Supertype) - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 15:55:00 UTC, 3 replies.
- [GitHub] [tika] tballison merged pull request #453: [TIKA-3559] Add MIME type for .webmanifest files - posted by GitBox <gi...@apache.org> on 2021/11/16 16:51:45 UTC, 0 replies.
- [jira] [Commented] (TIKA-3559) Add MIME type for .webmanifest files - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/16 16:52:00 UTC, 1 replies.
- [GitHub] [tika] tballison commented on pull request #452: TIKA-3551 TikaConfig: unspecified attribute of "xml-reader-utils" breaks configuration file parser - posted by GitBox <gi...@apache.org> on 2021/11/16 16:52:19 UTC, 0 replies.
- [jira] [Commented] (TIKA-3551) TikaConfig: unspecified attribute of "xml-reader-utils" breaks configuration file parser - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/16 16:53:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-3593) Remove XMLParser from service loading in 2.x - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/16 17:19:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3592) Need to add bcutil-jdk15on to embedded dependencies in tika-bundle-standard - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/16 17:19:00 UTC, 0 replies.
- [GitHub] [tika] danielin917 opened a new pull request #456: [TIKA-860] Allow for AutoDetectParser to specify SecureContentHandler parameters. - posted by GitBox <gi...@apache.org> on 2021/11/16 18:05:04 UTC, 0 replies.
- [GitHub] [tika] danielin917 commented on pull request #455: [TIKA-860] Allow for AutoDetectParser to specify SecureContentHandler parameters. - posted by GitBox <gi...@apache.org> on 2021/11/16 18:05:39 UTC, 1 replies.
- [GitHub] [tika] tballison merged pull request #456: [TIKA-860] Allow for AutoDetectParser to specify SecureContentHandler parameters. - posted by GitBox <gi...@apache.org> on 2021/11/16 19:27:32 UTC, 0 replies.
- [jira] [Updated] (TIKA-3594) Improve configurability of the SecureContentHandler - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 19:50:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3594) Improve configurability of the SecureContentHandler - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/16 19:50:00 UTC, 0 replies.
- [GitHub] [tika] danielin917 closed pull request #455: [TIKA-860] Allow for AutoDetectParser to specify SecureContentHandler parameters. - posted by GitBox <gi...@apache.org> on 2021/11/16 20:01:13 UTC, 0 replies.
- [GitHub] [tika] sebastian-nagel commented on pull request #452: TIKA-3551 TikaConfig: unspecified attribute of "xml-reader-utils" breaks configuration file parser - posted by GitBox <gi...@apache.org> on 2021/11/17 13:04:55 UTC, 0 replies.
- [jira] [Commented] (TIKA-3594) Improve configurability of the SecureContentHandler - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/17 16:17:00 UTC, 1 replies.
- [GitHub] [tika] tballison merged pull request #452: TIKA-3551 TikaConfig: unspecified attribute of "xml-reader-utils" breaks configuration file parser - posted by GitBox <gi...@apache.org> on 2021/11/17 16:44:54 UTC, 0 replies.
- Next 2.x release? - posted by Tim Allison <ta...@apache.org> on 2021/11/18 14:25:11 UTC, 0 replies.
- [jira] [Created] (TIKA-3595) Avoid importing embedded dependencies in tika-bundle-standard - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/18 14:51:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3595) Avoid importing embedded dependencies in tika-bundle-standard - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/18 15:41:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3595) Avoid importing embedded dependencies in tika-bundle-standard - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/18 16:36:00 UTC, 0 replies.
- [GitHub] [tika-docker] wjwilson-ibm commented on pull request #4: [TIKA-3417] Running tika-docker as non-root user - posted by GitBox <gi...@apache.org> on 2021/11/18 18:03:31 UTC, 1 replies.
- [jira] [Commented] (TIKA-3417) Running tika-docker as non-root user - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/18 18:04:00 UTC, 6 replies.
- [jira] [Resolved] (TIKA-3561) Tika throwing java.lang.OutOfMemoryError - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/18 18:57:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3575) Cannot use loadErrorHandler="ignore" in tika config - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/18 18:59:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3575) Cannot use loadErrorHandler="ignore" in tika config - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/18 18:59:00 UTC, 0 replies.
- [GitHub] [tika-docker] dameikle commented on pull request #4: [TIKA-3417] Running tika-docker as non-root user - posted by GitBox <gi...@apache.org> on 2021/11/18 21:45:59 UTC, 1 replies.
- [GitHub] [tika-docker] dameikle merged pull request #4: [TIKA-3417] Running tika-docker as non-root user - posted by GitBox <gi...@apache.org> on 2021/11/18 21:51:42 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3575) Cannot use loadErrorHandler="ignore" in tika config - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/18 23:26:00 UTC, 0 replies.
- Behavior unpack vs rmeta endpoints - posted by ju...@francelabs.com on 2021/11/19 10:31:35 UTC, 4 replies.
- [jira] [Comment Edited] (TIKA-3590) OSX DMG files wrong MIME type detection (wrong MediaType and Supertype) - posted by "Tetiana Tvardovska (Jira)" <ji...@apache.org> on 2021/11/19 16:54:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-3596) Detect corrupted XML files as application/xml instead of text/plain - posted by "Luís Filipe Nassif (Jira)" <ji...@apache.org> on 2021/11/20 19:24:00 UTC, 2 replies.
- [jira] [Created] (TIKA-3596) Detect corrupted XML files as application/xml instead of text/plain - posted by "Luís Filipe Nassif (Jira)" <ji...@apache.org> on 2021/11/20 19:24:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3596) Detect corrupted XML files as application/xml instead of text/plain - posted by "Luís Filipe Nassif (Jira)" <ji...@apache.org> on 2021/11/20 19:25:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3596) Detect corrupted XML files as application/xml instead of text/plain - posted by "Luís Filipe Nassif (Jira)" <ji...@apache.org> on 2021/11/20 19:26:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3597) Tika Does Not Work As Documented in Gradle Project - posted by "John Midgley (Jira)" <ji...@apache.org> on 2021/11/22 18:27:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3598) The new external parser should write stdout to a tmp file if no output file pattern is found on the commandline - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/22 18:50:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3598) The new external parser should write stdout to a tmp file if no output file pattern is found on the commandline - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/22 18:51:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3598) The new external parser should write stdout to a tmp file if no output file pattern is found on the commandline - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/22 18:51:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3598) The new external parser should write stdout to a tmp file if no output file pattern is found on the commandline - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/22 19:16:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3597) Tika Does Not Work As Documented in Gradle Project - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/22 19:45:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3324) Add checkstyle checker - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/22 21:18:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3488) Security issue XXE in TIKA due to JDOM - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/22 23:17:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3599) Command line tika extracts encoding of file in eml - posted by "GIOELE PERIN (Jira)" <ji...@apache.org> on 2021/11/23 09:13:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3596) Detect corrupted XML files as application/xml instead of text/plain - posted by "Luís Filipe Nassif (Jira)" <ji...@apache.org> on 2021/11/23 17:07:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3596) Detect truncated/bad encoded XML files as application/xml instead of text/plain - posted by "Luís Filipe Nassif (Jira)" <ji...@apache.org> on 2021/11/23 17:32:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3596) Detect truncated/bad encoded XML files as application/xml instead of text/plain - posted by "Luís Filipe Nassif (Jira)" <ji...@apache.org> on 2021/11/23 17:35:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3596) Detect truncated/bad encoded XML files as application/xml instead of text/plain - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/23 18:32:00 UTC, 1 replies.
- [GitHub] [tika] mahozad opened a new pull request #457: Fix some broken links in a javadoc - posted by GitBox <gi...@apache.org> on 2021/11/24 11:58:46 UTC, 0 replies.
- [jira] [Commented] (TIKA-3506) please fix multipile CVE in commons-compress for tika-parsers 1.x too - posted by "Colm O hEigeartaigh (Jira)" <ji...@apache.org> on 2021/11/24 12:56:00 UTC, 2 replies.
- [GitHub] [tika] tballison merged pull request #457: Fix some broken links in a javadoc - posted by GitBox <gi...@apache.org> on 2021/11/24 19:06:15 UTC, 0 replies.
- Fully embracing the java module system -- discussion on Lucene's jira - posted by Tim Allison <ta...@apache.org> on 2021/11/24 19:11:16 UTC, 0 replies.
- [jira] [Created] (TIKA-3600) Upgrade gson version in tika-app - posted by "Shubhangi Raut (Jira)" <ji...@apache.org> on 2021/11/24 23:00:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3601) Update JSoup to fix CVE-2021-37714 - posted by "Colm O hEigeartaigh (Jira)" <ji...@apache.org> on 2021/11/25 08:01:00 UTC, 0 replies.
- [GitHub] [tika] coheigea opened a new pull request #458: TIKA-3601 - Update JSoup to fix CVE-2021-37714 - posted by GitBox <gi...@apache.org> on 2021/11/25 08:05:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-3601) Update JSoup to fix CVE-2021-37714 - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/25 08:06:00 UTC, 4 replies.
- [jira] [Updated] (TIKA-3600) Upgrade gson version in tika-app - posted by "Shubhangi Raut (Jira)" <ji...@apache.org> on 2021/11/25 08:15:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3602) Content Detection web page still recommends Metadata.RESOURCE_NAME_KEY - posted by "Adam Rauch (Jira)" <ji...@apache.org> on 2021/11/26 17:32:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2796) Update GoogleTranslator to use google-cloud-translate Java API - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/11/28 14:38:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3602) Content Detection web page still recommends Metadata.RESOURCE_NAME_KEY - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/29 14:43:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #458: TIKA-3601 - Update JSoup to fix CVE-2021-37714 - posted by GitBox <gi...@apache.org> on 2021/11/29 14:43:36 UTC, 0 replies.
- [jira] [Updated] (TIKA-3599) Command line tika extracts encoding of file in eml - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/29 14:59:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3599) Command line tika extracts encoding of file in eml - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/29 15:05:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2781) Numbering is removed from headings when converting docx - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/29 20:09:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2781) Numbering is removed from headings when converting docx - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/29 20:10:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3602) Content Detection web page still recommends Metadata.RESOURCE_NAME_KEY - posted by "Adam Rauch (Jira)" <ji...@apache.org> on 2021/11/29 20:23:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3603) Upgrade testcontainers and small cleanup in opensearch integration tests - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/30 20:46:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3603) Upgrade testcontainers and small cleanup in opensearch integration tests - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/11/30 20:50:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3603) Upgrade testcontainers and small cleanup in opensearch integration tests - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/11/30 22:30:00 UTC, 0 replies.