You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (TIKA-3751) General upgrades for 2.4.1 - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/01 05:07:00 UTC, 8 replies.
- [jira] [Commented] (TIKA-3710) HTML document detected incorrect as message/rfc822 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/01 15:11:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3783) Filename detection misses when a # and several . are in a filename - posted by "Alexander (Jira)" <ji...@apache.org> on 2022/06/02 06:06:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3783) Filename detection misses when a # and several . are in a filename - posted by "Alexander (Jira)" <ji...@apache.org> on 2022/06/02 07:22:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3710) HTML document detected incorrect as message/rfc822 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/02 09:54:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3783) Filename detection misses when a # and several . are in a filename - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/02 09:57:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3783) Filename detection misses when a # and several . are in a filename - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/02 12:11:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3784) Detector return "application/x-x509-key" when scanning a .p12 file - posted by "Matthias Hofbauer (Jira)" <ji...@apache.org> on 2022/06/02 12:31:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3784) Detector returns "application/x-x509-key" when scanning a .p12 file - posted by "Matthias Hofbauer (Jira)" <ji...@apache.org> on 2022/06/02 13:32:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3784) Detector returns "application/x-x509-key" when scanning a .p12 file - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/02 16:09:00 UTC, 2 replies.
- [jira] [Updated] (TIKA-3782) Improve logging in pipes-iterator-jdbc - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/03 13:04:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3785) Align pipes-iterator-csv with pipes-iterator-jdbc - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/03 13:06:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3782) Improve logging in pipes-iterator-jdbc - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/03 13:20:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3785) Align pipes-iterator-csv with pipes-iterator-jdbc - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/03 13:20:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3785) Align pipes-iterator-csv with pipes-iterator-jdbc - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/03 15:09:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3782) Improve logging in pipes-iterator-jdbc - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/03 15:09:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3695) LimitingMetadataFilter - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/03 19:52:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3780) General upgrades for 1.28.4 - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/03 20:26:00 UTC, 6 replies.
- [jira] [Commented] (TIKA-3768) message/rfc822 does not include Headers in extracted text - posted by "Nick Burch (Jira)" <ji...@apache.org> on 2022/06/05 14:17:00 UTC, 2 replies.
- [GitHub] [tika] dependabot[bot] opened a new pull request, #591: Bump jaxb-runtime from 2.3.6 to 4.0.0 - posted by GitBox <gi...@apache.org> on 2022/06/06 10:30:06 UTC, 0 replies.
- [GitHub] [tika] dependabot[bot] commented on pull request #532: Bump jaxb-runtime from 2.3.6 to 3.0.2 - posted by GitBox <gi...@apache.org> on 2022/06/06 10:30:14 UTC, 0 replies.
- [GitHub] [tika] dependabot[bot] closed pull request #532: Bump jaxb-runtime from 2.3.6 to 3.0.2 - posted by GitBox <gi...@apache.org> on 2022/06/06 10:30:14 UTC, 0 replies.
- Text extraction performance - posted by Tim Allison <ta...@apache.org> on 2022/06/06 18:44:52 UTC, 1 replies.
- [jira] [Created] (TIKA-3786) Allow pass-through of content-length to metadata in TikaResource - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/07 13:32:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3786) Allow pass-through of content-length to metadata in TikaResource - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/07 13:41:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3786) Allow pass-through of content-length to metadata in TikaResource - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/07 14:11:00 UTC, 0 replies.
- next releases -- 2.4.1 and 1.28.4 - posted by Tim Allison <ta...@apache.org> on 2022/06/07 15:08:47 UTC, 5 replies.
- [jira] [Created] (TIKA-3787) Keep processing on write limit reached - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/07 15:50:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3788) Allow embedded exceptions and warnings to percolate to the parent's metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/08 16:16:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3788) Allow embedded exceptions and warnings to percolate to the parent's metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/08 20:35:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3787) Keep processing on write limit reached - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/08 20:35:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3788) Allow embedded exceptions and warnings to percolate to the parent's metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/08 20:36:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3788) Allow embedded exceptions and warnings to percolate to the parent's metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/08 20:39:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3787) Keep processing on write limit reached - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/08 22:27:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3789) Allow parsers to pass embedded metadata to container file's metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/09 12:43:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3789) Allow parsers to pass embedded metadata to container file's metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/09 12:44:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3789) Allow parsers to pass embedded metadata to container file's metadata - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/09 15:09:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3790) Actually implement tika server client - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/10 13:20:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-3779) Temp file leftover in PDFParser.parse() - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/11 10:53:00 UTC, 0 replies.
- JDK 19: Rampdown Phase 1 + EA builds 26 & JDK 20: EA builds 1 - posted by David Delabassee <da...@oracle.com> on 2022/06/13 13:41:17 UTC, 0 replies.
- [jira] [Created] (TIKA-3791) Implement bulk updates for opensearch emitter - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/13 14:10:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3791) Implement bulk updates for opensearch emitter - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/13 14:43:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3790) Actually implement tika server client - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/13 14:49:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3779) Temp file leftover in PDFParser.parse() - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/13 15:11:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-3779) Temp file leftover in PDFParser.parse() - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/13 15:46:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3779) Temp file leftover in PDFParser.parse() - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/13 15:46:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3780) General upgrades for 1.28.4 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/13 16:02:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3790) Actually implement tika server client - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/13 16:15:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3791) Implement bulk updates for opensearch emitter - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/13 16:15:00 UTC, 0 replies.
- [VOTE] Release Apache Tika 1.28.4 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2022/06/13 17:52:54 UTC, 3 replies.
- [jira] [Created] (TIKA-3792) AutoDetectParser should not decorate content handlers more than once - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/14 14:36:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3792) AutoDetectParser should not decorate content handlers more than once - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/14 16:10:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3736) Fix flaky Solr PipesIterator test - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/14 17:20:00 UTC, 0 replies.
- [VOTE] Release Apache Tika 2.4.1 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2022/06/14 17:45:45 UTC, 3 replies.
- [jira] [Commented] (TIKA-3792) AutoDetectParser should not decorate content handlers more than once - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/14 18:15:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1 - posted by "Ostico (Jira)" <ji...@apache.org> on 2022/06/16 16:56:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1 - posted by "Ostico (Jira)" <ji...@apache.org> on 2022/06/16 16:57:00 UTC, 7 replies.
- [RESULT][VOTE] Release Apache Tika 1.28.4 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2022/06/16 18:46:13 UTC, 3 replies.
- [jira] [Updated] (TIKA-3793) General upgrades for 1.28.5 - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/06/17 17:42:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3793) General upgrades for 1.28.5 - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/06/17 17:42:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3751) General upgrades for 2.4.1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/17 18:10:00 UTC, 0 replies.
- [RESULT][VOTE] Release Apache Tika 2.4.1 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2022/06/17 18:11:04 UTC, 1 replies.
- [jira] [Commented] (TIKA-3793) General upgrades for 1.28.5 - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/17 18:33:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3794) ocrImageType is not configurable via headers in tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/17 20:44:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3795) General upgrades for 2.4.2 - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/06/18 01:54:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3795) General upgrades for 2.4.2 - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/18 03:29:00 UTC, 6 replies.
- [ANNOUNCE] Apache Tika 1.28.4 released - posted by Tim Allison <ta...@apache.org> on 2022/06/18 12:21:41 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 2.4.1 released - posted by Tim Allison <ta...@apache.org> on 2022/06/18 12:23:39 UTC, 0 replies.
- Re: What may have changed in ODT parser in Tika 2 - posted by Sergey Beryozkin <sb...@gmail.com> on 2022/06/18 16:37:56 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3794) ocrImageType is not configurable via headers in tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/20 19:16:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3571) Add an interface for rendering engines - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/20 19:17:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3754) Allow customization of image graphics engine in PDFParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/20 19:17:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3796) IncludeHeadersAndFooters is not being passed through via tika-config to the MSOffice parser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/20 20:13:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3796) IncludeHeadersAndFooters is not being passed through via tika-config to the MSOffice parser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/20 20:17:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3794) ocrImageType is not configurable via headers in tika-server - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/20 21:23:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3796) IncludeHeadersAndFooters is not being passed through via tika-config to the MSOffice parser - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/20 22:45:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3797) Tika's ServiceLoader should ignore duplicate classes - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/21 16:44:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3797) Tika's ServiceLoader should ignore duplicate classes - posted by "Hudson (Jira)" <ji...@apache.org> on 2022/06/21 19:20:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3798) Tika hangs up with some RAR archives - posted by "Mikhail Gushinets (Jira)" <ji...@apache.org> on 2022/06/22 04:44:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3798) Tika hangs up with some RAR archives - posted by "Nick Burch (Jira)" <ji...@apache.org> on 2022/06/22 08:43:00 UTC, 9 replies.
- [jira] [Updated] (TIKA-3798) Tika hangs up with some RAR archives - posted by "Mikhail Gushinets (Jira)" <ji...@apache.org> on 2022/06/22 08:53:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3798) Tika hangs up with some RAR archives - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/22 14:19:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3799) Refactor FuzzingCLI to use PipesParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/22 14:51:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3799) Refactor FuzzingCLI to use PipesParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/22 14:52:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3799) Refactor FuzzingCLI to use PipesParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/22 21:46:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3800) Consider wrapping 'unrar' commandline executable as a parser to handle rar v5 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/23 19:12:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3800) Consider wrapping 'unrar' commandline executable as a parser to handle rar v5 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/23 19:14:00 UTC, 0 replies.
- remove jdk14 build from ci - posted by Tilman Hausherr <TH...@t-online.de> on 2022/06/24 03:29:04 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3800) Consider wrapping 'unrar' commandline executable as a parser to handle rar v5 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/24 21:45:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3800) Consider wrapping 'unrar' commandline executable as a parser to handle rar v5 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/24 21:46:00 UTC, 1 replies.
- [FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022 - posted by Gavin McDonald <gm...@apache.org> on 2022/06/27 08:01:21 UTC, 0 replies.
- Pdf Parse - posted by Muhammad Shahzeb Ali <mu...@purelogics.net> on 2022/06/27 11:45:42 UTC, 1 replies.
- [jira] [Created] (TIKA-3801) Integrate unrar and junrar parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/27 15:33:00 UTC, 0 replies.
- CVE-2022-33879: Apache Tika: Incomplete fix and new regex DoS in StandardsExtractingContentHandler - posted by Tim Allison <ta...@apache.org> on 2022/06/27 20:30:57 UTC, 0 replies.
- Re: regarding the data bank of test PDF files (pdfs_202011) . . . - posted by Tim Allison <ta...@apache.org> on 2022/06/29 10:20:28 UTC, 0 replies.
- [jira] [Created] (TIKA-3806) Remove deprecated ContentHandlerDecoratorFactory method in 2.5 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/29 18:13:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3807) Improve configurability of content handlers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2022/06/30 18:53:00 UTC, 0 replies.