You are viewing a plain text version of this content. The canonical link for it is here.
- Re: 1.27? - posted by Tim Allison <ta...@apache.org> on 2021/07/01 15:30:09 UTC, 1 replies.
- [GitHub] [tika] tballison commented on a change in pull request #446: [TIKA-3418] DefaultZipContainerDetector does not support loading of ZipContainerDetectors in an OSGi enviroment - posted by GitBox <gi...@apache.org> on 2021/07/02 16:33:07 UTC, 1 replies.
- [jira] [Commented] (TIKA-3418) DefaultZipContainerDetector does not support loading of ZipContainerDetectors in an OSGi enviroment - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/07/02 16:34:00 UTC, 1 replies.
- [GitHub] [tika] tballison closed pull request #335: TIKA-307 truncated zip - posted by GitBox <gi...@apache.org> on 2021/07/02 17:02:51 UTC, 0 replies.
- [GitHub] [tika] tballison commented on pull request #335: TIKA-307 truncated zip - posted by GitBox <gi...@apache.org> on 2021/07/02 17:02:55 UTC, 0 replies.
- [jira] [Commented] (TIKA-307) Better handling of partial/truncated input data to parsers - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/07/02 17:03:00 UTC, 2 replies.
- Re: [VOTE] Release Apache Tika 1.27 Candidate #1 - posted by Tilman Hausherr <TH...@t-online.de> on 2021/07/02 18:21:33 UTC, 3 replies.
- [jira] [Resolved] (TIKA-3440) Add emitter for OpenSearch - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/02 20:48:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3461) Create sub modules in tika-pipes-integration tests - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/02 20:49:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3440) Add emitter for OpenSearch - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/02 21:57:00 UTC, 0 replies.
- [RESULT][VOTE] Release Apache Tika 1.27 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2021/07/06 11:07:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-3400) Use equals for Object and String Comparison Instead of == - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/06 14:46:00 UTC, 1 replies.
- [jira] [Updated] (TIKA-3368) Add Bill of Materials (BOM) artifact (Tika 1.x) - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/06 14:46:00 UTC, 1 replies.
- [ANNOUNCE] Apache Tika 1.27 released - posted by Tim Allison <ta...@apache.org> on 2021/07/06 15:20:10 UTC, 0 replies.
- Re: 2.0.0? - posted by Tim Allison <ta...@apache.org> on 2021/07/06 15:52:35 UTC, 4 replies.
- refactoring tika-pipes integration tests -- TIKA-3461 - posted by Tim Allison <ta...@apache.org> on 2021/07/06 15:54:45 UTC, 1 replies.
- [jira] [Updated] (TIKA-3385) POST to /tika/form endpoint on tika-server fails on Java11 - posted by "Gary Taylor (Jira)" <ji...@apache.org> on 2021/07/07 10:48:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3385) POST to /tika/form endpoint on tika-server fails on Java11 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 13:28:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3385) POST to /tika/form endpoint on tika-server fails on Java11 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 13:28:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3462) Clean up module names - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 14:52:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3463) Add FileListIterator as a pipes-iterator - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 14:54:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3464) Is it possible to extract individual pdf pages using Tika Server? - posted by "Sal (Jira)" <ji...@apache.org> on 2021/07/07 15:07:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3464) Is it possible to extract individual pdf pages using Tika Server? - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 15:12:00 UTC, 5 replies.
- [jira] [Resolved] (TIKA-3462) Clean up module names - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 15:14:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3465) Move FileSystemPipesIterator to o.a.t.pipes.fs name space - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 15:27:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3465) Move FileSystemPipesIterator to o.a.t.pipes.pipesiterator.fs name space - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 15:29:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3466) Cannot detect mimetype of xhtml file when script is first node instead of html - posted by "Packiaraj Sakkanan (Jira)" <ji...@apache.org> on 2021/07/07 15:47:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3466) Cannot detect mimetype of xhtml file when script is first node instead of html - posted by "Packiaraj Sakkanan (Jira)" <ji...@apache.org> on 2021/07/07 15:49:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3461) Create sub modules in tika-pipes-integration tests - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/07 15:57:00 UTC, 1 replies.
- [jira] [Closed] (TIKA-3464) Is it possible to extract individual pdf pages using Tika Server? - posted by "Sal (Jira)" <ji...@apache.org> on 2021/07/07 16:03:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3466) Cannot detect mimetype of xhtml file when script is first node instead of html - posted by "Nick Burch (Jira)" <ji...@apache.org> on 2021/07/07 16:42:00 UTC, 13 replies.
- [jira] [Commented] (TIKA-3465) Move FileSystemPipesIterator to o.a.t.pipes.pipesiterator.fs name space - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/07 17:04:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3462) Clean up module names - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/07 17:04:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3463) Add FileListIterator as a pipes-iterator - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 18:26:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3461) Create sub modules in tika-pipes-integration tests - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 18:27:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3463) Add FileListIterator as a pipes-iterator - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 18:27:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3462) Clean up module names - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 18:28:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3466) Cannot detect mimetype of xhtml file when script is first node instead of html - posted by "Packiaraj Sakkanan (Jira)" <ji...@apache.org> on 2021/07/07 19:07:00 UTC, 4 replies.
- [jira] [Created] (TIKA-3467) Clean up poms in main - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 20:01:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3463) Add FileListIterator as a pipes-iterator - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/07 20:13:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3468) Add java 11 github action for main - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 21:50:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3467) Clean up poms in main - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/07 21:54:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3467) Clean up poms in main - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/07 21:55:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2680) Email attachments to an email are not extracted - posted by "Abha (Jira)" <ji...@apache.org> on 2021/07/07 22:50:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-3468) Add java 11 github action for main - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/07 23:51:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3469) Add 'ready' ping to forked pipes processor - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/08 13:21:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3469) Consume bytes until 'ready' ping to forked pipes processor - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/08 13:36:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-3469) Consume bytes until 'ready' ping to forked pipes processor - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/08 13:50:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3470) Push jpeg2000 warning to trigger only when necessary - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/08 14:25:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3469) Consume bytes until 'ready' ping to forked pipes processor - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/08 14:54:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3470) Push jpeg2000 warning to trigger only when necessary - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/08 15:04:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3470) Push jpeg2000 warning to trigger only when necessary - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/08 15:24:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3471) Some ideas - posted by "Ivan Radovanovic (Jira)" <ji...@apache.org> on 2021/07/09 12:04:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2680) Email attachments to an email are not extracted - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/09 14:19:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3472) SimpleDateFormat is not threadsafe - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/09 15:04:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3471) Some ideas - posted by "Kenneth William Krugler (Jira)" <ji...@apache.org> on 2021/07/09 20:56:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3471) Some ideas - posted by "Kenneth William Krugler (Jira)" <ji...@apache.org> on 2021/07/09 20:56:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3473) Upgrade OpenSearch -- 1.0 GA is now available - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 13:44:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3474) tika-eval in 2.x should handle the exception key from 1.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 13:46:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3475) General upgrades for 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 13:51:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3474) tika-eval in 2.x should handle the exception key from 1.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 14:05:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3473) Upgrade OpenSearch -- 1.0 GA is now available - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 14:05:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3475) General upgrades for 2.0.0 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/13 15:57:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3473) Upgrade OpenSearch -- 1.0 GA is now available - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/13 15:57:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3472) SimpleDateFormat is not threadsafe - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/13 15:57:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3474) tika-eval in 2.x should handle the exception key from 1.x - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/13 15:57:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3476) Remove tag reports from default tika-eval reports - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 20:14:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3476) Remove tag reports from default tika-eval reports - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 20:25:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3477) Fix new closed channel exception in MSOffice files in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/13 20:27:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3476) Remove tag reports from default tika-eval reports - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/13 21:57:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3477) Fix new closed channel exception in MSOffice files in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 14:39:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3478) Extract - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 14:39:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3478) Extract "desc" metadata field from AppleUserBox in MP4 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 14:40:00 UTC, 4 replies.
- [jira] [Created] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1252(?) as ISO-8859-1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 15:18:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1252(?) as ISO-8859-1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 15:18:00 UTC, 1 replies.
- [jira] [Updated] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 15:25:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 15:40:00 UTC, 4 replies.
- [jira] [Resolved] (TIKA-3478) Extract "desc" metadata field from AppleUserBox in MP4 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 15:43:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 16:15:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3477) Fix new closed channel exception in MSOffice files in 2.x - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/14 17:50:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3478) Extract "desc" metadata field from AppleUserBox in MP4 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/14 17:50:00 UTC, 1 replies.
- [VOTE] Release Apache Tika 2.0.0 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2021/07/14 18:16:21 UTC, 4 replies.
- [jira] [Resolved] (TIKA-3472) SimpleDateFormat is not threadsafe - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/14 20:42:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3480) Add pipesClientId to pipes forked process for better logging - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/15 16:37:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3481) sqlite3 module should rely on tika-core:provided - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/15 16:43:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3480) Add pipesClientId to pipes forked process for better logging - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/15 16:43:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3481) sqlite3 module should rely on tika-core:provided - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/15 16:45:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3481) sqlite3 module should rely on tika-core:provided - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/15 17:57:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3480) Add pipesClientId to pipes forked process for better logging - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/15 17:58:00 UTC, 0 replies.
- JDK 17 is now in Rampdown Phase Two - posted by Rory O'Donnell <ro...@oracle.com> on 2021/07/15 19:33:08 UTC, 0 replies.
- [jira] [Created] (TIKA-3482) Improve handling of FetchException in pipes processor - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/15 22:09:00 UTC, 0 replies.
- [GitHub] [tika-helm] bynare opened a new pull request #5: Implement a network policy - posted by GitBox <gi...@apache.org> on 2021/07/16 02:55:45 UTC, 0 replies.
- [jira] [Created] (TIKA-3483) Implement a network policy - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/07/16 15:50:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3483) Implement a network policy for Helm Chart - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/07/16 15:51:00 UTC, 1 replies.
- [GitHub] [tika-helm] lewismc commented on pull request #5: [TIKA-3483] Implement a network policy for Helm Chart - posted by GitBox <gi...@apache.org> on 2021/07/16 15:51:37 UTC, 0 replies.
- [jira] [Commented] (TIKA-3483) Implement a network policy for Helm Chart - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/07/16 15:52:00 UTC, 5 replies.
- [GitHub] [tika-helm] lewismc commented on a change in pull request #5: [TIKA-3483] Implement a network policy for Helm Chart - posted by GitBox <gi...@apache.org> on 2021/07/16 15:55:01 UTC, 0 replies.
- [GitHub] [tika-helm] lewismc edited a comment on pull request #5: [TIKA-3483] Implement a network policy for Helm Chart - posted by GitBox <gi...@apache.org> on 2021/07/16 15:56:22 UTC, 0 replies.
- [jira] [Created] (TIKA-3484) TikaPipesOpenSearchTest: java.lang.IllegalArgumentException: "basePath" directory does not exist - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/07/16 17:45:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3484) TikaPipesOpenSearchTest: java.lang.IllegalArgumentException: "basePath" directory does not exist - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/07/16 18:01:00 UTC, 0 replies.
- [GitHub] [tika] peterkronenberg opened a new pull request #447: TIKA-3361 Make ocrStrategy=Auto more intelligent - posted by GitBox <gi...@apache.org> on 2021/07/16 18:13:25 UTC, 0 replies.
- [jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/07/16 18:14:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3485) testBadJVMArgs fails on Windows - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/07/16 18:54:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3485) testBadJVMArgs fails on Windows - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/07/16 18:55:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-3482) Improve handling of FetchException in pipes processor - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/16 19:35:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3484) TikaPipesOpenSearchTest: java.lang.IllegalArgumentException: "basePath" directory does not exist - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/16 19:35:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #447: TIKA-3361 Make ocrStrategy=Auto more intelligent - posted by GitBox <gi...@apache.org> on 2021/07/16 19:42:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-3485) testBadJVMArgs fails on Windows - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/16 20:57:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/16 21:21:00 UTC, 0 replies.
- [RESULT][VOTE] Release Apache Tika 2.0.0 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2021/07/19 11:19:20 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3475) General upgrades for 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 11:57:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3410) Clean up logging in PipesServer - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 11:59:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3297) Simplify parser configuration in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 11:59:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3253) improve "attachments" tika-eval report directory - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 12:00:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2943) Modularize tika-parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 12:00:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2944) TikaConfig should support the parameters without XML type attribute - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 12:01:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2385) Tesseract OCR rotation.py not run - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 12:03:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1706) Bring back commons-io to tika-core - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 12:04:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3486) Update miredot key for 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/19 12:21:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-1706) Bring back commons-io to tika-core - posted by "Yaniv Kunda (Jira)" <ji...@apache.org> on 2021/07/19 15:18:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3487) Timezones inappropriately set to GMT - posted by "Peter Winckles (Jira)" <ji...@apache.org> on 2021/07/19 17:04:00 UTC, 0 replies.
- [GitHub] [tika-helm] bynare commented on a change in pull request #5: [TIKA-3483] Implement a network policy for Helm Chart - posted by GitBox <gi...@apache.org> on 2021/07/20 01:22:23 UTC, 2 replies.
- [jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document - posted by "David Pilato (Jira)" <ji...@apache.org> on 2021/07/20 14:05:00 UTC, 3 replies.
- [jira] [Resolved] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document - posted by "David Pilato (Jira)" <ji...@apache.org> on 2021/07/20 15:16:00 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 2.0.0 released - posted by Tim Allison <ta...@apache.org> on 2021/07/20 18:49:41 UTC, 0 replies.
- [jira] [Created] (TIKA-3488) Security issue XXE in TIKA due to JDOM - posted by "Arvind Jagtap (Jira)" <ji...@apache.org> on 2021/07/21 11:19:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3489) Robots.txt files frequently identified as message/rfc822 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2021/07/21 13:25:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3489) Robots.txt files frequently identified as message/rfc822 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2021/07/21 13:41:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 13:50:00 UTC, 8 replies.
- [jira] [Commented] (TIKA-3153) Text File identified as message/rfc822 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2021/07/21 13:58:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2443) Plain text file identified as rfc822 and which can cause StackOverflowError - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2021/07/21 14:01:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2443) Plain text file identified as rfc822 and which can cause StackOverflowError - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 14:44:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3153) Text File identified as message/rfc822 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 14:45:00 UTC, 0 replies.
- Interesting PDF on stackoverflow - posted by Tim Allison <ta...@apache.org> on 2021/07/21 16:21:45 UTC, 1 replies.
- [jira] [Updated] (TIKA-3490) Fix serialization in opensearch emitter for embedded documents - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:13:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3490) Fix serialization in opensearch emitter for embedded documents - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:13:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3104) Detection of memgraph files exported from Xcode - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server.. - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3270) Render non-text in PDFs for OCR - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3454) Facilitate configuration of translation and transcription impls in tika-server/tika-docker/tika-helm - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3003) Remove unused dependencies - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3452) java.nio.file.FileSystemException Read-only file system in 2.0.0-BETA tika-docker - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3314) Treat soft hyphens like hyphens - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3404) Rearchitect GoogleTranslator to use https://github.com/googleapis/java-translate - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3420) Set tesseract ocr langauges as docker build args - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-3367) Add Bill of Materials (BOM) artifact - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:07 UTC, 1 replies.
- [jira] [Updated] (TIKA-2758) Possible error charset detection - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2945) AutoDetectParser should skip the content type detection if Metadata already has it - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2596) Make PDF2XHTML and AbstractPDF2XHTML public classes - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2369) Define a clean Recogniser interface: for objects from binary data; and for text classification - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 1 replies.
- [jira] [Updated] (TIKA-2623) get embedded resources in PDF/doc files - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2711) When parsing a UNIX text file apostrophes are rendered as ? - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2558) Add a new pid api to Tika - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2794) Tika extracts text from pdf on MacBook, but not windows server., - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2542) Support in tika-server for getting plain text and metadata at the same time - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2492) Remove pdfdebugger from tika - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2720) A parser to output universal sentence encodings to text - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2611) Tika mistakenly determines mimetype of .js file as application/x-elc - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2701) Text is not extracted properly from WMF files - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2565) Upgrade edu.ucar dependencies to 4.6.11 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2946) Review how TikaConfig can avoid parsing XML itself - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2346) Allow Office format parsers to exclude parsing shapes - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 1 replies.
- [jira] [Updated] (TIKA-2639) Update freedesktop.org shared-mime-info-spec hyperlink in MimeTypesReader.java - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2796) Update GoogleTranslator to use google-cloud-translate Java API - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1724) Create parser for .obo file format. - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1808) Head section closed too eager - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1709) Tika Server doesn't handle multi-part attachments or form-encoded inputs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 2 replies.
- [jira] [Updated] (TIKA-1738) ForkClient does not always delete temporary bootstrap jar - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1953) tika-server NullPointerException while processing rtfs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1829) org.apache.tika.parser.ocr.TesseractOCRParser.getSupportedTypes(TesseractOCRParser.java:92) NPE - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1800) MediaType#parse does not decode escaped special characters - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1988) Age Detection Tika Recogniser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 2 replies.
- [jira] [Updated] (TIKA-1840) No way to link slide notes to slide in PPT output. - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-2340) Add explicit deps to tika-parsers which are currently used from transitive scope - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-2312) [Mp3Parser] expose fields form ID3TagsAndAudio - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1952) Access Date is getting modified while capturing the MetaData information using AutoDetectParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1616) Tika Parser for GIBS Metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1688) Tika Version in Metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1540) New Tika plugin for image based feature extraction using computer vision techniques - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1518) Docker with Tika Server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1672) Integrate tika-java7 component - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1697) Parser Implementation for AkomaNtoso Legal XML Documents - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1609) Leverage Google's LibPhonenumber for enhanced phone number extraction and metadata modeling - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1705) Update ASM dependency to 5.0.4 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1640) Make ExternalParser support aliases for key names in extracted metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1674) Add example to show how to extract embedded files - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1577) NetCDF Data Extraction - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1598) Parser Implementation for Streaming Video - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:10 UTC, 1 replies.
- [jira] [Updated] (TIKA-1465) Implement extraction of non-global variables from netCDF3 and netCDF4 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1505) chmparser breaks down when extracting from file of CHM format v3 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1395) Create embedded image extraction example - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1454) Extracting as HTML loses links in xlsx, ppt, and pptx files - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1425) Automatic batching of Microsoft service calls - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1436) improvement to PDFParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1456) Visual Sentiment API parser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1417) Create Extract Embedded Images from PDFs Example - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1308) Support in memory parse mode(don't create temp file): to support run Tika in GAE - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:12 UTC, 1 replies.
- [jira] [Updated] (TIKA-1390) Create tika-example module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:12 UTC, 1 replies.
- [jira] [Updated] (TIKA-1318) Use of Deprecated Word6Extractor.getParagraphText() Method - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:12 UTC, 1 replies.
- [jira] [Updated] (TIKA-1329) Add RecursiveParserWrapper aka Jukka's (and Nick's) RecursiveMetadataParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:12 UTC, 1 replies.
- [jira] [Updated] (TIKA-1366) Update some of Tika Server services to support JAX-RS 2.0 AsyncResponse - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:12 UTC, 1 replies.
- [jira] [Updated] (TIKA-1328) Translate Metadata and Content - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:12 UTC, 1 replies.
- [jira] [Updated] (TIKA-1379) error in Tika().detect for xml files with xades signature - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:12 UTC, 1 replies.
- [jira] [Updated] (TIKA-1295) Make some Dublin Core items multi-valued - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:13 UTC, 1 replies.
- [jira] [Updated] (TIKA-1220) Parser implementration for IFC files - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:13 UTC, 1 replies.
- [jira] [Updated] (TIKA-1276) Missing embedded dependencies in tika-bundle - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:14:13 UTC, 1 replies.
- [jira] [Updated] (TIKA-987) Embedded drawing (SHAPE MERGEFORMAT) sometimes not extracted - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-1208) Migrate Any23 mime contributions to Tika - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-974) No longer return charset info in Metadata's CONTENT_ENCODING - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-988) We don't extract a placeholder for a Word document embedded in an Excel document - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-980) MicrodataContentHandler for Apache Tika - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-928) Separation of Tika Core Properties From Metadata Processing - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-894) Add webapp mode for Tika Server, simplifies deployment - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-1059) Better Handling of InterruptedException in ExternalParser and ExternalEmbedder - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-985) Support for HTML5 elements - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-776) ExifTool Embedder - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-891) Use POST in addition to PUT on method calls in tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-1108) Represent individual slides in pptx - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-774) ExifTool Parser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:05 UTC, 1 replies.
- [jira] [Updated] (TIKA-770) New ODF metadata keys - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:05 UTC, 1 replies.
- [jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:05 UTC, 1 replies.
- [jira] [Updated] (TIKA-715) Some parsers produce non-well-formed XHTML SAX events - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/21 22:15:05 UTC, 1 replies.
- [jira] [Created] (TIKA-3491) Upgrade version for TPS: commons-compress to 1.21 in tika-bundle - posted by "Shubhangi Raut (Jira)" <ji...@apache.org> on 2021/07/22 07:34:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3492) Upgrade version for TPS: rome to 1.16.0 in tika-bundle - posted by "Shubhangi Raut (Jira)" <ji...@apache.org> on 2021/07/22 07:41:00 UTC, 0 replies.
- Access to Tika Wiki - posted by David Pilato <da...@pilato.fr> on 2021/07/22 08:26:10 UTC, 1 replies.
- [jira] [Commented] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server.. - posted by "Simon Lucy (Jira)" <ji...@apache.org> on 2021/07/22 11:50:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3493) dcterms:created date depends on the current TimeZone in RTF documents - posted by "David Pilato (Jira)" <ji...@apache.org> on 2021/07/22 11:50:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3493) dcterms:created date depends on the current TimeZone in RTF documents - posted by "David Pilato (Jira)" <ji...@apache.org> on 2021/07/22 11:51:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-3493) dcterms:created date depends on the current TimeZone in RTF documents - posted by "David Pilato (Jira)" <ji...@apache.org> on 2021/07/22 11:59:00 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-3493) dcterms:created date depends on the current TimeZone in RTF documents - posted by "David Pilato (Jira)" <ji...@apache.org> on 2021/07/22 12:00:00 UTC, 0 replies.
- JIRA...sorry - posted by Tim Allison <ta...@apache.org> on 2021/07/22 12:21:57 UTC, 3 replies.
- [jira] [Created] (TIKA-3494) Allow legacy combined doc extract in pipes module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/22 12:29:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3494) Allow legacy combined doc extract in pipes module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/22 12:30:00 UTC, 2 replies.
- [jira] [Assigned] (TIKA-3493) dcterms:created date depends on the current TimeZone in RTF documents - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/22 12:33:00 UTC, 0 replies.
- TesseractOCRConfig.setTesseractPath moved to TesseractOCRParser - posted by David Pilato <da...@pilato.fr> on 2021/07/22 13:51:54 UTC, 1 replies.
- [jira] [Commented] (TIKA-3490) Fix serialization in opensearch emitter for embedded documents - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/22 15:59:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3490) Fix serialization in opensearch emitter for embedded documents - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/22 18:39:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3490) Fix serialization in opensearch emitter for embedded documents - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/22 20:28:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3494) Allow legacy combined doc extract in pipes module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/22 20:29:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3494) Allow legacy combined doc extract in pipes module - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/22 21:55:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3495) parent-child in solr emitter doesn't seem to include parent id (_nest_parent_) - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/23 15:29:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3495) parent-child in solr emitter doesn't seem to include parent id (_nest_parent_) - posted by "David Eric Pugh (Jira)" <ji...@apache.org> on 2021/07/23 15:38:00 UTC, 17 replies.
- [jira] [Comment Edited] (TIKA-3495) parent-child in solr emitter doesn't seem to include parent id (_nest_parent_) - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/23 15:43:00 UTC, 7 replies.
- [jira] [Updated] (TIKA-3495) parent-child in solr emitter doesn't seem to include parent id (_nest_parent_) - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/23 15:46:00 UTC, 5 replies.
- [jira] [Created] (TIKA-3496) Dates should have a timezone? - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/23 17:27:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3496) Dates should have a timezone? - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/23 17:36:00 UTC, 3 replies.
- [jira] [Resolved] (TIKA-3496) Dates should have a timezone? - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/23 21:23:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3496) Allow users to specify a default timezone when a file format doesn't store the tz - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/23 21:25:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3496) Allow users to specify a default timezone when a file format doesn't store the tz - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/23 23:05:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3497) Update README for installing Tika Server as a service for 2.0 release - posted by "David Eric Pugh (Jira)" <ji...@apache.org> on 2021/07/24 16:55:00 UTC, 0 replies.
- [GitHub] [tika] epugh opened a new pull request #448: TIKA-3497 bump versions in example commands for installing as a service. - posted by GitBox <gi...@apache.org> on 2021/07/24 16:55:34 UTC, 0 replies.
- [jira] [Commented] (TIKA-3497) Update README for installing Tika Server as a service for 2.0 release - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/07/24 16:56:00 UTC, 1 replies.
- [GitHub] [tika] nddipiazza opened a new pull request #449: TIKA-3495 - verify embedded docs work - posted by GitBox <gi...@apache.org> on 2021/07/24 23:51:51 UTC, 0 replies.
- [GitHub] [tika] dadoonet closed pull request #269: Update soap-api to 1.4.0 and stax-ex to 1.8.1 - posted by GitBox <gi...@apache.org> on 2021/07/26 14:25:22 UTC, 0 replies.
- [GitHub] [tika] dadoonet commented on pull request #269: Update soap-api to 1.4.0 and stax-ex to 1.8.1 - posted by GitBox <gi...@apache.org> on 2021/07/26 14:25:22 UTC, 0 replies.
- [jira] [Created] (TIKA-3498) Fix dependency convergence and other import issues in scientific vs standard package - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/26 14:47:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3498) Fix dependency convergence and other import issues in scientific vs standard package - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/26 15:14:00 UTC, 1 replies.
- [GitHub] [tika] tballison merged pull request #448: TIKA-3497 bump versions in example commands for installing as a service. - posted by GitBox <gi...@apache.org> on 2021/07/26 15:18:39 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #449: TIKA-3495 - verify embedded docs work - posted by GitBox <gi...@apache.org> on 2021/07/26 15:23:50 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3492) Upgrade version for TPS: rome to 1.16.0 in tika-bundle - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/26 15:42:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3498) Fix dependency convergence and other import issues in scientific vs standard package - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/26 15:45:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3492) Upgrade version for TPS: rome to 1.16.0 in tika-bundle - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/26 16:31:00 UTC, 1 replies.
- 2.0.1? - posted by Tim Allison <ta...@apache.org> on 2021/07/26 21:02:44 UTC, 4 replies.
- [DISCUSS] Support Elasticsearch in the tika-pipes module? - posted by Tim Allison <ta...@apache.org> on 2021/07/26 21:08:18 UTC, 2 replies.
- [jira] [Created] (TIKA-3499) [junit5] Prepare migration - posted by "Aurélien Marocco (Jira)" <ji...@apache.org> on 2021/07/27 07:10:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3499) [junit5] Prepare migration - posted by "Aurélien Marocco (Jira)" <ji...@apache.org> on 2021/07/27 08:59:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3499) [junit5] Prepare migration - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 09:41:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-3488) Security issue XXE in TIKA due to JDOM - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 13:11:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3488) Security issue XXE in TIKA due to JDOM - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 13:17:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3500) junit5 for tika-core - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 13:44:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-3500) junit5 for tika-core - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 14:27:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-3501) junit5 for tika-parsers module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 20:14:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3501) junit5 for tika-parsers module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 20:14:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3500) junit5 for tika-core - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 21:45:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3501) junit5 for tika-parsers module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/27 21:46:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3501) junit5 for tika-parsers module - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/27 22:59:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3500) junit5 for tika-core - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/27 22:59:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3502) General upgrades for 2.0.1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/28 13:36:00 UTC, 0 replies.
- surefire and system.exit - posted by Tim Allison <ta...@apache.org> on 2021/07/28 14:18:34 UTC, 4 replies.
- [jira] [Created] (TIKA-3503) Figure out how to upgrade dl4j - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/28 14:44:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3502) General upgrades for 2.0.1 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/28 15:41:00 UTC, 2 replies.
- [jira] [Created] (TIKA-3504) Convert org.testcontainers in OpenSearch and Solr to junit5 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/28 16:15:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1436) improvement to PDFParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/28 16:28:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3505) Move maxForEmitBatchBytes into PipesConfigBase - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/28 19:56:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3505) Move maxForEmitBatchBytes into PipesConfigBase - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/28 20:05:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3505) Move maxForEmitBatchBytes into PipesConfigBase - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/28 21:59:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3506) please fix multipile CVE in commons-compress for tika-parsers 1.x too - posted by "Stefan Seide (Jira)" <ji...@apache.org> on 2021/07/29 10:09:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3507) Add an optional reporter to the AsyncProcessor - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/29 15:44:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3506) please fix multipile CVE in commons-compress for tika-parsers 1.x too - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/29 16:44:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3506) please fix multipile CVE in commons-compress for tika-parsers 1.x too - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/29 16:54:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3508) Trivial cleanups in opensearch and Solr emitters - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/29 18:03:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3508) Trivial cleanups in opensearch and Solr emitters - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/29 18:49:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3507) Add an optional reporter to the AsyncProcessor - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/29 18:55:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3508) Trivial cleanups in opensearch and Solr emitters - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/07/29 20:02:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3495) parent-child in solr emitter doesn't seem to include parent id (_nest_parent_) - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/29 20:28:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3508) Trivial cleanups in opensearch and Solr emitters - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/29 21:08:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3509) tika-parser-microsoft-module trigger a dependency on log4j-core and log4j-slf4j-impl - posted by "Thomas Mortagne (Jira)" <ji...@apache.org> on 2021/07/30 16:02:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3509) tika-parser-microsoft-module trigger a dependency on log4j-core and log4j-slf4j-impl - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/07/30 17:58:00 UTC, 0 replies.