You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (TIKA-3303) Broken link to Getting Started page on https://tika.apache.org/ - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/04/01 18:23:00 UTC, 0 replies.
- OpenJDK 17 Early Access build 16 is now available - posted by Rory O'Donnell <ro...@oracle.com> on 2021/04/02 07:34:19 UTC, 0 replies.
- [jira] [Created] (TIKA-3344) @POST methods does not accept same X-Tika headers as their @PUT counterpart - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/02 14:19:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3345) Tika Server prints wrong error message for invalid X-Tika header - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/02 14:23:00 UTC, 0 replies.
- [GitHub] [tika] Subhajitdas298 opened a new pull request #422: [TIKA-3344] [TIKA-3345] - posted by GitBox <gi...@apache.org> on 2021/04/02 14:49:21 UTC, 0 replies.
- [jira] [Commented] (TIKA-3344) @POST methods does not accept same X-Tika headers as their @PUT counterpart - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/02 14:50:00 UTC, 12 replies.
- [jira] [Commented] (TIKA-3345) Tika Server prints wrong error message for invalid X-Tika header - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/02 15:03:00 UTC, 4 replies.
- [GitHub] [tika] thammegowda commented on pull request #419: fix for TIKA-3329 contributed by Thamme Gowda - posted by GitBox <gi...@apache.org> on 2021/04/03 08:54:14 UTC, 0 replies.
- [jira] [Commented] (TIKA-3329) RTG Translator with many-to-eng translation - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/03 08:55:00 UTC, 5 replies.
- [GitHub] [tika] nddipiazza commented on pull request #412: TIKA-3317 - add a solr fetch iterator to tika pipes - posted by GitBox <gi...@apache.org> on 2021/04/03 18:09:29 UTC, 0 replies.
- [jira] [Commented] (TIKA-3317) Tika Pipes - add a solr fetch iterator - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/03 18:10:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 - posted by "Shmuel Krakower (Jira)" <ji...@apache.org> on 2021/04/04 09:36:00 UTC, 6 replies.
- [jira] [Created] (TIKA-3346) Parsers should only appear once in the "parsed by" metadata value - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/05 15:59:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #422: [TIKA-3344] [TIKA-3345] - posted by GitBox <gi...@apache.org> on 2021/04/05 19:36:23 UTC, 0 replies.
- [jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/05 19:44:00 UTC, 9 replies.
- [jira] [Created] (TIKA-3347) Upgrade to PDFBox 3.x when available - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/06 01:00:07 UTC, 0 replies.
- [jira] [Commented] (TIKA-3347) Upgrade to PDFBox 3.x when available - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/06 01:03:00 UTC, 8 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #423: Add back dependency jaxb-runtime for tika-parser-pdf-module - posted by GitBox <gi...@apache.org> on 2021/04/06 07:16:54 UTC, 0 replies.
- [jira] [Updated] (TIKA-3340) LanguageProfile for Myanmar - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/06 13:14:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server.. - posted by "Simon Lucy (Jira)" <ji...@apache.org> on 2021/04/06 16:52:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2917) Extract metadata from inline images in PDFs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/06 19:05:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3347) Upgrade to PDFBox 3.x when available - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/04/06 19:16:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server.. - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/06 19:25:00 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server.. - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/06 19:25:00 UTC, 1 replies.
- [GitHub] [tika] PeterAlfredLee merged pull request #423: Add back dependency jaxb-runtime for tika-parser-pdf-module - posted by GitBox <gi...@apache.org> on 2021/04/07 12:32:41 UTC, 0 replies.
- [GitHub] [tika] Subhajitdas298 opened a new pull request #424: [TIKA-3344] [TIKA-3345] main - posted by GitBox <gi...@apache.org> on 2021/04/07 19:04:28 UTC, 0 replies.
- [GitHub] [tika] Subhajitdas298 commented on pull request #424: [TIKA-3344] [TIKA-3345] main - posted by GitBox <gi...@apache.org> on 2021/04/07 22:03:03 UTC, 1 replies.
- [jira] [Created] (TIKA-3349) Dark themes in Ubuntu cause Tika app GUI to render white text on white background - posted by "Ross Spencer (Jira)" <ji...@apache.org> on 2021/04/08 11:07:00 UTC, 0 replies.
- [VOTE] Accept tika-helm source code into the Apache Tika project - posted by Lewis John McGibbney <le...@apache.org> on 2021/04/09 03:10:28 UTC, 2 replies.
- [jira] [Created] (TIKA-3350) Tika's PDFParser should use the underlying file via TikaInputStream if it exists - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/09 13:08:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3350) Tika's PDFParser should use the underlying file via TikaInputStream if it exists - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/09 14:41:00 UTC, 1 replies.
- [GitHub] [tika] tballison merged pull request #424: [TIKA-3344] [TIKA-3345] main - posted by GitBox <gi...@apache.org> on 2021/04/09 16:17:03 UTC, 0 replies.
- [jira] [Updated] (TIKA-3343) Move Tika's legacy lang id to its own module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/09 17:33:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3343) Move Tika's legacy lang id to its own module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/09 17:35:00 UTC, 1 replies.
- [jira] [Updated] (TIKA-3343) Move Tika's legacy lang id to its own submodule for Tika 2.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/09 17:37:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3343) Move Tika's legacy lang id to its own submodule for Tika 2.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/09 17:37:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3340) LanguageProfile for Myanmar - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/09 19:13:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3343) Move Tika's legacy lang id to its own submodule for Tika 2.0 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/09 19:13:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt - posted by "Vamsi Molli (Jira)" <ji...@apache.org> on 2021/04/11 15:28:00 UTC, 0 replies.
- [GitHub] [tika-docker] Subhajitdas298 opened a new pull request #3: Update README.md - posted by GitBox <gi...@apache.org> on 2021/04/12 11:57:49 UTC, 0 replies.
- RE: TikaServer-UsingprebuiltDockerimage is incorrect - posted by Subhajit Das <su...@live.com> on 2021/04/12 12:16:59 UTC, 0 replies.
- Re: Prometheus exporter for TikaServer - posted by Tim Allison <ta...@apache.org> on 2021/04/12 13:00:29 UTC, 1 replies.
- [jira] [Created] (TIKA-3351) Make list of parsers in metadata unique - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/04/12 16:28:00 UTC, 0 replies.
- [GitHub] [tika] peterkronenberg opened a new pull request #425: TIKA-3351 Don't allow dups in Parsed-By metadata - posted by GitBox <gi...@apache.org> on 2021/04/12 16:47:17 UTC, 0 replies.
- [jira] [Commented] (TIKA-3351) Make list of parsers in metadata unique - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/12 16:48:00 UTC, 6 replies.
- [GitHub] [tika-docker] lewismc commented on pull request #3: Update README.md - posted by GitBox <gi...@apache.org> on 2021/04/13 02:38:24 UTC, 0 replies.
- [RESULT] WAS Re: [VOTE] Accept tika-helm source code into the Apache Tika project - posted by Lewis John McGibbney <le...@apache.org> on 2021/04/13 02:43:57 UTC, 0 replies.
- [IP CLEARANCE] Apache Tika - tika-helm donation - posted by lewis john mcgibbney <le...@apache.org> on 2021/04/13 03:03:16 UTC, 0 replies.
- [GitHub] [tika] kieraCurtis commented on a change in pull request #419: fix for TIKA-3329 contributed by Thamme Gowda - posted by GitBox <gi...@apache.org> on 2021/04/13 03:14:03 UTC, 2 replies.
- [GitHub] [tika] thammegowda commented on a change in pull request #419: fix for TIKA-3329 contributed by Thamme Gowda - posted by GitBox <gi...@apache.org> on 2021/04/13 03:38:14 UTC, 0 replies.
- [GitHub] [tika-docker] dameikle merged pull request #3: Update README.md - posted by GitBox <gi...@apache.org> on 2021/04/13 08:51:31 UTC, 0 replies.
- [GitHub] [tika-docker] dameikle commented on pull request #3: Update README.md - posted by GitBox <gi...@apache.org> on 2021/04/13 08:52:23 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3340) LanguageProfile for Myanmar - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/13 16:09:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3352) Add a handler for json output from the /tika endpoint - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/13 18:58:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3352) Add a handler for json output from the /tika endpoint - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/13 18:59:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3345) Tika Server prints wrong error message for invalid X-Tika header - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/14 11:55:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3352) Add a handler for json output from the /tika endpoint - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/14 13:22:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3352) Add a handler for json output from the /tika endpoint - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/14 13:55:00 UTC, 0 replies.
- [jira] [Reopened] (TIKA-3327) Simple server metrics monitoring - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 15:37:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3327) Simple server metrics monitoring - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 15:38:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3344) @POST methods does not accept same X-Tika headers as their @PUT counterpart - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 15:38:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3353) Tika Server Production ready monitoring (Prometheus and JMX) - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 15:55:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3353) Tika Server Production ready monitoring (Prometheus and JMX) - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 15:56:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3345) Tika Server prints wrong error message for invalid X-Tika header - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 15:56:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3344) @POST methods does not accept same X-Tika headers as their @PUT counterpart - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 15:56:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3354) [tika-parsers] Wrong commons-io version imported - posted by "Arnaud MERGEY (Jira)" <ji...@apache.org> on 2021/04/14 16:02:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3354) [tika-parsers] Wrong commons-io version imported - posted by "Arnaud MERGEY (Jira)" <ji...@apache.org> on 2021/04/14 16:03:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3353) Tika Server Production ready monitoring (Prometheus and JMX) - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/14 17:45:00 UTC, 6 replies.
- [jira] [Commented] (TIKA-3354) [tika-parsers] Wrong commons-io version imported - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/04/14 19:05:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3355) Integrate fakeload into MockParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/14 20:15:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3355) Integrate fakeload into MockParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/14 20:29:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3355) Integrate fakeload into MockParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/14 20:30:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3355) Integrate fakeload into MockParser - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/14 21:42:00 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-3354) [tika-parsers] Wrong commons-io version imported - posted by "Arnaud MERGEY (Jira)" <ji...@apache.org> on 2021/04/15 07:19:00 UTC, 0 replies.
- Test failure - posted by Peter Kronenberg <pe...@torch.ai> on 2021/04/15 14:04:02 UTC, 3 replies.
- [GitHub] [tika] peterkronenberg opened a new pull request #426: Fix up exception handling for invalid config - posted by GitBox <gi...@apache.org> on 2021/04/15 14:06:59 UTC, 0 replies.
- [GitHub] [tika] peterkronenberg commented on pull request #426: Fix up exception handling for invalid config - posted by GitBox <gi...@apache.org> on 2021/04/15 14:07:54 UTC, 0 replies.
- [jira] [Created] (TIKA-3356) Broken xhtml with extractImages in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/15 15:50:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3356) Broken xhtml with extractImages in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/15 15:56:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #426: Fix up exception handling for invalid config - posted by GitBox <gi...@apache.org> on 2021/04/15 15:57:08 UTC, 0 replies.
- [jira] [Created] (TIKA-3357) Remove ambiguity in request handlers - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/15 18:40:00 UTC, 0 replies.
- [GitHub] [tika] Subhajitdas298 opened a new pull request #427: [TIKA-3357] removes ambiguity by choosing handler based on produce type - posted by GitBox <gi...@apache.org> on 2021/04/15 18:45:15 UTC, 0 replies.
- [jira] [Commented] (TIKA-3357) Remove ambiguity in request handlers - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/15 18:46:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-3263) WriteLimitReachedException is not public - posted by "Aaron Weber (Jira)" <ji...@apache.org> on 2021/04/15 20:47:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3263) WriteLimitReachedException is not public - posted by "Aaron Weber (Jira)" <ji...@apache.org> on 2021/04/15 20:56:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3358) Extract swf from PDFs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/16 12:18:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3359) Extract swf from PDFs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/16 12:20:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3358) Extract swf from PDFs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/16 13:11:00 UTC, 1 replies.
- [jira] [Reopened] (TIKA-3358) Extract swf from PDFs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/16 14:23:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3359) Extract swf from PDFs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/16 14:23:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3360) Retrospective release of tika-helm for Tika 1.26 and 2.0.0-ALPHA - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/16 15:13:00 UTC, 0 replies.
- Introducing tika-helm; a Helm chart to deploy Apache Tika on Kubernetes. - posted by lewis john mcgibbney <le...@apache.org> on 2021/04/16 15:19:02 UTC, 1 replies.
- [jira] [Commented] (TIKA-3359) Extract swf from PDFs - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/16 16:07:00 UTC, 5 replies.
- [GitHub] [tika] lewismc commented on pull request #419: fix for TIKA-3329 contributed by Thamme Gowda - posted by GitBox <gi...@apache.org> on 2021/04/17 02:39:37 UTC, 0 replies.
- [jira] [Commented] (TIKA-3360) Retrospective release of tika-helm for Tika 1.26 and 2.0.0-ALPHA - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/17 03:49:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/04/17 16:14:00 UTC, 0 replies.
- [GitHub] [tika] peterkronenberg opened a new pull request #428: TIKA-3361 Make ocrStrategy=Auto more intelligent - posted by GitBox <gi...@apache.org> on 2021/04/17 16:14:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/17 16:15:00 UTC, 2 replies.
- [GitHub] [tika] lfcnassif commented on pull request #364: Fix TIKA-3196 - posted by GitBox <gi...@apache.org> on 2021/04/18 01:35:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-3196) PackageParser should attempt to parse entries from zip files with STORED entries with data descriptor - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/18 01:36:00 UTC, 5 replies.
- [GitHub] [tika] lfcnassif edited a comment on pull request #364: Fix TIKA-3196 - posted by GitBox <gi...@apache.org> on 2021/04/18 01:40:02 UTC, 1 replies.
- [jira] [Reopened] (TIKA-3196) PackageParser should attempt to parse entries from zip files with STORED entries with data descriptor - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/19 13:35:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3196) PackageParser should attempt to parse entries from zip files with STORED entries with data descriptor - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/19 13:36:00 UTC, 0 replies.
- [GitHub] [tika] tballison commented on pull request #364: Fix TIKA-3196 - posted by GitBox <gi...@apache.org> on 2021/04/19 13:36:22 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #427: [TIKA-3357] removes ambiguity by choosing handler based on produce type - posted by GitBox <gi...@apache.org> on 2021/04/19 13:38:04 UTC, 0 replies.
- [GitHub] [tika] Subhajitdas298 opened a new pull request #429: [TIKA-3353] Prometheus and JMX monitoring over micrometer - posted by GitBox <gi...@apache.org> on 2021/04/19 16:29:17 UTC, 0 replies.
- [GitHub] [tika] Subhajitdas298 commented on pull request #429: [TIKA-3353] Prometheus and JMX monitoring over micrometer - posted by GitBox <gi...@apache.org> on 2021/04/19 16:35:24 UTC, 1 replies.
- [jira] [Commented] (TIKA-3164) Upgrade to POI 5.0.0 when available - posted by "Andreas Beeker (Jira)" <ji...@apache.org> on 2021/04/19 22:52:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3362) AsyncParser and EmitterResource have handler type hardcoded to text - posted by "Giovanni De Stefano (Jira)" <ji...@apache.org> on 2021/04/20 08:13:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3362) AsyncParser and EmitterResource have handler type hardcoded to text - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/20 09:20:00 UTC, 5 replies.
- [jira] [Updated] (TIKA-3362) AsyncParser and EmitterResource have handler type hardcoded to text - posted by "Giovanni De Stefano (Jira)" <ji...@apache.org> on 2021/04/20 09:29:00 UTC, 3 replies.
- JDK 17 Early Access build 18 is available - posted by Rory O'Donnell <ro...@oracle.com> on 2021/04/20 10:04:36 UTC, 0 replies.
- [jira] [Reopened] (TIKA-3359) Extract swf from PDFs - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/20 14:28:00 UTC, 0 replies.
- Re: high level parser module names in 2.x - posted by Tim Allison <ta...@apache.org> on 2021/04/20 14:57:24 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/04/20 15:26:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3362) AsyncParser and EmitterResource have handler type hardcoded to text - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/20 20:35:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-3362) AsyncParser and EmitterResource have handler type hardcoded to text - posted by "Giovanni De Stefano (Jira)" <ji...@apache.org> on 2021/04/21 12:10:00 UTC, 0 replies.
- Fwd: Title extraction question in Tika - posted by Nicholas DiPiazza <ni...@gmail.com> on 2021/04/21 15:45:20 UTC, 1 replies.
- [jira] [Updated] (TIKA-3360) Retrospective release of tika-helm for tika-docker 1.26 and 1.26-full - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/21 16:06:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3360) Retrospective release of tika-helm for tika-docker 1.26 and 1.26-full - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/21 16:08:00 UTC, 2 replies.
- [GitHub] [tika-helm] lewismc opened a new pull request #1: TIKA-3360 Retrospective release of tika-helm for tika-docker 1.26 and 1.26-full - posted by GitBox <gi...@apache.org> on 2021/04/21 16:26:33 UTC, 0 replies.
- [GitHub] [tika-helm] lewismc merged pull request #1: TIKA-3360 Retrospective release of tika-helm for tika-docker 1.26 and 1.26-full - posted by GitBox <gi...@apache.org> on 2021/04/21 16:26:55 UTC, 0 replies.
- Should the async queue be persisted? What about support for ? - posted by Giovanni De Stefano <gi...@servisoft.be> on 2021/04/21 17:10:51 UTC, 3 replies.
- [jira] [Resolved] (TIKA-3360) Retrospective release of tika-helm for tika-docker 1.26 and 1.26-full - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/21 19:27:00 UTC, 0 replies.
- [GitHub] [tika] Subhajitdas298 opened a new pull request #430: [TIKA-3357] Remove ambiguity in request handlers - main - posted by GitBox <gi...@apache.org> on 2021/04/22 04:24:16 UTC, 0 replies.
- [jira] [Updated] (TIKA-3357) Remove ambiguity in request handlers - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/22 04:56:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3327) Simple server metrics monitoring - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/22 04:57:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3327) Simple server metrics monitoring (server status over JMX) - posted by "Subhajit Das (Jira)" <ji...@apache.org> on 2021/04/22 04:58:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #429: [TIKA-3353] Prometheus and JMX monitoring over micrometer - posted by GitBox <gi...@apache.org> on 2021/04/22 14:43:05 UTC, 0 replies.
- [GitHub] [tika] tballison commented on pull request #429: [TIKA-3353] Prometheus and JMX monitoring over micrometer - posted by GitBox <gi...@apache.org> on 2021/04/22 14:44:29 UTC, 0 replies.
- [jira] [Commented] (TIKA-3312) Support Log4j2 jar in Tika-app.jar - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/22 19:47:00 UTC, 2 replies.
- [jira] [Created] (TIKA-3363) Have tika-docker artifacts start in spawn mode (configurable) - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/22 20:59:00 UTC, 0 replies.
- [RELEASE] tika-helm 1.26 and 1.26-full - posted by lewis john mcgibbney <le...@apache.org> on 2021/04/22 21:04:40 UTC, 0 replies.
- [GitHub] [tika-docker] philipsoutham opened a new pull request #4: Running as non-root user - posted by GitBox <gi...@apache.org> on 2021/04/23 03:46:24 UTC, 0 replies.
- [GitHub] [tika-docker] philipsoutham commented on pull request #4: Running as non-root user - posted by GitBox <gi...@apache.org> on 2021/04/23 04:13:21 UTC, 1 replies.
- [jira] [Created] (TIKA-3364) PDF Content is extracted twice - posted by "David Pilato (Jira)" <ji...@apache.org> on 2021/04/23 10:27:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3364) PDF Content is extracted twice - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/23 14:09:00 UTC, 8 replies.
- [jira] [Commented] (TIKA-3364) PDF Content is extracted twice - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/23 14:09:00 UTC, 7 replies.
- [jira] [Updated] (TIKA-3364) PDF Content is extracted twice - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/23 14:13:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3324) Add checkstyle checker - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/23 14:39:00 UTC, 0 replies.
- CFP for ApacheCon 2021 closes in ONE WEEK - posted by Rich Bowen <rb...@apache.org> on 2021/04/23 15:00:01 UTC, 0 replies.
- [jira] [Created] (TIKA-3365) RTFParser to XMLContentHandler incorrectly interprets en dash. - posted by "Gordon Allen (Jira)" <ji...@apache.org> on 2021/04/23 15:42:00 UTC, 0 replies.
- [INVITATION] Apache Tika container orchestration meetup - posted by lewis john mcgibbney <le...@apache.org> on 2021/04/23 17:45:23 UTC, 0 replies.
- [jira] [Created] (TIKA-3366) Retrospective release of tika-docker 2.0.0-ALPHA - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/23 18:37:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-3363) Have tika-docker artifacts start in spawn mode (configurable) - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/23 18:38:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3363) Have tika-docker artifacts start in spawn mode (configurable) - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/04/23 18:38:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3367) Add Bill of Materials (BOM) artifact - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/23 23:49:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3367) Add Bill of Materials (BOM) artifact - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/23 23:49:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3368) Add Bill of Materials (BOM) artifact (Tika 1.x) - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/23 23:49:00 UTC, 0 replies.
- [GitHub] [tika] grossws opened a new pull request #431: [TIKA-3367] Add Bill of Materials (BOM) - posted by GitBox <gi...@apache.org> on 2021/04/23 23:54:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-3367) Add Bill of Materials (BOM) artifact - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/23 23:55:00 UTC, 0 replies.
- [GitHub] [tika] grossws opened a new pull request #432: [TIKA-3368] Add tika-bom module - posted by GitBox <gi...@apache.org> on 2021/04/24 00:06:17 UTC, 0 replies.
- [jira] [Commented] (TIKA-3368) Add Bill of Materials (BOM) artifact (Tika 1.x) - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/24 00:07:00 UTC, 0 replies.
- [RFC] Tika BOMs/platforms - posted by Konstantin Gribov <gr...@gmail.com> on 2021/04/24 00:42:58 UTC, 1 replies.
- [jira] [Created] (TIKA-3369) Flaky Tesseract OCR confirmMultiPageTiffHandling test - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/24 01:20:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3369) Flaky Tesseract OCR confirmMultiPageTiffHandling test - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/24 01:21:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3149) Tikka 1.18 not working with tess4j 3.4.8 on linux - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/24 01:28:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3149) Tikka 1.18 not working with tess4j 3.4.8 on linux - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/24 01:29:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3149) Tikka 1.18 not working with tess4j 3.4.8 on linux - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/24 01:29:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-3149) Tikka 1.18 not working with tess4j 3.4.8 on linux - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/24 01:30:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3370) Refactor the AsyncProcessor in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/24 11:32:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3371) Add "id" to FetchEmitTuple - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/24 11:39:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3371) Add "id" to FetchEmitTuple - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/24 11:39:00 UTC, 0 replies.
- Tika Server writeLimit header - posted by ju...@francelabs.com on 2021/04/26 16:39:13 UTC, 0 replies.
- Re: Tika Server writeLimit header - posted by Tim Allison <ta...@apache.org> on 2021/04/26 18:06:13 UTC, 1 replies.
- [jira] [Created] (TIKA-3372) Fix writelimit in recursiveparserhandler - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/26 19:17:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3372) Fix writelimit in recursiveparserhandler - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/26 19:18:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3372) Fix writelimit in recursiveparserhandler - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/26 21:31:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-3372) Fix writelimit in recursiveparserhandler - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/26 21:31:00 UTC, 10 replies.
- [jira] [Created] (TIKA-3373) add "yml" as extension - posted by "Caleb Cushing (Jira)" <ji...@apache.org> on 2021/04/27 11:50:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3373) add "yml" as extension - posted by "Caleb Cushing (Jira)" <ji...@apache.org> on 2021/04/27 12:03:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3373) add "yml" as extension - posted by "Nick Burch (Jira)" <ji...@apache.org> on 2021/04/27 12:12:00 UTC, 5 replies.
- [GitHub] [tika-helm] philipsoutham opened a new pull request #2: Locking down the Tika environment - posted by GitBox <gi...@apache.org> on 2021/04/27 15:33:15 UTC, 0 replies.
- [jira] [Created] (TIKA-3374) Non-Unicode archive entry name is garbled - posted by "Ryan Liu (Jira)" <ji...@apache.org> on 2021/04/28 02:23:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3374) Non-Unicode archive entry name is garbled - posted by "Ryan Liu (Jira)" <ji...@apache.org> on 2021/04/28 02:24:00 UTC, 0 replies.
- [GitHub] [tika] Ryan421 opened a new pull request #433: [TIKA-3374] Apply charset detection for archive entry name - posted by GitBox <gi...@apache.org> on 2021/04/28 03:07:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/28 03:08:00 UTC, 13 replies.
- [jira] [Created] (TIKA-3375) Release new version - posted by "Chris Dressen (Jira)" <ji...@apache.org> on 2021/04/28 15:17:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3375) Release new version - posted by "Chris Dressen (Jira)" <ji...@apache.org> on 2021/04/28 15:30:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-3370) Refactor the AsyncProcessor in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/28 15:48:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-2787) Make WriteLimitReachedException public and not subclass of SAXException - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/28 16:18:00 UTC, 0 replies.
- Release 1.27? - posted by Tim Allison <ta...@apache.org> on 2021/04/28 16:21:55 UTC, 3 replies.
- [jira] [Commented] (TIKA-2787) Make WriteLimitReachedException public and not subclass of SAXException - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/28 17:49:00 UTC, 0 replies.
- [GitHub] [tika-docker] dameikle commented on pull request #4: Running as non-root user - posted by GitBox <gi...@apache.org> on 2021/04/28 18:00:27 UTC, 0 replies.
- [GitHub] [tika] tballison commented on a change in pull request #433: [TIKA-3374] Apply charset detection for archive entry name - posted by GitBox <gi...@apache.org> on 2021/04/28 20:59:24 UTC, 1 replies.
- [GitHub] [tika] Ryan421 commented on a change in pull request #433: [TIKA-3374] Apply charset detection for archive entry name - posted by GitBox <gi...@apache.org> on 2021/04/29 02:41:32 UTC, 1 replies.
- [GitHub] [tika] Ryan421 commented on pull request #433: [TIKA-3374] Apply charset detection for archive entry name - posted by GitBox <gi...@apache.org> on 2021/04/29 06:18:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-3164) Upgrade to POI 5.0.0 when available - posted by "Konstantin Gribov (Jira)" <ji...@apache.org> on 2021/04/29 08:10:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #433: [TIKA-3374] Apply charset detection for archive entry name - posted by GitBox <gi...@apache.org> on 2021/04/29 13:30:47 UTC, 0 replies.
- [jira] [Created] (TIKA-3376) Improve handling of write limit reached in new /tika json endpoint - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/29 17:06:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3376) Improve handling of write limit reached in new /tika json endpoint - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/29 18:31:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3377) Remove pipes components from TikaConfig in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/30 11:26:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3378) Move tika-langdetect-commons to tika-langdetect-test-commons in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/30 20:56:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3378) Move tika-langdetect-commons to tika-langdetect-test-commons in 2.x - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/04/30 21:40:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3377) Remove pipes components from TikaConfig in 2.x - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/30 21:51:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3371) Add "id" to FetchEmitTuple - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/30 21:51:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3378) Move tika-langdetect-commons to tika-langdetect-test-commons in 2.x - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/04/30 22:36:00 UTC, 0 replies.