You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Created] (TIKA-2682) Upgrade jempbox to 1.8.15 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/02 14:45:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2682) Upgrade jempbox to 1.8.15 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/02 15:29:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2680) Email attachments to an email are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/02 15:31:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2672) Upgrade dl4j to 1.0.0-beta - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/02 16:07:00 UTC, 24 replies.
- [jira] [Assigned] (TIKA-2669) Tika JAX-RS PDF parser option / custom config issue - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/02 16:17:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2682) Upgrade jempbox to 1.8.15 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/07/02 16:19:00 UTC, 2 replies.
- [jira] [Created] (TIKA-2683) Missing space and inappropriate new-line in Boilerpipe extracted text - posted by "Karanjeet Singh (JIRA)" <ji...@apache.org> on 2018/07/02 17:51:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2669) Tika JAX-RS PDF parser option / custom config issue - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/02 19:22:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2669) Tika JAX-RS PDF parser option / custom config issue - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/07/02 20:01:00 UTC, 2 replies.
- [jira] [Assigned] (TIKA-2675) OpenDocumentParser should fail on invalid zip files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/03 11:53:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-874) Identify FITS (Flexible Image Transport System) files - posted by "Susan (JIRA)" <ji...@apache.org> on 2018/07/03 14:38:00 UTC, 3 replies.
- [jira] [Created] (TIKA-2684) Tika does not extract *.fits header text, just file level metadata - posted by "Susan (JIRA)" <ji...@apache.org> on 2018/07/03 17:55:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2684) Tika does not extract *.fits header text, just file level metadata - posted by "Susan (JIRA)" <ji...@apache.org> on 2018/07/03 19:01:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-874) Identify FITS (Flexible Image Transport System) files - posted by "Susan (JIRA)" <ji...@apache.org> on 2018/07/03 19:09:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2684) Tika does not extract *.fits header text, just file level metadata - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/03 20:09:00 UTC, 7 replies.
- [jira] [Created] (TIKA-2685) Email attached to an undeliverable email report are not extracted - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/05 14:31:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2685) Email attached to an undeliverable email report are not extracted - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/05 14:35:00 UTC, 7 replies.
- [jira] [Updated] (TIKA-2685) Email attached to an undeliverable email report are not extracted - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/05 14:37:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2684) Tika does not extract *.fits header text, just file level metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2018/07/05 15:41:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2684) Tika does not extract *.fits header text, just file level metadata - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 12:49:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2675) OpenDocumentParser should fail on invalid zip files - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/07/06 12:58:00 UTC, 3 replies.
- [jira] [Resolved] (TIKA-2675) OpenDocumentParser should fail on invalid zip files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 13:24:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 15:27:00 UTC, 12 replies.
- [jira] [Comment Edited] (TIKA-2672) Upgrade dl4j to 1.0.0-beta - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 15:31:00 UTC, 0 replies.
- Tika 1.19? - posted by Tim Allison <ta...@apache.org> on 2018/07/06 15:40:07 UTC, 1 replies.
- [jira] [Assigned] (TIKA-2685) Email attached to an undeliverable email report are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 16:45:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2685) Email attached to an undeliverable email report are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 20:01:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2680) Email attachments to an email are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 20:48:00 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-2680) Email attachments to an email are not extracted - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/06 21:08:00 UTC, 0 replies.
- image recognition...how do the parts play together? - posted by Tim Allison <ta...@apache.org> on 2018/07/06 21:39:00 UTC, 3 replies.
- [jira] [Updated] (TIKA-2680) Email attachments to an email are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/06 22:10:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2648) mime detection based on resource name detects resources as "text/x-php" instead of "text/html" - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2018/07/08 19:53:00 UTC, 2 replies.
- Register now for ApacheCon and save $250 - posted by Rich Bowen <rb...@apache.org> on 2018/07/09 14:31:16 UTC, 0 replies.
- [jira] [Created] (TIKA-2686) pdfbox fontbox 2.0.8 has security vulnerability CVE-2018-8036 and should be upgraded to 2.0.11 - posted by "Abhijit Rajwade (JIRA)" <ji...@apache.org> on 2018/07/10 14:52:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2686) pdfbox fontbox 2.0.8 has security vulnerability CVE-2018-8036 and should be upgraded to 2.0.11 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/11 16:47:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification - posted by "Gerard Bouchar (JIRA)" <ji...@apache.org> on 2018/07/13 09:29:00 UTC, 4 replies.
- [jira] [Created] (TIKA-2687) Avoid potential to overwrite attachments - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/13 20:15:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2687) Avoid potential to overwrite attachments - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/13 21:25:00 UTC, 0 replies.
- tika-2.x-windows - Build # 284 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/07/13 21:40:31 UTC, 0 replies.
- [jira] [Commented] (TIKA-2683) Missing space and inappropriate new-line in Boilerpipe extracted text - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/07/16 19:29:00 UTC, 17 replies.
- [jira] [Commented] (TIKA-1982) Add language (and possibly other fields) to /rmeta endpoint - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2018/07/17 05:17:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2578) Mails not recognized when unknown X-headers are present - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/17 22:01:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2688) MBOX not recognized when unknown X-headers are present - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/17 22:09:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2688) MBOX not recognized when unknown X-headers are present - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/17 22:10:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2689) *.ai type (Adobe illustrator ) files are not detected correctly. - posted by "Amit Pandey (JIRA)" <ji...@apache.org> on 2018/07/18 12:46:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2689) *.ai type (Adobe illustrator ) files are not detected correctly. - posted by "Amit Pandey (JIRA)" <ji...@apache.org> on 2018/07/18 12:53:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2690) Exclude commons-logging & commons-logging-api from uimafit-core - posted by "Hans Brende (JIRA)" <ji...@apache.org> on 2018/07/18 13:32:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2690) Exclude commons-logging & commons-logging-api from uimafit-core - posted by "Hans Brende (JIRA)" <ji...@apache.org> on 2018/07/18 13:37:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2683) Missing space and inappropriate new-line in Boilerpipe extracted text - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2018/07/18 13:58:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2683) Missing space and inappropriate new-line in Boilerpipe extracted text - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2018/07/18 14:02:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2690) Exclude commons-logging & commons-logging-api from uimafit-core - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/07/18 14:25:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-2688) MBOX not recognized when unknown X-headers are present - posted by "Yury Kats (JIRA)" <ji...@apache.org> on 2018/07/18 22:27:00 UTC, 3 replies.
- [jira] [Created] (TIKA-2691) Can't create an RPM - posted by "Celpan Valeria (JIRA)" <ji...@apache.org> on 2018/07/19 13:06:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2691) Can't create an RPM - posted by "Celpan Valeria (JIRA)" <ji...@apache.org> on 2018/07/20 06:54:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2691) Can't create an RPM - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/24 21:46:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2691) Can't create an RPM - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/25 18:30:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2690) Exclude commons-logging & commons-logging-api from uimafit-core - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/25 18:37:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2689) *.ai type (Adobe illustrator ) files are not detected correctly. - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/25 18:57:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2688) MBOX not recognized when unknown X-headers are present - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/25 19:12:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2691) Can't create an RPM - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/07/25 19:45:00 UTC, 2 replies.
- tika-2.x-windows - Build # 287 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/07/25 19:46:32 UTC, 0 replies.
- Tika dependencies audit - posted by Maxim Solodovnik <so...@gmail.com> on 2018/07/26 07:11:53 UTC, 1 replies.
- [jira] [Updated] (TIKA-2691) Can't create a RPM - posted by "Celpan Valeria (JIRA)" <ji...@apache.org> on 2018/07/26 07:46:00 UTC, 0 replies.
- improving Tika for web contents - posted by gbouchar <gb...@protonmail.com.INVALID> on 2018/07/26 09:38:14 UTC, 2 replies.
- [jira] [Created] (TIKA-2692) Blanket upgrades in prep for 1.19 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 12:14:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2693) Tika 1.17 uses the wrong classloader for reflection - posted by "Karl Wright (JIRA)" <ji...@apache.org> on 2018/07/26 13:07:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2694) "From" headers is not always extracted correctly on msg mails - posted by "Celpan Valeria (JIRA)" <ji...@apache.org> on 2018/07/26 14:16:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2694) "From" headers is not always extracted correctly on msg mails - posted by "Celpan Valeria (JIRA)" <ji...@apache.org> on 2018/07/26 14:17:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2695) Upgrade Lucene in tika-eval and tika-example - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 14:25:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2368) Clean up SentimentParser dependencies - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 14:34:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2692) Blanket upgrades in prep for 1.19 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 14:37:00 UTC, 8 replies.
- [jira] [Commented] (TIKA-2694) "From" headers is not always extracted correctly on msg mails - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 14:57:00 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-2694) "From" headers is not always extracted correctly on msg mails - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 15:03:00 UTC, 1 replies.
- tika-2.x-windows - Build # 288 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/07/26 15:31:33 UTC, 0 replies.
- tika-2.x-windows - Build # 289 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/07/26 17:26:05 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2692) Blanket upgrades in prep for 1.19 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 18:51:00 UTC, 0 replies.
- tika-2.x-windows - Build # 290 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/07/26 20:18:11 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2692) Blanket upgrades in prep for 1.19 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/26 20:35:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2693) Tika 1.17 uses the wrong classloader for reflection - posted by "Andreas Beeker (JIRA)" <ji...@apache.org> on 2018/07/26 22:26:00 UTC, 3 replies.
- [jira] [Created] (TIKA-2696) Support output of Tesseract OSD output for psm mode 0 - posted by "August Valera (JIRA)" <ji...@apache.org> on 2018/07/26 23:52:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2697) exception during xlsx parsing - posted by "Peter Farkas (JIRA)" <ji...@apache.org> on 2018/07/27 06:50:01 UTC, 0 replies.
- [jira] [Commented] (TIKA-2697) exception during xlsx parsing - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/07/27 11:38:00 UTC, 4 replies.
- [jira] [Updated] (TIKA-2697) exception during xlsx parsing - posted by "Peter Farkas (JIRA)" <ji...@apache.org> on 2018/07/27 11:45:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2462) Add a parser for sas7bdat - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/27 14:18:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-2629) Add image/x-dpx media-type detection - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/27 14:19:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2479) Handle empty cells in tables uniformly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/27 14:19:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-1288) Epub's content extracted partially - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/27 14:19:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2641) Unit test for consistency between tabular/columnar formats - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/07/27 15:08:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2628) Add image/aces media-type detection - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/07/27 15:08:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2698) Issue in Crawling using FSCRawler 2.4 - posted by "Neel Gagan (JIRA)" <ji...@apache.org> on 2018/07/30 11:52:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-2697) exception during xlsx parsing - posted by "Peter Farkas (JIRA)" <ji...@apache.org> on 2018/07/31 09:51:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2699) Security: Sonatype Nexus scan is reporting multiple vulnearbilities on the bouncy castle version used by Apache Tika - posted by "Abhijit Rajwade (JIRA)" <ji...@apache.org> on 2018/07/31 10:22:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2699) Security: Sonatype Nexus scan is reporting multiple vulnearbilities on the bouncy castle version used by Apache Tika - posted by "Abhijit Rajwade (JIRA)" <ji...@apache.org> on 2018/07/31 10:24:00 UTC, 5 replies.
- [jira] [Comment Edited] (TIKA-2699) Security: Sonatype Nexus scan is reporting multiple vulnearbilities on the bouncy castle version used by Apache Tika - posted by "Abhijit Rajwade (JIRA)" <ji...@apache.org> on 2018/07/31 10:27:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2699) Security: Sonatype Nexus scan is reporting multiple vulnearbilities on the bouncy castle version used by Apache Tika - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/31 10:42:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2693) Tika 1.17 uses the wrong classloader for reflection - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/31 11:17:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2700) The HTML parser should parse the contents of the title tag as raw text, not HTML - posted by "Gerard Bouchar (JIRA)" <ji...@apache.org> on 2018/07/31 13:22:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2700) The HTML parser should parse the contents of the title tag as raw text, not HTML - posted by "Gerard Bouchar (JIRA)" <ji...@apache.org> on 2018/07/31 15:41:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-2700) The HTML parser should parse the contents of the title tag as raw text, not HTML - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/07/31 17:40:00 UTC, 0 replies.