You are viewing a plain text version of this content. The canonical link for it is here.
- FW: Tika DjVu? - posted by Chris Mattmann <ma...@apache.org> on 2018/08/01 06:38:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-2700) The HTML parser should parse the contents of the title tag as raw text, not HTML - posted by "Gerard Bouchar (JIRA)" <ji...@apache.org> on 2018/08/01 09:42:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2701) Text is not extracted properly from WMF files - posted by "Grigoriy Alekseev (JIRA)" <ji...@apache.org> on 2018/08/01 09:56:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2701) Text is not extracted properly from WMF files - posted by "Grigoriy Alekseev (JIRA)" <ji...@apache.org> on 2018/08/01 09:57:00 UTC, 7 replies.
- [jira] [Created] (TIKA-2702) Different behavior between TIKA and pdfbox - posted by "Lior (JIRA)" <ji...@apache.org> on 2018/08/01 12:06:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2702) Different behavior between TIKA and pdfbox - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/01 12:57:00 UTC, 3 replies.
- [jira] [Updated] (TIKA-2552) Upgrade to POI 4.0.0 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/01 13:01:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2552) Upgrade to POI 4.0.0 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/01 13:02:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2552) Upgrade to POI 4.0.0 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/01 13:14:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2703) Error indexing a xlsx file - posted by "Mario Bisonti (JIRA)" <ji...@apache.org> on 2018/08/01 13:58:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2703) Error indexing a xlsx file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/01 14:19:00 UTC, 16 replies.
- [jira] [Comment Edited] (TIKA-2703) Error indexing a xlsx file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/03 14:20:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/03 15:38:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2648) mime detection based on resource name detects resources as "text/x-php" instead of "text/html" - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/08/03 15:41:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/03 15:59:00 UTC, 9 replies.
- [jira] [Resolved] (TIKA-2702) Different behavior between TIKA and pdfbox - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/03 17:07:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/03 17:48:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/03 17:51:01 UTC, 0 replies.
- [jira] [Commented] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/03 18:43:01 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification - posted by "Gerard Bouchar (JIRA)" <ji...@apache.org> on 2018/08/07 07:27:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2705) Allow configuration of TesseractOCRParser as we do for other parsers - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/07 16:22:00 UTC, 0 replies.
- tika-2.x-windows - Build # 294 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/08/07 17:35:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-2672) Upgrade dl4j to 1.0.0-beta - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/07 17:36:00 UTC, 3 replies.
- [jira] [Resolved] (TIKA-2705) Allow configuration of TesseractOCRParser as we do for other parsers - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/07 17:56:00 UTC, 0 replies.
- tika-2.x-windows - Build # 295 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/08/07 18:33:15 UTC, 0 replies.
- [jira] [Commented] (TIKA-2705) Allow configuration of TesseractOCRParser as we do for other parsers - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/07 18:34:00 UTC, 2 replies.
- Re: JDK 11 , JDK 12 and JDK 8u192 Early Access builds are available on jdk.java.net - posted by Tim Allison <ta...@apache.org> on 2018/08/08 12:14:45 UTC, 1 replies.
- [jira] [Commented] (TIKA-2693) Tika 1.17 uses the wrong classloader for reflection - posted by "Karl Wright (JIRA)" <ji...@apache.org> on 2018/08/08 13:04:00 UTC, 10 replies.
- [jira] [Created] (TIKA-2706) Store exceptions from VBAMacroReader as we do other embedded exceptions - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/08 17:21:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2706) Store exceptions from VBAMacroReader as we do other embedded exceptions - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/08 17:32:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2706) Store exceptions from VBAMacroReader as we do other embedded exceptions - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/08 18:12:00 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-2693) Tika 1.17 uses the wrong classloader for reflection - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/08 19:37:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2672) Upgrade dl4j to 1.0.0-beta2 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/09 16:04:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2672) Upgrade dl4j to 1.0.0-beta2 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/09 16:05:00 UTC, 7 replies.
- [jira] [Resolved] (TIKA-2695) Upgrade Lucene in tika-eval and tika-example - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/09 19:01:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2695) Upgrade Lucene in tika-eval and tika-example - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/09 19:01:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2695) Upgrade Lucene in tika-eval and tika-example - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/09 19:43:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2696) Support output of Tesseract OSD output for psm mode 0 - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/08/13 17:48:00 UTC, 1 replies.
- [jira] [Updated] (TIKA-2696) Support output of Tesseract OSD output for psm mode 0 - posted by "August Valera (JIRA)" <ji...@apache.org> on 2018/08/13 18:08:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2611) Tika mistakenly determines mimetype of .js file as application/x-elc - posted by "Umut Saribiyik (JIRA)" <ji...@apache.org> on 2018/08/14 14:58:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2611) Tika mistakenly determines mimetype of .js file as application/x-elc - posted by "Umut Saribiyik (JIRA)" <ji...@apache.org> on 2018/08/14 14:59:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2672) Upgrade dl4j to 1.0.0-beta2 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/14 15:53:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2667) Upgrade jmatio to 1.4 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/14 15:54:00 UTC, 0 replies.
- tika-2.x-windows - Build # 301 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/08/14 16:16:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-2667) Upgrade jmatio to 1.4 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/14 16:17:00 UTC, 2 replies.
- tika-2.x-windows - Build # 302 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/08/14 17:16:39 UTC, 0 replies.
- [jira] [Created] (TIKA-2707) Upgrade to commons-compress 1.18 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/16 15:17:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2707) Upgrade to commons-compress 1.18 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/16 15:33:01 UTC, 0 replies.
- [jira] [Commented] (TIKA-2707) Upgrade to commons-compress 1.18 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/16 16:30:00 UTC, 2 replies.
- tika-2.x-windows - Build # 303 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/08/16 18:47:45 UTC, 0 replies.
- [jira] [Created] (TIKA-2708) Bold text is omitted from PDF documents - posted by "Dmytro Sadovnychyi (JIRA)" <ji...@apache.org> on 2018/08/16 19:51:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2709) Invalid handling of tags - posted by "Gerard Bouchar (JIRA)" <ji...@apache.org> on 2018/08/17 12:16:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2709) Invalid handling of tags - posted by "Gerard Bouchar (JIRA)" <ji...@apache.org> on 2018/08/17 12:21:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2710) Set Tika to OSGi Execution Environment JavaSE-1.8 - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2018/08/17 18:22:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2692) Blanket upgrades in prep for 1.19 - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2018/08/17 18:24:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2711) When parsing a UNIX text file apostrophes are rendered as ? - posted by "Ichbiah (JIRA)" <ji...@apache.org> on 2018/08/17 19:03:09 UTC, 0 replies.
- [jira] [Commented] (TIKA-2710) Set Tika to OSGi Execution Environment JavaSE-1.8 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/08/18 03:03:00 UTC, 2 replies.
- tika-2.x-windows - Build # 304 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/08/18 03:16:38 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2710) Set Tika to OSGi Execution Environment JavaSE-1.8 - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2018/08/18 03:17:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2712) List start element is not being output - posted by "Dave Kincaid (JIRA)" <ji...@apache.org> on 2018/08/19 21:30:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2689) *.ai type (Adobe illustrator ) files are not detected correctly. - posted by "Amit Pandey (JIRA)" <ji...@apache.org> on 2018/08/20 07:41:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2689) *.ai type (Adobe illustrator ) files are not detected correctly. - posted by "Amit Pandey (JIRA)" <ji...@apache.org> on 2018/08/20 07:43:00 UTC, 2 replies.
- [jira] [Created] (TIKA-2713) Setting classpath for tika-server and opennlp processor models - posted by "Badger (JIRA)" <ji...@apache.org> on 2018/08/20 16:40:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2713) Setting classpath for tika-server and opennlp processor models - posted by "Badger (JIRA)" <ji...@apache.org> on 2018/08/20 16:43:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2713) Setting classpath for tika-server and opennlp processor models - posted by "Badger (JIRA)" <ji...@apache.org> on 2018/08/20 17:28:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2713) Setting classpath for tika-server and opennlp processor models - posted by "Badger (JIRA)" <ji...@apache.org> on 2018/08/20 17:29:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2714) Tika Parse Errors for certain attachments - posted by "Suman Moorthy (JIRA)" <ji...@apache.org> on 2018/08/20 22:48:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2715) Detection Encoding Problem - posted by "Sébastien Nicouleau (JIRA)" <ji...@apache.org> on 2018/08/21 09:20:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2716) Sonatype Nexus auditor is reporting that spring framework vesrion used by Tika 1.18 is vulnerable - posted by "Abhijit Rajwade (JIRA)" <ji...@apache.org> on 2018/08/22 09:33:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2717) Sonatype Nexus auditor is reporting that Jackson databind version used by Apache Tika is vulnerable - posted by "Abhijit Rajwade (JIRA)" <ji...@apache.org> on 2018/08/22 09:48:00 UTC, 0 replies.
- JDK 11: First Release Candidate available - posted by Rory O'Donnell <ro...@oracle.com> on 2018/08/24 09:33:16 UTC, 0 replies.
- [jira] [Commented] (TIKA-1880) Attribute number-columns-repeated not correctly used in ODS documents - posted by "Michael Standfuss (JIRA)" <ji...@apache.org> on 2018/08/24 17:55:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2718) encrypted rar file can't throw EncryptedDocumentException and extract encrypted content as file content - posted by "YongZhao (JIRA)" <ji...@apache.org> on 2018/08/27 06:14:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2718) encrypted rar file can't throw EncryptedDocumentException and extract encrypted content as file content - posted by "YongZhao (JIRA)" <ji...@apache.org> on 2018/08/27 06:34:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-2714) Tika Parse Errors for certain attachments - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/27 17:19:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2718) encrypted rar file can't throw EncryptedDocumentException and extract encrypted content as file content - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/27 17:25:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2717) Sonatype Nexus auditor is reporting that Jackson databind version used by Apache Tika is vulnerable - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/27 17:33:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2715) Detection Encoding Problem - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/27 17:42:00 UTC, 0 replies.
- Help with extracting attached rfc822 - posted by Tim Allison <ta...@apache.org> on 2018/08/28 14:08:10 UTC, 0 replies.
- [jira] [Commented] (TIKA-2680) Email attachments to an email are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/08/28 15:06:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2719) Java 9: Requiring tika-parsers from module-info.java fails with "module not found" - posted by "James Baker (JIRA)" <ji...@apache.org> on 2018/08/30 13:45:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2719) Java 9: Requiring tika-parsers from module-info.java fails with "module not found" - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2018/08/30 16:59:00 UTC, 4 replies.