dev@tika.apache.org, 2019-01

You are viewing a plain text version of this content. The canonical link for it is here.

- [jira] [Created] (TIKA-2803) Apache Tika not properly extracting text from PDF for Indian languages - posted by "Subramanian (JIRA)" <ji...@apache.org> on 2019/01/01 06:15:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2803) Apache Tika not properly extracting text from PDF for Indian languages - posted by "Subramanian (JIRA)" <ji...@apache.org> on 2019/01/01 06:22:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-2749) OCR on PDFs should "just work" out of the box - posted by "Markus Mandalka (JIRA)" <ji...@apache.org> on 2019/01/02 13:12:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2801) Tika includes 2 vulnerable components - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 16:20:00 UTC, 3 replies.
- [jira] [Created] (TIKA-2804) Blanket dependency upgrades for next release cycle - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 16:51:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2804) Blanket dependency upgrades for next release cycle - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 17:17:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-2787) Make WriteLimitReachedException public and not subclass of SAXException - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 17:42:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2787) Make WriteLimitReachedException public and not subclass of SAXException - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 17:45:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2803) Apache Tika not properly extracting text from PDF for Indian languages - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 17:55:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2802) Out of memory issues when extracting large files (pst) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 18:30:00 UTC, 15 replies.
- tika-2.x-windows - Build # 369 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2019/01/03 20:16:56 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2765) Regression extracting text from corrupted docx files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 20:33:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2765) Regression extracting text from corrupted docx files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 20:33:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2726) Handle truncated ooxml more robustly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2019/01/03 20:34:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2765) Regression extracting text from corrupted docx files - posted by "Hudson (JIRA)" <ji...@apache.org> on 2019/01/03 21:00:00 UTC, 5 replies.
- JDK 12 Early Access build 26 & JDK 13 Early Access builds available - posted by Rory O'Donnell <ro...@oracle.com> on 2019/01/04 10:22:42 UTC, 0 replies.
- [jira] [Updated] (TIKA-2802) Out of memory issues when extracting large files (pst) - posted by "Caleb Ott (JIRA)" <ji...@apache.org> on 2019/01/04 16:32:00 UTC, 2 replies.
- [jira] [Created] (TIKA-2805) Should the HTML parser by default just ignore the

[jira] [Updated] (TIKA-2805) Should the HTML parser by default just ignore the