You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/01 03:18:04 UTC, 38 replies.
- [jira] [Reopened] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 14:02:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 14:02:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2348) Improve error reporting in wmf/emf - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 15:24:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2348) Improve error reporting in wmf/emf - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 15:25:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2349) Try to match digests when finding equivalent embedded files in tika-eval Compare - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 16:13:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2349) Try to match digests when finding equivalent embedded files in tika-eval Compare - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 16:22:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2349) Try to match digests when finding equivalent embedded files in tika-eval Compare - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/01 16:40:04 UTC, 2 replies.
- [jira] [Commented] (TIKA-2348) Improve error reporting in wmf/emf - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/01 17:41:04 UTC, 1 replies.
- [jira] [Commented] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text. - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/01 18:43:04 UTC, 12 replies.
- [jira] [Created] (TIKA-2350) Add catch block when opening Action on document open in PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 19:04:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2350) Add catch block when opening Action on document open in PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 19:24:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 19:24:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2350) Add catch block when opening Action on document open in PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/01 19:24:04 UTC, 0 replies.
- RE: Tika 1.15 - posted by "Allison, Timothy B." <ta...@mitre.org> on 2017/05/01 19:42:42 UTC, 19 replies.
- [jira] [Commented] (TIKA-2350) Add catch block when opening Action on document open in PDFParser - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/01 19:49:04 UTC, 2 replies.
- [jira] [Commented] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/01 19:49:04 UTC, 2 replies.
- [jira] [Commented] (TIKA-2342) Broken words - posted by "Nino Skopac (JIRA)" <ji...@apache.org> on 2017/05/02 11:32:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2344) Cannot sub to mailing list - posted by "Nino Skopac (JIRA)" <ji...@apache.org> on 2017/05/02 11:36:04 UTC, 0 replies.
- [jira] [Closed] (TIKA-2344) Cannot sub to mailing list - posted by "Nino Skopac (JIRA)" <ji...@apache.org> on 2017/05/02 11:36:04 UTC, 0 replies.
- [jira] [Closed] (TIKA-2342) Broken words - posted by "Nino Skopac (JIRA)" <ji...@apache.org> on 2017/05/02 11:37:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2351) Getting error while parsing documents - posted by "VENU (JIRA)" <ji...@apache.org> on 2017/05/02 12:38:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2351) Getting error while parsing documents - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2017/05/02 12:41:04 UTC, 5 replies.
- [jira] [Updated] (TIKA-2351) Getting error while parsing documents - posted by "VENU (JIRA)" <ji...@apache.org> on 2017/05/02 12:51:04 UTC, 1 replies.
- [jira] [Created] (TIKA-2352) Incorrect EOF error in WordPerfect parser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/02 13:21:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2352) Incorrect EOF exception in WordPerfect parser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/02 13:22:04 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2322) Video labeling using existing ObjectRecognition - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/02 13:53:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2352) Incorrect EOF exception in WordPerfect parser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/02 19:50:04 UTC, 17 replies.
- [jira] [Comment Edited] (TIKA-2352) Incorrect EOF exception in WordPerfect parser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/02 19:52:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-1334) Add presentation layer for results of each run - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/03 11:28:04 UTC, 15 replies.
- [jira] [Comment Edited] (TIKA-1334) Add presentation layer for results of each run - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/03 11:29:04 UTC, 5 replies.
- [jira] [Created] (TIKA-2353) How to fetch document creator/author/last-modified - posted by "VENU (JIRA)" <ji...@apache.org> on 2017/05/03 11:41:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2353) How to fetch document creator/author/last-modified - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2017/05/03 11:47:04 UTC, 0 replies.
- Re: Apache Tika - posted by Chris Mattmann <ma...@apache.org> on 2017/05/03 13:58:09 UTC, 0 replies.
- OSGI expert help from Bob/others: TIKA-2016 - posted by Chris Mattmann <ma...@apache.org> on 2017/05/03 18:54:12 UTC, 3 replies.
- [jira] [Resolved] (TIKA-2352) Incorrect EOF exception in WordPerfect parser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/03 20:26:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2343) --text-main in tika-server - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/04 01:32:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2343) --text-main in tika-server - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/04 01:50:04 UTC, 6 replies.
- [jira] [Created] (TIKA-2354) Missing many embedded images in .doc files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/04 02:21:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2354) Missing many embedded images in .doc files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/04 02:22:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2354) Missing many embedded images in .doc files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/04 02:23:04 UTC, 3 replies.
- [jira] [Resolved] (TIKA-2354) Missing many embedded images in .doc files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/04 02:36:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2293) Tess4jOCRParser - A simpler Java version of TesseractOCRParser - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/04 14:55:05 UTC, 5 replies.
- Welcome Thejan Wijesinghe GSoC 2017 student! - posted by Chris Mattmann <ma...@apache.org> on 2017/05/04 17:19:38 UTC, 8 replies.
- [jira] [Created] (TIKA-2355) Cache trained mode while running ObjectRecognition server from Docker builds - posted by "Madhav Sharan (JIRA)" <ji...@apache.org> on 2017/05/04 22:47:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2355) Cache trained mode while running ObjectRecognition server from Docker builds - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/04 23:01:04 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2293) Tess4jOCRParser - A simpler Java version of TesseractOCRParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/05 00:13:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-2293) Tess4jOCRParser - A simpler Java version of TesseractOCRParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/05 00:15:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2356) Temporarily prevent duplication of sheets in some xlsx POI-61034 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/05 13:59:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2356) Temporarily prevent duplication of sheets in some xlsx POI-61034 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/05 14:06:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2336) Upgrade to POI 3.17-beta1 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/05 14:07:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2356) Temporarily prevent duplication of sheets in some xlsx POI-61034 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/05 15:05:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2357) Allow Tesseract PSM up to 13 - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2017/05/08 12:55:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2357) Allow Tesseract PSM up to 13 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/08 15:10:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2357) Allow Tesseract PSM up to 13 - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2017/05/08 15:21:04 UTC, 0 replies.
- TODO: Reminder: Thamme & Chris to document DL4J vision/Tika-DL - posted by Chris Mattmann <ma...@apache.org> on 2017/05/09 14:27:24 UTC, 5 replies.
- [jira] [Commented] (TIKA-2318) Improve reports for Compare option in tika-eval - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/09 18:58:04 UTC, 2 replies.
- dl4j blows tika-app up to 270MB - posted by "Allison, Timothy B." <ta...@mitre.org> on 2017/05/09 19:26:59 UTC, 3 replies.
- [jira] [Created] (TIKA-2358) Avoid bundling dl4j with tika-app and tika-server - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/09 19:45:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2358) Avoid bundling dl4j with tika-app and tika-server - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/09 19:51:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2358) Avoid bundling dl4j with tika-app and tika-server - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/09 21:06:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2358) Avoid bundling dl4j with tika-app and tika-server - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/10 00:15:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-1867) Tika external parsers cannot be turned off without patching the tika-app-XX.jar - posted by "Daniel Conn (JIRA)" <ji...@apache.org> on 2017/05/10 17:00:09 UTC, 1 replies.
- [jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/10 18:48:04 UTC, 7 replies.
- [jira] [Updated] (TIKA-2359) Extreme slow parsing on the attachment attached - posted by "Eugen Mayer (JIRA)" <ji...@apache.org> on 2017/05/11 13:50:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2359) Extreme slow parsing on the attachment attached - posted by "Eugen Mayer (JIRA)" <ji...@apache.org> on 2017/05/11 13:50:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/11 15:46:04 UTC, 27 replies.
- [jira] [Comment Edited] (TIKA-2359) Extreme slow parsing on the attachment attached - posted by "Eugen Mayer (JIRA)" <ji...@apache.org> on 2017/05/12 07:04:04 UTC, 4 replies.
- Tika talk next week - help needed! - posted by Nick Burch <ni...@apache.org> on 2017/05/14 15:34:21 UTC, 8 replies.
- [jira] [Commented] (TIKA-390) Missing Header/Footer text for ODT documents - posted by "Ulf Dittmer (JIRA)" <ji...@apache.org> on 2017/05/15 09:51:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-1768) Document headers and footers in metadata - posted by "Ulf Dittmer (JIRA)" <ji...@apache.org> on 2017/05/15 09:55:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2360) Handle SentimentParser resource failure more robustly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/15 14:43:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2360) Handle SentimentParser resource failure more robustly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/15 14:49:04 UTC, 11 replies.
- [jira] [Comment Edited] (TIKA-2360) Handle SentimentParser resource failure more robustly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/15 15:22:04 UTC, 1 replies.
- [jira] [Created] (TIKA-2361) Upgrade to PDFBox 2.0.6 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 02:38:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2361) Upgrade to PDFBox 2.0.6 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 10:09:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2362) Skipping Header and Footer data from documents - posted by "Mujahid Ateeb Khan (JIRA)" <ji...@apache.org> on 2017/05/16 10:45:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2361) Upgrade to PDFBox 2.0.6 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/16 11:49:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2363) Skip image recognition test if network call fails - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 12:14:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2360) Handle SentimentParser resource failure more robustly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 12:19:04 UTC, 1 replies.
- [jira] [Updated] (TIKA-2360) Handle SentimentParser resource failure more robustly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 12:19:04 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2362) Skipping Header and Footer data from documents - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 12:21:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 12:22:04 UTC, 6 replies.
- [jira] [Created] (TIKA-2364) Clean up printstacktrace - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 12:23:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2365) Signer's Information doesn't match issue - posted by "Mujahid Ateeb Khan (JIRA)" <ji...@apache.org> on 2017/05/16 12:49:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2363) Skip image recognition test if network call fails - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/16 12:49:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2365) Signer's Information doesn't match issue - posted by "Mujahid Ateeb Khan (JIRA)" <ji...@apache.org> on 2017/05/16 12:51:04 UTC, 1 replies.
- [jira] [Created] (TIKA-2366) Add image cropping functionality to TesseractOCRParser - posted by "Zachary Lee Jones (JIRA)" <ji...@apache.org> on 2017/05/16 14:20:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2364) Clean up printstacktrace - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/16 15:12:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2367) Avoid npe in wmf - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 18:56:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2364) Clean up printstacktrace - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 18:58:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2364) Clean up printstacktrace - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 18:58:04 UTC, 0 replies.
- [jira] [Reopened] (TIKA-2360) Handle SentimentParser resource failure more robustly - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 19:00:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2367) Avoid npe in wmf - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/16 19:02:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2367) Avoid npe in wmf - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/16 19:46:04 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2362) Skipping Header and Footer data from documents - posted by "Mujahid Ateeb Khan (JIRA)" <ji...@apache.org> on 2017/05/17 04:18:04 UTC, 1 replies.
- [jira] [Created] (TIKA-2368) Clean up SentimentParser dependencies - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/17 12:16:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2365) Signer's Information doesn't match issue - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2017/05/17 13:45:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2368) Clean up SentimentParser dependencies - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/18 01:59:04 UTC, 3 replies.
- [jira] [Created] (TIKA-2369) Define a clean Recogniser interface: for objects from binary data; and for text classification - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/18 02:06:04 UTC, 0 replies.
- Re: TikaInputStream parse the content and write to OutputStream - posted by Chris Mattmann <ma...@apache.org> on 2017/05/18 02:11:30 UTC, 2 replies.
- [jira] [Created] (TIKA-2370) Close Font in TrueTypeParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/18 10:16:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2370) Close Font in TrueTypeParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/18 10:28:05 UTC, 1 replies.
- [jira] [Resolved] (TIKA-2370) Close Font in TrueTypeParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/18 10:28:05 UTC, 0 replies.
- [jira] [Updated] (TIKA-2368) Clean up SentimentParser dependencies - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/18 10:37:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2370) Close Font in TrueTypeParser - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/18 11:17:04 UTC, 0 replies.
- [commons-text] Regarding code consolidation. - posted by Rob Tompkins <ch...@apache.org> on 2017/05/18 13:46:13 UTC, 0 replies.
- [jira] [Created] (TIKA-2371) Check properties presence - PDFParser - posted by "Julien Massiera (JIRA)" <ji...@apache.org> on 2017/05/18 14:00:19 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2368) Clean up SentimentParser dependencies - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/18 14:14:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2372) OSX DMG support - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2017/05/18 19:29:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2372) OSX DMG support - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2017/05/18 19:35:12 UTC, 2 replies.
- Tika App, Extract (-z) and Inline PDF Images? - posted by Nick Burch <ni...@apache.org> on 2017/05/18 21:02:24 UTC, 2 replies.
- [jira] [Updated] (TIKA-1705) Update ASM dependency to 5.0.4 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-2340) Add explicit deps to tika-parsers which are currently used from transitive scope - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1697) Parser Implementation for AkomaNtoso Legal XML Documents - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1059) Better Handling of InterruptedException in ExternalParser and ExternalEmbedder - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-894) Add webapp mode for Tika Server, simplifies deployment - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-891) Use POST in addition to PUT on method calls in tika-server - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-987) Embedded drawing (SHAPE MERGEFORMAT) sometimes not extracted - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1436) improvement to PDFParser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1395) Create embedded image extraction example - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1366) Update some of Tika Server services to support JAX-RS 2.0 AsyncResponse - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-988) We don't extract a placeholder for a Word document embedded in an Excel document - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1220) Parser implementration for IFC files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1390) Create tika-example module - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1456) Visual Sentiment API parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1724) Create parser for .obo file format. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1208) Migrate Any23 mime contributions to Tika - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1367) Tika documentation should list tika-parsers parser dependencies - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1609) Leverage Google's LibPhonenumber for enhanced phone number extraction and metadata modeling - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-776) ExifTool Embedder - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1577) NetCDF Data Extraction - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-2338) Change Scope of Jai-ImageIO-Core dependency - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1465) Implement extraction of non-global variables from netCDF3 and netCDF4 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1328) Translate Metadata and Content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1840) No way to link slide notes to slide in PPT output. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1616) Tika Parser for GIBS Metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1308) Support in memory parse mode(don't create temp file): to support run Tika in GAE - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1276) Missing embedded dependencies in tika-bundle - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1709) Tika Server doesn't handle multi-part attachments or form-encoded inputs - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1598) Parser Implementation for Streaming Video - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1688) Tika Version in Metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1505) chmparser breaks down when extracting from file of CHM format v3 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1540) New Tika plugin for image based feature extraction using computer vision techniques - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1425) Automatic batching of Microsoft service calls - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-2346) Allow Office format parsers to exclude parsing shapes - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1295) Make some Dublin Core items multi-valued - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-2312) [Mp3Parser] expose fields form ID3TagsAndAudio - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1953) tika-server NullPointerException while processing rtfs - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-1318) Use of Deprecated Word6Extractor.getParagraphText() Method - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1738) ForkClient does not always delete temporary bootstrap jar - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1640) Make ExternalParser support aliases for key names in extracted metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1329) Add RecursiveParserWrapper aka Jukka's (and Nick's) RecursiveMetadataParser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-774) ExifTool Parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1301) Establish TikaServer on Apache hosted VM - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-985) Support for HTML5 elements - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1454) Extracting as HTML loses links in xlsx, ppt, and pptx files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1808) Head section closed too eager - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1417) Create Extract Embedded Images from PDFs Example - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1815) Text content from parser is empty when NamedEntityParser is enabled - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:11 UTC, 0 replies.
- [jira] [Updated] (TIKA-1674) Add example to show how to extract embedded files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1672) Integrate tika-java7 component - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1706) Bring back commons-io to tika-core - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-715) Some parsers produce non-well-formed XHTML SAX events - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1829) org.apache.tika.parser.ocr.TesseractOCRParser.getSupportedTypes(TesseractOCRParser.java:92) NPE - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1800) MediaType#parse does not decode escaped special characters - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1952) Access Date is getting modified while capturing the MetaData information using AutoDetectParser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-980) MicrodataContentHandler for Apache Tika - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1379) error in Tika().detect for xml files with xades signature - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1108) Represent individual slides in pptx - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:12 UTC, 0 replies.
- [jira] [Updated] (TIKA-1106) CLAVIN Integration - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:13 UTC, 1 replies.
- [jira] [Updated] (TIKA-1518) Docker with Tika Server - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:40:13 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:41:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1815) Text content from parser is empty when NamedEntityParser is enabled - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:42:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1106) CLAVIN Integration - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2017/05/21 15:46:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2373) Fix licenses via rat before 1.15 release - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/22 14:35:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2373) Fix licenses via rat before 1.15 release - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/22 14:37:04 UTC, 2 replies.
- [jira] [Updated] (TIKA-1334) Add presentation layer for results of each run - posted by "Stephen Downie (JIRA)" <ji...@apache.org> on 2017/05/22 15:01:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2374) Tika App -z should extract PDF inline images by default - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2017/05/22 16:53:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2373) Fix licenses via rat before 1.15 release - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/22 17:34:04 UTC, 0 replies.
- [VOTE] Release Apache Tika 1.15 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2017/05/22 19:25:07 UTC, 9 replies.
- [VOTE] Release Apache Tika 1.15 Candidate #2 - posted by Tim Allison <ta...@apache.org> on 2017/05/24 01:22:43 UTC, 6 replies.
- Integrating Tika with Apache Beam - posted by Sergey Beryozkin <sb...@gmail.com> on 2017/05/24 10:41:54 UTC, 4 replies.
- [jira] [Created] (TIKA-2375) Add tika-eval to release artifacts - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/24 13:13:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2376) Avoid org.json dependency - posted by "Claus Ibsen (JIRA)" <ji...@apache.org> on 2017/05/25 07:30:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2376) Avoid org.json dependency - posted by "Claus Ibsen (JIRA)" <ji...@apache.org> on 2017/05/25 07:30:05 UTC, 2 replies.
- [jira] [Commented] (TIKA-2376) Avoid org.json dependency - posted by "Claus Ibsen (JIRA)" <ji...@apache.org> on 2017/05/25 07:31:04 UTC, 6 replies.
- [jira] [Created] (TIKA-2377) Remove org.json from TEIParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/25 19:03:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2378) Error extracting text from application/x-msaccess mime type - posted by "Steve Reynolds (JIRA)" <ji...@apache.org> on 2017/05/25 23:02:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2378) Error extracting text from application/x-msaccess mime type - posted by "Steve Reynolds (JIRA)" <ji...@apache.org> on 2017/05/25 23:03:04 UTC, 1 replies.
- [jira] [Commented] (TIKA-2378) Error extracting text from application/x-msaccess mime type - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2017/05/25 23:12:04 UTC, 4 replies.
- [jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/27 16:29:04 UTC, 0 replies.
- Jirasearch for Tika, Lucene, Solr, Infra issues: you can now drill down by attachment type - posted by Michael McCandless <lu...@mikemccandless.com> on 2017/05/29 16:52:58 UTC, 0 replies.
- Release process git tag - posted by "Allison, Timothy B." <ta...@mitre.org> on 2017/05/30 13:12:46 UTC, 1 replies.
- location for javadocs? - posted by "Allison, Timothy B." <ta...@mitre.org> on 2017/05/30 15:45:11 UTC, 2 replies.
- [ANNOUNCE] Apache Tika 1.15 released - posted by Tim Allison <ta...@apache.org> on 2017/05/30 16:17:45 UTC, 3 replies.
- [jira] [Commented] (TIKA-1804) Tika use no free json.org - posted by "Hudson (JIRA)" <ji...@apache.org> on 2017/05/30 19:09:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-1804) Tika use no free json.org - posted by "Mattmann, Chris A (388J) (JIRA)" <ji...@apache.org> on 2017/05/30 19:10:04 UTC, 1 replies.
- [jira] [Created] (TIKA-2379) tika-bundle 1.1.5 has wrong import of org.sfl4j.event package which does not exists - posted by "Claus Ibsen (JIRA)" <ji...@apache.org> on 2017/05/31 07:59:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-2379) tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists - posted by "Claus Ibsen (JIRA)" <ji...@apache.org> on 2017/05/31 08:00:18 UTC, 1 replies.
- [jira] [Commented] (TIKA-2379) tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/31 12:47:04 UTC, 1 replies.
- [jira] [Assigned] (TIKA-2379) tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2017/05/31 14:12:04 UTC, 0 replies.
- [jira] [Created] (TIKA-2380) Upgrade to Jackcess 2.1.8 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/31 14:14:04 UTC, 0 replies.
- experiences with Tika in Docker - posted by "Allison, Timothy B." <ta...@mitre.org> on 2017/05/31 19:33:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2381) Include tika-eval artifact in release - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/31 20:16:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2381) Include tika-eval artifact in release - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/05/31 20:46:04 UTC, 0 replies.