You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (TIKA-2187) Align default behavior of experimental docx parser with that of doc parser in handling delText - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2016/12/01 00:46:58 UTC, 4 replies.
- [jira] [Resolved] (TIKA-2090) Extract javascript from PDActions in PDFs - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/01 00:48:58 UTC, 0 replies.
- tika-2.x-windows - Build # 81 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/01 01:19:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-2090) Extract javascript from PDActions in PDFs - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/01 01:19:59 UTC, 2 replies.
- RE: FW: ApacheCon Miami is coming in May. - posted by "Allison, Timothy B." <ta...@mitre.org> on 2016/12/01 13:49:56 UTC, 2 replies.
- [jira] [Commented] (TIKA-2152) NullPointerException on a valid Word file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/01 15:23:59 UTC, 0 replies.
- [jira] [Commented] (TIKA-2180) Multiple requests on Tika to extract text slows down - posted by "Ashish Basran (JIRA)" <ji...@apache.org> on 2016/12/01 21:21:58 UTC, 11 replies.
- [jira] [Created] (TIKA-2188) Illegal SAXException when using cTAKESParser - posted by "Alan Simmons (JIRA)" <ji...@apache.org> on 2016/12/01 21:23:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2189) Default value mismatch for "enableImageProcessing" in TesseractOCRConfig.properties and TesseractOCRConfig.java - posted by "Bipul Kumar (JIRA)" <ji...@apache.org> on 2016/12/02 13:17:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2189) Default value mismatch for "enableImageProcessing" in TesseractOCRConfig.properties and TesseractOCRConfig.java - posted by "Bipul Kumar (JIRA)" <ji...@apache.org> on 2016/12/02 13:20:58 UTC, 3 replies.
- [GitHub] tika pull request #139: [TIKA-2189] fix for Default value mismatch for "enab... - posted by dasbipulkumar <gi...@git.apache.org> on 2016/12/02 13:28:58 UTC, 1 replies.
- [jira] [Commented] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Björn Decker (JIRA)" <ji...@apache.org> on 2016/12/02 13:41:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2190) Add "preserve_interword_spaces" option of tesseract - posted by "Bipul Kumar (JIRA)" <ji...@apache.org> on 2016/12/02 15:31:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2190) Add "preserve_interword_spaces" option of tesseract - posted by "Bipul Kumar (JIRA)" <ji...@apache.org> on 2016/12/02 15:33:58 UTC, 8 replies.
- [jira] [Comment Edited] (TIKA-2180) Multiple requests on Tika to extract text slows down - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/02 20:36:59 UTC, 0 replies.
- Re: [ANNOUNCE] Welcome Luis Filipe Nassif and Thamme Gowda as Apache Tika PMC members and committers - posted by Tyler Bui-Palsulich <tb...@gmail.com> on 2016/12/04 20:18:43 UTC, 0 replies.
- [jira] [Created] (TIKA-2191) Apply current .docx unit tests to experimental SAX parser and fix or document as necessary - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/05 12:11:58 UTC, 0 replies.
- [GitHub] tika pull request #140: update - posted by xexes <gi...@git.apache.org> on 2016/12/05 15:30:41 UTC, 1 replies.
- [jira] [Updated] (TIKA-2192) Extract embedded files from headers, footers, footnotes, etc from docx/m - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/06 01:55:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2192) Extract embedded files from headers, footers, footnotes, etc from docx - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/06 01:55:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2191) Apply current .docx unit tests to experimental SAX parser and fix or document as necessary - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/06 14:19:58 UTC, 1 replies.
- [jira] [Commented] (TIKA-2191) Apply current .docx unit tests to experimental SAX parser and fix or document as necessary - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/06 14:22:58 UTC, 7 replies.
- [jira] [Commented] (TIKA-2192) Extract embedded files from headers, footers, footnotes, etc from docx/m - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/06 14:47:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2188) Illegal SAXException when using cTAKESParser (Docker configuration) - posted by "Alan Simmons (JIRA)" <ji...@apache.org> on 2016/12/07 21:02:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2188) Illegal SAXException when using cTAKESParser (in Docker container) - posted by "Alan Simmons (JIRA)" <ji...@apache.org> on 2016/12/07 21:03:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2188) Illegal SAXException when using cTAKESParser (in Docker container) - posted by "Alan Simmons (JIRA)" <ji...@apache.org> on 2016/12/07 21:03:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2180) Multiple requests on Tika to extract text slows down - posted by "Ashish Basran (JIRA)" <ji...@apache.org> on 2016/12/08 01:49:58 UTC, 3 replies.
- [jira] [Created] (TIKA-2193) java.io.NotSerializableException while using ForkParser - posted by "Michal Hlavac (JIRA)" <ji...@apache.org> on 2016/12/09 21:06:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2193) java.io.NotSerializableException while using ForkParser - posted by "Michal Hlavac (JIRA)" <ji...@apache.org> on 2016/12/09 21:14:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-2193) java.io.NotSerializableException while using ForkParser - posted by "Michal Hlavac (JIRA)" <ji...@apache.org> on 2016/12/09 21:40:59 UTC, 0 replies.
- [jira] [Created] (TIKA-2194) matlab files detected as 'text/plain' - posted by "Mihai Glont (JIRA)" <ji...@apache.org> on 2016/12/11 16:05:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2194) matlab files detected as 'text/plain' - posted by "Mihai Glont (JIRA)" <ji...@apache.org> on 2016/12/11 17:49:58 UTC, 2 replies.
- [jira] [Commented] (TIKA-2099) Tar files without magic bytes are sporadically detected as text - posted by "Robin Schimpf (JIRA)" <ji...@apache.org> on 2016/12/12 07:54:58 UTC, 0 replies.
- RE: got docx? - posted by "Allison, Timothy B." <ta...@mitre.org> on 2016/12/12 14:57:55 UTC, 0 replies.
- [jira] [Created] (TIKA-2195) Consolidate MockParser's service loading file and custom-mimetype entry in tika-core - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/13 01:17:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2195) Consolidate MockParser's service loading file and custom-mimetype entry into tika-core's tests jar - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/13 01:29:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2195) Consolidate MockParser's service loading file and custom-mimetype entry into tika-core's tests jar - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/13 01:29:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2195) Consolidate MockParser's service loading file and custom-mimetype entry into tika-core's tests jar - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/13 02:06:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2173) Add extractInlineImages to PDFParser to enable parameter setting via config - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/13 04:02:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2196) IllegalArgumentException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 17:19:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2196) IllegalArgumentException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 17:19:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2197) TikaException from invalid URL in an Excel document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 17:32:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2197) TikaException from invalid URL in an Excel document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 17:32:58 UTC, 1 replies.
- [jira] [Created] (TIKA-2198) NullPointerException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 17:51:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2198) NullPointerException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 17:52:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-2199) RecordFormatException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 18:15:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2199) RecordFormatException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 18:15:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2200) XML schema mismatch error on a valid Word document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 18:27:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2200) XML schema mismatch error on a valid Word document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 18:27:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2201) OutOfMemoryError on a reasonably sized document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 19:28:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2201) OutOfMemoryError on a reasonably sized document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 19:34:59 UTC, 0 replies.
- [jira] [Created] (TIKA-2202) StringIndexOutOfBoundsException on a valid Word document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 19:39:13 UTC, 0 replies.
- [jira] [Updated] (TIKA-2202) StringIndexOutOfBoundsException on a valid Word document - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 19:39:16 UTC, 1 replies.
- [jira] [Updated] (TIKA-2203) InvalidOperationException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 20:01:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2203) InvalidOperationException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 20:01:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2204) IndexOutOfBoundsException on a valid Powerpoint file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:04:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2204) IndexOutOfBoundsException on a valid Powerpoint file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:04:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2205) IllegalArgumentException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:22:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-2205) IllegalArgumentException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:23:58 UTC, 1 replies.
- [jira] [Created] (TIKA-2206) RecordFormatException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:44:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2206) RecordFormatException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:45:59 UTC, 0 replies.
- [jira] [Created] (TIKA-2207) ArrayIndexOutOfBoundsException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:55:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2207) ArrayIndexOutOfBoundsException on a valid Excel file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/13 21:55:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2208) Catch missing libraires - posted by "David Pilato (JIRA)" <ji...@apache.org> on 2016/12/14 08:29:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2208) Catch missing libraires - posted by "Ryan Ernst (JIRA)" <ji...@apache.org> on 2016/12/14 08:36:58 UTC, 26 replies.
- [jira] [Resolved] (TIKA-2202) StringIndexOutOfBoundsException on a valid Word document - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/14 12:02:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2201) OutOfMemoryError on a reasonably sized document - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/14 12:13:58 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-2208) Catch missing libraires - posted by "David Pilato (JIRA)" <ji...@apache.org> on 2016/12/16 08:16:58 UTC, 6 replies.
- [jira] [Closed] (TIKA-1352) Upgrade to PDFBox 1.8.6 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 15:57:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-446) Upgrade to PDFBox 1.3.1 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 15:58:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-810) Upgrade to PDFbox 1.7.0 as available - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 15:58:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2209) Update PDFBox to 2.0.4 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:01:01 UTC, 0 replies.
- [jira] [Closed] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:04:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-1442) Upgrade to PDFBox 1.8.8 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:05:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:05:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-1285) Upgrade to PDFBox 2.0.0 when available - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:05:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-1588) Upgrade to PDFBox 1.8.10 when available - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:06:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-1290) Upgrade to PDFBOX 1.8.5 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:06:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-2082) Upgrade to PDFBox 2.0.3 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:06:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-1104) Upgrade to PDFBox 1.8.1 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:06:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-1049) Upgrade to PDFBox 1.7.1 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:06:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-393) Upgrade to PDFBOX 1.1.0 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:06:59 UTC, 0 replies.
- [jira] [Closed] (TIKA-2051) Upgrade to PDFBox 2.0.3 when available - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:07:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-1959) Upgrade to PDFBox 2.0.1/JempBox 1.8.12 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:07:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-380) Upgrade to PDFBox 1.0.0 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:07:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-1996) Upgrade to PDFBox 2.0.2 when available - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:07:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-1419) Upgrade to PDFBox 1.8.7 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:07:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2209) Update PDFBox to 2.0.4 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 16:08:58 UTC, 1 replies.
- [jira] [Resolved] (TIKA-2209) Update PDFBox to 2.0.4 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 17:55:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2117) NullPointerException on PDF (fixed in PDFBox) - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 17:56:59 UTC, 0 replies.
- [jira] [Commented] (TIKA-2209) Update PDFBox to 2.0.4 - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2016/12/16 17:56:59 UTC, 0 replies.
- tika-2.x-windows - Build # 82 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/16 18:22:19 UTC, 0 replies.
- [jira] [Created] (TIKA-2210) Add experimental SAX/Streaming XSLF/pptx extractor - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/17 00:42:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2210) Add experimental SAX/Streaming XSLF/pptx extractor - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/17 01:47:58 UTC, 0 replies.
- Fwd: Action recommended: Migrate Microsoft Translator API to Azure—limited access via Azure DataMarket starting January 1, 2017 - posted by lewis john mcgibbney <le...@apache.org> on 2016/12/17 03:12:02 UTC, 0 replies.
- [jira] [Created] (TIKA-2211) ePub formatting instructions appear in plain text output - posted by "Adam Carroll (JIRA)" <ji...@apache.org> on 2016/12/18 15:09:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2211) ePub formatting instructions appear in plain text output - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 11:55:58 UTC, 6 replies.
- [jira] [Created] (TIKA-2212) Add mime for .potm to OOXMLParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 13:02:59 UTC, 0 replies.
- [jira] [Commented] (TIKA-2212) Add mime for .potm to OOXMLParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 13:07:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2212) Update mimes for OOXMLParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 13:08:58 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-2212) Update mimes for OOXMLParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 13:15:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2212) Update mimes for OOXMLParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 13:16:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2107) Old MS Word files give error while indexing - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 13:41:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2094) Error parsing .doc file with visio embed - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 13:42:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2213) ArrayIndexOutOfBoundsException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 15:42:01 UTC, 0 replies.
- [jira] [Updated] (TIKA-2213) ArrayIndexOutOfBoundsException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 15:42:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2214) ArrayIndexOutOfBoundsException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 15:49:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-2214) ArrayIndexOutOfBoundsException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 15:49:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-2215) TikaException about "Invalid embedded resource" on a valid PPT file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 15:56:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2215) TikaException about "Invalid embedded resource" on a valid PPT file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 15:56:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2216) ArrayIndexOutOfBoundsException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 16:09:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2216) ArrayIndexOutOfBoundsException on a valid Word file - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 16:10:59 UTC, 2 replies.
- [jira] [Created] (TIKA-2217) RuntimeException on a PPT with a movie - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 16:21:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2217) RuntimeException on a PPT with a movie - posted by "Seva Alekseyev (JIRA)" <ji...@apache.org> on 2016/12/19 16:23:58 UTC, 1 replies.
- [jira] [Created] (TIKA-2218) Add a few more places where PPTX relationships might include an attachment - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 21:00:59 UTC, 0 replies.
- tika-2.x-windows - Build # 83 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/19 21:18:53 UTC, 0 replies.
- [jira] [Commented] (TIKA-2218) Add a few more places where PPTX relationships might include an attachment - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/19 21:18:58 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2218) Add a few more places where PPTX relationships might include an attachment - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/19 21:38:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2219) CharsetDetector no longer detects windows-1252 charset - posted by "Pascal Essiembre (JIRA)" <ji...@apache.org> on 2016/12/19 22:12:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2219) CharsetDetector no longer detects windows-1252 charset - posted by "Pascal Essiembre (JIRA)" <ji...@apache.org> on 2016/12/19 22:17:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2219) CharsetDetector no longer detects windows-1252 charset - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 14:46:58 UTC, 10 replies.
- [jira] [Created] (TIKA-2220) Refactor/merge new experimental docx/pptx components - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 15:03:58 UTC, 0 replies.
- Re: Apache Tika issue review (TIKA-2190 & TIKA-2189) - posted by Chris Mattmann <ma...@apache.org> on 2016/12/20 15:22:16 UTC, 0 replies.
- [jira] [Created] (TIKA-2221) poi.EncryptedDocumentException not wrapped in tika.exception.EncryptedDocumentException - posted by "Matthew Caruana Galizia (JIRA)" <ji...@apache.org> on 2016/12/20 17:04:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2220) Refactor/merge new experimental docx/pptx components - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 18:16:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2221) poi.EncryptedDocumentException not wrapped in tika.exception.EncryptedDocumentException - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 18:30:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2220) Refactor/merge new experimental docx/pptx components - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/20 19:03:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2221) poi.EncryptedDocumentException not wrapped in tika.exception.EncryptedDocumentException - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/12/20 19:03:58 UTC, 2 replies.
- tika-2.x-windows - Build # 84 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/20 19:19:09 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2219) CharsetDetector no longer detects windows-1252 charset - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 19:26:58 UTC, 0 replies.
- tika-2.x - Build # 183 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/20 19:49:53 UTC, 0 replies.
- tika-2.x-windows - Build # 85 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/20 20:19:51 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2190) Add "preserve_interword_spaces" option of tesseract - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 20:29:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2190) Add "preserve_interword_spaces" option of tesseract - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 21:37:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2189) Default value mismatch for "enableImageProcessing" in TesseractOCRConfig.properties and TesseractOCRConfig.java - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 21:41:58 UTC, 0 replies.
- [GitHub] tika pull request #141: New WordPerfect and QuattroPro parsers for TIKA-1946... - posted by essiembre <gi...@git.apache.org> on 2016/12/20 21:44:53 UTC, 1 replies.
- [jira] [Commented] (TIKA-1946) Add mime detection and parser for WordPerfect - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/12/20 21:44:58 UTC, 49 replies.
- [jira] [Comment Edited] (TIKA-1946) Add mime detection and parser for WordPerfect - posted by "Pascal Essiembre (JIRA)" <ji...@apache.org> on 2016/12/20 21:51:58 UTC, 9 replies.
- [jira] [Comment Edited] (TIKA-2219) CharsetDetector no longer detects windows-1252 charset - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/20 21:51:58 UTC, 0 replies.
- tika-2.x-windows - Build # 86 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/20 22:19:26 UTC, 0 replies.
- [jira] [Closed] (TIKA-2094) Error parsing .doc file with visio embed - posted by "wangruochan (JIRA)" <ji...@apache.org> on 2016/12/21 03:03:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-1788) message/rfc822 parser doesn't identify attachment filenames from Content-Disposition header - posted by "Derek Hardison (JIRA)" <ji...@apache.org> on 2016/12/21 04:22:58 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1788) message/rfc822 parser doesn't identify attachment filenames from Content-Disposition header - posted by "Derek Hardison (JIRA)" <ji...@apache.org> on 2016/12/21 04:24:58 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2211) ePub formatting instructions appear in plain text output - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 14:05:58 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2211) ePub formatting instructions appear in plain text output - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 14:11:58 UTC, 0 replies.
- tika-2.x-windows - Build # 87 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/21 14:19:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1946) Add mime detection and parser for WordPerfect - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 17:31:58 UTC, 1 replies.
- [jira] [Created] (TIKA-2222) Contributing a XFDL Parser - posted by "Pascal Essiembre (JIRA)" <ji...@apache.org> on 2016/12/21 18:10:00 UTC, 0 replies.
- tika-2.x-windows - Build # 88 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/21 18:18:50 UTC, 0 replies.
- [jira] [Created] (TIKA-2223) Extra ß characters in some WordPerfect files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 18:21:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2223) Extra ß characters in some WordPerfect files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 18:22:58 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-2223) Extra ß characters in some WordPerfect files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 18:24:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2223) Extra ß characters in some WordPerfect files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 18:24:58 UTC, 0 replies.
- [jira] [Reopened] (TIKA-1946) Add mime detection and parser for WordPerfect - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 20:34:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-1946) Add mime detection and parser for WordPerfect - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/21 21:47:58 UTC, 2 replies.
- [jira] [Created] (TIKA-2224) Mime magic for OneNote formats - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/12/22 01:02:43 UTC, 0 replies.
- [jira] [Commented] (TIKA-2224) Mime magic for OneNote formats - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/12/22 01:32:58 UTC, 8 replies.
- [jira] [Created] (TIKA-2225) Parse DOCX file due to NullPointerException on POI code - posted by "Jorge Spinsanti (JIRA)" <ji...@apache.org> on 2016/12/22 13:23:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2225) Parse DOCX file due to NullPointerException on POI code - posted by "Jorge Spinsanti (JIRA)" <ji...@apache.org> on 2016/12/22 13:24:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2226) Add UnsupportedFormatException (extends TikaException) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/22 14:09:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2226) Add UnsupportedFormatException (extends TikaException) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/22 14:12:58 UTC, 1 replies.
- [jira] [Resolved] (TIKA-2226) Add UnsupportedFormatException (extends TikaException) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/22 16:34:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-2224) Mime magic for OneNote formats - posted by "Krishnan Narayan (JIRA)" <ji...@apache.org> on 2016/12/22 16:51:58 UTC, 1 replies.
- tika-2.x-windows - Build # 89 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/22 17:18:55 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2224) Mime magic for OneNote formats - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/12/22 18:19:58 UTC, 0 replies.
- [jira] [Created] (TIKA-2227) Replacement of MSOffice#KEYWORDS for RTF and ODT docs - posted by "David Pilato (JIRA)" <ji...@apache.org> on 2016/12/22 20:21:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2227) Replacement of MSOffice#KEYWORDS for RTF and ODT docs - posted by "David Pilato (JIRA)" <ji...@apache.org> on 2016/12/22 20:35:58 UTC, 0 replies.
- [jira] [Closed] (TIKA-2227) Replacement of MSOffice#KEYWORDS for RTF and ODT docs - posted by "David Pilato (JIRA)" <ji...@apache.org> on 2016/12/22 20:36:58 UTC, 0 replies.
- tika-2.x-windows - Build # 90 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/12/23 04:18:48 UTC, 0 replies.
- [jira] [Created] (TIKA-2228) WordPerfect parser update to support 5.x - posted by "Pascal Essiembre (JIRA)" <ji...@apache.org> on 2016/12/23 19:56:58 UTC, 0 replies.
- [GitHub] tika pull request #142: Update to WordPerfect parser to support 5.x for TIKA... - posted by essiembre <gi...@git.apache.org> on 2016/12/23 20:02:34 UTC, 0 replies.
- [jira] [Commented] (TIKA-2228) WordPerfect parser update to support 5.x - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/12/23 20:02:58 UTC, 0 replies.
- [GitHub] tika pull request #143: New XFDL parser for TIKA-2222 contributed by pascal.... - posted by essiembre <gi...@git.apache.org> on 2016/12/24 04:00:36 UTC, 0 replies.
- [jira] [Commented] (TIKA-2222) Contributing a XFDL Parser - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/12/24 04:01:01 UTC, 0 replies.
- [jira] [Commented] (TIKA-1804) Tika use no free json.org - posted by "Thamme Gowda (JIRA)" <ji...@apache.org> on 2016/12/25 10:42:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2094) Error parsing .doc file with visio embed - posted by "Jorge Spinsanti (JIRA)" <ji...@apache.org> on 2016/12/26 18:39:58 UTC, 0 replies.
- [jira] [Commented] (TIKA-2091) regression: Zip bomb detected! for HTML file - posted by "Varun Thacker (JIRA)" <ji...@apache.org> on 2016/12/27 23:16:58 UTC, 0 replies.