You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (TIKA-1191) ForkParser / ClassLoaderProxy does not define package - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/01/01 21:40:03 UTC, 5 replies.
- [jira] [Commented] (TIKA-2462) Add a parser for sas7bdat - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/02 17:52:00 UTC, 0 replies.
- Re: Not-yet-broken breaking changes for Tika 2? - posted by Nick Burch <ap...@gagravarr.org> on 2018/01/02 19:20:20 UTC, 0 replies.
- [jira] [Updated] (TIKA-2509) TesseractOCRParser ignores configured ImageMagickPath in processImage method - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/03 13:22:03 UTC, 0 replies.
- [jira] [Created] (TIKA-2534) Current juniversalchardet version is EOL - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/03 13:44:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2535) Use latest org.opengis:geoapi to avoid rejected/EOL'd jsr-275 dependency - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/03 15:03:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2536) Move to later edu.ucar.grib version to avoid EOL bzip2 dependency - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/03 15:56:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2535) Use latest org.opengis:geoapi to avoid rejected/EOL'd jsr-275 dependency - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/03 15:58:01 UTC, 0 replies.
- [jira] [Created] (TIKA-2537) com.googlecode.json-simple is EOL, moved to github - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/04 15:52:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2538) com.github.junrar is EOL, many forks exist - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/04 16:12:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2538) com.github.junrar is EOL, many forks exist - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/04 16:31:00 UTC, 3 replies.
- [jira] [Created] (TIKA-2539) TagSoup HTML parser is project EOL - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/05 09:14:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2539) TagSoup HTML parser is project EOL - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2018/01/05 14:19:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2536) Move to later edu.ucar version to avoid EOL dependencies - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/05 15:39:00 UTC, 3 replies.
- [jira] [Created] (TIKA-2540) Referenced version of org.ow2.asm:asm is branch EOL - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/05 15:44:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2541) Referenced version of Apache SIS (org.apache.sis) is branch EOL - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/05 16:07:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2542) Support in tika-server for getting plain text and metadata at the same time - posted by "Manolo Caracuel (JIRA)" <ji...@apache.org> on 2018/01/05 22:12:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2542) Support in tika-server for getting plain text and metadata at the same time - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/01/06 02:03:00 UTC, 5 replies.
- [jira] [Issue Comment Deleted] (TIKA-2542) Support in tika-server for getting plain text and metadata at the same time - posted by "Manolo Caracuel (JIRA)" <ji...@apache.org> on 2018/01/06 02:05:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2543) No content extraction for application/x-webarchive format - posted by "Rafael Ferreira (JIRA)" <ji...@apache.org> on 2018/01/07 06:17:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2544) Docx Numbering Issue - posted by "Manish (JIRA)" <ji...@apache.org> on 2018/01/08 07:34:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/08 10:20:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-2196) IllegalArgumentException on a valid Excel file - posted by "Vinay Kawade (JIRA)" <ji...@apache.org> on 2018/01/09 19:36:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2196) IllegalArgumentException on a valid Excel file - posted by "Vinay Kawade (JIRA)" <ji...@apache.org> on 2018/01/09 19:38:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2545) RereadableInputStream backing byte array not constructed properly - posted by "Eugene Hart (JIRA)" <ji...@apache.org> on 2018/01/10 01:44:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2545) RereadableInputStream backing byte array not constructed properly - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/10 06:44:01 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1191) ForkParser / ClassLoaderProxy does not define package - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/11 06:49:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2546) com.pff:java-libpst is branch EOL - posted by "Richard Jones (JIRA)" <ji...@apache.org> on 2018/01/12 11:28:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2546) com.pff:java-libpst is branch EOL - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2018/01/12 14:14:02 UTC, 0 replies.
- [jira] [Created] (TIKA-2547) RFC822 w multipart/mixed first text element should be treated as body, not attachment - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/12 19:55:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2509) TesseractOCRParser ignores configured ImageMagickPath in processImage method - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/01/15 11:44:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2509) TesseractOCRParser ignores configured ImageMagickPath in processImage method - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/01/15 11:47:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2548) Add Python Path configuration to TesseractOCRParser - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/01/15 11:49:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2509) TesseractOCRParser ignores configured ImageMagickPath in processImage method - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/01/15 11:51:01 UTC, 0 replies.
- [jira] [Created] (TIKA-2549) NoSuchMethodException "CTPictureBaseImpl.(org.apache.xmlbeans.SchemaType, boolean)" parsing certain .docx files - posted by "Adam Rauch (JIRA)" <ji...@apache.org> on 2018/01/16 04:12:02 UTC, 0 replies.
- [jira] [Commented] (TIKA-2549) NoSuchMethodException "CTPictureBaseImpl.(org.apache.xmlbeans.SchemaType, boolean)" parsing certain .docx files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/16 06:16:00 UTC, 3 replies.
- [jira] [Updated] (TIKA-2549) NoSuchMethodException "CTPictureBaseImpl.(org.apache.xmlbeans.SchemaType, boolean)" parsing certain .docx files - posted by "Adam Rauch (JIRA)" <ji...@apache.org> on 2018/01/16 07:12:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2543) No content extraction for application/x-webarchive format - posted by "Rafael Ferreira (JIRA)" <ji...@apache.org> on 2018/01/22 02:37:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-1961) OutOfMemory when parsing shapes xml from xlsx files with multi-byte Unicode characters - posted by "Andrei Rebegea (JIRA)" <ji...@apache.org> on 2018/01/22 10:40:00 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-1961) OutOfMemory when parsing shapes xml from xlsx files with multi-byte Unicode characters - posted by "Andrei Rebegea (JIRA)" <ji...@apache.org> on 2018/01/22 10:41:00 UTC, 2 replies.
- [jira] [Reopened] (TIKA-1961) OutOfMemory when parsing shapes xml from xlsx files with multi-byte Unicode characters - posted by "Andrei Rebegea (JIRA)" <ji...@apache.org> on 2018/01/22 10:48:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1961) OutOfMemory when parsing shapes xml from xlsx files with multi-byte Unicode characters - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/22 14:17:13 UTC, 0 replies.
- [jira] [Created] (TIKA-2550) ToTextHandler includes element content - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/22 20:15:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2551) TIka Server uses HtmlParser for XML no matter what config is given, even if XML is disabled in Config - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/23 16:13:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2535) Use latest org.opengis:geoapi to avoid rejected/EOL'd jsr-275 dependency - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 16:15:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2535) Use latest org.opengis:geoapi to avoid rejected/EOL'd jsr-275 dependency - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 16:22:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2535) Use latest org.opengis:geoapi to avoid rejected/EOL'd jsr-275 dependency - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 16:25:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2551) TIka Server uses HtmlParser for XML no matter what config is given, even if XML is disabled in Config - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 16:28:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2536) Move to later edu.ucar version to avoid EOL dependencies - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 17:09:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2541) Referenced version of Apache SIS (org.apache.sis) is branch EOL - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 17:10:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2552) Upgrade to POI 4.0.0 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 19:41:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2553) Upgrade compiler definition to Java 8 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/23 19:43:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2553) Upgrade compiler definition to Java 8 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/01/23 21:48:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2535) Use latest org.opengis:geoapi to avoid rejected/EOL'd jsr-275 dependency - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/01/23 21:48:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2527) Typos in tika-mimetypes.xml - posted by "Andreas Meier (JIRA)" <ji...@apache.org> on 2018/01/24 12:24:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-2527) Typos in tika-mimetypes.xml - posted by "Andreas Meier (JIRA)" <ji...@apache.org> on 2018/01/24 12:45:04 UTC, 3 replies.
- [jira] [Commented] (TIKA-1974) Tika 2.0 - remove deprecated metadata properties - posted by "Tim Allison (JIRATEST)" <ji...@apache.org> on 2018/01/24 14:39:00 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-1974) Tika 2.0 - remove deprecated metadata properties - posted by "Tim Allison (JIRATEST)" <ji...@apache.org> on 2018/01/24 14:47:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2554) Subtypes for common text formats currently included in text/plain - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/25 14:04:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2554) Subtypes for common text formats currently included in text/plain - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/25 15:20:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2555) Text with [underline] + [another format] in word document generates overlapping html tags. - posted by "Serban Alexe (JIRA)" <ji...@apache.org> on 2018/01/25 17:08:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2556) org.json package clash - posted by "Andrei Rebegea (JIRA)" <ji...@apache.org> on 2018/01/26 10:26:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2556) org.json package clash - posted by "Andrei Rebegea (JIRA)" <ji...@apache.org> on 2018/01/26 10:29:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2557) .mbox detected as text/html - posted by "Andreas Meier (JIRA)" <ji...@apache.org> on 2018/01/26 13:23:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2556) org.json package clash - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2018/01/26 15:28:00 UTC, 5 replies.
- [jira] [Comment Edited] (TIKA-2556) org.json package clash - posted by "Andrei Rebegea (JIRA)" <ji...@apache.org> on 2018/01/26 16:03:00 UTC, 0 replies.
- ***UNCHECKED*** [jira] [Commented] (TIKA-1974) Tika 2.0 - remove deprecated metadata properties - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/26 18:43:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-1974) Tika 2.0 - remove deprecated metadata properties - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/26 18:44:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2551) TIka Server uses HtmlParser for XML no matter what config is given, even if XML is disabled in Config - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/26 18:55:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2558) Add a new pid api to Tika - posted by "Stefan Sveen (JIRA)" <ji...@apache.org> on 2018/01/30 09:16:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2558) Add a new pid api to Tika - posted by "Stefan Sveen (JIRA)" <ji...@apache.org> on 2018/01/30 13:57:00 UTC, 1 replies.
- [jira] [Closed] (TIKA-1961) OutOfMemory when parsing shapes xml from xlsx files with multi-byte Unicode characters - posted by "Andrei Rebegea (JIRA)" <ji...@apache.org> on 2018/01/30 16:49:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2559) Expose language metadata from PDF documents - posted by "Matt Sheppard (JIRA)" <ji...@apache.org> on 2018/01/31 02:30:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2559) Expose language metadata from PDF documents - posted by "Matt Sheppard (JIRA)" <ji...@apache.org> on 2018/01/31 02:36:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2559) Expose language metadata from PDF documents - posted by "Matt Sheppard (JIRA)" <ji...@apache.org> on 2018/01/31 02:37:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2559) Expose language metadata from PDF documents - posted by "Matt Sheppard (JIRA)" <ji...@apache.org> on 2018/01/31 02:37:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2560) doclint-java8-disable profile with liferay - posted by "Fabio (JIRA)" <ji...@apache.org> on 2018/01/31 09:38:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-2560) doclint-java8-disable profile with liferay - posted by "Fabio (JIRA)" <ji...@apache.org> on 2018/01/31 10:09:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1974) Tika 2.0 - remove deprecated metadata properties - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/31 14:44:01 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2556) org.json package clash - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/31 14:54:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2547) RFC822 w multipart/mixed first text element should be treated as body, not attachment - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/01/31 20:18:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2547) RFC822 w multipart/mixed first text element should be treated as body, not attachment - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/01/31 20:22:01 UTC, 0 replies.
- Use of java.util.logging in Tika - posted by Ken Krugler <kk...@transpac.com> on 2018/01/31 21:41:19 UTC, 0 replies.
- [jira] [Created] (TIKA-2561) Tika Parser includes oudated/vulnerable version of JSoup - posted by "Asela (JIRA)" <ji...@apache.org> on 2018/01/31 22:29:00 UTC, 0 replies.