You are viewing a plain text version of this content. The canonical link for it is here.
- [tika] branch master updated (8289d0b -> a8e0a54) - posted by dm...@apache.org on 2018/03/04 13:09:34 UTC, 0 replies.
- [tika] 01/02: TIKA-1518: Updated CHANGES file to include description - posted by dm...@apache.org on 2018/03/04 13:09:35 UTC, 0 replies.
- [tika] 02/02: Merge branch 'dameikle-TIKA-1518' - posted by dm...@apache.org on 2018/03/04 13:09:36 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-1518: Add local docker build based on dockerfile-maven-plugin - posted by dm...@apache.org on 2018/03/04 13:17:42 UTC, 0 replies.
- [tika] branch master updated: TIKA-1518: Updated the README and changed image name to tika-server for clarity - posted by dm...@apache.org on 2018/03/04 13:32:41 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-1518: Updated the README and changed image name to tika-server for clarity - posted by dm...@apache.org on 2018/03/04 13:34:31 UTC, 0 replies.
- [tika] branch master updated: TIKA-2598 -- add enforcerplugin to fail on dependency convergence problems, and fix dependency conflicts where possible. - posted by ta...@apache.org on 2018/03/06 20:18:24 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2598 -- add enforcerplugin to fail on dependency convergence problems, and fix dependency conflicts where possible. - posted by ta...@apache.org on 2018/03/06 20:20:57 UTC, 0 replies.
- [tika] branch master updated: TIKA-2576 -- Upgrade commons compress and add detection and parsing of zstd (if user provides com.github.luben:zstd-jni... via Andreas Meier - posted by ta...@apache.org on 2018/03/06 20:50:17 UTC, 0 replies.
- [tika] branch master updated: TIKA-2598 -- unbreak the build (sorry!), fix problems after tika-app - posted by ta...@apache.org on 2018/03/07 00:52:48 UTC, 0 replies.
- [tika] branch branch_1x updated (4eb8ae1 -> cf0348d) - posted by ta...@apache.org on 2018/03/07 00:54:29 UTC, 0 replies.
- [tika] 01/02: TIKA-2576 -- Upgrade commons compress and add detection and parsing of zstd (if user provides com.github.luben:zstd-jni... via Andreas Meier - posted by ta...@apache.org on 2018/03/07 00:54:30 UTC, 0 replies.
- [tika] 02/02: TIKA-2598 -- unbreak the build (sorry!), fix problems after tika-app - posted by ta...@apache.org on 2018/03/07 00:54:31 UTC, 0 replies.
- [tika] branch master updated: TIKA-2598 -- unbreak the build (sorry, again!), fix missing javacpp dependency. - posted by ta...@apache.org on 2018/03/07 13:27:02 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2598 -- unbreak the build (sorry, again!), fix missing javacpp dependency. - posted by ta...@apache.org on 2018/03/07 13:28:09 UTC, 0 replies.
- [tika] branch master updated: turn off debug in powerpointparsertest - posted by ta...@apache.org on 2018/03/07 13:45:58 UTC, 0 replies.
- [tika] branch master updated: TIKA-2600 -- remove md5 checksum, and switch sha-1 to sha-512 for release artifacts - posted by ta...@apache.org on 2018/03/07 14:43:11 UTC, 0 replies.
- [tika] branch master updated: TIKA-2594 -- improve eml detection for those starting with Subject: and containing html - posted by ta...@apache.org on 2018/03/07 18:14:31 UTC, 0 replies.
- [tika] branch master updated: TIKA-2592 -- ignore charsets not supported by IANA in html meta-headers via Andreas Meier. - posted by ta...@apache.org on 2018/03/07 18:47:50 UTC, 0 replies.
- [tika] branch master updated: TIKA-2591 -- Add workaround to identify TIFFs that might confuse commons-compress's tar detection via Daniel Schmidt - posted by ta...@apache.org on 2018/03/07 19:55:18 UTC, 0 replies.
- [tika] branch master updated (462ee47 -> cadeb1d) - posted by ta...@apache.org on 2018/03/07 19:59:21 UTC, 0 replies.
- [tika] 01/01: Merge pull request #225 from grigoriy/TIKA-2590 - posted by ta...@apache.org on 2018/03/07 19:59:22 UTC, 0 replies.
- [tika] branch master updated: TIKA-2590 update Changes.txt - posted by ta...@apache.org on 2018/03/07 20:09:15 UTC, 0 replies.
- [tika] branch branch_1x updated (8163b59 -> a9b4b36) - posted by ta...@apache.org on 2018/03/07 20:19:41 UTC, 0 replies.
- [tika] 01/06: turn off debug in powerpointparsertest - posted by ta...@apache.org on 2018/03/07 20:19:42 UTC, 0 replies.
- [tika] 02/06: TIKA-2600 -- remove md5 checksum, and switch sha-1 to sha-512 for release artifacts - posted by ta...@apache.org on 2018/03/07 20:19:43 UTC, 0 replies.
- [tika] 03/06: TIKA-2594 -- improve eml detection for those starting with Subject: and containing html - posted by ta...@apache.org on 2018/03/07 20:19:44 UTC, 0 replies.
- [tika] 04/06: TIKA-2592 -- ignore charsets not supported by IANA in html meta-headers via Andreas Meier. - posted by ta...@apache.org on 2018/03/07 20:19:45 UTC, 0 replies.
- [tika] 05/06: TIKA-2591 -- Add workaround to identify TIFFs that might confuse commons-compress's tar detection via Daniel Schmidt - posted by ta...@apache.org on 2018/03/07 20:19:46 UTC, 0 replies.
- [tika] 06/06: TIKA-2590 -- revert listenForAllRecords = false thanks to Grigoriy Alekseev - posted by ta...@apache.org on 2018/03/07 20:19:47 UTC, 0 replies.
- [tika] branch master updated: TIKA-2527 -- Various new mimes and typo fixes in tika-mimetypes.xml via Andreas Meier. - posted by ta...@apache.org on 2018/03/07 20:37:57 UTC, 0 replies.
- [tika] branch branch_1x updated (a9b4b36 -> 33f756f) - posted by ta...@apache.org on 2018/03/07 20:38:50 UTC, 0 replies.
- [tika] 01/02: TIKA-2590 update Changes.txt - posted by ta...@apache.org on 2018/03/07 20:38:51 UTC, 0 replies.
- [tika] 02/02: TIKA-2527 -- Various new mimes and typo fixes in tika-mimetypes.xml via Andreas Meier. - posted by ta...@apache.org on 2018/03/07 20:38:52 UTC, 0 replies.
- [tika] branch master updated: TIKA-2594 improve eml detection via Luis Filipe Nassif - posted by ta...@apache.org on 2018/03/07 20:43:29 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2594 improve eml detection via Luis Filipe Nassif - posted by ta...@apache.org on 2018/03/07 20:44:05 UTC, 0 replies.
- [tika] branch master updated: TIKA-1518: Detach docker file build from build phase in Maven execution - posted by dm...@apache.org on 2018/03/07 21:15:38 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-1518: Detach docker file build from build phase in Maven execution - posted by dm...@apache.org on 2018/03/07 21:25:07 UTC, 0 replies.
- [tika] branch master updated: TIKA-1518 -- turn dockerfile-maven-plugin back on. Accidentally commented it out. Doh! - posted by ta...@apache.org on 2018/03/07 21:29:00 UTC, 0 replies.
- [tika] branch branch_1x updated (42aa774 -> c996d01) - posted by lf...@apache.org on 2018/03/08 11:30:43 UTC, 0 replies.
- [tika] branch master updated: TIKA-2568: detection of full encrypted 7z files - posted by lf...@apache.org on 2018/03/08 11:31:40 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2338 support for tif in pdfs - posted by lf...@apache.org on 2018/03/08 12:14:25 UTC, 0 replies.
- [tika] branch master updated: TIKA-2338 support for tif in pdfs - posted by lf...@apache.org on 2018/03/08 12:15:19 UTC, 0 replies.
- [tika] branch master updated: TIKA-2338 -- fix imageio version conflict in tika-dl - posted by ta...@apache.org on 2018/03/08 18:57:46 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2338 -- fix imageio version conflict in tika-dl - posted by ta...@apache.org on 2018/03/08 18:59:00 UTC, 0 replies.
- [tika] branch master updated: TIKA-2530 -- temporary workaround -- check for zero length byte array in rtf body to avoid buffer underflow from POI, via Pascal Essiembre. - posted by ta...@apache.org on 2018/03/08 19:10:46 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2530 -- temporary workaround -- check for zero length byte array in rtf body to avoid buffer underflow from POI, via Pascal Essiembre. - posted by ta...@apache.org on 2018/03/08 19:11:47 UTC, 0 replies.
- [tika] branch master updated: TIKA-2591 -- prevent AIOOBE when haystack shorter than needle - posted by ta...@apache.org on 2018/03/08 19:56:26 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2591 -- prevent AIOOBE when haystack shorter than needle - posted by ta...@apache.org on 2018/03/08 19:57:00 UTC, 0 replies.
- [tika] branch master updated: TIKA-2604 -- properly escape (or not) class path in windows and linux environments. - posted by ta...@apache.org on 2018/03/09 16:50:56 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2604 -- properly escape (or not) class path in windows and linux environments. - posted by ta...@apache.org on 2018/03/09 16:51:39 UTC, 0 replies.
- [tika] branch multiple-parsers updated (bc8a75e -> 348bfb9) - posted by ni...@apache.org on 2018/03/13 18:15:23 UTC, 0 replies.
- [tika] 01/13: Name sample config files based on issue number - posted by ni...@apache.org on 2018/03/13 18:15:24 UTC, 0 replies.
- [tika] 02/13: Add TODOs for code to be shared/copied with other areas - posted by ni...@apache.org on 2018/03/13 18:15:25 UTC, 0 replies.
- [tika] 03/13: Ignore vim temp files - posted by ni...@apache.org on 2018/03/13 18:15:26 UTC, 0 replies.
- [tika] 04/13: Pull out deep Metadata clone to a utils method for re-use - posted by ni...@apache.org on 2018/03/13 18:15:27 UTC, 0 replies.
- [tika] 05/13: Prepare to track metadata between parsers - posted by ni...@apache.org on 2018/03/13 18:15:28 UTC, 0 replies.
- [tika] 06/13: Fix exception handling - posted by ni...@apache.org on 2018/03/13 18:15:29 UTC, 0 replies.
- [tika] 07/13: Pull common "Real Parser" identification logic out to utils - posted by ni...@apache.org on 2018/03/13 18:15:30 UTC, 0 replies.
- [tika] 08/13: Use utils for recording details of the parser used - posted by ni...@apache.org on 2018/03/13 18:15:31 UTC, 0 replies.
- [tika] 09/13: Move logic for recording embedded parser failures in the metadata to utils, and use for multiple parsers - posted by ni...@apache.org on 2018/03/13 18:15:32 UTC, 0 replies.
- [tika] 10/13: TODO updates, enforce allowed policies - posted by ni...@apache.org on 2018/03/13 18:15:33 UTC, 0 replies.
- [tika] 11/13: Bring over stream reset logic from ParserDecorator and update comments - posted by ni...@apache.org on 2018/03/13 18:15:34 UTC, 0 replies.
- [tika] 12/13: Implement some metadata policies for merging values from multiple parsers - posted by ni...@apache.org on 2018/03/13 18:15:35 UTC, 0 replies.
- [tika] 13/13: More metadata handling between parsers, start on unit testing - posted by ni...@apache.org on 2018/03/13 18:15:36 UTC, 0 replies.
- [tika] branch multiple-parsers updated (348bfb9 -> 6a39214) - posted by ni...@apache.org on 2018/03/14 07:07:59 UTC, 0 replies.
- [tika] 01/03: Start on a multiple parser that would try several text encodings, pick the best and use that, to ensure it would be possible - posted by ni...@apache.org on 2018/03/14 07:08:00 UTC, 0 replies.
- [tika] 02/03: Give parserCompleted the ParseContext, use that to pass around for the pick-best-text case what charsets to try next and what text we got from them - posted by ni...@apache.org on 2018/03/14 07:08:01 UTC, 0 replies.
- [tika] 03/03: Some (currently failing) Supplemental Parser tests - posted by ni...@apache.org on 2018/03/14 07:08:02 UTC, 0 replies.
- [tika] branch multiple-parsers updated (6a39214 -> 12a98b6) - posted by ni...@apache.org on 2018/03/14 17:35:14 UTC, 0 replies.
- [tika] 01/04: Correct Metadata merging by policy, and get (incomplete) unit tests to pass - posted by ni...@apache.org on 2018/03/14 17:35:15 UTC, 0 replies.
- [tika] 02/04: Further unit tests - posted by ni...@apache.org on 2018/03/14 17:35:16 UTC, 0 replies.
- [tika] 03/04: Optionally use a new Handler for each Parser, if a factory was given - posted by ni...@apache.org on 2018/03/14 17:35:17 UTC, 0 replies.
- [tika] 04/04: Keep all implemented and unit test - posted by ni...@apache.org on 2018/03/14 17:35:18 UTC, 0 replies.
- [tika] branch multiple-parsers updated (12a98b6 -> 50f8591) - posted by ni...@apache.org on 2018/03/19 10:44:31 UTC, 0 replies.
- [tika] 01/02: Remove un-used reference - posted by ni...@apache.org on 2018/03/19 10:44:32 UTC, 0 replies.
- [tika] 02/02: Fix test references to embedded exception property definition - posted by ni...@apache.org on 2018/03/19 10:44:33 UTC, 0 replies.
- [tika] branch multiple-parsers updated: All obvious places that need changing have, alias back in the original name for compatibility - posted by ni...@apache.org on 2018/03/19 14:33:32 UTC, 0 replies.
- [Tika Wiki] Update of "API Bindings for Tika" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2018/03/21 02:09:02 UTC, 0 replies.
- [tika] branch master updated (1df10c3 -> 682c38d) - posted by ni...@apache.org on 2018/03/21 08:29:13 UTC, 0 replies.
- [tika] branch multiple-parsers updated (f80fc23 -> 6514a00) - posted by ni...@apache.org on 2018/03/21 08:29:16 UTC, 0 replies.
- [tika] 01/01: Merge commit '682c38d' into multiple-parsers - posted by ni...@apache.org on 2018/03/21 08:29:17 UTC, 0 replies.
- [tika] branch multiple-parsers updated (6514a00 -> 54477aa) - posted by ni...@apache.org on 2018/03/21 08:40:29 UTC, 0 replies.
- [tika] 01/02: ParserUtils methods for handling the reset/re-read of the stream - posted by ni...@apache.org on 2018/03/21 08:40:30 UTC, 0 replies.
- [tika] 02/02: Simplify stream resetting logic by using new ParserUtil methods for it - posted by ni...@apache.org on 2018/03/21 08:40:31 UTC, 0 replies.
- [tika] branch master updated: TIKA-2579 and TIKA-2607: Upgrade PDFBox to 2.0.9 and include new jbig2-imageio from org.apache.pdfbox - posted by ta...@apache.org on 2018/03/28 14:25:48 UTC, 0 replies.
- [tika] branch master updated: TIKA-2614 -- treat simple body inline, not as an attachment - posted by ta...@apache.org on 2018/03/28 15:33:27 UTC, 0 replies.
- [tika] branch master updated: TIKA-2616 -- preserve message/news - posted by ta...@apache.org on 2018/03/28 15:46:41 UTC, 0 replies.
- [tika] branch master updated: TIKA-2617 -- handle new IOOBE on streams now parsed as npoifs in ppt embedded streams as any other IOException on an embedded stream - posted by ta...@apache.org on 2018/03/28 16:28:29 UTC, 0 replies.
- [tika] branch branch_1x updated: fix cherry-pick conflict - posted by ta...@apache.org on 2018/03/28 16:43:16 UTC, 0 replies.
- [tika] branch branch_1x updated (029715d -> c5cf55f) - posted by ta...@apache.org on 2018/03/28 16:46:27 UTC, 0 replies.
- [tika] 01/03: TIKA-2614 -- treat simple body inline, not as an attachment - posted by ta...@apache.org on 2018/03/28 16:46:28 UTC, 0 replies.
- [tika] 02/03: TIKA-2616 -- preserve message/news - posted by ta...@apache.org on 2018/03/28 16:46:29 UTC, 0 replies.
- [tika] 03/03: TIKA-2617 -- handle new IOOBE on streams now parsed as npoifs in ppt embedded streams as any other IOException on an embedded stream - posted by ta...@apache.org on 2018/03/28 16:46:30 UTC, 0 replies.
- [tika] branch master updated: Update forbiddenapis to version 2.5 and remove commons-io hack from pom.xml - posted by tp...@apache.org on 2018/03/28 18:34:51 UTC, 0 replies.
- [tika] branch master updated: TIKA-2618 -- avoid overwriting labels - posted by ta...@apache.org on 2018/03/28 19:12:14 UTC, 0 replies.
- [tika] branch branch_1x updated (c5cf55f -> ca9c2f5) - posted by ta...@apache.org on 2018/03/28 19:13:40 UTC, 0 replies.
- [tika] 01/02: Update forbiddenapis to version 2.5 and remove commons-io hack from pom.xml - posted by ta...@apache.org on 2018/03/28 19:13:41 UTC, 0 replies.
- [tika] 02/02: TIKA-2618 -- avoid overwriting labels - posted by ta...@apache.org on 2018/03/28 19:13:42 UTC, 0 replies.
- [tika] branch branch_1x updated: update CHANGES.txt because of conflict in cherry-pick - posted by ta...@apache.org on 2018/03/29 11:14:49 UTC, 0 replies.
- [tika] branch master updated: TIKA-2621 -- add support for brotli - posted by ta...@apache.org on 2018/03/29 17:50:15 UTC, 0 replies.
- [tika] branch master updated (2cb195a -> 3ecccf1) - posted by ta...@apache.org on 2018/03/29 18:58:13 UTC, 0 replies.
- [tika] 01/02: Merge branch 'TIKA-2613' of https://github.com/ewanmellor/tika into ewanmellor-TIKA-2613 - posted by ta...@apache.org on 2018/03/29 18:58:14 UTC, 0 replies.
- [tika] 02/02: Merge branch 'ewanmellor-TIKA-2613' - posted by ta...@apache.org on 2018/03/29 18:58:15 UTC, 0 replies.
- [tika] branch branch_1x updated (f9910e2 -> 04225d2) - posted by ta...@apache.org on 2018/03/29 19:15:20 UTC, 0 replies.
- [tika] 01/04: Fix for TIKA-2582 contributed by ewanmellor. - posted by ta...@apache.org on 2018/03/29 19:15:21 UTC, 0 replies.
- [tika] 02/04: Fix for TIKA-2584 contributed by ewanmellor. - posted by ta...@apache.org on 2018/03/29 19:15:22 UTC, 0 replies.
- [tika] 03/04: Fix for TIKA-2613 contributed by ewanmellor. - posted by ta...@apache.org on 2018/03/29 19:15:23 UTC, 0 replies.
- [tika] 04/04: TIKA-2621 -- add support for brotli - posted by ta...@apache.org on 2018/03/29 19:15:24 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2621 -- add support for brotli - update CHANGES.txt - posted by ta...@apache.org on 2018/03/29 19:16:06 UTC, 0 replies.
- [tika] branch master updated: TIKA-2620 allow configuration of setting KCMS - posted by ta...@apache.org on 2018/03/29 19:39:26 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-2620 allow configuration of setting KCMS - posted by ta...@apache.org on 2018/03/29 19:40:17 UTC, 0 replies.
- [tika] branch branch_1x updated: fix cherry-picked version clash for TIKA-2621 - posted by ta...@apache.org on 2018/03/29 20:00:30 UTC, 0 replies.