You are viewing a plain text version of this content. The canonical link for it is here.
- [tika] branch main created (now 62fe4ad) - posted by ta...@apache.org on 2020/07/08 20:44:40 UTC, 0 replies.
- [tika-docker] branch main created (now 798ed1a) - posted by dm...@apache.org on 2020/07/13 20:11:45 UTC, 0 replies.
- [tika] branch main updated: writeLimit and maxEmbeddedResources for recursive parsing - add header (#326) - posted by ta...@apache.org on 2020/07/15 17:37:28 UTC, 0 replies.
- [tika] branch main updated (e2250e0 -> f53eeaf) - posted by ta...@apache.org on 2020/07/15 17:51:46 UTC, 0 replies.
- [tika] branch main updated: TIKA-3134 -- fix bug and add unit tests - posted by ta...@apache.org on 2020/07/15 18:27:50 UTC, 0 replies.
- [tika] branch branch_1x updated (9b42784 -> 941a150) - posted by ta...@apache.org on 2020/07/15 18:59:37 UTC, 0 replies.
- [tika] 01/05: TIKA-3130 -- add ICC prefix - posted by ta...@apache.org on 2020/07/15 18:59:38 UTC, 0 replies.
- [tika] 02/05: TIKA-3135 -- no need to spool the file for the metadata extractor's HeifParser - posted by ta...@apache.org on 2020/07/15 18:59:39 UTC, 0 replies.
- [tika] 03/05: writeLimit and maxEmbeddedResources for recursive parsing - add header (#326) - posted by ta...@apache.org on 2020/07/15 18:59:40 UTC, 0 replies.
- [tika] 04/05: TIKA-3134 -- fix bug and add unit tests - posted by ta...@apache.org on 2020/07/15 18:59:41 UTC, 0 replies.
- [tika] 05/05: fix merge conflicts and unit test - posted by ta...@apache.org on 2020/07/15 18:59:42 UTC, 0 replies.
- [tika] branch main updated: TIKA-3131 -- swap default values of averageCharTolerance and spacingTolerance to match PDFBox defaults (#325) - posted by ta...@apache.org on 2020/07/15 19:08:08 UTC, 0 replies.
- [tika] branch main updated: TIKA-3088 - fix NPE in OpenDocumentContentParser caused by com.sun.org.apache.xml.internal.serializer.ToHTMLStream - posted by ta...@apache.org on 2020/07/16 14:12:31 UTC, 0 replies.
- [tika] branch branch_1x updated (941a150 -> d8d4af1) - posted by ta...@apache.org on 2020/07/16 14:14:15 UTC, 0 replies.
- [tika] 01/02: TIKA-3131 -- swap default values of averageCharTolerance and spacingTolerance to match PDFBox defaults (#325) - posted by ta...@apache.org on 2020/07/16 14:14:16 UTC, 0 replies.
- [tika] 02/02: TIKA-3088 - fix NPE in OpenDocumentContentParser caused by com.sun.org.apache.xml.internal.serializer.ToHTMLStream - posted by ta...@apache.org on 2020/07/16 14:14:17 UTC, 0 replies.
- [tika] branch main updated (44a30b3 -> d6257f4) - posted by ta...@apache.org on 2020/07/16 17:00:23 UTC, 0 replies.
- [tika] branch branch_1x updated: fix for TIKA-3139 contributed by wiwi (#328) - posted by ta...@apache.org on 2020/07/16 19:33:53 UTC, 0 replies.
- [tika] branch TIKA-3137 created (now 3bdcd97) - posted by ta...@apache.org on 2020/07/16 19:58:23 UTC, 0 replies.
- [tika] 01/01: TIKA-3137 -- first pass, need to add unit tests for tika-batch - posted by ta...@apache.org on 2020/07/16 19:58:24 UTC, 0 replies.
- [tika] branch TIKA-3140 created (now 78e5b9a) - posted by ta...@apache.org on 2020/07/16 21:28:43 UTC, 0 replies.
- [tika] branch TIKA-3140 updated (78e5b9a -> eb6e07e) - posted by ta...@apache.org on 2020/07/17 15:56:07 UTC, 0 replies.
- [tika] branch TIKA-3140 updated (eb6e07e -> 4971e2e) - posted by ta...@apache.org on 2020/07/17 16:56:59 UTC, 0 replies.
- [tika] branch main updated: TIKA-3129 -- add a status endpoint to report server status. Users must turn it on via the commandline -status option. - posted by ta...@apache.org on 2020/07/17 17:06:38 UTC, 4 replies.
- [tika] branch main updated (23329a6 -> bf224dc) - posted by ta...@apache.org on 2020/07/17 17:17:18 UTC, 0 replies.
- [tika] 01/01: Merge remote-tracking branch 'origin/TIKA-3140' into main - posted by ta...@apache.org on 2020/07/17 17:17:19 UTC, 0 replies.
- [tika] branch branch_1x updated (6686a6f -> d2aa1ac) - posted by ta...@apache.org on 2020/07/17 17:47:03 UTC, 0 replies.
- [tika] 01/06: TIKA-3129 -- add a status endpoint to report server status. Users must turn it on via the commandline -status option. - posted by ta...@apache.org on 2020/07/17 17:47:04 UTC, 0 replies.
- [tika] 02/06: TIKA-3137 -- first pass, need to add unit tests for tika-batch - posted by ta...@apache.org on 2020/07/17 17:47:05 UTC, 0 replies.
- [tika] 03/06: TIKA-3140 -- initial commit - posted by ta...@apache.org on 2020/07/17 17:47:06 UTC, 0 replies.
- [tika] 04/06: fix merge conflicts - posted by ta...@apache.org on 2020/07/17 17:47:07 UTC, 0 replies.
- [tika] 05/06: TIKA-3137 add a list type for Param/configuration to avoid the comma-delimited lists which will get huge and ugly and were a bad idea. - posted by ta...@apache.org on 2020/07/17 17:47:08 UTC, 0 replies.
- [tika] 06/06: fix merge conflicts - posted by ta...@apache.org on 2020/07/17 17:47:09 UTC, 0 replies.
- [tika] branch main updated: TIKA-3140 -- add the tika-eval metadata filter to a service file so that it loads automatically - posted by ta...@apache.org on 2020/07/17 19:22:37 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3140 -- add the tika-eval metadata filter to a service file so that it loads automatically - posted by ta...@apache.org on 2020/07/17 19:28:30 UTC, 0 replies.
- [tika] branch main updated: Improve unit test to ensure that the CompressorParser is not called - posted by ta...@apache.org on 2020/07/20 20:57:36 UTC, 3 replies.
- [tika] branch main updated (839d318 -> ed0c91f) - posted by ta...@apache.org on 2020/07/24 21:13:08 UTC, 0 replies.
- [tika] 01/03: TIKA-3146 -- add Nutch's TextProfileSignature to tika-eval - posted by ta...@apache.org on 2020/07/24 21:13:09 UTC, 1 replies.
- [tika] 02/03: TIKA-3145 -- add TextSha256Signature - posted by ta...@apache.org on 2020/07/24 21:13:10 UTC, 1 replies.
- [tika] 03/03: TIKA-3146 -- clean up text profile signature and add unit test for cjk - posted by ta...@apache.org on 2020/07/24 21:13:11 UTC, 1 replies.
- [tika] branch branch_1x updated (499394e -> 1a0314f) - posted by ta...@apache.org on 2020/07/24 21:20:23 UTC, 0 replies.
- [tika] branch main updated: TIKA-3147 -- strip punctuation before language id; fix bug that omitted filters on text before language id. - posted by ta...@apache.org on 2020/07/27 16:54:25 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3147 -- strip punctuation before language id; fix bug that omitted filters on text before language id. - posted by ta...@apache.org on 2020/07/27 16:57:30 UTC, 0 replies.
- [tika] branch main updated: TIKA-3147 -- drop tokens below quant value. - posted by ta...@apache.org on 2020/07/27 19:30:00 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3147 -- drop tokens below quant value. - posted by ta...@apache.org on 2020/07/27 19:31:09 UTC, 0 replies.
- [tika] branch main updated (ca4852d -> 64b429c) - posted by ta...@apache.org on 2020/07/31 17:41:30 UTC, 0 replies.