You are viewing a plain text version of this content. The canonical link for it is here.
- [tika] branch branch_1x updated: [TIKA-3344] [TIKA-3345] (#422) - posted by ta...@apache.org on 2021/04/05 19:36:25 UTC, 0 replies.
- [tika] branch TIKA-3347 created (now 588b92d) - posted by ta...@apache.org on 2021/04/06 13:54:20 UTC, 0 replies.
- [tika] 01/02: Merge remote-tracking branch 'origin/main' into main - posted by ta...@apache.org on 2021/04/06 13:54:21 UTC, 0 replies.
- [tika] 02/02: TIKA-3346 -- parsers should only appear once in x-parsed-by - posted by ta...@apache.org on 2021/04/06 13:54:22 UTC, 0 replies.
- [tika] branch TIKA-3347 updated: TIKA-3347 -- first attempt at upgrading to PDFBox 3.x - posted by ta...@apache.org on 2021/04/06 13:54:47 UTC, 0 replies.
- [tika] branch TIKA-3347 updated: TIKA-3347 -- small improvement to PDFMarkedContent2XHTML - posted by ta...@apache.org on 2021/04/06 14:22:24 UTC, 0 replies.
- [tika] branch TIKA-3347 updated: TIKA-3347 -- fix logic in PDFMarkedContent2XHTML - posted by ta...@apache.org on 2021/04/06 14:58:21 UTC, 0 replies.
- [tika] branch main updated: add back dependency jaxb-runtime for tika-parser-pdf-module - posted by pe...@apache.org on 2021/04/07 12:32:37 UTC, 0 replies.
- [tika] branch TIKA-3347 updated: TIKA-3347 -- merge 3.0.0-SNAPSHOT - posted by ta...@apache.org on 2021/04/07 15:10:58 UTC, 0 replies.
- [tika] branch main updated: don't close readers/writers in json serialization - posted by ta...@apache.org on 2021/04/07 16:23:15 UTC, 0 replies.
- [tika] branch TIKA-3347 updated (2b1353e -> f336c59) - posted by ta...@apache.org on 2021/04/08 21:47:54 UTC, 0 replies.
- [tika] 01/02: Merge remote-tracking branch 'origin/main' into TIKA-3347 - posted by ta...@apache.org on 2021/04/08 21:47:55 UTC, 1 replies.
- [tika] 02/02: Actually use the underlying file if it is there... - posted by ta...@apache.org on 2021/04/08 21:47:56 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3350 - posted by ta...@apache.org on 2021/04/09 13:45:53 UTC, 0 replies.
- [tika] branch main updated (4bd278f -> 149e991) - posted by ta...@apache.org on 2021/04/09 15:04:37 UTC, 0 replies.
- [tika] 01/04: fix jdbcfetchiterator unit test to block on adding to queue - posted by ta...@apache.org on 2021/04/09 15:04:38 UTC, 0 replies.
- [tika] 02/04: close json writer in tikacli - posted by ta...@apache.org on 2021/04/09 15:04:39 UTC, 0 replies.
- [tika] 03/04: close json writer in tika-server - posted by ta...@apache.org on 2021/04/09 15:04:40 UTC, 0 replies.
- [tika] 04/04: TIKA-3350 use the file if supplied via TikaInputStream - posted by ta...@apache.org on 2021/04/09 15:04:41 UTC, 0 replies.
- [tika] branch main updated: [TIKA-3344] [TIKA-3345] main (#424) - posted by ta...@apache.org on 2021/04/09 16:19:16 UTC, 0 replies.
- [tika] branch main updated: cleanup dependencies in tika-eval - posted by ta...@apache.org on 2021/04/09 16:19:58 UTC, 0 replies.
- [tika] branch main updated: TIKA-3343 -- move Tika's legacy lang detector to its own submodule in tika-langdetect - posted by ta...@apache.org on 2021/04/09 17:35:45 UTC, 0 replies.
- [tika] branch main updated (4069025 -> a632354) - posted by ta...@apache.org on 2021/04/10 11:09:01 UTC, 0 replies.
- [tika] 01/02: TIKA-3343 -- move Tika's legacy lang detector to its own submodule in tika-langdetect -- git add lang models - posted by ta...@apache.org on 2021/04/10 11:09:02 UTC, 0 replies.
- [tika] 02/02: update comparison reports for extract exception comparisons - posted by ta...@apache.org on 2021/04/10 11:09:03 UTC, 0 replies.
- [tika] branch main updated (a632354 -> 5c3050b) - posted by ta...@apache.org on 2021/04/12 15:51:34 UTC, 0 replies.
- [tika] 01/03: Merge remote-tracking branch 'origin/main' into main - posted by ta...@apache.org on 2021/04/12 15:51:35 UTC, 0 replies.
- [tika] 02/03: set defaul eol=lf for better consistency with style checker - posted by ta...@apache.org on 2021/04/12 15:51:36 UTC, 0 replies.
- [tika] 03/03: Merge remote-tracking branch 'origin/main' into main - posted by ta...@apache.org on 2021/04/12 15:51:37 UTC, 1 replies.
- [tika] branch main updated: set defaul eol=lf for better consistency with style checker, second try - posted by ta...@apache.org on 2021/04/12 15:55:46 UTC, 0 replies.
- [tika] branch main updated: set defaul eol=lf for better consistency with style checker, third try - posted by ta...@apache.org on 2021/04/12 15:58:09 UTC, 0 replies.
- [tika-docker] branch master updated: Update README.md - posted by dm...@apache.org on 2021/04/13 08:51:22 UTC, 1 replies.
- [tika] branch main updated: Tika's OpenNLPDetector now covers 148 languages and language-script pairs (TIKA-3340). - posted by ta...@apache.org on 2021/04/13 16:02:56 UTC, 0 replies.
- [tika] branch main updated: update changes in main branch after 1.26 release - posted by ta...@apache.org on 2021/04/13 16:04:09 UTC, 0 replies.
- [tika] branch main updated: TIKA-3351 -- prevent multiple "parsed by" entries - posted by ta...@apache.org on 2021/04/13 16:24:40 UTC, 0 replies.
- [tika] branch main updated: TIKA-3351 -- prevent multiple "parsed by" entries...might need to revert this approach... - posted by ta...@apache.org on 2021/04/13 16:31:12 UTC, 0 replies.
- [tika] branch main updated: fix gitignore and git add actual language model - posted by ta...@apache.org on 2021/04/13 18:15:34 UTC, 0 replies.
- [tika] branch main updated: TIKA-3352 -- add a json option for the /tika endpoint - posted by ta...@apache.org on 2021/04/14 10:12:18 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3352: Add json output for /tika endpoint in tika-server - posted by ta...@apache.org on 2021/04/14 13:52:19 UTC, 0 replies.
- [tika] branch main updated: TIKA-3355 -- integrate fakeload library into MockParser - posted by ta...@apache.org on 2021/04/14 20:22:39 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3355 -- integrate fakeload library into MockParser - posted by ta...@apache.org on 2021/04/14 20:28:46 UTC, 0 replies.
- [tika] branch main updated: bump line length in checkstyle...coz who doesn't have a 120 column CRT nowadays... - posted by ta...@apache.org on 2021/04/15 00:16:04 UTC, 0 replies.
- [tika] branch TIKA-3347 updated (f336c59 -> 473acc1) - posted by ta...@apache.org on 2021/04/15 13:59:03 UTC, 0 replies.
- [tika] 02/02: Few more required updates... - posted by ta...@apache.org on 2021/04/15 13:59:05 UTC, 0 replies.
- [tika] branch main updated: TIKA-3355 -- include dependencies in test-jar - posted by ta...@apache.org on 2021/04/15 14:58:44 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3355 -- include dependencies in test-jar - posted by ta...@apache.org on 2021/04/15 15:01:34 UTC, 0 replies.
- [tika] branch main updated: Fix up exception handling for invalid config (#426) - posted by ta...@apache.org on 2021/04/15 15:57:07 UTC, 0 replies.
- [tika] branch main updated: Fix build on Windows... sorry :( - posted by ta...@apache.org on 2021/04/15 16:23:34 UTC, 0 replies.
- [tika] branch main updated: TIKA-3359 -- extract rich media from PDFs - posted by ta...@apache.org on 2021/04/16 13:58:15 UTC, 0 replies.
- [tika-helm] branch main created (now 28d3b6f) - posted by le...@apache.org on 2021/04/16 15:05:02 UTC, 0 replies.
- [tika-helm] 01/05: Initial chart creation - posted by le...@apache.org on 2021/04/16 15:05:03 UTC, 0 replies.
- [tika-helm] 02/05: Add README and LICENSE - posted by le...@apache.org on 2021/04/16 15:05:04 UTC, 0 replies.
- [tika-helm] 03/05: First working implementation and documentation - posted by le...@apache.org on 2021/04/16 15:05:05 UTC, 0 replies.
- [tika-helm] 04/05: Update version to latest-full - posted by le...@apache.org on 2021/04/16 15:05:06 UTC, 0 replies.
- [tika-helm] 05/05: README update - posted by le...@apache.org on 2021/04/16 15:05:07 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3196 -- multithreading issue in package parser - posted by ta...@apache.org on 2021/04/19 13:35:37 UTC, 0 replies.
- [tika] branch branch_1x updated: [TIKA-3357] removes ambiguity by choosing handler based on produce type (#427) - posted by ta...@apache.org on 2021/04/19 13:38:02 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3357 -- add unit test - posted by ta...@apache.org on 2021/04/19 13:46:14 UTC, 0 replies.
- [tika] branch main updated: TIKA-3359 -- extract rich media from PDFs -- broaded the search for /EF - posted by ta...@apache.org on 2021/04/20 14:37:16 UTC, 0 replies.
- [tika] branch main updated: TIKA-3362 -- enable configuration of content type, writelimit and max embedded resources for async, FetchEmitTuple - posted by ta...@apache.org on 2021/04/20 20:29:25 UTC, 0 replies.
- [tika-helm] branch main updated: TIKA-3360 Retrospective release of tika-helm for tika-docker 1.26 and 1.26-full - posted by le...@apache.org on 2021/04/21 16:27:14 UTC, 0 replies.
- [tika-helm] annotated tag v1.26-full updated (f5dce91 -> 34e0b16) - posted by le...@apache.org on 2021/04/21 16:29:48 UTC, 0 replies.
- [tika-helm] branch main updated: Pin development version on 'latest-full' to optimize releasse management procedure - posted by le...@apache.org on 2021/04/21 17:03:37 UTC, 0 replies.
- [tika-helm] annotated tag v1.26 updated (50aed45 -> 731a6da) - posted by le...@apache.org on 2021/04/21 17:06:49 UTC, 0 replies.
- [tika-helm] branch main updated: Update README.md - posted by le...@apache.org on 2021/04/21 18:56:42 UTC, 2 replies.
- [tika-helm] branch main updated: Update values.yaml - posted by le...@apache.org on 2021/04/21 21:07:11 UTC, 1 replies.
- [tika] branch branch_1x updated: [TIKA-3353] Prometheus and JMX monitoring over micrometer (#429) - posted by ta...@apache.org on 2021/04/22 14:42:57 UTC, 0 replies.
- [tika] branch main updated: TIKA-3312 -- upgrade to log4j2 throughout - posted by ta...@apache.org on 2021/04/22 18:10:05 UTC, 0 replies.
- [tika] branch main updated: checkstyle fixups - posted by ta...@apache.org on 2021/04/23 14:22:02 UTC, 0 replies.
- [tika] branch add-bom created (now 4b64d29) - posted by gr...@apache.org on 2021/04/23 23:49:10 UTC, 0 replies.
- [tika] 01/02: Add tika-bom module - posted by gr...@apache.org on 2021/04/23 23:49:11 UTC, 1 replies.
- [tika] 02/02: Add dependencyManagement to tika-pipes (make it BOM too) - posted by gr...@apache.org on 2021/04/23 23:49:12 UTC, 1 replies.
- [tika] branch add-bom updated (4b64d29 -> 3868979) - posted by gr...@apache.org on 2021/04/23 23:50:00 UTC, 0 replies.
- [tika] branch add-bom updated (3868979 -> 1f46702) - posted by gr...@apache.org on 2021/04/23 23:54:25 UTC, 0 replies.
- [tika] 01/02: [TIKA-3367] Add tika-bom module - posted by gr...@apache.org on 2021/04/23 23:54:26 UTC, 0 replies.
- [tika] 02/02: [TIKA-3367] Add dependencyManagement to tika-pipes (make it BOM too) - posted by gr...@apache.org on 2021/04/23 23:54:27 UTC, 0 replies.
- [tika] branch add-bom-1x created (now 8b70320) - posted by gr...@apache.org on 2021/04/24 00:05:29 UTC, 0 replies.
- [tika] 01/01: [TIKA-3368] Add tika-bom module - posted by gr...@apache.org on 2021/04/24 00:05:30 UTC, 0 replies.
- [tika] branch main updated: Update Tika site urls to https (in safe places) - posted by gr...@apache.org on 2021/04/24 00:45:03 UTC, 0 replies.
- [tika] branch main updated: Use https for xsd maven schema locations - posted by gr...@apache.org on 2021/04/24 00:57:29 UTC, 0 replies.
- [tika] branch main updated: TIKA-3373 Add the *.yml extension for YAML, which is commonly used, along with aliases for popular alternate mimetypes for it - posted by ni...@apache.org on 2021/04/27 12:05:48 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3373 Add the *.yml extension for YAML, which is commonly used, along with aliases for popular alternate mimetypes for it - posted by ni...@apache.org on 2021/04/27 12:12:22 UTC, 0 replies.
- [tika] branch main updated: TIKA-3370 -- refactor AsyncProcessor - posted by ta...@apache.org on 2021/04/28 15:45:29 UTC, 0 replies.
- [tika] branch branch_1x updated (f7d5119 -> f414fe4) - posted by ta...@apache.org on 2021/04/28 16:04:57 UTC, 0 replies.
- [tika] 01/05: update changes for new development in 1.27...please may it never happen... 2.0.0 here we come! - posted by ta...@apache.org on 2021/04/28 16:04:58 UTC, 0 replies.
- [tika] 02/05: Merge remote-tracking branch 'origin/branch_1x' into branch_1x - posted by ta...@apache.org on 2021/04/28 16:04:59 UTC, 0 replies.
- [tika] 03/05: Merge remote-tracking branch 'origin/branch_1x' into branch_1x - posted by ta...@apache.org on 2021/04/28 16:05:00 UTC, 0 replies.
- [tika] 04/05: TIKA-3372 -- fix write limit in recursive parser handler - posted by ta...@apache.org on 2021/04/28 16:05:01 UTC, 0 replies.
- [tika] 05/05: Merge remote-tracking branch 'origin/branch_1x' into branch_1x - posted by ta...@apache.org on 2021/04/28 16:05:02 UTC, 0 replies.
- [tika] branch main updated: TIKA-2787 -- make WriteLimitReachedException public for Tika 2.x - posted by ta...@apache.org on 2021/04/28 16:28:29 UTC, 0 replies.
- [tika] branch main updated: TIKA-3374 -- apply charset detection for archive entry name (#433) - posted by ta...@apache.org on 2021/04/29 13:30:42 UTC, 0 replies.
- [tika] branch main updated: TIKA-3374 -- fix up to encoding detection in package parser - posted by ta...@apache.org on 2021/04/29 15:44:12 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3374 add encoding detection to zip entry names via Ryan Liu. - posted by ta...@apache.org on 2021/04/29 17:17:32 UTC, 0 replies.
- [tika] branch main updated (fbac00b -> 9ac7e75) - posted by ta...@apache.org on 2021/04/29 17:26:06 UTC, 0 replies.
- [tika] 01/02: fix logic in digest key matcher in ExtractComparer - posted by ta...@apache.org on 2021/04/29 17:26:07 UTC, 0 replies.
- [tika] 02/02: TIKA-3376 -- improve write limit reached handling in new /tika json output - posted by ta...@apache.org on 2021/04/29 17:26:08 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3376 improve handling of write limit reached in json output from /tika endpoint - posted by ta...@apache.org on 2021/04/29 17:31:41 UTC, 0 replies.
- [tika] branch branch_1x updated: fix logic in ExtractComparer - posted by ta...@apache.org on 2021/04/29 17:36:25 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3372 -- fix write limit handling in the PDFParser - posted by ta...@apache.org on 2021/04/30 18:51:16 UTC, 0 replies.
- [tika] branch main updated (9ac7e75 -> 3420a79) - posted by ta...@apache.org on 2021/04/30 20:37:50 UTC, 0 replies.
- [tika] 01/03: TIKA-3371 -- add id to fetchtuple - posted by ta...@apache.org on 2021/04/30 20:37:51 UTC, 0 replies.
- [tika] 02/03: TIKA-3377 -- avoid loading the fetchers, emitters and other pipes components in TikaConfig as default. Still more cleanup necessary... - posted by ta...@apache.org on 2021/04/30 20:37:52 UTC, 0 replies.
- [tika] branch main updated (3420a79 -> 5cf12bb) - posted by ta...@apache.org on 2021/04/30 20:56:26 UTC, 0 replies.
- [tika] 01/02: TIKA-3374 -- allow users to turn off charset detection - posted by ta...@apache.org on 2021/04/30 20:56:27 UTC, 0 replies.
- [tika] 02/02: TIKA-3378 -- mv tika-langdetect-commons to tika-langdetect-test-commons - posted by ta...@apache.org on 2021/04/30 20:56:28 UTC, 0 replies.
- [tika] branch branch_1x updated: TIKA-3374 -- allow users to turn off charset detection of entry names in ZipArchives - posted by ta...@apache.org on 2021/04/30 21:00:32 UTC, 0 replies.
- [tika] branch main updated: TIKA-3372 -- fix writelimit in PDFs - posted by ta...@apache.org on 2021/04/30 21:39:50 UTC, 0 replies.