You are viewing a plain text version of this content. The canonical link for it is here.
- [tika] branch master updated: TIKA-2478 -- rfc822 parser should handle alternative parts as the Outlook parser does. Added parameter to allow for legacy behavior in RFC822Parser and a parameter to "include all alternatives" to the OutlookParser. - posted by ta...@apache.org on 2017/11/02 12:32:47 UTC, 0 replies.
- [tika] branch master updated: TIKA-2485 -- Allow configuration of markLimit in EncodingDetectors via tika-config.xml - posted by ta...@apache.org on 2017/11/02 13:41:18 UTC, 0 replies.
- [tika] branch master updated: TIKA-2489 -- upgrade to PDFBox 2.0.8 - posted by ta...@apache.org on 2017/11/03 13:37:40 UTC, 0 replies.
- [tika] branch master updated (93411f4 -> 88a5e51) - posted by ta...@apache.org on 2017/11/03 19:31:02 UTC, 0 replies.
- [tika] 01/02: allow for greater leniency in failure to load resources from the network - posted by ta...@apache.org on 2017/11/03 19:31:03 UTC, 0 replies.
- [tika] 02/02: TIKA-2490 and TIKA-2491 -- turn off initializable problem stderr warnings in tika-app, confirm that configuration of initializable problems works from an input file and allow for a tika-config.xml file without specifying a classloader - posted by ta...@apache.org on 2017/11/03 19:31:04 UTC, 0 replies.
- [Tika Wiki] Update of "TikaEvalOnVM" by TimothyAllison - posted by Apache Wiki <wi...@apache.org> on 2017/11/03 20:14:52 UTC, 6 replies.
- [tika] branch master updated: TIKA-2492 -- exclude pdfbox debugger - posted by ta...@apache.org on 2017/11/06 20:58:28 UTC, 0 replies.
- [tika] branch master updated: TIKA-2492 -- exclude pdfbox debugger, but get it right this time. - posted by ta...@apache.org on 2017/11/06 21:42:23 UTC, 0 replies.
- [tika] branch master updated: TIKA-2492 -- exclude pdfbox debugger from tika-bundle - posted by ta...@apache.org on 2017/11/08 15:38:37 UTC, 0 replies.
- [tika] branch master updated: TIKA-2488 -- catch potential npe in getting attachment's inputstream - posted by ta...@apache.org on 2017/11/13 15:29:07 UTC, 0 replies.
- [tika] branch master updated (9c2e1b9 -> 780ab0c) - posted by ta...@apache.org on 2017/11/13 17:49:08 UTC, 0 replies.
- [tika] 01/02: Upgrade to Jackson 2.9.2 (TIKA-2501). - posted by ta...@apache.org on 2017/11/13 17:49:09 UTC, 0 replies.
- [tika] 02/02: * Upgrade to OpenNLP 1.8.3 (TIKA-2502). - posted by ta...@apache.org on 2017/11/13 17:49:10 UTC, 0 replies.
- [tika] branch master updated: TIKA-2503. Need to confirm this doesn't break anything - posted by ta...@apache.org on 2017/11/13 18:23:13 UTC, 0 replies.
- [tika] branch master updated: TIKA-2486 upgrade metadata-extractor to avoid CVE in xmp-core to 2.10.1 - posted by ta...@apache.org on 2017/11/13 18:39:35 UTC, 0 replies.
- [tika] branch master updated: remove unused dependency - posted by ta...@apache.org on 2017/11/13 21:04:03 UTC, 0 replies.
- [tika] branch master updated: TIKA-2502 -- rollback until we can figure out how to get the upgrade working with our OSGi bundle. - posted by ta...@apache.org on 2017/11/13 21:26:25 UTC, 0 replies.
- [tika] branch master updated: TIKA-2483 -- revert loading of mime repository in PackageParser from TIKA-2311 to avoid NPE in ForkParser - posted by ta...@apache.org on 2017/11/14 15:48:19 UTC, 0 replies.
- [tika] branch master updated: TIKA-2034 upgrade xmpcore - posted by ta...@apache.org on 2017/11/14 15:57:55 UTC, 0 replies.
- [tika] branch master updated: TIKA-2504 exclude dependency on old vfs2 to remove vulnerability from plexus-utils - posted by ta...@apache.org on 2017/11/14 16:26:35 UTC, 0 replies.
- [tika] branch master updated: TIKA-2502 - Upgrade opennlp-tools to 1.8.3 maven-bundle-plugin to 3.3.0 - posted by bo...@apache.org on 2017/11/17 23:11:55 UTC, 0 replies.
- [tika] branch master updated: TIKA-2506 - Check config for null during DL4J Test. - posted by bo...@apache.org on 2017/11/18 01:25:10 UTC, 0 replies.
- [tika] branch master updated (1e8008c -> 91ef9a9) - posted by ma...@apache.org on 2017/11/21 17:04:07 UTC, 0 replies.
- [tika] 01/01: Merge pull request #208 from ThejanW/master - posted by ma...@apache.org on 2017/11/21 17:04:08 UTC, 0 replies.
- [tika] branch master updated: Remove docker files now present in https://github.com/USCDataScience/tika-dockers - posted by ma...@apache.org on 2017/11/21 17:15:26 UTC, 0 replies.
- [Tika Wiki] Update of "TikaAndVision" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2017/11/22 00:02:37 UTC, 4 replies.
- [Tika Wiki] Update of "ImageCaption" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2017/11/22 03:56:23 UTC, 1 replies.
- [Tika Wiki] Update of "TikaAndVisionVideo" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2017/11/22 05:30:25 UTC, 0 replies.
- [tika] branch master updated: Update changes with TIKA-2400 / GH-208 - posted by ma...@apache.org on 2017/11/22 05:50:59 UTC, 0 replies.
- [tika] branch master updated (946614b -> 639f3bf) - posted by dm...@apache.org on 2017/11/24 01:08:47 UTC, 0 replies.
- [tika] 01/03: Fix for TIKA-2347 Adds underline extraction from word documents - posted by dm...@apache.org on 2017/11/24 01:08:48 UTC, 0 replies.
- [tika] 02/03: TIKA-2347 - Added extraction of element in DOCX files - posted by dm...@apache.org on 2017/11/24 01:08:49 UTC, 0 replies.
- [tika] 03/03: TIKA-2347 - Add underline extraction from Word documents (doc/docx) from Stuart Hendren as well as strikethrough extraction in docx. - posted by dm...@apache.org on 2017/11/24 01:08:50 UTC, 0 replies.
- [tika] branch master updated: TIKA-2347 - Add underline extraction from Word documents (doc/docx) from Stuart Hendren as well as strikethrough extraction in docx. - posted by dm...@apache.org on 2017/11/24 01:13:44 UTC, 0 replies.
- [tika] branch master updated: TIKA-2510 -- Extract media files from ooxml - posted by ta...@apache.org on 2017/11/27 18:02:52 UTC, 0 replies.
- [tika] branch master updated: TIKA-2511 Cache TikaConfig in EmbeddedDocumentUtil for faster processing of files with lots of embedded files. - posted by ta...@apache.org on 2017/11/27 19:53:56 UTC, 0 replies.
- [tika] branch master updated: clean up imports, update unit tests to use assertContains, and confirm that in xhtml doesn't add spaces in extracted text. - posted by ta...@apache.org on 2017/11/27 20:54:13 UTC, 0 replies.
- [tika] branch master updated: TIKA-2512 add underline/strikethrough extraction for docx and pptx in SAX-based parsers - posted by ta...@apache.org on 2017/11/28 13:17:50 UTC, 0 replies.
- [tika] branch master updated (ef3fc7b -> 72c4e33) - posted by ta...@apache.org on 2017/11/28 13:30:24 UTC, 0 replies.
- [tika] 01/03: Merge branch 'fix-oom-when-parsing-large-pdfs' of https://github.com/shrike/tika into shrike-fix-oom-when-parsing-large-pdfs - posted by ta...@apache.org on 2017/11/28 13:30:25 UTC, 0 replies.
- [tika] 02/03: Merge branch 'shrike-fix-oom-when-parsing-large-pdfs' - posted by ta...@apache.org on 2017/11/28 13:30:26 UTC, 0 replies.
- [tika] 03/03: Update test and add note in release notes. Many thanks, shrike! This closes 213. - posted by ta...@apache.org on 2017/11/28 13:30:27 UTC, 0 replies.
- [tika] branch master updated: TIKA-2510, correct fix. Only add to seen/handledTarget _after_ processing. - posted by ta...@apache.org on 2017/11/28 14:20:12 UTC, 0 replies.
- [tika] branch TIKA-2385 updated: Merge branch 'TIKA-2835' of https://github.com/pmweiss/tika into TIKA-2385 - posted by dm...@apache.org on 2017/11/30 23:43:11 UTC, 0 replies.