You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by Nick Burch <ni...@alfresco.com> on 2011/12/21 01:50:01 UTC
Re: svn commit: r1221323 - in /tika/trunk: tika-core/src/main/resources/org/apache/tika/mime/
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/test/java/org/apache/tika/detect/
tika-parsers/src/test/java/org/apache/tika/mime/ t...
On 20/12/11 15:55, amylka@apache.org wrote:
> <mime-type type="application/vnd.ms-works">
> +<magic priority="50">
> +<match value="0xd0cf11e0a1b11ae1" type="string" offset="0:8">
> +<match value="M\x00a\x00t\x00O\x00S\x00T" type="string" offset="1152:4096" />
> +</match>
> +</magic>
> <glob pattern="*.wps"/>
> --- tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/microsoft/POIFSContainerDetector.java (original)
> +++ tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/microsoft/POIFSContainerDetector.java Tue Dec 20 15:55:48 2011
> @@ -164,6 +164,9 @@ public class POIFSContainerDetector impl
> return VSD;
> } else if (names.contains("\u0001Ole10Native")) {
> return OLE10_NATIVE;
> + } else if (names.contains("MatOST")) {
> + // this occurs on older Works Word Processor files (versions 3.0 and 4.0)
> + return WPS;
> } else if (names.contains("CONTENTS")&& names.contains("SPELLING")) {
Can you check your indenting settings? Only there seems to be something
a bit odd with the indenting on several bits of this commit
Cheers
Nick