You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/06/02 18:47:01 UTC

[jira] [Commented] (TIKA-1168) The IWork NumbersContentHandler returns unsupported Metadata PropertyType

    [ https://issues.apache.org/jira/browse/TIKA-1168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14015538#comment-14015538 ] 

Nick Burch commented on TIKA-1168:
----------------------------------

I've just tried with a recent svn snapshot build, and tika is able to extract the metadata from your test file without errors

Any chance you could re-test, to see if the metadata work since 1.4 has solved it for you too?

> The IWork NumbersContentHandler returns unsupported Metadata PropertyType
> -------------------------------------------------------------------------
>
>                 Key: TIKA-1168
>                 URL: https://issues.apache.org/jira/browse/TIKA-1168
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.4
>            Reporter: Jake Gilley
>         Attachments: testMultipleSheets.numbers
>
>
> The resolveMetadataKey function in org.apache.tika.parser.iwork.NumbersContentHandler returns a Property with PropertyType "SIMPLE" by default. This is causing a PropertyTypeException("SIMPLE is not supported") when adding the Property using Metatadata.add(Property, String). The exception is being generated by line 92 in org.apache.tika.parser.iwork.NumbersContentHandler.
> metadata.add(metadataKey, attributes.getValue("sfa:string"));
> Caused by: org.apache.tika.metadata.PropertyTypeException: SIMPLE is not supported
> 	at org.apache.tika.metadata.Metadata.add(Metadata.java:373)
> 	at org.apache.tika.parser.iwork.NumbersContentHandler.startElement(NumbersContentHandler.java:92)
> 	at org.apache.tika.sax.ContentHandlerDecorator.startElement(ContentHandlerDecorator.java:126)
> 	at org.apache.xerces.parsers.AbstractSAXParser.startElement(Unknown Source)
> 	at org.apache.xerces.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source)
> 	at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanStartElement(Unknown Source)
> 	at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source)
> 	at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)
> 	at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
> 	at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
> 	at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
> 	at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
> 	at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
> 	at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
> 	at javax.xml.parsers.SAXParser.parse(Unknown Source)
> 	at org.apache.tika.parser.iwork.IWorkPackageParser.parse(IWorkPackageParser.java:209)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	... 16 more



--
This message was sent by Atlassian JIRA
(v6.2#6252)