You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/06/02 18:47:01 UTC
[jira] [Commented] (TIKA-1168) The IWork NumbersContentHandler
returns unsupported Metadata PropertyType
[ https://issues.apache.org/jira/browse/TIKA-1168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14015538#comment-14015538 ]
Nick Burch commented on TIKA-1168:
----------------------------------
I've just tried with a recent svn snapshot build, and tika is able to extract the metadata from your test file without errors
Any chance you could re-test, to see if the metadata work since 1.4 has solved it for you too?
> The IWork NumbersContentHandler returns unsupported Metadata PropertyType
> -------------------------------------------------------------------------
>
> Key: TIKA-1168
> URL: https://issues.apache.org/jira/browse/TIKA-1168
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.4
> Reporter: Jake Gilley
> Attachments: testMultipleSheets.numbers
>
>
> The resolveMetadataKey function in org.apache.tika.parser.iwork.NumbersContentHandler returns a Property with PropertyType "SIMPLE" by default. This is causing a PropertyTypeException("SIMPLE is not supported") when adding the Property using Metatadata.add(Property, String). The exception is being generated by line 92 in org.apache.tika.parser.iwork.NumbersContentHandler.
> metadata.add(metadataKey, attributes.getValue("sfa:string"));
> Caused by: org.apache.tika.metadata.PropertyTypeException: SIMPLE is not supported
> at org.apache.tika.metadata.Metadata.add(Metadata.java:373)
> at org.apache.tika.parser.iwork.NumbersContentHandler.startElement(NumbersContentHandler.java:92)
> at org.apache.tika.sax.ContentHandlerDecorator.startElement(ContentHandlerDecorator.java:126)
> at org.apache.xerces.parsers.AbstractSAXParser.startElement(Unknown Source)
> at org.apache.xerces.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source)
> at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanStartElement(Unknown Source)
> at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source)
> at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)
> at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
> at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
> at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
> at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
> at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
> at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
> at javax.xml.parsers.SAXParser.parse(Unknown Source)
> at org.apache.tika.parser.iwork.IWorkPackageParser.parse(IWorkPackageParser.java:209)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> ... 16 more
--
This message was sent by Atlassian JIRA
(v6.2#6252)